INTERNATIONAL ENCYCLOPEDIA 
OF TAHE 


SOCIAL SCIENCES 


aur: “es Begga 


Daie 


. 
” 


LAS 


sasener 


i 


GRh 


Ase No. 


International 
Encyclopedia of the 
SO GEAL 
SCLENGES 


Volumes 11 and 12 
Complete and Unabridged 


Associate Editors 


Heinz Eulau, Political Science 
Lloyd A. Fallers, Anthropology 
William H. Kruskal, Statistics 
Gardner Lindzey, Psychology 
Albert Rees, Economics 

Albert J. Reiss, Jr., Sociology 
Edward Shils, Social Thought 


Special Editors 
Elinor G. Barber, Biographies 
John G, Darley, Applied Psychology 
Bert F. Hoselitz, Economic Development 
Clifford T. Morgan, Experimental Psychology 
Robert H. Strotz, Econometrics 


Editorial Staff 


Marjorie A. Bassett, Economics 

P. G. Bock, Political Science 

Robert M. Coen, Econometrics 

J. M. B. Edwards, Sociology 

David S. Gochman, Psychology 
George Lowy, Bibliographies 
Judith M. Tanur, Statistics 

Judith M. Treistman, Anthropology 


Alvin Johnson 


HONORARY EDITOR 


W. Allen Wallis 


CHAIRMAN, EDITORIAL ADVISORY BOARD 


International / 
Encyclopedia of the 
SOCIAL 
SCIENGES 


DAVID L. SILLS EDITOR 
VOLUME 11 


The Macmillan Company & The Free Press, New Yo a 
COLLIER-MACMILLAN PUBLISHERS, LONDO 


N 
O) 


Coryricut © 1968 BY CROWELL COLLIER AND MACMILLAN, INC. 


ALL RIGHTS RESERVED UNDER THE INTERNATIONAL COPYRIGHT UNION, 
THE INTER-AMERICAN COPYRIGHT UNION, AND UNDER THE PAN-AMERICAN 
COPYRIGHT CONVENTIONS. 


NO PART OF THIS BOOK MAY BE REPRODUCED OR TRANSMITTED IN 

ANY FORM OR BY ANY MEANS, ELECTRONIC OR MECHANICAL, INCLUDING 
PHOTOCOPYING, RECORDING, OR BY ANY INFORMATION STORAGE AND 
RETRIEVAL SYSTEM, WITHOUT PERMISSION IN WRITING FROM 
CROWELL COLLIER AND MACMILLAN, INC. 


MANUFACTURED IN THE UNITED STATES OF AMERICA 


REPRINT EDITION 1972 


International 
Encyclopedia of the 
SOCIAL 
SCIENCES 


NADEL, S. F. 


Siegfried Frederick Nadel (1903-1956) was 
born and raised in Austria. His early interests and 
training were in the field of music, but he earned 
his doctoral degree in psychology and philosophy 
at the University of Vienna, studying with Karl 
Bühler and Moritz Schlick. Before he began formal 
training in anthropology, Nadel had already pub- 
lished on the psychology and philosophy of music, 
written a book on musical typology and a biog- 
raphy of F. B. Busoni, produced programs of exotic 
music for Radio Vienna, and toured Czechoslovakia 
with his own opera company. 

In 1932 a fellowship from the Rockefeller Foun- 
dation and the International Institute of African 
Languages and Cultures brought Nadel to the Lon- 
don School of Economics, He studied with Bronis- 
law Malinowski, and with C. G. Seligman until the 
latter left for Africa, and for several years partici- 
pated in Malinowski’s seminar whenever possible. 
In 1934 Nadel began what was to be a career of 
active field work. He worked with Nupe and related 
groups in northern Nigeria between 1934 and 
1936 and among the Nuba of the Anglo—Egyptian 
Sudan between 1938 and 1940. 

A deeply moral man, Nadel wanted to take part 
personally in the destruction of the Nazi social 
system. He enlisted in the Sudan Defense Force in 
1941 and was later transferred to the East African 
Command of the British Army. His army career 
included service in Eritrea from 1942 to 1945 and 
in Tripolitania from 1945 to 1946. In Tripolitania 
he held the office of secretary for native affairs 
in the British Military Administration. He was 
given a gazetted commendation “for outstanding 


service” and achieved the rank of lieutenant 
colonel. 

Research and ethnography. Nadel had mas- 
tered Hausa before he arrived in Nigeria, and with- 
in six months of beginning field work he acquired 
proficiency in the difficult, tonal language of the 
Nupe. His linguistic talents, sociability, psycholog- 
ical insights, and adventurous spirit made him a 
first-rate field worker. 

Nadel’s frequent shifts between ethnography 
and theory enhance the interest of his ethnographic 
writings and the value of his theoretical treatises. 
In A Black Byzantium (1942) he made explicit 
the theoretical basis for the method of ethnographic 
analysis, and in The Nuba (1947) he brilliantly 
summarized a large number of structural constan- 
cies found among ten tribal groups. His paper 
“Witchcraft in Four African Societies” (1952) 
cleverly interlaced ethnography, theory, and the 
method of covariant analysis to arrive at conclu- 
sions about the causes of witchcraft. The paper is 
a model of the comparative method. The challeng- 
ing and elusive concepts that define the world of 
the supernatural are ably handled in Nupe Religion 
(1954). The book contains theoretical discussions 
that add to an understanding of the specific beliefs, 
rituals, and dogmas of the religious life of the 
Nupe. For Nadel, ethnography was not a simple 
listing of social and cultural traits; he was con- 
cerned with the “reality” that existed for the groups 
he studied and with explaining why given bits of 
that reality “hung together.” 

Theoretical writings. Nadel’s major contribu- 
tions to anthropology lie not in ethnography but 
in theory. Although he is frequently referred to as 
a leading representative of British social anthropol- 
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ogy, his work shows varied influences: Malinow- 
ski's and Radcliffe-Brown’s in anthropology; Max 
Weber’s and Parsons’ in sociology; Whitehead’s in 
philosophy; and Köhler’s, Koffka’s, and Lewin’s in 
psychology. He was also familiar with the work of 
a large number of American cultural anthropolo- 
gists. The breadth of Nadel’s theoretical approach 
makes it difficult to identify him with any particu- 
lar school. 

The Foundations of Social Anthropology, the 
first of Nadel’s two major theoretical works, con- 
cerns itself with “the logical premises that under- 
lie our knowledge of societies . . . and with the pre- 
requisites, conceptual and technical, of any enquiry 
meant to lead to this knowledge” (1951, p. v). 
Nadel argued in Foundations that anthropology is 
a science and, as such, must go beyond description 
to explanation. The subject matter of anthropology 
is behavior that is aim-controlled—tasks that have 
“consciousness” and exhibit purpose, Following 
Weber and Parsons, Nadel stipulated that purpose- 
ful actions can be understood in terms of logic, 
cause, and purpose. Moreover, he appears to have 
attributed equality, or parity, to sociological and 
psychological phenomena. For example, he wrote 
that to understand society one needs to think of 
social systems as having an “ulterior purpose” that 
comes from without these systems. “Since our sub- 
ject matter is man-acting-in-society, this ulterior 
purpose can refer only to one of two things: to the 
nature of the human organism and its require- 
ments (or ‘needs’); and to the nature of society 
and its requirements (the ‘necessities of group ex- 
istence’)” (ibid., p. 368). Here, then, the two as- 
pects of ulterior purpose are facts about individuals 
(psychological and organic facts) and facts about 
groups (sociological facts, facts about the needs 
of society). 

Yet Nadel also argued that to arrive at complete 
explanations it is necessary to enter the psycholog- 
ical realms of “consciousness” and “motivation.” 
Although in Foundations Nadel nowhere “proved” 
the priority of psychological causation over socio- 
logical causation, he did apparently believe that 
sociological facts are “emergent” from psycholog- 
ical ones and that the latter therefore have a kind 
of priority. And in his second major work, The 
Theory of Social Structure, he made the following 
statement: “...the interrelation of sub-groups is 
coterminous with the relationships of persons. . . . 
Let me emphasize that in reducing the interrelation 
of sub-groups to the relationships of actors in their 
roles I am not merely trying to simplify the picture 
.., there is no other way to give meaning to the 
conception of interrelations between groups” 


(1957, p. 95). In brief, although the ulterior pur- 
poses of social systems can be considered in terms 
of satisfying group needs, as well as in terms of 
satisfying individual needs, group needs can ulti- 
mately be reduced to the needs of actors in role 
relationships, that is, to the needs of individuals, 

In Foundations Nadel also included close analyt- 
ical treatment of such basic concepts as culture, 
society, institution, group, function, pattern, con- 
figuration, ethos, and eidos. He had gone through 
the difficult task of translating his own raw field 
data into these concepts, and he attempted to pro- 
vide some rationale for his particular mode of 
“translation.” Nadel did not present generally ac- 
cepted viewpoints in this book; rather, he delved 
deeply into the basic conceptual and methodolog- 
ical problems which confront all anthropologists. 

Nadel’s Theory of Social Structure is one of the 
outstanding theoretical works of twentieth-century 
social science. Roles, according to Nadel, are 
“modes of action ‘allocated to individuals’ by the 
norms of society” and lie at the very heart of social- 
structural analysis. To describe social structure, 
one must consider three aspects of roles: the allo- 
cation principles that provide actors with roles; the 
degree to which given roles have authority over 
other roles—the command-over-action aspect; and 
the degree to which given roles command the var- 
ious resources and benefits of the social system. 
In applauding this important contribution to social 
theory, Janowitz has written that Nadel’s formal 
presentation of the core of social structure paral- 
lels “the classical concerns of sociology and politi- 
cal science. Naturally, it remains for the social 
scientist to elaborate these definitions with a va- 
riety of concepts and taxonomies which give sub- 
stantive content to such an approach” (1963, 
p. 152). Such an elaboration would bring the con- 
cept of social structure into the world of empirical 
research, Nadel himself began the task of elabora- 
tion, in his treatment of a role player’s command 
over resources and benefits. Nadel’s social struc- 
tural concepts have been further elaborated by 
Freilich (1964), who has also attempted to opera- 
tionalize them. 


Students of Africa will certainly continue to find 
Nadel’s ethnographic writings of great importance. 
All anthropologists, however, must take into ac- 
count Nadel’s ideas about anthropological research 
and explanation. He raised crucial questions that 
will remain with anthropology for many years: 
whether anthropological analysis should consider 
only one level—cultural, social, or psychological 
—or whether two or more of these levels are appro- 


priate; whether psychological explanations have 
causal priority over sociological ones; whether it is 
possible to do “experimental anthropology”; and 
whether correlational approaches are more useful 
than “systems” analyses. Any professional prepar- 
ing to write on these issues, as well as on the na- 
ture of social structure or on structural analysis, 
must first acquire a thorough familiarity with 
Nadel’s writings. 

Morris FREILICH 


[See also ANTHROPOLOGY, article on SOCIAL ANTHRO- 
POLOGY; SOCIAL STRUCTURE. Other relevant material 
may be found in AFRICAN SOCIETY, article on SUB- 
SAHARAN AFRICA, and in the biographies of Mari- 
NOWSKI; RADCLIFFE-BROWN; SELIGMAN, C. G.] 
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Sir Lewis Namier (1888-1960), British histo- 
rian, was born Ludwik Bernstein at Vola Oksheyska 
in Russian Poland. He was the son of Joseph Bern- 
stein (who later resumed the older family name 
of Namierowski), a lawyer trained at the Univer- 
sity of Warsaw and subsequently a landed proprie- 
tor in Austrian East Galicia. The family had moved 
to Galicia by 1890, and Namier was raised on his 
maternal grandfather’s estate near Tarnopol. His 
parents were of Jewish origin and in Namier’s 
youth they were freethinkers, but during World 
War 1, long after Namier had left home, they be- 
came Roman Catholics. Their style of life was pat- 
terned on that of the Polish landed gentry. Until 
he was ten, Namier frequently attended the Uniat 
(Roman Catholic Eastern Rite) church with a 
nurse, but subsequently he passed the compulsory 
Gymnasium religious examination in Judaism. 
Throughout his life he demonstrated a strong 
Jewish ethnic, even racial—but not religious—con- 
sciousness; yet at the same time he was evidently 
struggling with the problem of group identification. 
(In his first entry in Who's Who, he rather self- 
consciously described himself as “a Russian subject 
by birth, naturalised British; a Jew by race.”) The 
cultural and social ambiguity of his parents’ posi- 
tion, in part repeated in his own life, gives retro- 
spective poignancy to the emphasis in his politics 
and his scholarship upon the importance of social 
and geographical roots. 

After completing his education at the Tarnopol 
Classical Gymnasium in 1906, he studied briefly 
at the University of Lausanne, where he attended 
lectures by Vilfredo Pareto, and at the London 
School of Economics, before entering Balliol Col- 
lege, Oxford, in 1908. At Balliol, Namier initially 
read law under Edward Hillier but soon switched 
to history under Arthur Lionel Smith and F. F. 
Urquhart. The Balliol years were crucial in his de- 
velopment: it was then he discovered English his- 
tory as a field of intensely satisfying if dispassion- 
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ate intellectual interest (quite different from the 
warm engagement he felt in Polish nationalism 
and the Jewish question); then too he began to 
identify strongly with English life: he became an 
Anglican (a full communicant in 1947) and a 
British subject, and in 1910 he changed his name 
to Lewis Bernstein Naymier (later Namier), after 
his family’s older name. In all this acculturation, 
however, he never lost the advantage of the out- 
sider’s point of view, which sometimes makes it 
possible to perceive relationships obscure to the 
native. 

After taking first-class honors in modern history 
in 1911, Namier stayed on at Oxford for another 
year. He shared the Beit prize in imperial history 
for 1912-1913 but failed to win a fellowship (re- 
search appointment) at All Souls College. After a 
visit to his parents in 1913, when he experienced 
at first hand the tensions that preceded the coming 
war, he spent a year in New York on business. 
Visiting in England at the outbreak of the war in 
1914, he concealed his poor eyesight and enlisted 
in the 20th Royal Fusiliers, The deception was dis- 
covered, and he was discharged in 1915. He spent 
the next few years as a civilian in public service: 
in the Propaganda Department from 1915 to 1917 
(his first publications date from these years); in 
the Political Intelligence Bureau of the Department 
of Information from 1917 to 1918; and in the Po- 
litical Intelligence Department of the Foreign Of- 
fice from 1918 to 1920. 

Namier’s return to scholarship involved many 
difficulties. He found that as a temporary lecturer 
at Balliol in 1920-1921 he had little time for his 
own research. On several occasions he was consid- 
ered for fellowships at various Oxford colleges, but 
without success; his extremely direct, even abra- 
sive, personality was usually blamed. After two 
disheartening years in business, 1921-1923, during 
which he represented British firms in Vienna and 
Czechoslovakia, he determined to return to scholar- 
ship. From 1923 to 1929 he devoted himself to re- 
search and writing, without academic affiliation 
and at the cost of considerable personal hardship; 
he was forced to depend in part on gifts and loans 
from friends. These were the most difficult years of 
his life, but they were also the most productive. 
It was during this period that he wrote the two 
great books on which his reputation is primarily 
based: The Structure of Politics at the Accession 
of George III (1929) and England in the Age of 
the American Revolution (1930). His early career 
was neither easy nor characteristic of the histori- 
cal profession, but it was not without its compensa- 
tions. As early as 1928 he could note with charac- 


teristic Olympianness, “Looking back at the time 
in the Army, the Civil Service, and in business, and 
at the influence those years have had on my his- 
torical work, I am able to appreciate Gibbon’s dic- 
tum that ‘the Captain of the Hampshire Grenadiers 
...has not been useless to the historian of the 
Roman Empire’” ([1929] 1957, p. x). 

From his school days, Namier had been deeply 
influenced by Pilsudski’s Polish national socialism 
and was for a long time emotionally engaged in 
the whole nationality struggle in eastern Europe, of 
which “the Jewish question” was seemingly only 
a part. By the early 1920s, however, he reacted to 
the anti-Semitic tone and activity of the new Polish 
state by becoming a deeply committed and active 
Zionist. From 1929 to 1931 he served as political 
secretary to the Jewish Agency for Palestine, the 
recognized Zionist administrative organization un- 
der the League of Nations mandate. His role in 
the Zionist movement was executive, rather than 
policy-making, for he was unconnected with any 
of the mass Zionist organizations and as a “cul- 
tural” or “national” Zionist was suspect to the reli- 
gious Zionists who formed the bulk of the move- 
ment, 

Namier’s most creative years were marked by 
an unusually well-balanced division of interest be- 
tween his political engagement and his scholarly 
commitment. His political engagement found vent 
in Zionist action, his scholarly commitment in writ- 
ing. This impressive compartmentalization broke 
down under the pressure of Hitler's rise and World 
War 1, and Namier shelved his eighteenth-century 
interests for about 15 years; only with the defeat 
of Hitler, the establishment of the state of Israel 
in 1949, and the consequent cooling of his Zionist 
and antifascist passions could he return to his 
Georgian studies. 

Namier was professor of history at the Univer- 
sity of Manchester from 1931 until his retirement 
in 1953, at the age of 65, Except for his Ford 
lectureship at Oxford in 1934, honorific recogni- 
tion came very late: fellow and Raleigh lecturer 
of the British Academy in 1944; Waynflete lecturer, 
Magdalen College, Oxford in 1946-1947; honorary 
fellow of Balliol College in 1948; knighthood in 
1952; Romanes lecturer, Oxford, and Creighton 
lecturer, London, in 1952; honorary doctorates 
(D. Litt.) from Durham in 1953, Oxford in 1955, 
Rome in 1956; an honorary D.c.L. from Oxford in 
1960; an honorary LITT. D. from Cambridge in 
1957; and the Leslie Stephen lectureship in 1958— 
1959. In 1956 a particularly distinguished Fest- 
schrift, edited by Richard Pares and A. J. P. Taylor, 
was presented to him. 


In 1947 Namier married Iulia de Beausobre, a 
Russian émigrée widow (see De Beausobre 1948). 
When he died in 1960, he received an Anglican 
burial. 


Contributions to historiography 


Namier’s career as a historian can be easily 
divided into four periods: (1) the years 1923- 
1930, when he worked exclusively on the eight- 
eenth century and published his two great works; 
(2) his first years at Manchester, when his eight- 
eenth-century work gradually yielded place to an 
increasing concentration on nineteenth-century 
central European history, culminating in 1848: The 
Revolution of the Intellectuals (delivered 1944, 
published 1946); (3) the later Manchester years, 
when he wrote on the diplomatic origins of the 
war of 1939; and (4) the quieter, honor-filled years 
of his retirement, 1953—1960, when he returned 
to the study of the eighteenth century. 

Namier discovered his general interest in Eng- 
lish history while he was at Balliol, but it was only 
gradually that he defined his special field of interest 
and his approach. In 1912, he started to work on 
a project he called “the imperial problem during 
the American revolution.” The topic is character- 
istic of the period when England, Oxford, and par- 
ticularly Balliol College were very imperial-minded. 
In the United States it was the heyday of the 
“imperial school” of North American colonial his- 
torians (G. L. Beer, H. L. Osgood, C. A. Van Tyne, 
C. McL. Andrews). The young Lawrence Gipson 
had returned from Oxford to start his lifework, the 
British Empire Before the American Revolution. At 
first Namier’s research centered primarily on Amer- 
ica, but during his stay in the United States in 
1913-1914, he met Andrews, who told him, “On 
this side, there are ever so many of us doing the 
work; why do you not contribute something from 
your own side?” Only slowly, however, did Namier’s 
attention move from America to England and from 
the amorphous “imperial problem” to the precise 
delineation of “the House of Commons which, in 
so ill-fated a manner, undertook the work of pre- 
serving the First British Empire.” 

The structure of parliaments. The problem 
Namier set himself was to explain how the eight- 
eenth-century parliaments, in trying to meet im- 
perial problems, lost the empire, But his documen- 
tary evidence revealed that the House of Commons 
was not, in fact, trying to meet imperial problems. 
It lacked a comprehensive collective identity or 
policy; instead, it was a collection of factions, of 
local and family interests. Few of these factions or 
interests had any great consciousness of empire or 
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imperial needs; most narrowly pursued their paro- 
chial or egotistical goals. Struggling with evidence 
of great volume and complexity, Namier, the ro- 
mantic conservative, dispassionately reconstructed 
a vanished social organism; Namier, the admirer 
of a landed patriciate, showed how negative was 
the role of the independent landed gentry in the 
eighteenth century; finally, Namier, the passionate 
Burkean, coldly revealed the ineptness of king and 
lords. 

Namier’s greatest success came in structural 
analysis. He related Parliament to its social con- 
text. He considered elections a register of social 
weight and analyzed the House of Commons that 
resulted from the electoral process for its economic 
and social composition, for geographic and family 
connections, and even for less obvious ties of 
friendship or dependence that brought one man 
under the influence of another. Finally, he showed 
that the traditional labels of Whig and Tory were 
relatively meaningless, compared with the stronger 
bonds of the family-based “connection.” 

Namier was less successful as a writer of sus- 
tained narrative. England in the Age of the Amer- 
ican Revolution is a collection of special studies on 
the political behavior of individuals and groups 
during the confused years 1760-1762, when the 
wartime Pitt-Newcastle coalition gave way to the 
peace ministry of Lord Bute; it is not, however, 
a comprehensive history of even those two years. 
Namier was so impatient of post hoc ergo propter 
hoc generalizations, so interested in the micro- 
scopic analysis of the political process, that he was 
little attracted to the grand narrative of the Ma- 
caulay tradition. 

Namier’s emphasis on quantification and on 
studying political behavior rather than constitu- 
tional theory probably made him the single most 
important influence in the introduction of modern 
social science methodology into British political 
history. His critics (usually to the left of him po- 
litically) have accused him of “taking the mind 
out of history,” that is, of failing to ascribe suffi- 
cient importance to rational purposiveness and 
idealism in individual behavior and to formal ide- 
ology in group behavior. Namier never denied 
these charges very strenuously. Most mass political 
action was to him, indeed, irrational. He became 
increasingly fascinated by the problem of irrational 
individual behavior as well, and he elegantly ana- 
lyzed such behavior in his essays “King George m1” 
and “Charles Townshend” (Crossroads of Power, 
pp. 124-140, 194-212). 

Diplomatic history. By the late 1930s, the im- 
pact of Hitler and of the coming world war had 
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deflected Namier from his Georgian studies to more 
modern central European history. Deeply dis- 
turbed by Hitlerite anti-Semitism, Namier seemed 
to be asking himself, “Where did Germany go 
wrong?” The most important work of this phase 
of his study was his 1944 Raleigh lecture, 1848: 
The Revolution of the Intellectuals, a perceptive if 
somewhat unbalanced essay, in which he charac- 
teristically exposed the lack of consistency between 
the formal ideology of the 1848 revolutionaries 
(particularly in central Europe) and their national 
and social prejudices and commitments. 

His important works on the diplomacy of the 
1930s were also controversial. Shortly after the 
outbreak of war in 1939, he began a searching 
analysis of the documentary collections that the 
different foreign powers published to justify their 
conduct. The resulting essays were published in 
the Political Quarterly during the war years, and 
the research that went into them led Namier into 
a major project: a diplomatic history of the origins 
of the war. He combined the wartime essays with 
new material in Diplomatic Prelude: 1938-1939 
(1948). The postwar flood of memoirs, war trial 
testimony, and documentary publications was too 
great to assimilate into this first attempt at a gen- 
eral history; several examples of the new evidence 
were, however, subjected to his characteristically 
rigorous scrutiny in a series of essays appended to 
the book. Although at one time Namier hoped to in- 
corporate these findings and subsequent analyses 
of new publications into a revised edition of Diplo- 
matic Prelude, he chose ultimately to return instead 
to parliamentary history. Before carrying out that 
decision, however, he contributed to diplomatic 
studies a series of penetrating essays, analytical 
reviews, and detailed commentaries in which he 
dissected various new publications. The best of 
these were collected in Europe in Decay (1950) 
and In the Nazi Era (1952a). In some ways these 
volumes provide the same sort of critical survey 
of the origins of World War 1 that the successive 
editions of G. P. Gooch’s Recent Revelations of Eu- 
ropean Diplomacy provide for the origins of World 
War 1. 

Namier’s contribution to recent diplomatic his- 
tory is fourfold. First, through personal interviews 
and correspondence and by revealing and publish- 
ing otherwise inaccessible documents, he added to 
the available evidence. Second, by his methodo- 
logically rigorous and sophisticated analysis, he 
demonstrated that it was possible to provide a 
scholarly, viable explanation of the events leading 
up to the war, even from limited data, His sharp 
perception suggested many tentative conclusions 


since substantiated by new evidence. In spite of 
some oversimplification, Diplomatic Prelude re- 
mains the starting point for the subject. Third, 
his generally caustic but always helpful reviews 
of, for example, French and German memoirs are 
of permanent use. By revealing the clumsy dis- 
tortions, contradictions, and fabrications, as well 
as the commoner sorts of inaccuracies, Namier 
attempted to warn the serious reader against both 
current apologists and future myth makers. Fourth, 
Namier’s work has had great influence in shaping 
the attitude of a whole generation of historians 
toward the diplomatic origins of the war. Not all 
share his severe judgments, e.g., his denunciation 
of appeasement. Yet his combination of meticulous 
dissection of evidence with a firm moral judgment 
has provided a secure conceptual framework for 
viewing the whole that is used even by those who 
come to some different conclusions. 

The “History of Parliament.” In his retirement 
years, 1953-1960, Namier returned to the eight- 
eenth century and served on the editorial board of 
the “History of Parliament” project. It was largely 
his advocacy that moved the British government to 
give considerable financial support to the compila- 
tion and publication of a biographical dictionary 
of the parliaments of England from the Middle 
Ages to 1832. (This project, first suggested by him 
in 1928, had been started by Col. Josiah Wedgwood 
with private support in the 1930s, but was stopped 
by the war.) The biographies for each forty- or 
fifty-year period are to be accompanied by an intro- 
ductory volume that statistically analyzes and 
interprets the social, economic, and political char- 
acter of the parliamentary membership and its 
voting record. When completed, the vast undertak- 
ing should become the primary tool for writing the 
political history of England. 

At the time of Namier’s death, the biographies 
for the period 1754—1790—for which he himself 
had assumed editorial responsibility—were sub- 
stantially completed; the accompanying interpreta- 
tive material was later prepared for publication by 
John Brooke (Namier & Brooke 1964a). If com- 
pleted substantially as planned, the “History of 
Parliament” will undoubtedly be the most lasting 
monument to Namier’s intellectual contribution to 
British historiography: that is, that the political 
historian should ask questions about status, in- 
terest, and behavior that can be answered only by 
minute observation and precise quantification. 


Namier’s work in modern European history has 
failed to produce a lastingly definable school; 
whether his more important eighteenth-century 


work will do so remains to be seen. His work was 
not in harmony with the ideological bent of the 
1930s; in the twenty years following the appear- 
ance of the Structure of Politics in 1929, only 
Robert Walcott’s work was directly influenced by 
Namier, although J. L. Neal was doing independ- 
ent analogous work on the sixteenth century. Not 
until the 1950s did Namier’s influence become 
visibly widespread in the work of John B. Owen, 
John Brooke, Jan Christie, Gerrit P. Judd 1y, 
Douglas Brunton, Donald H. Pennington, and 
Mary Freer Keeler, and more tangentially in that 
of Lucy S. Sutherland, Richard Pares, Norman 
Gash, H. J. Hanham, and others. Whether or not 
this influence will persist will depend largely on 
the fate of the “History of Parliament.” 


Jacos M. PRICE AND GERHARD L. WEINBERG 


[See also History.] 
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(1930) 1962 England in the Age of the American Revo- 
lution. 2d ed. New York: St. Martins. > A paperback 
edition was published in 1961. 
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NATION 


The nation has come to be accepted as the cen- 
tral political concept of recent times. In prevailing 
usage in English and other languages, a “nation” 
is either synonymous with a state or its inhabitants, 
or else it denotes a human group bound together 
by common solidarity—a group whose members 
place loyalty to the group as a whole over any con- 
flicting loyalties. This latter definition was first 
proposed by John Stuart Mill, except that he called 
the concept “nationality.” “A portion of mankind 
may be said to constitute a Nationality,” Mill wrote, 
“if they are united among themselves by common 
sympathies, which do not exist between them and 
any others—which make ‘them co-operate with 
each other more willingly than with other people, 
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desire to be under the same government, and de- 
sire that it should be government by themselves, 
or a portion of themselves, exclusively” (1861, 
chapter 16). 

The national ideal owes its universality to two 
historic movements: first, the French Revolution 
and the Napoleonic wars, which spread this ideal 
from France to other European countries, notably 
Germany, Italy, Spain, and Russia; second, the 
anticolonial movement of the twentieth century, 
which extended it to Asia, Africa, and other con- 
tinents. Because of the wide currency of the 
term “nation,” its meaning has become blurred, 
much as has that of the equally ubiquitous term 
“democracy.” In the nineteenth century some theo- 
rists (e.g., Mazzini, Mill, Renan) espoused the 
national ideal, and others (e.g., Acton) rejected it. 
In the twentieth century there have been fewer 
consistent critics (e.g., Toynbee, Kedourie) and 
advocates of supranational organization; most 
writers have taken the national ideal for granted 
while debating the “true” definition of the nation 
or the “real” meaning of nationalism. In short, the 
nation has become not only a key concept of po- 
litical theory but also a favorite weapon of political 
polemics. 


Development of the concept 


The word “nation” stems from the Latin verb 
nasci, “to be born,” and originally meant a group 
of people born in the same place, whether that 
place was thought of as a few dozen or many 
thousands of square miles. In the European uni- 
versities of the late Middle Ages, “nations” were 
groups of students who came from the same region 
or country. Somewhat later, a primary and a sec- 
ondary meaning evolved, political usage adopting 
the former and legal usage the latter. To French 
radical writers in the eighteenth century a nation 
meant the people of a given country, without dis- 
tinction of rank and often in contrast to the ruling 
monarch, In 1789-1793 the three French estates 
merged in the National Assembly, abolished the 
economic and political prerogatives of noblemen 
and clergy, transformed the monarchy into a re- 
public, and replaced the historic mosaic of regions 
and provinces with arbitrarily drawn départements. 
Nation was the slogan of champions of constitu- 
tionalism, secularism, equality, and centralization 
—of those who wished to modernize society and to 
rationalize its administrative structure. Whatever 
the word “nation” had meant to earlier genera- 
tions, its future meaning was profoundly affected 
by these revolutionary policies, In twentieth-cen- 
tury Asia, Africa, and Latin America, as we shall 


see, this link between the ideals of modernization 
and of nationhood is even closer. 

In its secondary usage, “nation” meant a strange 
people. The King James Bible distinguished be- 
tween the “people” of Israel and the “nations” of 
gentiles. English colonists in North America spoke 
of the Sioux, Cherokee, and other indigenous 
groups as “nations” of Indians, whereas Europeans 
of a later era (when nation had become associated 
with the modern nation-state) called corresponding 
African groups “tribes.” Similarly, sixteenth-cen- 
tury English lawyers translated the Latin phrase 
ius gentium as “law of nations,” for which Jeremy 
Bentham in 1780 substituted “international law.” 

The two meanings at first remained distinct 
enough. When political orators of the late eight- 
eenth century invoked the nation, they meant 
the people as supporters of popular government, 
whether in a sovereign state or in one of its sub- 
divisions, Palmer (1959-1964, vol. 1, p. 472) 
quotes such an appeal to the “nation” of Artois. 
To lawyers and diplomats, by contrast, a nation 
was any sovereign state, whatever its form of 
government. The victory of popular government 
in Europe in the period from 1789 to 1918 led to 
a blending of the two meanings, for nations now 
were proclaimed sovereign, But what if the boun- 
daries or indeed the existence of a state did not 
conform to the people’s wishes? This was the 
judgment expressed by Fichte when he appealed 
to a “German nation” over the heads of several 
dozen princes, by Mazzini when he advocated 
Italian unity, and by eastern European nationalists 
who plotted the downfall of the multilingual Otto- 
man, Hapsburg, and Romanov empires. 

Popular, or national, government thus threat- 
ened the legitimacy not only of monarchs but of 
their very realms. Mazzini, generalizing from the 
Italian example and steeped in the optimistic 
liberalism of the mid-nineteenth century, thought 
the answer to the question “Who are members of 
what nation?” obvious in all cases, and he re- 
mained confident that the nations of Europe could 
live in divinely ordained harmony once that answer 
was acknowledged. A similar optimism was im- 
plicit in Wilson’s proclamation of the principle of 
national self-determination. The peacemakers of 
1919 tried to apply this Mazzinian—Wilsonian doc- 
trine to eastern Europe; yet by 1941 all the newly 
created states had succumbed to Nazi German or 
Soviet Russian conquest. Clearly the redrawing of 
boundaries along national lines had held out no 
panacea for the ills of world politics. Where 
Mazzini and his contemporaries had blamed wars 
on the suppression of nationality, Western liberals 


a century later were more likely to blame them 
on an excess of nationalism. 

The League of Nations, including such coun- 
tries as China, Thailand, Ethiopia, Liberia, the 
Soviet Union, and Iraq, officially extended the term 
“nation” to all sovereign states inside and outside 
of Europe regardless of their form of government 
or the character of their social development. Mean- 
while, communist theorists and political leaders 
in non-European countries contributed to the same 
expansion of usage. Marx had rejected the nation 
as a bourgeois ideal likely to imperil the interna- 
tional solidarity of the proletariat. But later Marx- 
ists, such as Otto Bauer and Lenin, who witnessed 
the force of nationalism in Austria-Hungary and 
in the Russian Empire, accepted the idea. The 
formation of the Soviet Union in 1921 as a federa- 
tion of national republics, each dominated by its 
Bolshevik party, demonstrates this acceptance. 
Moreover, beginning with the Congress of the 
Peoples of the East at Baku in 1920, Russian 
communists tried to effect an alliance with the 
nationalist movements in colonial countries—an 
effort that led a generation later to the labeling 
of communist-supported guerrilla movements as 
“wars of national liberation,” 

Formation of the National Congress in India 
in 1885 and proclamation of the “National Pact” 
in Turkey in 1920 and of the National Revolu- 
tionary Party in Mexico in 1929 were landmarks 
in the spread of the national ideal to Asia, Africa, 
and Latin America. In India the aim was the modi- 
fication and later the end of colonial rule. In 
Turkey it was preservation of sovereignty for the 
rump of the defeated Ottoman Empire. In Mexico 
the goals were political unity after two decades 
of civil war and exclusion of United States po- 
litical and economic influences, But far more than 
political goals were implied: the fostering of 
cottage industries and the lowering of caste bar- 
riers in India, the adoption of secular law codes 
and a Latin script in Turkey, redistribution of land 
and a new pride in the Aztec heritage in Mexico. 
All three movements were examples of “compre- 
hensive-nationalist parties,” that is, of political 
organizations dedicated to external independence, 
internal unity, mass education, economic develop- 
ment, and secularization—to “a hastening of the 
process of modernization of which the rise of 
national consciousness itself is but one facet” 
(Rustow 1960, p. 400). 

The ideals of modernity and of nationhood are 
embodied today in many proclamations of the 
United Nations, an organization that has come 
close to achieving the universality already implicit 
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in the League of Nations. By virtue of its member- 
ship in the United Nations, a sovereign country 
as large as India or as small as the Maldive Islands 
may be considered a nation. Such usage does not 
simply represent a victory of the secondary-legal 
over the primary-political meaning; it also ex- 
presses the assumption (or the hope or pretense) 
that all member states are, or eventually will be, 
ruled according to popular principles, that nations 
in the legal sense will also be “nationalities” as 
defined by Mill. 


Elements of nationhood 


Nationalist writers have done little to clarify 
what they mean by nation or to explain how 
nations have come into existence. They have gen- 
erated more heat than light, and talk about the 
nation has been most emphatic where the sense 
of nationality was least developed, for example, 
among nineteenth-century Germans and Italians 
and twentieth-century Arabs and Africans, rather 
than among Englishmen, Frenchmen, or Japanese. 

Definition versus explanation, Scholars have 
added further ambiguities, notably by distinguish- 
ing “objective characteristics” of nationhood (e.g., 
geography, history, economic structure ) from “sub- 
jective characteristics” (e.g,, consciousness, loyalty, 
will)—a distinction that carries echoes both from 
German metaphysics and Marxist sociology. (See, 
for example, Wright 1942, vol. 2, p. 992.) In fact, 
the so-called subjective formulations are usually 
genuine attempts at definition, whereas the “ob- 
jective definitions” are generally more or less 
adequate attempts at explanation. For common 
language, common history, prolonged self-govern- 
ment, and other circumstances are likely to pro- 
mote feelings of nationality, but they are not 
among the defining characteristics of a nation. 
The Swiss are a nation although they speak three 
or four different languages; Israel is a nation even 
though Jewish communities had very distinctive 
histories in two millennia of the Diaspora; and the 
Poles remained a nation through a century and a 
half of partition. 

Nationhood as a variable. Even when defini- 
tion and explanation are separated, several diffi- 
culties remain. One is that nationhood, like any 
form of loyalty, is a matter of degree—a given 
people at a given time may be more or less a 
nation, while none fully approximates the ideal 
type. Moreover, a state’s boundaries may not coin- 
cide with the limits of national self-consciousness: 
the state may include ethnic minorities that do 
not feel the same national allegiance and exclude 
national groups beyond the borders (i.e., irreden- 
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tas) that do. Taking both variables into account, 
it may be easy enough to agree intuitively that 
Malaysia today is less of a nation-state than Al- 
geria, Algeria less than Turkey, and Turkey less 
than Sweden. Such assessments ultimately imply 
a series of predictions that under certain condi- 
tions loyalty will prevail, under others break down; 
hence many different measures might be devised, 
their accuracy depending on our foresight. Yet 
once nationality is acknowledged to be matter of 
degree, there is little profit in drawing a rigid line 
across the continuous spectrum and insisting that 
all peoples above that line are nations and those 
below are not. 

The leaders of a new nation quite naturally tend 
to assign to their people a higher rank on such a 
scale of nationhood than it may in fact merit; 
indeed they often assume the existence of a nation 
that is still to be created—as did Jinnah after 1940 
in his campaign for a Pakistan or Fichte in his 
Addresses to the German Nation (1808). Nation- 
alism (i.e., the desire to form or maintain a 
nation-state) has thus often preceded the emer- 
gence of a nation—or rather its ascent toward 
more perfect nationhood; yet the early nationalist 
will in all sincerity see in his program an effect 
rather than a cause of the existence of his nation. 

Like any other loyalty, national loyalty once 
formed may change its point of attachment, or it 
may dissipate; and several loyalties may conflict 
at a given time. Loyalty to the German and Italian 
nations began to supersede other, narrower loyal- 
ties in the nineteenth century. In Britain today no 
serious conflict is generally felt between a wider 
British and a more particular English, Welsh, or 
Scottish nationality; in Ireland the conflict be- 
came irreconcilable between 1846 and 1921. Since 
World War 11, common European loyalties have 
begun to compete with the national allegiances of 
the past, and to that extent the European countries 
are losing their character as nations; if the process 
should continue, our descendants may one day be 
able to speak of a European nation. Outside of 
Europe allegiances, in the absence of traditions of 
self-government, are often even more fluid. In 
1919 an Egyptian leader at the Paris Peace Con- 
ference curtly told a Syrian delegation that he 
could not be troubled about their “Arab” problems; 
but to Nasser’s generation the Egyptian problem 
itself has become an Arab problem. On the other 

. hand, in Syria today contemporary nationalists are 
caught in an intense and, as it were, concentric 
conflict of loyalties: to Syria in her present borders; 
to a Greater Syria including Lebanon, Jordan, and 
hopefully Palestine; to a Fertile Crescent commu- 


nity including Iraq as well; and to an Arab linguistic 
nationality from Morocco to Oman. [See LoyaLty.] 

Territorial identity. The modern nation-state 
is one particular form of the territorial state. The 
size of the territory must reconcile the imperatives 
of unity, which impose upper limits, with the re- 
quirements of a modern division of labor, which 
impose lower limits. But even within such a middle 
range there are innumerable ways in which boun- 
daries may be drawn, and it is in this context that 
the factors invoked by politicians as warrants of 
nationality or by scholars as “objective” charac- 
teristics of a nation may be examined. Geography, 
history, language, popular will, all these are sig- 
nificant in the formation of national identity, 
though not in the way it has often been assumed. 

Geography. The “natural frontier” clearly is a 
polemic rather than a scientific concept. The Pyre- 
nees have long divided Frenchmen from Spaniards, 
but the Alps have helped make the Swiss into a 
nation; portions of the Rhine Valley have at times 
marked off Frenchmen from Germans, but the Nile 
is the basis of Egypt's unity; and whereas insularity 
has helped preserve Japan’s distinctiveness it did 
not protect Britons from Anglo-Saxon, Danish, and 
Norman invasions—each of which made its con- 
tribution to British nationality. It is not mountains, 
valleys, or islands that constitute nations, it is their 
human inhabitants. [See GEOGRAPHY, article on 
POLITICAL GEOGRAPHY.] 

History. History is no less equivocal. In Europe 
there are few areas that have not changed political 
control many times over the centuries and to which 
three or four “historic claims” could not be readily 
contrived. In any case, nationalist attitudes to his- 
tory should not be taken at face value. History 
often serves as a reservoir of symbols from which 
nationalists instinctively select what suits their par- 
ticular purpose. For all their historical romanti- 
cism, they are usually straining for a break with 
their society's immediate past of dynastic frag- 
mentation or foreign subjection. Hence the glories 
of a remote past (real or mythical) become their 
allies against the recent past in the struggle for 
a better future. “To forget and—I will venture 
to say—to get one’s history wrong, are essential 
factors in the making of a nation” (Renan [1882] 
1939, p. 190). They generally will find those 
symbols most suitable that are respected among 
modern nations, that unite rather than divide their 
nation within the boundaries they propose, and to 
which their followers have a better claim than 
their rivals or antagonists. The Pharaohs, the 
Phoenicians, and the Assyrians held a natural 
appeal for Egyptian, Lebanese, and Iraqi nation- 


alists; Pan-Arabs, however, felt drawn to the early 
Caliphate; and promoters of the Egyptian—Syrian 
union of 1958-1961 were attracted to Saladin 
(who ruled the same two countries). Turkish secu- 
lar nationalists could not glorify Ottoman history 
while deposing the sultan, nor could they acknowl- 
edge their Byzantine legacy while battling Greek 
armies; central Asian motifs would have encour- 
aged a politically futile Pan-Turkism; but the 
Hittites in Anatolia seemed an ideal symbol as long 
as they could be alleged to be of Turkish ancestry 
(Lewis 1961, pp. 362 ff.). 

Language. No less a scholar than Toynbee has 
denounced the attempt to find “the criterion of 
Nationality in the shibboleth of Language” (1934— 
1961, vol. 8, p. 536); yet in the heyday of Eu- 
ropean nationalism language was more frequently 
invoked than any other criterion. Unlike geog- 
raphy, language is a human phenomenon; unlike 
history, which is continuous and may mean many 
things to many men, language does divide human 
beings into distinct groups. Language, moreover, 
is closely related to modernization: modernity 
means interdependence, and in modern societies 
more people talk and write to more others than 
ever before. Still, language is not an adequate cri- 
terion for nationality for several reasons. Language 
is not fixed, and politics shapes language as much 
as language shapes politics. European linguistic 
lines, for example, largely reflect the dynastic 
borders of the tenth to the fifteenth centuries. In 
many other parts of the world, language areas are 
either too small (tropical Africa) or too large 
(Latin America, the Arab Middle East) to provide 
a suitable setting for modern nation-states. If the 
linguistic map of the world were compared with 
political boundaries of the 1960s, it would turn out 
that the two coincide even roughly in only two 
dozen countries, the majority of them in Europe. 
In nearly half the countries of the world less than 
70 per cent of the population speak the same lan- 
guage, and in one out of four there is no linguistic 
majority. Throughout most of the world, if present 
states are to become nations, either linguistic 
identity will have to be consciously fostered or else 
some different basis of nationality must be found. 

Popular will. The last traditional determinant 
of nationality may be called the plebiscitary prin- 
ciple; it is suggested by Renan’s well-known 
metaphor of the existence of a nation being “un 
plébiscite de tous les jours.” As a practical expedi- 
ent, however, plebiscites can determine national 
boundaries only in marginal situations, and even 
then the choice needs to be defined, and the re- 
sult enforced, either by the common consent of 
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pre-existing neighboring states or else by a pre- 
dominant concert of outside powers. As Sir Ivor 
Jennings has written, “On the surface it seemed 
reasonable: let the people decide. It was in fact 
ridiculous because the people cannot decide until 
somebody decides who are the people” (1956, 
P. 56; cf. Kedourie 1960, pp. 125 ff.). 


The growth of nations 


The conventional search for determinants of 
nationality in history, geography, language, and 
popular will has posed as many logical problems 
as it has resolved. A more hopeful approach is 
represented by recent theories which connect the 
phenomenon of the nation with such processes as 
“modernization” and “social communication.” 

Modernization. Modernization may be defined 
as “expanding control over nature through closer 
interaction among men” (for similar formulations 
cf. Black 1966, pp. 1-9). As a sustained histori- 
cal process, modernization began in Renaissance 
Europe and spread to other continents mainly as 
a result of the European impact upon them. 
Although the nation and modernization have sepa- 
rate historical origins, their subsequent asspciation 
is more than accidental. 

The term “nation” has been most commonly 
applied to European peoples since the end of the 
Middle Ages and to others that came under their 
influence, but national consciousness as defined 
by Mill has existed in all parts of the world and 
in all eras—among peoples long subject to foreign 
rule, such as Poles, Finns, and Irishmen, among 
the conquering Arab tribes of the seventh century 
A.D., among many African ethnic groups before the 
European conquest. Modernization began in Eu- 
rope, in the Romanov and Ottoman empires, and 
in many of the colonial realms long before the 
advent of national consciousness; but continued 
modernization almost everywhere produced a 
wholesale realignment of borders and loyalties. 
Traditional feudal systems proved too cumbersome, 
dynastic and colonial empires too heterogeneous, 
and traditional tribes, principalities, towns, and 
villages too small. By contrast, nation-states of in- 
termediate size provided a political framework for 
equality of opportunity, for comprehensive division 
of labor, and hence for modern science and in- 
dustry. Conversely, premodern nations that made 
contact with an outside modernizing world could 
retain the loyalty of their members only by adopt- 
ing policies of modernization. Where nations had 
been rather the exception in traditional times, they 
became the universal ideal of the present era. In 
Europe itself, and more especially in Asia, Africa, 
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and Latin America, nationalism and the drive for 
modernity are today two facets of the same social, 
cultural, and political revolution. [See MODERNIZA- 
TION.] 

Social communication. The term “social com- 
munication” is elaborated primarily in the works 
of Karl Deutsch (1953; Deutsch et al. 1957), whose 
major premise would be obvious if it had not been 
so consistently ignored by the classic nationalist 
writers, Nationality is not an inborn characteristic 
but the result of a process of social learning and 
habit forming. Such learning in Europe and North 
America typically came about through the growth 
of an intense and continuous network of social 
communication (that is, of trade, travel, corre- 
spondence, and the like)—a network linking a 
number of nearby cities, each with its rural hinter- 
land. In short, a new style of life resulted from a 
process that Deutsch calls “social mobilization,” 
which is nothing but the social aspect of modern- 
ization. An outside challenge to that new way of 
life and the advent of a new generation then acted 
as catalysts in shaping a political consciousness 
of nationality. 

The process of political integration, according 
to Deutsch, usually began in the prenationalist or 
the nationalist era with the emergence of a core 
area—an area where administrative and economic 
capabilities exceeded the load of political demands. 
Tf a durable nation-state is to form around a core 
area (such as the Île-de-France, Prussia, or Pied- 
mont), there must be a compensating flow of ad- 
vantages and sacrifices linking the various regions, 
and rival plans of integration must be eliminated, 
often by force. The formation of political coalitions 
cutting across both classes and regions and the 
prospect of mutual economic advantages are help- 
ful in securing the final result. 

The nation and political tradition. Deutsch’s 
analysis here converges with the findings of Kohn 
(1944) and other, earlier students of nationalism. 
The sense of nationality in western and northern 
Europe that emerged in the eighteenth century was 
securely based on a continuity of political rule by 
dynasties and governing elites going back to the 
Middle Ages. Where nationality was not thus an- 
chored in political tradition, as in most of central 
and eastern Europe, the sense of nationality has 
been precarious, and hence, by a familiar dialectic, 
more aggressive. [See NATIONALISM] 

The same distinction applies outside of Europe. 
Generally, in postcolonial states the degree of po- 
litical integration has been roughly proportionate 
to the number of indigenous administrators and 


politicians trained under the colonial regime. Not 
only the political elites, but the very boundaries 
of the new states—that is, their prospective na- 
tional identities—are a colonial legacy. The British 
in India yielded to demands for a separate Pakistan 
but did not effectively support any separatist claims 
of the princely states. The French, eager to keep 
Africans within a French overseas community in 
the 1950s, divided West and Equatorial Africa into 
14 separate territories and confirmed their decision 
when they gave them independence in 1960. The 
British, by contrast, worked out a plan of federa- 
tion for independent Nigeria. The Belgians con- 
ferred independence on the Congo as a whole, but 
on Rwanda and Burundi separately. Except for the 
formation of Tanzania, the African boundaries es- 
tablished at the time of independence have re- 
mained virtually intact. The reason is not love 
for the former colonial rulers or the linguistic, 
geographic, or economic logic of the boundaries 
themselves; rather, in a continent poor in political 
tradition, these boundaries represent what little 
tradition there is. 

In western Europe it was mainly the monarchs 
of a prenational era who decided who their sub- 
jects were to be; in Africa and Asia that decision, 
by default and necessity, has been left to the ex- 
colonial powers. If the Latin American experience 
has any relevance, these postcolonial boundaries 
may be expected to endure well into the future. 
For whatever the periodic spells of anarchy within 
individual Latin American countries, the boun- 
daries between them have undergone few changes 
in the last century and a half. Broadly, they still 
reflect those of the territorial divisions under 
Spanish colonial rule. 

Only within a stable framework of geographic 
identity can the other elements of a modern nation- 
state be assembled. These include the development 
of a body of public servants recruited from, and 
responsive to, all major social and regional groups; 
the growth of a pattern of political organization, 
such as political parties, and functional associa- 
tions, such as trade unions and professional so- 
cieties; and a widespread habit of political par- 
ticipation through voting in elections, the reading 
of newspapers, and the like. These political ac- 
complishments, in turn, are closely linked to other 
aspects of modernization, such as growing social 
equality, social and geographic mobility, the ex- 
pansion of a secular and public system of educa- 
tion, an increase in economic productivity, and 
(under certain conditions) the formation of a na- 
tional language. [See POLITICAL CULTURE.] 


Nation building. Many of the developments 
just listed can be encouraged or accelerated by 
suitable measures of public policy. Recent writers, 
particularly in the United States (Bendix 1964; 
Deutsch & Foltz 1963; Pye 1962), have given cur- 
rency to the concept of “nation building,” which 
may be thought of as the sum total of such 
policies, It is true that there are times when the 
founders of states are highly conscious of their 
architectonic task; and our own era, when so many 
newly proclaimed states are eager to ascend to 
the foremost ranks of modernity, is one of those 
periods, But even a Washington, a Bismarck, an 
Atatiirk, and a Nehru must build on foundations 
laid long before; nor is the structure likely to settle 
within their lifetime. The founder of a common- 
wealth, as Rousseau suggested, must work in one 
century to reap his reward in another. It seems 
therefore more appropriate to speak of the growth 
of nations or of the search for nationhood than 
of nation building or of crises of nationhood; the 
organic and patient metaphors, on balance, seem 
more accurate than the mechanical, voluntaristic, 
or dramatic ones. 


The disparity between the claims to nationhood 
and the political realities in many of sixty-odd “new 
nations” is one of the most visible problems of the 
mid-twentieth century, Political instability, military 
coups, sporadic guerrilla warfare, and subversion 
from outside may be considered so many facets 
or consequences of that central problem. Viewed 
from a perspective of the domestic politics of any 
given country, the nation appears as the unifying, 
integrating ideal. But the proclamation of new 
nations has also given rise to new types of inter- 
national tension. The transition from colonialism 
to independence has in fact often merely trans- 
formed the internal violence within a dissolving 
empire into better-organized violence among new 
states, as in Israel, Kashmir, Cyprus, and Viet- 
nam, Within a perspective of world politics, there- 
fore, the nation-state may rightly be considered a 
disruptive element, Unless the growth of nations 
is accompanied by a growth of a system of inter- 
national order, the new nation-states of the 1950s 
and 1960s may prove no more viable than did the 
Baltic or Danubian states in 1939-1941. In the 
words of Léopold Senghor, the first president of 
Senegal, “Nation is the first reality of the twen- 
tieth century . . . the second reality . . . is the 
interdependence of races, continents, and nations” 


(1959, p. 45). 
Dankwart A. Rustow 
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[See also GOVERNMENT; INTERNATIONAL POLITICS; NA- 
TIONALISM; STATE. Directly related are the entries 
CommunisM, article on NATIONAL COMMUNISM; 
MODERNIZATION; PAN MOVEMENTS; POLITICAL CUL- 
TURE; TRUSTEESHIP. Other relevant material may 
be found in the articles under INTERNATIONAL RE- 
LATIONS. ] 
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A Study of History. 


NATIONAL CHARACTER 


The term “national character” is used to de- 
scribe the enduring personality characteristics and 
unique life styles found among the populations of 
particular national states. This behavior is some- 
times considered on an abstract level, that is, as 
cultural behavior without actual reference to nec- 
essarily different personality modalities. It may 
also be considered as motivated by underlying psy- 
chological mechanisms characteristic of a given 
people. 


History of the field 


Europe has had a long history of self-conscious 
awareness of national differences. In ordinary 
conversation and in essays one finds discussions 
of the differences between Danes and Swedes, be- 
tween Belgians and Dutch, between Germans and 
Italians, or even between northern and southern 
Italians, northern and southern Belgians, or north- 
ern and southern Dutch, Every national group 
develops over a period of time certain stereotypes 
of members of other national entities. Commonly 
held stereotypes may be discussed in a tone of 
objective detachment or with varying degrees of 
approval of the traits considered. While the per- 
ception of behavioral differences has led to a great 
deal of verbal expression and impressionistic writ- 


ing, only since the 1940s have serious efforts been 
made to explore systematically the validity or pre- 
cise nature of the perceived differences with re- 
spect to underlying personality configurations. 
The social or cultural anthropologist's obser- 
vations of the behavioral configurations found in 
highly divergent non-Western cultures have af- 
forded him a much wider view of manifest vari- 
ability in human behavior than is found within the 
western European tradition. The anthropologist 
has had to contend with radically different lan- 
guage structures and cognitive—perceptual patterns 
which define the natural and social environment; 
divergent patterns of causality and logic; unusual 
decision-making patterns in social groups; patterns 
of internalized or coerced responsibility and au- 
thority unknown in the West; different patterns 
of expressing, disguising, or denying feelings and 
emotions—not to mention wide variations in moral 
definitions and values. Almost every generally ac- 
cepted, unquestioned “universal” concerning the 
psychological nature of man and the basic ele- 
ments of social, economic, or political life has been 
seriously challenged by the anthropological data 
which present the full spectrum of world cultures. 
During the course of World War 11, a number of 
anthropologists developed the idea that their con- 
cern with culturally determined personality differ- 
ences had equal relevance to understanding dif- 
ferences among Western nations. They believed 
that careful evaluation of characteristics common 
to significant segments of the populations of the 
nations involved in world conflict could lead to 
more meaningful analysis of diverse sociopolit- 
ical developments occurring within these national 
states. Moreover, they contended that a systematic 
analysis of the differences in “national character” 
within the Western societies would lead to insights 
into the periodic tensions and misunderstandings 
that had arisen between individual members of 
both the then allied and enemy national groups. 
Culture and personality. Anthropologists like 
Bronislaw Malinowski had been stimulated by dy- 
namic psychosocial theories of human nature, 
principally the theory of psychoanalysis, and were 
prompted to search out test cases challenging uni- 
versal statements concerning man’s psychological 
functioning. By the time of World War 1 social 
anthropologists in the United States, as a group, 
had become increasingly concerned with cross- 
cultural studies in the problems of personality 
development. The subdiscipline of culture and 
personality became a valid specialization within 
anthropological research. The methods developed 


in culture-and-personality studies were considered 
applicable to analyses that would produce a deeper 
understanding of the behavior of nationals from 
various European countries and from Japan. In 
addition to employing the careful observations 
used in ethnographic work, culture-and-personality 
research also borrowed techniques and methods 
developed in clinical psychology and psychiatry for 
individual psychodiagnosis, Culture-and-personality 
research employed direct observation of child so- 
cialization in the family, depth interviews, detailed 
life history data, dream analysis, and projective 
psychological techniques such as the Rorschach 
and the Thematic Apperception Test. 

Culture at a distance. Since a number of the 
national-character studies were conducted in a 
period of total war and were concerned with enemy 
or occupied countries, which were therefore inac- 
cessible to direct research, substitute techniques 
had to be developed. Research of this kind came to 
be known as “the study of culture at a distance” 
(Mead & Métraux 1953). Nationals residing out- 
side their countries were interviewed for reminis- 
cences about child-rearing patterns and social 
attitudes influencing interpersonal relationships 
within their national states. In addition, since the 
societies studied were highly literate and employed 
mass means of communication, novels, cartoons, 
newspaper articles, and photographs were all 
available for systematic analysis, These materials 
were carefully examined for recurrent themes or 
other clues to customary attitudes and social per- 
ceptions that would point to possible underlying 
personality differences or at least to differences 
in the structure or hierarchy of values. 


Approaches to national-character analysis 


Studies of national character differ widely both 
in their underlying theoretical assumptions and 
in their research objectives. All of them assume, 
however, that there are elements held in common 
by members of a particular state that can be traced 
in some manner to the relatively enduring forma- 
tive influences of the cultural character of that 
state on malleable human nature. It is further as- 
sumed that common experiences have a centripetal 
effect that to some degree or in some areas out- 
weighs the centrifugal effects of idiosyncratic 
experiences. 

As an underlying objective, national-character 
studies all share a desire to make the perception 
of national differences more comprehensible and 
to order them more systematically into over-all 
patterns. However, the variables considered are 
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often neither comparable nor rigorously defined. 
There are variations in what is subsumed under 
the concept of “national character,” depending on 
the author's approach. 

Margaret Mead, a vigorous proponent of national- 
character studies, distinguished three approaches 
(1953). First, there is the analysis of relationships 
between the basic learning common to children 
within a nation or culture and later characteristics 
witnessed in the behavior of adults within the same 
society. Formative childhood experiences are the 
immediate focus of such studies, Second, there 
are societal studies of the pattern and structure 
of interpersonal relationships. There are cultural 
sanctions operating continually throughout the 
society to reinforce behavioral patterns, and thus 
there is an expected consistency in cultural con- 
figurations. Cultural constraints become fixed and 
internalized aspects of personality. Third, there are 
studies comprising simple comparative descriptions 
of those cultural configurations which distinguish 
one national unit from another; different life styles 
and ways of looking at things are defined as part 
of national character. Studies of this last variety 
remain, from a psychological standpoint at least, 
surface descriptions of what seem to be consistent 
culturally defined values, or behavior patterns, 
without reference to possible underlying motiva- 
tions or personality mechanisms. In contrast, 
studies included in the first two categories men- 
tioned by Mead seek to push beyond the descriptive 
level to trace out certain underlying structurally 
consistent aspects of personality that are mani- 
fested in the overt behavior peculiar to members 
of a given society. 

The basic personality. Kardiner, in several 
studies concerned with personality patterns opera- 
tive both in non-Western societies and in certain 
sectors of American society, developed the concept 
of basic personality (Kardiner 1939; Kardiner & 
Ovesey 1962). By means of this concept Kardiner 
attempted to define components of a common per- 
sonality integration shared by a significant number 
of individuals who have had similar cultural ex- 
periences, The concept is based on an interpreta- 
tion of psychoanalytic theory that de-emphasizes 
biological variables and focuses on culturally de- 
termined primary (i.e., family) influences on per- 
sonality development. Other social institutions as 
well as ideological and religious projective systems 
derive their particular flavor from these socializa- 
tion experiences. An important consideration in 
studying basic personality variables is the position 
or life situation of the parents within the society. 
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Changes in the economic structure of a society 
strongly influence the experiences of childhood 
and can radically alter the primary family, thus 
causing changes to occur in basic personality. 

The modal personality. Linton, subsequent to 
his collaborative work with Kardiner, developed a 
somewhat different concept, that of modal person- 
ality. In modal personality, Linton (1945) sought 
to emphasize the fact that personality patterns, 
especially those in more complex societies, are not 
invariable, When a concept of modality is used, 
no judgment need be made as to the degree, range, 
or variety of personality configurations found 
within a particular culture. Nor does the concept 
of modal personality define the number of possible 
personality types found within a particular group. 
The concept is quantitatively descriptive rather 
than based on a series of assumptions derived 
from psychoanalytic theory. (See the discussion in 
Duijker & Frijda [1960] for more elaborate differ- 
entiation of the Kardiner and Linton concepts. ) 

Child rearing. A number of studies in national 
character are concerned with correlating the cen- 
tral role of culturally prevalent child-rearing prac- 
tices with resultant personality modalities found 
in the adult. One can consider studies of class 
and ethnic differences within a particular nation 
as a form of national-character study. In this 
respect there have been attempts to describe sys- 
tematically the child-rearing practices of different 
classes and ethnic groups in America (e.g., Miller 
& Swanson 1958), In these studies, variables such 
as weaning and toilet-training practices are seen 
as diagnostic of differential formative socialization 
experiences. 

Functional prerequisites. Another type of na- 
tional-character study examines the basic person- 
ality traits that are necessary for at least a 
working minority of individuals within a society 
to keep that society functioning on its own terms. 
When Erich Fromm, the psychoanalyst, discussed 
national character (1941) he contended that in 
an industrial society with ever-increasing bureauc- 
ratization and standardization of occupations, the 
personality traits of discipline, orderliness, and 
punctuality are necessary. These traits have to be 
present in a significant portion of the population 
if a complex industrial society is to continue to 
function effectively. Robert K. Merton, the sociolo- 
gist, has also concerned himself with defining the 
types of personality structure that function best 
in bureaucratic settings (1940). He discusses how 
the settings themselves are influential in determin- 
ing personality variables. 


Sociopolitical interpretations. One of the basic 
objectives of national-character studies is to ex- 
amine the tensions underlying the political and 
social structures of modern states. Social ten- 
sions are particularly apparent in societies that 
are rapidly changing. For example, one type of 
social tension that is frequently observed results 
from the systematic attempts of an elite to estab- 
lish particular patterns of directed social change, 
in spite of the unavailability of sufficient individ- 
uals whose training and social experience equip 
them for achieving the goals set by the elite. 

Some national-character studies seek to differen- 
tiate between the patterns that are characteristic 
of the elite and those patterns of the populace that 
have been imposed upon it. In one such study, 
Bauer attempted to demonstrate the social tension 
existing between the political elite in Russian so- 
ciety and a large number of individuals who are 
not motivationally involved in the same ways as 
the members of the Communist party hierarchy 
(1948). 

If necessary personality traits are not forthcom- 
ing from a proportionate number of individuals 
within the society, the society will not function 
well in terms of newer values, whatever the elite 
controlling the society attempts to do. Even 
though institutional legal structures are consciously 
changed in accord with social planning, if charac- 
teristic changes in socialization experiences do not 
accompany these changes in such a manner as to 
facilitate the appearance of adequate motivational 
behavior, the sought-after change will not become 
stabilized and self-perpetuating. 

In another study indirectly concerned with the 
psychological processes underlying dynamics of 
social change, DeVos (1960) analyzed achieve- 
ment motivations in both rural and urban Japan. 
He related a continual preoccupation with hard 
work and socially approved accomplishment to the 
manner in which Japanese children internalize 
guilt and suggested that the prior presence of these 
and other related personality variables and social 
values tended to facilitate the rapid change in 
Japanese social structure from a feudal society to 
a modern state. 

Hagen (1962), in a comprehensive study of 
economic and social change in a number of dis- 
crete societies, cogently discussed the relationship 
of personality variables to different economic tra- 
ditions, such as colonialism or feudalism, and the 
manner in which they either facilitate or hamper 
economic development. This study is illustrative 
of the fact that considerations of national char- 


acter are having considerable influence in aug- 
menting theoretical approaches in economics and 
political science. 


Controversies over research 


There has been a great deal of criticism and 
reaction to national-character studies on the part 
of those who regard them as putting undue em- 
phasis on what is considered to be an unproven 
relation between the developmental experiences of 
childhood and national character. 

A most noteworthy example of such controversy 
is found in the numerous criticisms leveled at the 
work of Geoffrey Gorer (Gorer & Rickman 1950) 
and his attempt to assess Russian national char- 
acter. Gorer, a strong proponent of the effect of 
childhood experiences, sought to derive certain 
features of Russian adult personality, especially 
attitudes toward authority, from the fact that a 
large proportion of the Great Russians are sub- 
jected to prolonged swaddling during infancy. He 
stressed the effects of the infant’s reaction to 
swaddling—how swaddling colors later perceptions 
of constraint and authority and how it influences 
both modes of self-control and the expression of 
aggression. While some criticism of Gorer’s work 
has been made with care and restraint, other op- 
ponents have sometimes resorted to overgeneral- 
ized statements or have lifted assertions out of 
context in order to make the causal sequence he 
suggests appear extremely ludicrous. The con- 
troversy over Gorer’s attempts at delineating Rus- 
sian national character highlights some of the 
possible limitations and overextensions of culture- 
and-personality methods. 

There have been extreme examples of attempts 
to explain behavior on the basis of some omni- 
present psychological mechanism. There was, for 
example, a hypothesis advanced during World 
War 1 that Germany’s national behavior was, in 
essence, paranoid, and that it stemmed from the 
omnipresence of paranoid mechanisms within the 
German people as a whole. A number of national- 
character studies do not stand up well when sub- 
jected to careful scrutiny as to methods, care 
in comparing data, and other necessary scientific 
safeguards. In effect, a number of the studies of 
national character are simply impressionistic 
statements based on culturally and psychologically 
sophisticated perceptions of a foreign culture. 

Lindesmith and Strauss (1950) point out a 
number of the more severe criticisms it is possible 
to level at national-character or culture-and-person- 
ality studies. They are critical not only of some of 
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the general conceptual frameworks offered, but 
of conclusions reached, evidence put forward, and 
methods used. Among the failings they criticize 
are tendencies to oversimplify or to overlook or 
ignore the range and variability of behavior found 
even in isolated simple cultures. Some studies they 
cite do not adequately distinguish between data 
and interpretation. In others, there is a lack of 
precision in defining the variables considered. 
Some studies are highly limited or selective in re- 
gard to data and informants used. One of their 
chief criticisms of psychoanalytically oriented 
studies is that the determining or causal links be- 
tween sets of data depend on heavily labored ex 
post facto interpretations not subject to any scien- 
tific validation, 

Lindesmith and Strauss suggest that genuine 
advances await the application of additional and 
better-controlled psychological research on specific 
issues within cultures on which there is already 
a considerable amount of ethnographic material. 


Unresolved questions 

Although very promising, national-character 
studies must be considered a relatively new ap- 
proach in social science. At this stage of develop- 
ment one can only guess the further direction of 
progress. A principal aim of national-character 
studies is to relate particular forms of observable 
behavior characteristic of a given population to 
the relative distribution of structural personality 
components. It is assumed that this relationship 
is a partial cause of behavioral differences or simi- 
larities between groups. To accomplish this goal 
adequately one must distinguish between observ- 
able behavior that is related to surface social pat- 
terns and behavior that is related to underlying psy- 
chological structures or personality components. 
The purpose of national-character studies, therefore, 
is to discover the distribution of underlying psy- 
chological structures in a given population and to 
determine the nature of their relationship to be- 
havioral.phenomena. 

Can it be that the distribution of underlying 
personality modalities in the form of motivational 
patterns within the population is actually not very 
different from one Western society to the next? 
Can it be that with changes in economic distri- 
bution or in the structure of political allocation 
of power observed changes in behavior may occur 
without any over-all shift in the relative presence 
of motivational patterns in the society itself? Do 
power shifts simply redistribute the roles played 
by personality variants present within a modern 
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state that, given the proper social ambience, be- 
come more visible? Consider, for example, whether 
people in West Germany today are any different 
in underlying motivational structure from what 
they were under Hitler. The greater equality of 
behavior between the sexes in Japan today does 
not necessarily reflect actual structural changes in 
the personality of many Japanese men and women 
but may represent the range of expression in overt 
behavior patterns that is now possible under a more 
egalitarian legal system. 

It is most probable that what we call the “person- 
ality” of an individual in actuality has a far wider 
behavioral potential than the patterns allowed for 
within any particular culture. Observed consist- 
encies of behavior within a given culture may 
therefore be the result of cultural limitations and 
selection. From this point of view one does not 
necessarily have to presume commonly held per- 
sonality traits for given populations. 

From the standpoint of national character, we 
can examine, for example, the present rapid and 
almost world-wide occurrence of certain commonly 
recognizable types of adolescent behavior that 
are considered, in some cases, “delinquent” or 
antisocial. The behavior expressed by a small per- 
centage of modern youth may attest more to simi- 
larities in anomic social conditions for youth in 
industrial societies than to shifts or changes in 
behavioral controls related to personality. 

From the standpoint of national character one 
has to evaluate to what extent the rapid spread of 
patterns of behavior that have become available 
through culture contact (e.g., mass media) is de- 
pendent on the presence of latent personality char- 
acteristics not readily expressible within older cul- 
ture forms. These examples can be interpreted as 
changes in behavior related to social change with- 
out necessarily invoking explanations related to 
personality patterns. There are, however, instances 
where some attention to personality structure ap- 
pears more relevant. Explanations at a psycholog- 
ical level may be more relevant when anticipated 
behavioral change fails to materialize in spite of 
social, legal, or economic inducements toward 
change. 

Rapid shifts may occur only when the newly de- 
fined behavior is peripherally related to complex 
types of personality functioning. Some patterns of 
behavior demand less restructuring of deeper as- 
pects of personality. One may cite, for example, 
the observed difficulty encountered when planned 
change demands the presence of long-range goals 
or entrepreneurial attitudes in cultures previously 


generally lacking such orientations in behavior. 
There is no easy diffusion of the integrated be- 
havioral patterns necessary to carry the burden of 
planning activity even though culture contact takes 
place. 

Allowing the assumption that one can demon- 
strate statistically ascertainable underlying modal 
personality types within relatively discrete societies 
with unique cultural histories, would one not also 
find similarities or differences of long standing 
within these societies that are based on rural-urban 
or class—occupational living patterns? Would these 
class differences not cut across national boundaries? 

Viewed historically, it may be that structural 
personality distributions do differ in various na- 
tional states, but that, given the establishment of 
similar social objectives such as industrialization, 
observed behavioral patterns in at least some seg- 
ments of the population come to bear increasingly 
close superficial resemblances without any im- 
mediate change in the actual distribution of under- 
lying personality differences. However, change in 
patterns of socialization, including formal educa- 
tion, may subsequently come to affect the newer 
generation. With changes in the patterns of social- 
ization, shifts within personality patterns of the 
population may then materialize, depending on the 
nature of the original patterns affected. For exam- 
ple, among the emerging national states there have 
been varying degrees of success in achieving simi- 
lar values. The acceptance of these values may be 
highly dependent upon the presence and persistence 
of underlying psychological structures that are 
differentially distributed among the various pop- 
ulations. 

A most important question for many of the 
emerging nations in the world today is: What length 
of time is necessary for changing motivational pat- 
terns within sufficiently large segments of a society 
so that new economic or occupational roles can be 
performed? It is obvious that some “psychological” 
lag occurs between legal and planned economic 
changes and the point at which the prevailing 
formative influences on the child have been changed 
so that he is motivated and able, by the time he 
becomes an adult, to participate effectively in the 
new society being formed. 

GEORGE A. DEVOS 


[See also CULTURE AND PERSONALITY; LIFE CYCLE; 
PERSONALITY; SOCIALIZATION. ] 
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NATIONAL INCOME AND 
PRODUCT ACCOUNTS 


National income and product accounts form a 
systematic interlocking framework containing con- 
sistent estimates of significant aggregates and 
components for the economy and for major eco- 
nomic “sectors,” or groupings of like decision- 
making units. The sector tables are built on 
accounting principles, showing the transactions 
among sectors and distinguishing the main types 
of economic activity: production, consumption, and 
capital formation and its financing, The capital 
account may be elaborated in the direction of 
showing detailed flows-of-funds among sectors, and 
related to balance sheets and wealth statements for 
successive points in time. 

Economic accounting systems may thus provide 
comprehensive numerical estimates of economic 
stocks and flows within national or regional econ- 
omies. The accounts may be described as “general 
purpose,” for they are intended to serve various 
analytical uses: to increase economic understand- 
ing, to serve as a background for economic ap- 
praisals and projections, and to make possible more 
effective public and private decision making. 

This article summarizes the historical develop- 
ment of national income concepts and estimates, 
and discusses the basic structure of economic ac- 
counts: the production account, in aggregate and 
by industry, with some reference to conceptual 
problems; appropriation accounts and their relation 
to the production account; and the capital account, 
and its relationship to flow-of-funds accounts and 
national balance sheets and wealth statements [see 
NATIONAL WEALTH]. Finally, some mention will be 
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made of the chief types of uses made of economic 
accounts. 


Historical development 

Until World War 1, national income estimates 
were developed largely by individual investigators 
in a dozen or so countries. The basic income and 
product concepts were clarified, and estimating 
methodology was developed. Between World War 1 
and World War n, estimating work was increas- 
ingly taken over by government statistical organi- 
zations and put on a regular, recurring basis. 
National income accounting systems, developed 
during World War 11, spread rapidly to most coun- 
tries of the world after the war. The focus is now 
on refinement and elaboration of the accounts. The 
following historical review relies heavily on Studen- 
ski (1958, part 1). 

Developments up to World War 1. The first 
estimates of national income were made in Eng- 
land by Sir William Petty in the essay “Verbum 
sapienti” in 1665, and in “Political Arithmetick” 11 
years later, although the essays were not published 
for some years. Petty adopted a comprehensive 
income concept, defining the “Income of the Peo- 
ple” as the sum of the “Annual Value of the Labour 
of the People” and the “Annual Proceed of the 
Stock or Wealth of the Nation” (rent, interest, and 
profit). On the other side of what was, in effect, 
a double-entry account, he included the “Annual 
Expense of the People,” comprising consumption 
outlays and, in principle, the surplus remaining 
after current consumption. 

In 1696, more systematic estimates were in- 
cluded by Gregory King in a manuscript entitled 
“Natural and Political Observations and Conclusions 
Upon the State and Condition of England,” excerpts 
from which were published in 1698. Adopting Pet- 
tys conceptual framework, King estimated “the 
Annual Income and Expense of the Nation as it 
stood in 1688” and the excess of income over ex- 
pense, or the “yearly increase of wealth.” The 
method used was to estimate for more than two 
dozen socioeconomic groups the number of families 
and the average income, expense, and saving per 
family. He constructed a ten-year time series and 
provided comparisons with France and Holland. 

In France, Pierre le Pesant de Boisguillebert was 
the first economist to develop the notion of national 
income and to make some crude estimates of its 
magnitude, which appeared in Détail de la France, 
written in the early 1690s and published in 1697, 
and in Factum de la France, published in 1707. 
Boisguillebert identified income with consumption, 
and on the income side he included income from 


property and from labor. More careful estimates 
were made by Marshall Vauban in his book Dime 
royale in 1707. Like Boisguillebert, Vauban was 
interested in tax reform and in income estimates 
as a basis for estimating yields from a gross in- 
come tax. He defined income broadly to consist of 
income generated in all branches of production, 
although available data allowed him to estimate 
only portions of the total. 

The early estimators, and their followers during 
the first century of income work, followed a com- 
prehensive production concept. This eventually be- 
came the accepted approach in Western countries, 
but only after some detours. In the later eighteenth 
century, the French physiocrats, led by François 
Quesnay, identified national income with agricul- 
tural product, believing that only agriculture pro- 
duced a rent, or produit net over and above costs. 
While not accepted generally even in France, this 
approach influenced Lavoisier, who made the most 
detailed estimates of French income prior to 1800. 
Although excessively restricted, the physiocratic 
concept had the useful by-product of leading its 
authors to show the connections between agricul- 
ture and the so-called nonproductive sectors of the 
economy in a tableau économique. This was a fore- 
runner of later sector accounts and input-output 
tables. [See ECONOMIC THOUGHT, article on PHYSIO- 
CRATIC THOUGHT.] 

In The Wealth of Nations (published in 1776) 
Adam Smith pointed out that all commodity pro- 
duction and distribution could be productive, re- 
turning a net income to producers in the form of 
profit. Since Smith was concerned with the creation 
of material wealth as a source of economic growth, 
he emphasized activity that “fixes itself” in com- 
modities rather than services that are consumed 
directly, defining the latter as unproductive. Thus, 
while Smith’s production concept was broader than 
that of the physiocrats, it was also restricted. 

Smith’s restricted concept was opposed by such 
economists as Lauderdale, Say, Senior, and Walras, 
but it stayed in the mainstream of economic thought 
for almost a century, perpetuated by the writings 
of David Ricardo and John Stuart Mill, in particu- 
lar. It was the basis of the chief national income 
estimates made in England and France between 
Smith’s time and the mid-nineteenth century, which 
placed main reliance on commodity production 
data. 

This restricted concept was also adopted by Karl 
Marx in Das Kapital, since by associating produc- 
tive labor with the creation of material goods alone, 
he could more conveniently develop his theories of 
the transformation of surplus value into capital. 
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The Marxian approach later became the basis of 
the “material product system” of income accounts 
in the Soviet Union and other communist countries. 

In Western thought, Alfred Marshall gave the 
coup de grâce to Smith’s restricted concept. In 
Economics of Industry in 1879 and in the Princi- 
ples in 1890, he firmly identified production with 
the creation of utility, thus clearly encompassing 
services as well as material commodities. He also 
clearly distinguished between gross and net prod- 
uct (or the “national dividend”), and pointed out 
that the net concept excludes the consumption of 
capital as well as of intermediate products. 

In the latter part of the nineteenth century, both 
English and French estimators returned to the 
comprehensive production concept, using income 
tax data and other sources in addition to production 
censuses and surveys. Perhaps the most sophisti- 
cated estimates made prior to World War 1 were 
those of A. L. Bowley. Recognizing the need for 
continuing estimates, he called for a central statis- 
tical agency to coordinate the basic data and pre- 
pare estimates on a regular basis. 

In the meantime, national income estimation 
was spreading to other countries. In Russia, three 
partial estimates of national income and consumer 
expenditures for commodities were made in the 
1790s by B. F. G. Hermann, A. N. Radishchev, and 
the anonymous authors of New World Geography. 
The latter two sets of estimates, which were in- 
cluded in reformist tracts, were suppressed, and 
apparently no further estimates were made until 
1897. S. N. Prokopovitch, a Marxian reformer, pre- 
pared new estimates in 1906, using the material 
production concept. These were updated in 1917 
under the Kerensky government as a basis for war- 
time planning. 

The United States was the fourth country in 
which national income estimates appeared. In 1843 
George Tucker of the University of Virginia pub- 
lished his Progress of the United States in Popula- 
tion and Wealth in Fifty Years, based on the ex- 
panded 1840 decennial census and the five previous 
censuses. In 1855, He added new chapters based on 
data from the census of 1850. One was devoted 
to estimates of annual income and product, by 
commodity-producing industry and by state, and 
another to national wealth estimates. In the 1890s, 
Charles B. Spahr published An Essay on the Pres- 
ent Distribution of Wealth in the United States, 
with estimates of national income and wealth for 
1880 and 1890, In 1915, Wilford I. King in effect 
extended the Spahr estimates to 1910 in his volume 
The Wealth and Income of the People of the United 
States, 
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additional countries before fe end of World War 1. 
These were Austria (in SE Dy Australia (in 1890), 
Norway (in 1893), Germany (in 1899—although 
estimates had been prepared earlier for individual 
German states), Japan (in 1902). , Switzerland (im 
1902), the Netherlands (in 1910 Jay Gn 191), 
and Bulgaria (in 1915). It is noteworth y that the 
Australian estimates, by Timothy Coghlan, were 
prepared officially and continued annually until his 
resignation as Commonwealth statistician in 1904, 
Coghlan used all three approaches to estimation: 
industry value added, factor cost, and final expen- 
ditures. His work foreshadowed the era of official 
estimation that began in the 1920s. 

Developments since World War 1. Between the 
two world wars, national income estimation work 
spread rapidly, the number of countries with esti- 
mates increasing from 13 in 1919 to 33 by 1939. 
What is more important, an increasing number of 
central governments took over national income esti- 
mation and put it on a regular basis, beginning 
with Canada and the Soviet Union in 1925 and 
Germany in 1929. By 1939, official estimates were 
prepared in nine countries. In the 1939 World Eco- 
nomic Survey, the League of Nations published 
comparative income estimates for 26 countries 
covering all or part of the period 1929-1938. In the 
same year, the League’s Committee on Statistics 
first took up the problem of international compara- 
bility of income estimates. World War 11 tempo- 
rarily slowed down the spread of national income 
work, but it stimulated some important qualitative 
developments. 

In his General Theory of Employment, Interest 
and Money (published in 1936), J. M. Keynes made 
national product and the expenditures for final 
products by the chief economic sectors central to 
his theory of income determination. In the fall of 
1939, the preparation of official national income 
estimates was authorized in Great Britain. Under 
Keynes's guidance, Richard Stone and James Meade 
in the Central Statistical Office completed a set of 
income and expenditure estimates—published as 
background for the 1941 budget. The initial esti- 
mates were later elaborated into appropriation ac- 
counts for the major sectors, linked to the consoli- 
dated production account by appropriate debit and 
credit entries, and published in a Treasury White 
Paper in 1946. The economic accounting approach 
was independently developed in the Netherlands 
and Norway during the war, and the estimates 
were published soon after the occupations were 
ended. 

After World War m, the evolution and adoption 
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of economic accounting spread rapidly. The devel- 
opment of macroeconomic theory and the possi- 
bility of using income accounts as background for 
full-employment and development policies were 
important factors. The United Nations based its 
dues assessments on a formula involving national 
income and provided technical assistance to na- 
tions instituting national income estimates. Meas- 
urement of National Income and the Construction 
of Social Accounts, published in 1947 by the 
League of Nations Committee of Statistical Experts, 
was helpful to countries undertaking estimating 
work, In 1953, the publication by the UN Statistical 
Office of A System of National Accounts and Sup- 
porting Tables (hereafter called the SNA) further 
promoted consistency in national accounting esti- 
mates. Interest in the field also was stimulated by 
the formation of the International Association for 
Research in Income and Wealth, which has held 
biennial meetings since 1947. 

During the first postwar decade, the number of 
countries with national income estimates grew 
from 39 to 93, During the following decade, the 
preparation of such estimates became almost uni- 
versal. The thrust of current work is toward im- 
proving the quality of the basic data and derived 
estimates and toward elaborating the basic sector 
accounts. In 1965, the Statistical Office of the 
United Nations began circulating a proposed revi- 
sion of the SNA which would integrate national 
income accounts with financial (flow-of-funds) 
accounts and balance sheets, as well as input- 
output tables. The United Nations plans to com- 
plete the revision by 1968 and to use the new inte- 
grated system both as a guide for member countries 
and as a consistent format for international report- 
ing and comparisons. 


The economic accounting structure 


Accounts provide a systematic and informative 
approach to organizing data about economic trans- 
actions. The balance of the accounts reflects eco- 
nomic and accounting truisms. In the case of pro- 
duction, the value of production (receipts from 
sales, plus the value of inventory change) is equal 
to the Costs of producing the output, including a 
residual “operating surplus,” or profit. In appropria- 
tion accounts, a disposition is made of all incomes 
—by expenditure, transfer payments, or saving. In 
capital accounts, additions to capital must be fi- 
nanced by saving, net capital transfers, or net bor- 
rowing. In balance sheets, the asset position of any 
unit or sector is balanced by liabilities plus a resid- 
ual net worth, The use of balancing accounts is 
Statistically advantageous in providing checks on 


estimates or in permitting the estimation of impor- 
tant variables as residuals. 

The accounting rule that for each credit there 
must be a corresponding debit reflects the two- 
sided nature of transactions. For each sale there 
must be a purchaser; for each payment, a receipt; 
and for each acquisition of an asset, an equal lia- 
bility must be incurred. Statistically, this means 
that all transactions are kept sight of, and none is 
lost in the myriad real and financial transactions 
of an economy. 

The structure of accounts, devised so as to pro- 
vide a useful summary picture of economic activi- 
ties, has several main dimensions: (1) the sectors 
or groupings of transactors to be distinguished; 
(2) the chief forms of economic activity for which 
separate accounts (or “subaccounts”) are to be set 
up; and (3) the chief types of items, or transaction 
classes, which should be distinguished within the 
accounts or in more detailed supporting tables. 

Before examining these dimensions, it should be 
noted that the structure of economic accounts con- 
ditions the form of data collection, and vice versa. 
Greater detail and complexity in the accounts is 
attained, not only at a greater financial cost but 
also at the possible cost of some loss in clarity. 
Once a structure has been adopted, there is a cost 
in loss of continuity when it is changed or basically 
revised. 

Sectors of the economy. Sectors are groupings 
of transactors similar in economic motivation and 
behavior, or at least more like each other than they 
are like the transactors grouped into a different 
sector. The United Nations SNA distinguishes three 
chief sectors, which are basic to most systems of 
economic accounts, although each may be further 
subdivided. 

(1) The enterprise sector includes all business 
firms and other organizations that produce goods 
and services for sale at prices generally intended 
to cover costs. This sector includes private corpora- 
tions and cooperatives, other than nonprofit insti- 
tutions serving households; unincorporated private 
enterprises, including self-employed craftsmen and 
professional men; nonprofit institutions serving 
other enterprises; all households and institutions in 
their capacity as landlords; and all public enter- 
prises. 

For flow-of-funds accounts, it is necessary to 
distinguish between financial enterprises, by major 
classes, and others. The nonfinancial subsector 
may be further subdivided by industry classes. 
These industries may be made up either of estab- 
lishments or of firms. The former is preferable for 
production accounts; the latter, for financing ac- 
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counts. Due to the greater heterogeneity of multi- 
establishment firms, meaningful disaggregation 
cannot be carried as far as it can with industries of 
establishments, although the latter are also seldom 
fully homogeneous, producing only the primary 
products in terms of which these industries are 
defined. 

(2) The household sector includes households 
and private nonprofit institutions rendering services 
chiefly to households. The units of this sector are 
the chief recipients of income from production and 
spend the bulk of income to satisfy wants. House- 
holds comprise single individuals, families, and 
persons living together in institutions. The non- 
profit institutions included in the sector render 
services to household members or to the public at 
large, and are supported primarily through dues, 
fees, grants, and miscellaneous sales. Pension trust 
funds also are included in this sector. 

When data permit, it is useful to separate house- 
holds, nonprofit institutions, and private trust 
funds, Other subsectors have been proposed, as for 
farm and nonfarm households and for major in- 
come size classes. The SNA provides a separate 
supporting table for a “rural sector,” of particular 
importance in less developed countries. 

(3) The general government sector comprises 
agencies of central, state, and local governments 
that perform common services, supported by gen- 
eral tax revenue, which cannot as conveniently be 
provided by enterprises. Social security and other 
public trust fund arrangements are generally in- 
cluded. Financial agencies and funds may be 
segregated, and subsectors may be provided for 
central, state, and local governments separately, 
with possible further groupings of agencies accord- 
ing to major functions. 

Forms of economic activity. The SNA distin- 
guishes three major forms of economic activity in 
terms of which the sector accounts may be sub- 
divided: production, appropriation and disposition 
of income, and capital formation. To these may be 
added a fourth, when the capital account is con- 
fined to real capital formation, namely, the financ- 
ing of capital formation. Finally, the net result of 
capital transactions and revaluations may be shown 
in balance sheets as of successive points in time. 
In later sections, each of these forms of accounts 
will be discussed. 

Types of transactions. The economic accounts 
themselves contain major classifications of flows 
by significant characteristics. They distinguish be- 
tween goods and services, by major categories; 
transfer or tax payments; saving or purchases of 
financial claims; factor income, by major legal or- 


ganization form and by the chief nonfactor charges 
against product; and so on. Much more extensive 
detail is generally provided in supporting tables, 
These show time series for the key variables of the 
accounting presentation, which generally relates 
only to the most recent period. The groupings and 
degree of detail depend on what is regarded as 
analytically useful. The economic analyst will more 
frequently use the detail in supporting tables than 
he will the sector accounts, but the accounts are 
indispensable as an organizing framework to dis- 
play the interrelationships among the many com- 
ponent variables. 

Sources of data. Data are seldom collected pri- 
marily for use in the economic accounts. Rather, 
they are collected for other purposes but are also 
used as the basis for estimates according to na- 
tional income concepts and framework. Thus, na- 
tional income estimates are eclectic, being pieced 
together from a variety of sources and by various 
methods. 

The chief sources are occasional industrial and 
other economic censuses, sample surveys of busi- 
nesses and households, tax returns, reports on gov- 
ernment expenditures, foreign trade and payments 
statistics, employment and payroll tabulations from 
social insurance records, reports of regulatory and 
supervisory agencies, company reports, and trade 
association and other private estimates. 

The reported data used by national income 
estimators frequently must be processed extensively. 
For example, corporate profits data are audited and 
valuation adjustments are computed to eliminate 
gains or losses due to methods of costing inven- 
tories or of estimating depreciation allowances. In 
the case of expenditures for commodities, an al- 
ternative to the use of data on retail sales, or other 
final sales (which are not usually classified by 
commodity), is the commodity-flow method. This 
involves taking the data on manufacturers’ sales or 
shipments, by type, adjusting for exports and im- 
ports, adding distributive markups and excise taxes, 
and adjusting for changes in distributors’ inven- 
tories. 

Some check on reliability is furnished by the ac- 
counting identities. For example, the statistical 
discrepancies among the estimates of national prod- 
uct made by the expenditure, income, and sector 
value-added approaches are revealing and may pro- 
vide clues to possible errors in the estimates. 

As the national economic accounts become more 
widely used, the requirements of the estimators 
become more influential in the design of census 
questionnaires and other statistical surveys. As 
basic data collection systems are improved and di- 
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rected more toward their use in national income 
estimation, the economic accounts become more 
accurate. 


The production account 


National income and product may be viewed as 
the sums of the debit and credit sides of a con- 
solidated product account. National income is gen- 
erally defined as the income accruing to the resi- 
dents of a nation from the use of their labor and 
property in production, while national product is 
the market value of the resulting final goods and 
services, The chief components of income and 
product are shown in Table 2 and SNA accounts 1 
and 2 are shown in Table 3; these tables will be 
discussed later. In the next section, we view income 
and product as the consolidation of production ac- 
counts for the various producing industries. But 
first it is necessary to clarify the definition of the 
income and product aggregate to which value is 
added by the various industries. 

Final and intermediate products. Final product 
must be related to the goals of a people. Both in- 
dividual and social goals usually involve satisfac- 
tion of wants (creation of utility) and security. The 
material wants of human beings, present and 
future, are satisfied by production of consumer 
goods and services and by additions to capital stock 
(real net investment), which enhance future pro- 
ductive capacity, Some theorists, notably Irving 
Fisher, would confine final output to consumption 
goods, But in that case, output would not be in- 
variant under changes in the proportion of income 
saved. Moreover, saving represents potential cur- 
rent consumption, as well as additions to capacity 
for future production, which in itself is a social 
goal, 

The goal of security involves national defense 
and maintenance of the internal social fabric. It 
can be argued that these goals are instrumental; 
but if they are admitted to a position of coordinate 
importance with the goal of creation of positive 
present and future utility, then most of the costs of 
general government would qualify as final. The 
communist countries, which follow the material 
product system, exclude both public and private 
services. Payments for these services are counted 
as transfers of income. 

Once final products are defined, it is then pos- 
sible to exclude “intermediate products” from 
national product in order to avoid double counting. 
That is, intermediate products, such as raw ma- 
terials and semifabricated goods, are those con- 
sumed in the production process. Since their value 
is included in the value of final products, only the 


latter should be counted if duplications are to be 
avoided. Final product is sometimes called net out- 
put as distinguished from a gross, duplicative total 
of outputs at all stages of production. 

Statistical agencies, which have a preference for 
objective procedures, have defined final products 
as those which are not resold during the accounting 
period. To implement this approach, they divide the 
economy into producing and consuming sectors, 
The producing sector is primarily composed of 
enterprise operations on current account. When 
enterprise accounts are consolidated, their intra- 
sector purchase and sales transactions cancel, and 
only their sales to final purchasers remain. 

The major final purchasing sectors are generally 
taken to be households as consumers, business on 
capital account, general government, and the rest- 
of-the-world. Exports of goods and services to other 
countries are final sales; however, the value of im- 
ports is usually netted against exports, since im- 
ports are intermediate products whose value is 
included in final sales. The purchases of business 
on capital account are new construction, improve- 
ments to land, durable equipment, and additions 
to inventories. These items are not sold during the 
current period, although they are gradually amor- 
tized or liquidated. In each period, depreciation 
allowances are deducted from gross fixed invest- 
ment in order to obtain net investment. 

Blanket application of the consuming sector ap- 
proach does not always produce clear-cut results. 
Some household consumption expenditures, such as 
purchases of uniforms worn on the job or trans- 
portation to and from work, might fairly be classed 
as intermediate products. Conversely, some outlays 
of the enterprise sector which are charged to cur- 
rent expense might well be classed as final pur- 
chases, Examples are small tools, certain intangible 
investments such as research and development 
costs, and certain welfare services for employees 
which could be reclassified as consumption. Finally, 
many economists have argued that certain activities 
of general governments are intermediate in that 
they promote production by enterprises. Some 
would go so far as to count all outlays on national 
and internal security as intermediate, including as 
final only direct collective consumption and addi- 
tions to public wealth. 

These issues have not been wholly resolved in 
the literature. Obviously, the movement and com- 
position of national product will differ somewhat, 
depending on the concepts and estimating conven- 
tions adopted. 

Economic and noneconomic activity. In line 
with their predilection for objective rules, official 
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national income estimators have tended to delimit 
income and product in terms of those transactions 
which pass through organized markets, plus value 
“Imputations” for activities that have major market 
counterparts, This approach reflects the posi- 
tion of Pigou ([1920] 1960, p. 31) that “just as 
economic welfare is that part of total welfare which 
can be brought directly or indirectly into relation 
with the money measure, so the national dividend 
is that part of the objective income of the com- 
munity, including, of course, income derived from 
abroad which can be measured in money.” 

Imputations generally made for nonmarket ac- 
tivities include those for payments in kind, the 
rental value of owner-occupied houses, certain 
financial services for which only indirect payment 
is made (such as interest forgone in return for the 
checking services of commercial banks), and food 
produced and consumed on farms. In an effort to 
expand somewhat the area of imputations, the 
United Nations SNA recommended imputations at 
sales value for all primary output (from hunting, 
fishing, forestry, and mineral extraction, as well as 
from farming), whether sold or used by the pro- 
ducers, and for products of their own trades con- 
sumed by the producers. 

Even with the somewhat expanded range of im- 
putations suggested by the SNA, it is apparent that 
national income and product, as generally meas- 
ured, fall considerably short of a measure of all 
activity contributing to economic welfare. It is 
difficult to draw a precise line between economic 
and other activity, but the former may be thought 
of as activity engaged in primarily for the sake of 
the income or benefit produced, while noneconomie 
activity is engaged in primarily for its own sake. 
Prominent types of unpaid economic activity for 
which a value might be imputed, given adequate 
data, are the services of housewives and other un- 
paid household labor, volunteer labor, schoolwork, 
and the rental value of durable goods (in addition 
to houses) owned by households, nonprofit institu- 
tions, and governments. 

Because of the greater importance of household 
production in less developed countries, it is ap- 
parent that income and product estimates tied 
closely to market criteria can give distorted results 
in temporal or international comparisons. The chief 
requirement for expanding imputations is adequate 
data on the volume of unpaid resources (or their 
product ) and the corresponding values. Exploratory 
studies may reveal the possibility of more extensive 
imputations than are now deemed feasible. 

Most national income statistics are restricted to 
legal economic activity. The SNA suggests the 


somewhat looser term “voluntary,” which would 
exclude extortion but would include black-market 
transactions. Serious statistical problems arise when 
the results of any forms of illegal activity are 
admitted to the accounts, although comparability 
among nations may thereby be increased. 

Gross and net product. As explained above, na- 
tional income and product are net measures in that 
double counting of intermediate products is avoided. 
Frequently, however, the measures are presented 
gross of capital consumption allowances, That is, 
investment estimates are presented gross of the 
fixed capital outlays required to offset the decline 
in value of the existing stock of fixed capital goods 
as they age—a decline due to wear, tear, accidental 
damage, and obsolescence, 

Conceptually, net investment and net national 
product measures are preferable to gross measures 
in that they indicate net additions to capital stocks 
and the potential consumable income of the com- 
munity, respectively. Gross investment and gross 
national product measures are statistically more 
reliable, however, because of the difficulties in 
measuring depreciation in current market values. 
“Gross” measures are also of intrinsic interest, since 
capital is consumed when gross investment falls 
short of capital consumption allowances. 

National and domestic product. Domestic in- 
come and product relate to the productive activity 
carried on within the geographical boundaries of a 
nation, regardless of the residence of the owners 
of the resources used, In estimating national in- 
come, income accruing to residents of a country 
from their property or labor used abroad is added, 
while income paid to foreign residents is sub- 
tracted, 

In SNA account 1, gross product is presented 
on a domestic basis, In account 2, national income 
is presented, and to domestic product at factor cost 
is added “net factor income payments from the rest 
of the world.” This dual treatment emphasizes that 
product is best studied on a domestic basis, so 
that it need not be split according to residency of 
owners; whereas income is more meaningfully 
related to the command over resources, domestic 
or foreign, exercised by the residents of a nation. 

Valuation. The national product is generally 
valued at market prices, while national income 
represents the factor cost of the same collection of 
final goods and services, In addition to time series 
of the national income and product at current 
market prices or factor costs, the final products and 
the factor services may be valued in constant 
dollars (as of some base period) to eliminate the 
effect of price changes. 
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Market prices have the advantage of providing 
an available, objective valuation standard, per- 
mitting the aggregates to be readily broken down 
by types of goods and services purchased, by sector. 
Market valuation is appropriate for welfare analy- 
sis, since rational consumers push outlays for dif- 
ferent products to the point where the ratios of 
their marginal utilities are equal to the ratios of the 
prices paid for them. 

Relative market prices are not equal to relative 
factor costs per unit, however, because of the in- 
fluence of indirect business taxes and subsidies 
(which may be regarded as negative business 
taxes). National income and product would be 
equal only if there were no indirect business taxes 
and subsidies or if they were offsetting. Relative 
unit costs and prices would be proportionate only 
if indirect business taxes less subsidies had the 
same relative incidence on all products. 

For purposes of production analysis, national 
product at factor cost is usually considered the 
preferable basis of valuation, in order to reveal the 
relative volume of factor services absorbed by vari- 
ous products and/or industries. However, relative 
factor costs per unit still tend to be distorted by the 
effects of market imperfections on factor prices. 
Factor cost estimates are not usually available by 
product, but only by industry, which is the usual 
basis for production and productivity studies. 

The national product may be adjusted for price 
changes in order to reveal the movements of con- 
stant-price-weighted physical volumes, or “real prod- 
uct,” through time. Instead of weighting physical 
units of the various final products by base-period 
prices, real product is generally obtained by divid- 
ing expenditures at current prices, grouped by types 
of goods and services, by appropriate price indexes 
with the base-period value of 100 per cent. When 
the constant-price expenditure series are aggre- 
gated, and the series valued at current prices is 
divided by this aggregate, the resulting “implicit 
deflator” is in effect a variable-weighted price index 
with respect to the component product groupings. 

The real product estimates are necessarily af- 
fected by any defects of the price deflators. Thus, 
they may not adequately reflect changes in quality 
of goods, and cyclical movements may be distorted 
if quoted prices fail to catch changes in collateral 
terms of sale. Special problems are posed by non- 
market products, particularly services of govern- 
ment and nonprofit institutions, although some 
progress has been made in measuring work units 
of various types and weighting them by unit costs. 
There are also difficulties in deflating expenditures 


for nonstandard products, such as new construction 
and custom-built equipment. 

Factor cost, by type, may be deflated by ap- 
propriate factor price indexes. Alternatively, units 
of factor services, such as man-hours worked in the 
case of labor, may be weighted by base-period factor 
prices, such as average hourly compensation. The 
result is a measure of real factor cost, or “input.” 
By comparing such measures with real product, or 
“output,” productivity measures can be derived. In 
a progressive economy, output typically rises by more 
than input, resulting in a “productivity increment,” 
or an increase in the ratio of output to input. The 
productivity ratio may also be obtained as the ratio 
of input prices to output prices [see PRopucTIVITY]. 


Production by sector and industry 


Production is primarily a function of the enter- 
prise sector, but households, nonprofit institutions, 
and general governments also hire factor services 
and contribute to the total value of product. 

Gross and net output of enterprises. Production 
accounts for enterprises represent rearrangements 
of their income (profit and loss) statements. The 
income statement shows receipts from sales and 
other sources, costs of production, and the residual 
profit, or “operating surplus.” Receipts from pro- 
ductive activity may be put on the credit side of 
a double-entry account; costs and net income are 
placed on the debit side. Production accounts are 
much the same as income statements, except that 
they also reflect inventory change. When the pro- 
duction accounts of the enterprises within an in- 
dustry, or the sector, are consolidated, the trans- 
actions among the enterprises within the industry 
(sector)—that is, intermediate product purchases 
and sales—cancel out, and the net output, or 
“value added,” of the sector is left. If net production 
accounts are set up for enterprises and industries 
by subtracting intermediate purchases from gross 
output, as illustrated in the hypothetical accounts 
of Table 1, then net output (gross product) and 
value added (income and nonfactor charges) can 
be summed to obtain sector and national totals. 

Corporate enterprises show profit as distinct 
from other types of factor income; accounts of un- 
incorporated enterprises show net entrepreneurial 
income as the residual. This consists of labor as 
well as of property income, and the total before 
tax is credited to the household sector (instead of 
just the dividends, as for corporations). 

Special treatment is required for financial in- 
termediaries. In the case of commercial banks, for 
example, interest received on assets represented 


g= 
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Table 1 — The XYZ Manufacturing Corporation value added statement 


for the year ending December 31, 


INCOME (USES) 
Factor costs 


Employee compensation 2,165 
Net interest 25 
Corporate profits before tax 390 
Tax liability 185 
Dividends 165 
Retained earnings 40 
2,580 
Nonfactor charges 
Depreciation 210 
Indirect business taxes 120 
Transfers 100 
430 
Less subsidies received —10 
Gross value added 3,000 


by deposits, but not paid to depositors, is taken as 
a measure of the services provided depositors. 
Value added, or factor and nonfactor costs, is equal 
to actual and imputed service charges, less pur- 
chases from other enterprises. 

Value added by nonenterprise sectors. Since, by 
definition, the nonenterprise sectors do not sell their 
outputs, the net output approach is not available 
for measuring the product of these sectors. But 
income originating can be estimated in terms of 
the factor costs and nonfactor charges against 
product. Indirect taxes and subsidies are not rele- 
vant to the government sector. Some countries also 
do not impute interest and depreciation charges to 
capital owned by governments, but calculate the 
income and product of governments only in terms 
of employee compensation. The compensation of 
domestic employees is counted as originating in 
the household sector, but usually no return is im- 
puted to the durable equipment owned by house- 
holds. A return is imputed to owner-occupied 
dwellings, but homeowners are counted for this 
purpose as being in the enterprise sector. 

Deflation of industry product. Deflation of in- 
dustry product to eliminate the effects of price 
changes is not performed directly. Since value 
added is the difference between two flows, gross 
production and purchases of intermediate products, 
each flow is deflated separately by an appropriate 
price index, and real industry product is taken as 
the difference. This method is called “double de- 
flation.” The aggregate of real product for all in- 
dustries should be identical with real final product, 
if consistent price deflators are used. 

Relationship to input-output tables. The in- 
put-output, or interindustry relationships, matrix 


(thousands of dollars) 


PRODUCT (SOURCES) 


Sales 4,620 

Net inventory change 300 

Value of gross production 4,920 
Less purchased goods and services 

consumed in production process —1,920 

3,000 

Gross product (value added) 3,000 


is, in effect, a disaggregation of the industry pro- 
duction accounts. The sales of each industry are 
shown horizontally on a “to-whom” basis for indus- 
tries and final demand sectors, The purchases of 
each are shown vertically on a “from-whom” basis 
in terms of industries and suppliers of factor serv- 
ices. [See INPUT-OUTPUT ANALYSIS.] 


The appropriation accounts 


Separate appropriation accounts show the receipt 
and disposition of income by sector, The receipt of 
income by a sector differs from its generation 
of “primary” income from production; disposition of 
income includes not only current final purchases 
credited to the production account but also transfer 
payments (including taxes) and saving. The rela- 
tionship of the sector accounts to the production 
account and to each other is shown in Table 2. 
Table 2 is based on the U.S. accounts, which il- 
lustrate the basic approach more simply than do 
the UN accounts, discussed later. 

In addition to the consolidated production ac- 
count and the three sector appropriation accounts, 
there are a rest-of-the-world account, which shows 
consolidated foreign transactions of the nation, and 
a saving-investment account. Actually, the U.S. ac- 
counts do not provide for a business sector account, 
but it is implicit in the accounting framework. 

The appropriation account entries consist of 
four major types of items. First, there are the 
primary income flows, consisting of factor costs 
and the nonfactor charges against product, each 
of which is a debit to the consolidated production 
account and, in most cases, a credit to one of 
the several sectoral appropriation accounts. Next, 
there are the secondary income flows, consisting of 
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Table 3 — United Nations system of national accounts 


ACCOUNT 1. DOMESTIC PRODUCT 


i 1.5 Private consumption expenditure (4.1) 

1.1 Net domestic product at factor cost (2.9) sf 
i i 14 + 5.17) 16 Generol government consumption expenditure (5.1) 

ef aplpr fon domeahe fad croot open eon aes - 17 Gross domestic fixed capital formation (3.1) 

: aes 1.8 Increase in stocks (3.2) 

IAT Creaa 1.9 Exports of goods and services (6.1) 

1.10 Less imports of goods ond services ~(6.4) 


Gross domestic product at market prices Expenditure on gross domestic product 


ACCOUNT 2, NATIONAL INCOME 


21 fompensation of employees (4.7) 2.9 Net domestic product at foctor cost (1.1) 
22 one from nener enterprises (4.8) 2.10 Net factor income from the rest-o! ‘world (6.2) 
23 Inc from property (4.9) 
2,4 Saving of corporations (3.3) 
2.5 Direct taxes on corporations (5.9) 
26 neral government income from property and entrepreneurship (5.6) 
2.7 Less interest on the public debt (5.7) 
2.8 Less interest on consumers’ debt ~(4.2) 
National income Net national product at factor cost 
ACCOUNT 3. DOMESTIC CAPITAL FORMATION 
31 tic fixed capital formation (1.7) 3.3 Saving of corporation (2.4) 


3.2 Increase in stocks (1.8) 3.4 Provisions for fixed capital consumption In corporations (1.2*) 
3.5 Net capital transfers to corporations (5.14 + 6,8 — 4.15) 
3.6 Net borrowing of corporations ~(4.18 + 5.19 + 6.11) 


Finance of gross capital formation in corporations 


3.7 Finance of gross capital formation in noncorporate private sector (4.12) 
3.8 Finance of gross capital formation in noncorporate public sector (5.13) 


Gross domestic capital formation Finance of gross domestic capital formation 


ACCOUNT 4. HOUSEHOLDS AND PRIVATE NONPROFIT INSTITUTIONS 
CURRENT ACCOUNT 


4.1 Consumption expenditure (1.5) 47 Compensation of employees (2.1) 

4.2 Interest on consumers’ debt —(2.8) 4.8 Income from unincorporated enterprises (2.2) 
43 Direct taxes (5.10) 49 Income from property (2.3) 

4.4 Other current transfers to general government (5.11) 4.10 Current transfers from general government (5.3) 
4.5 Current transfers to rest-of-the-world (6.5) 4.11 Current transfers from ft -the-world (6.3*) 


4.6 Saving (4.13) 


Disposal of income Income of households and private nonprofit institutions 


CAPITAL RECONCILIATION ACCOUNT 


4.12 Finance of gross capital formation in noncorporate private sector (3.7) 4.13 Saving (4.6) 
4.14 Provisions for fixed capital consumption (1.2*) 
4.15 Net capital transfers from corporations (3.5*) 
416 | transfers from general government (5.15) 
4.17 Net capital transfers from rest-of-the-world (6.9) 
4.18 Net borrowing —(3.6 + 5,19 + 6.11) 


Disbursements Receipts 
ACCOUNT 5, GENERAL GOVERNMENT 
CURRENT ACCOUNT 

5.1 Consumption expenditure (1.6) 5.6 Income from property and entrepreneurship (2.6) 
5.2 Subsidies —(1,4) 5.7 Less interest on the public deb? (2.7) 
5.3 Current transfers to households (4.10) 5.8 indirect toxes (1.3) 
5.4 Current transfers to rest-of-the-world (6,5) 5.9 Direct taxes on corporations (2.5) 
5.5 Saving (5.16) 5.10 Direct taxes on households (4.3) 


5.11 Other current transfers from households (4.4) 
5.12 Current trans! from rest-of-the-world (6.3*) 


Disposal of current revenue Current revenue 


CAPITAL RECONCILIATION ACCOUNT 


5.13 Finance of gross capital formation in noncorporate public sector (3.8) 5.16 Seving (5.5) 
5.14 Net capital transfers to corporations (3,5*) 5.17 Provisions for fixed capital consumption (1.2") 
5.15 Net capital transfers to noncorporate Private sector (4.16) 5.18 Net capital transfers from rest-of the-world (6.10) 


5.19 Net borrowing ~(3.6 + 4.18 + 6.11) 


Disbursements Receipts 


ACCOUNT 6. EXTERNAL TRANSACTIONS (REST-OF-THE-WORLD ACCOUNT) 


‘CURRENT ACCOUNT 
6.1 Exports of goods and services (1.9) 


6.4 Imports of goods and services ~(1.10) 
6.2 Not factor income from rest-of-the-world (2.10) 6 -of the 
6.3 Current transfers from rest-of-the-world (4.11 + 5.12) Aa E ee ee eos AERA) 


Surplus of nation on current account (6.7) 
Disposol of current receipts 


CAPITAL RECONCILIATION ACCOUNT 
6.11 Net lending to rest-of the-world —(3.6 + 4.18 + 5.19) 


Current receipts 


6.7 Surplus of nation on current ‘account (6.6) 

6.8 Net capital transfers from Fest-of-the-world to corporations (3.5*) 

69 Net capital transfers from fest-of-the-world to households (4.17) 

6.10 capital transfers from rest-of-the-world to general government (5.18) 
Receipts Disbursements 

* Denotes “part of” item listed, 


Source: United Nations, Statistical Office (1953) 1964, pp. 18-19. 
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total sources and uses of funds must balance, al- 
though in practice a residual statistical discrep- 
ancy entry must be made to offset errors of various 
types in the component estimates. 

The U.S, Federal Reserve Board (FRB) uses a 
somewhat different format for its flow-of-funds 
summary table, as shown in Table 4. In effect, the 
FRB combines the saving—investment and financial 
accounts for the various nonfinancial and financial 
sectors, and the total economy. The table starts 
with gross saving (line 1) and gross investment 
(line 4), Gross investment is defined as the sum 
of gross (private) tangible capital expenditures 
(line 5) and “net financial investment.” The latter 
item is obtained as the difference between direct 
estimates of financial uses (net acquisition of as- 
sets, line 11) and financial sources of funds (in- 
creases in liabilities, line 12). The bulk of the table 
consists of an itemization of more than two dozen 
types of financial claims which provide the specific 
sources and/or uses of funds (lines 13 through 
43), A further breakdown of types of claims would 
be possible if its usefulness warranted the addi- 
tional cost. 

Note that in principle, gross saving and gross 
investment are equal for each sector, as well as for 
the total economy, when gross investment includes 
net financial investment as well as tangible capital 
expenditures. In practice, there are sector discrep- 
ancies (line 44) resulting from statistical errors. It 
should also be observed that financial flows for 
each sector are generally recorded on a net trans- 
action basis for each category. That is to say, the 
asset entry represents funds used to acquire assets 
of that type, less funds realized for disposition of 
assets of the same type during the accounting 
period; the liability entry represents funds raised 
by borrowing, less repayments of the same type of 
claims in the stated accounting period. It has been 
suggested that it would be more informative if data 
could be provided showing gross financial flows for 
the various categories. 

For the economy as a whole, saving and invest- 
ment for all the sectors are summed. The inter- 
sectoral financial flows cancel out within the do- 
mestic economy, as do transactions in previously 
existing tangible assets. National investment is 
seen (in the last column of Table 4) to consist of 
gross tangible new investment, plus a financial 
component consisting of the increase in net finan- 
cial claims on foreigners. Gross saving equals gross 
investment so defined, except for the statistical dis- 
crepancy. 

The FRB flow-of-funds system is not fully co- 
ordinate with the U.S. Commerce Department's 


national income sector accounts. The division of 
the economy into sectors is somewhat different; in 
particular, the income accounts do not segregate 
a financial sector, which is essential to a flow-of- 
funds system. An integrated system requires uni- 
form sectors which accommodate the needs of 
financial accounts as well as of appropriation and 
saving—investment accounts. Further, the defini- 
tions and estimates of common items now differ. 
For example, the FRB defines household purchases 
of durable goods as investment and estimates the 
corresponding capital consumption allowances as 
part of gross saving of households, while the Com- 
merce Department counts gross outlays for durable 
goods as current consumption expenditure. Obvi- 
ously, a common set of definitions and estimates is 
requisite to an integrated system. 

In Canada and a few other countries, financing 
accounts are integrated with the basic income ac- 
counts, by sector. In determining the structure of 
integrated flow accounts with respect to sectors 
and transaction categories, thought must also be 
given to the desired structure of sector and national 
balance sheets, since balance sheets are uniquely 
related to the capital and financing accounts. 

Relation to balance sheets and wealth statements. 
Assets, liabilities, and net worth on sector and com- 
bined balance sheets change as a result of invest- 
ments and saving, lending and borrowing, capital 
transfers, and capital gains and losses from re- 
valuation of claims due to changes in market 
prices. Thus, if the capital reconciliation and fi- 
nancing accounts are supplemented by revaluation 
accounts, balance sheets at successive dates can 
be prepared by cumulatively adding to an initial 
balance sheet the net changes shown in the capital 
and financing accounts. 

When balance sheets for all sectors are con- 
solidated, the claims of one domestic sector against 
another and the corresponding liabilities cancel out, 
and national net worth (wealth), equal to tangible 
domestic assets plus net foreign assets, is ‘left. The 
change in national wealth or net worth is equal to 
net investment or net saving, as carried in the 
capital account, plus revaluations [see NATIONAL 
WEALTH], 


Uses of the economic accounts 


Integrated systems of economic accounts of the 
sort sketched in this article obviously have many 
uses, actual or potential. They may be discussed 
briefly under six headings. 

(1) Appraisal—public and private organizations 
use the economic accounts in periodic reviews of 
the economic situation at the international, na- 


32 NATIONAL INCOME AND PRODUCT ACCOUNTS 


“BOL “A ‘S961 sequenony ‘uyang sAsesey josapay :e2:n0S 


4N3W 
-IS3ANI 
ONY 
ONIAYS 
TVNOLLYN 


-d3X4 2S1] 


"50" UOY) 5507, 


ri- 6- . 6- sı ye se- (p — 1) sapuodassip oos pp 
oP fet. ci ety k se zi |e- ERa A A A suoyposun4; jopuouy snoaunjjasiw 
rr- rr- ssəujsnq əjpaodsoəuou ut Ajinbg 
z= T r- v sr 68 v pas apoa 
v © r . r ajqp4od soxoy 
s- e r- t= ssoyio of 
e a T E-F t= s19|D9p puo s494049 04 
. þr s- s b a r- r- Hipon Arunas 
s s r r supoj josapag 
Si r- r- si ri seded joxs0u uado 
oz rz r- 0% s % vt Cs suooj 3940 
s 7. € 9% pəyissoj> asaymosja jou supoj yuog 
sz re 69 pa» sowinsuoy, 
rL ré ot səbo6uow 33410 
e- szi t- T- oa r sowoy Ajjuoj-p 04 -1 'soBoBpow 
SNA DEE Ea et ri g- s9p945 91010410 
sh os E yz se or t- spuog u6}0104 puo ajosodio) 
T ve os g- 9% 's401J0B1q0 1020] pun DIS 
et s= st r9 r s'i- vz Sa1I1sND05 jUOUIUIOAB ‘S'N 
6s sse yo sz re $9 609/49 oo (£0) (te tt race siuawnsysuy joyau 41p9 
i sD aS swai 440q payopyjosuo> 
69 69 el se vu s9A19591 puns Uo}suag 
er er . rr soAsosas o2u0snsul 3j 
t sar “st suopmsus sBuans 4y 
ri v sm r “t ve ze syuoq jopsawwo> iy 
= roer eez sjunos9p s6ujans pup awy 
k F e 
r ta 4uawusoaob “S'N 
7 rz 9 v £ sz- r ausowop apand 
9% rt Aauasn pun sisodep puowag 
t- w]e kauan Aansoasy 
. . a6buoy>xə u619104 'S'N Opyo puo Pjog 
use pata re rop LL 86 ost va soounos jopuourg 
se 69 vez re se9 zy rst co 494 “sasn jopuoutg 
tr- si s fh re se- toi- 66l (Z1 — U1) tuowysenuy jopuouy JaN 
8'r abuoyə Kojuaauj 
s r 6 T9 juawdynbe puo jupid 
s4 uonsnysuos joyuapisay 
qosnp sownsue> 
s r 6 s89 490 ‘ainj1puadxe jojido> ajoatid 
tr- ez oz p er ve res (01 +5) sewysaauy s5039 
si v 9% o's- rz- ort (Z—1) Suraos 4N 
Yr 2 ow uonduinsuos jojdog 
6L re rz- 819 Burans 5509 
= oe 
3 A i g Ş £ = g = ANODILVI NOMDYSNVYL 
i È È È 
aom | auouy syu 0; sjuor sous 
s40133S HELO | ywoquey “Jor' -oyin HE Sera ii 
ny “4538 -wwo Aojauoyy [920] puo 
5s 
S40193S IVIDNVNIS SY¥O193S TWIDNVNIJNON DILS3WOG ILYAlðd 


(S4n]JOp 40 suorysiq U!) 961 ‘sjuN022p spunj-jo-moy SƏD pa, 


In jo Arpuiuings — p ajqoy 


NATIONAL INCOME AND PRODUCT ACCOUNTS 33 


tional, and regional levels. The reviews often serve 
as background for projections, for policy recom- 
mendations to correct unsatisfactory developments, 
or for planning adaptive actions. 

(2) Analysis—the economic accounts provide a 
major body of statistics for quantitative analyses 
of changes or differences in economic aggregates 
and structure. Production and demand functions 
are of particular interest. Statistical analyses are 
often directed toward the testing and refinement of 
economic theory; they enhance the ability of econ- 
omists to predict and, thus, to advise on policy 
issues. 

(3) Projections—the economic accounts serve 
as a background for short-term projections of ag- 
gregates and components, based mainly on demand 
models, and for long-range projections, based 
mainly on supply models or production functions. 
The models require that certain exogenous vari- 
ables, such as government expenditures or the labor 
force, be estimated from outside the model; how- 
ever, the estimates of the values of endogenous 
variables are based on the structural relationships 
contained within the model. Projections serve as 
background for adaptive measures by firms and for 
policy formulation by government agencies. 

(4) Goal setting—regional, national, and supra- 
national agencies may set goals in terms of desired 
and feasible levels or rates of growth of national 
income and product and their components, such as 
domestic and foreign investment. Since World War 
m, many countries have enacted legislation re- 
quiring the central government to take the neces- 
sary measures to maintain high levels of employ- 
ment and production relative to capacity. These 
goals are meaningful only when they can be quanti- 
fied in terms of the economic accounts and related 
statistics, and the current situation can be ap- 
praised with reference to the goals. 

(5) Policy formulation—current evaluations and 
projections within the economic accounting frame- 
work provide the background needed to determine 
whether policy actions are required to correct pos- 
sible discrepancies between actual or projected de- 
velopments and desired objectives. Just as im- 
portant, the economic relationships revealed by 
analyses based on the income and product accounts 
make it possible to predict the consequences of 
given or alternative policy actions, and the relation- 
ships revealed assist policy makers in choosing 
among policies. If the objective is given in numer- 
ical terms, analysis can help policy makers to find 
the economic measures suitable to the objective. 

(6) Planning—knowledge of the current and 
prospective general economic situations and po- 


tentials, as revealed through the economic accounts, 
is of obvious value for planning to firms, other 
private organizations, and governments. Firms 
project the markets for their goods on the basis of 
the national market projections. On the basis of 
their sales projections, they can plan their invest- 
ment, personnel, and other internal policies. Gov- 
ernment can project the probable public service, 
investment, and revenue requirements of the na- 
tion and can also plan the new construction, 
training, and other programs necessary to meet 
these needs. In the semisocialist or wholly socialist 
countries, the economic accounts must be supple- 
mented by much more detailed data for firms as 
well as for industries. 

It is clear that all the activities just mentioned 
are facilitated by comprehensive and consistent 
systems of economic accounts. As their economies 
grow more complex, most countries are devoting 
more resources to the development of national in- 
come accounts and to the collection of the primary 
data on which they are based. 


Jonn W. KENDRICK 
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NATIONAL INTEREST 


The concept of the national interest is used in 
both political analysis and political action. As an 
analytic tool, it is employed to describe, explain, 
or evaluate the sources or the adequacy of a na- 
tion’s foreign policy. As an instrument of political 


action, it serves as a means of justifying, de- 
nouncing, or proposing policies. Both usages, in 
other words, refer to what is best for a national 
society. They also share a tendency to confine the 
intended meaning to what is best for a nation in 
foreign affairs. 

Beyond these general considerations, however, 
the two uses of the concept have little in common. 
In its action usage the concept lacks structure and 
content but, nevertheless, serves its users, political 
actors, well. As an analytic tool the concept is 
more precise and elaborate but, nevertheless, con- 
founds the efforts of its users, political analysts. 
These differences arise out of the fact that the 
national interest is rooted in values (“what is 
best”). While analysts have discovered that the 
value-laden character of the concept makes it 
difficult to employ as a tool of rigorous investiga- 
tion, actors have found that this very same char- 
acteristic renders the concept useful both as a 
way of thinking about their goals and as a means 
of mobilizing support for them. That is, not only 
do political actors tend to perceive and discuss 
their goals in terms of the national interest, but 
they are also inclined to claim that their goals are 
the national interest, a claim that often arouses 
the support necessary to move toward a realiza- 
tion of the goals. Consequently, even though it 
has lost some of its early appeal as an analytic 
tool, the national interest enjoys considerable 
favor as a basis for action and has won a promi- 
nent place in the dialogue of public affairs. 


History of the concept 


The national interest has a much longer history 
as an instrument of action than as a tool of 
analysis. According to a historian who traced past 
uses of the term, political actors made claims on 
behalf of the national interest as early as the 
sixteenth century in Italy and the seventeenth 
century in England (Beard 1934, pp, 22-24). At 
that time claims made in the name of “the will 
of the prince,” “dynastic interests,” raison d'état, 
and other older catchwords began to lose their 
effectiveness as a new form of political organiza- 
tion, the nation-state, came into being and served 
as the political unit to which men owed their 
allegiance. Thus, the old terms were gradually 
replaced by new ones that reflected the new 
loyalties. The national interest was one of these, 
as was “national honor,” “the public interest,” and 
“the general will.” Beard also found that “the 
term, national interest, has been extensively em- 
ployed by American statesmen since the establish- 
ment of the Constitution” (1934, p. 26). 


Many decades elapsed, however, before the na- 
tional interest attracted attention as a tool of 
analysis. Not until the twentieth century, when 
two world wars made it clear that mass publics 
had both a vital stake in foreign affairs and played 
a vital role in them, did analysts focus on the 
national interest as a concept which could be used 
to describe, explain, and assess the foreign policies 
of nations. Beard was himself one of the first to 
develop the concept for this purpose and to dis- 
tinguish it from the “public interest,” which 
through convention has come to be used in refer- 
ence to the domestic policies of nations. 

Initially, the national interest appealed to an- 
alysts whose main concern was to evaluate the 
foreign policies which led to World War u. Im- 
pressed with the thought that the global conflict 
might have been avoided if the British and the 
French had not acquiesced to Hitler at Munich in 
1938 and if the United States had not adopted 
isolationist policies throughout the 1930s, a num- 
ber of analysts turned to the national interest as 
a way of determining the adequacy and effective- 
ness of past, present, or future policies. They rea- 
soned in retrospect that the advent of World War 
m made it clear that the prewar policies of the 
three nations were ill-advised and that the policies 
proved to be contrary to the best interests of Eng- 
land, France, and the United States. To these 
analysts it thus seemed obvious that the best 
interest of a nation is a matter of objective reality 
and that by describing this reality one is able to 
use the concept of the national interest as a basis 
for evaluating the appropriateness of the policies 
which a nation pursues. Because of their underly- 
ing assumption that the national interest can be 
objectively determined, we shall call these analysts 
“objectivists.” 

It should be noted that most objectivists do not 
have an explicit and elaborate rationale for their 
approach to the national interest, Interested pri- 
marily in analyzing the contents of foreign policy, 
the objectivists are not particularly concerned 
about the methodological and philosophical foun- 
dations of their inquiries. They make no special 
effort to explain how and why their descriptions 
of the national interest are in accord with reality 
because, for them, the correspondence between 
their descriptions and the objective situation is 
self-evident. Objectivists thus leave to their read- 
ers the task of inferring their conceptualization 
of the national interest from substantive observa- 
tions which are as variable as the situations which 
they describe. 

It is possible, however, to derive some insight 
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into the underlying rationale of the objectivists 
from the writings of one analyst who did under- 
take to develop an explicit framework for explain- 
ing why his substantive interpretations of the 
national interest reflect objective reality. The an- 
alyst is Hans Morgenthau, whose works advance “a 
realist theory of international politics” founded on 
the concept of national interest. “Interest is the 
perennial standard by which political action must 
be judged and directed,” Morgenthau wrote ( [1948] 
1954, p. 9), emphasizing that, therefore, the “ob- 
jectives of a foreign policy must be defined in 
terms of the national interest” (p. 528). And 
exactly what constitutes the interest of a nation? 
Morgenthau recognized that “the kind of interest 
determining political action in a particular period 
of history depends upon the political and cultural 
context within which foreign policy is formulated” 
(p. 8), but he envisioned accounting for these 
contextual factors by defining interest in terms of 
power (p. 5). For Morgenthau the power at a 
nation’s command relative to that of other nations 
is, at any moment in time, an objective reality for 
that nation and thus serves to determine what its 
true interest is and should be. As will be seen, how- 
ever, the difficulty with Morgenthau’s formulation 
is the lack of a method for determining what a 
nation’s relative power is. That is, he does not 
indicate how use of the criterion of power will 
enable nations to “follow . . . but one guiding star, 
one standard for thought, one rule for action: the 
national interest” (1951, p. 242). 

As the discipline of political science gave in- 
creasing emphasis to scientific explanation, an- 
other group of analysts joined the objectivists in 
converging upon the national interest as an an- 
alytic concept. Concerned less with evaluating the 
worth of foreign policies and more with explaining 
why nations do what they do when they engage 
in international action, this group found the na- 
tional interest attractive as a possible means of 
probing the sources of foreign policy. They rea- 
soned that nations do what they do in order to 
satisfy their best interests and that by describing 
these needs and wants the analyst would be in a 
position to use the concept of the national interest 
as a tool for explanation. These analysts, in other 
words, deny the existence of an objective reality 
which is discoverable through systematic inquiry. 
For them the national interest is not a singular 
objective truth that prevails whether or not it is 
perceived by the members of a nation, but it is, 
rather, a pluralistic set of subjective preferences 
that change whenever the requirements and 
aspirations of the nation’s members change. For 
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want of a better term, hereafter we shall call those 
who approach the national interest in this way 
the “subjectivists.” 

The advent of the decision-making approach to 
international politics (Snyder et al. 1954) provided 
the subjectivists with an additional rationale for 
their approach to the national interest. Partly as 
a reaction to the objectivists and partly out of a 
concern to render concepts usable by linking them 
to observable behavior, students of decision mak- 
ing contend that the national interest, being com- 
posed of values (what people want), is not sus- 
ceptible of objective measurement even if defined 
in terms of power and that, accordingly, the only 
way to uncover what people need and want is to 
assume that their requirements and aspirations are 
reflected in the actions of a nation’s policy makers. 
For these analysts, in other words, the national 
interest is whatever the officials of a nation seek 
to preserve and enhance. As two leading spokes- 
men for this approach put it, “The national in- 
terest is what the nation, i.e., the decision-maker, 
decides it is” (Furniss & Snyder 1955, p. 17). 

It is worthy of emphasis that although the ob- 
jectivists and subjectivists differed profoundly in 
their premises and conclusions, both came to ac- 
cept the appropriateness of analyzing foreign 
policy and international politics in terms of the 
national interest. To be sure, the two groups 
focused on different phenomena when they in- 
vestigated the national interest, but they both em- 
phasized that its relevance to the external actions 
of nations was considerable. 


Limitations of the concept 


Despite the claims made for the concept and 
notwithstanding its apparent utility, the national 
interest has never fulfilled its early promise as an 
analytic tool. Attempts by both objectivists and sub- 
Jectivists to use and apply it have proven fruitless 
or misleading, with the result that, while textbooks 
on international politics continue to assert that 
nations act to protect and realize their national 
interests, the research literature of the field has 
not been increased and enriched by monographs 
which give central prominence to the concept. 

The reasons for this failure of the concept as 
an analytic tool are numerous. One is the am- 
biguous nature of the nation and the difficulty of 
specifying whose interests it encompasses. A sec- 
ond is the elusiveness of criteria for determining 
the existence of interests and for tracing their 
presence in substantive policies. Still another con- 
founding factor is the absence of procedures for 
cumulating the interests once they have been 


identified. This is in turn complicated by uncer- 
tainty as to whether the national interest has been 
fully identified once all the specific interests have 
been cumulated or whether there are not other, 
more generalized values which render the national 
interest greater than the sum of its parts. 

These limitations are readily discernible in the 
premises and writings of the objectivists. What is 
best for a nation in foreign affairs is never self- 
evident. More important, it is not even potentially 
knowable as a singular objective truth. Men are 
bound to differ on what constitute the most ap- 
propriate goals for a nation. For, to repeat, goals 
and interests are value-laden. They involve sub- 
jective preferences, and thus the cumulation of 
national interests into a single complex of values 
is bound to be as variable as the number of ob- 
servers who use different value frameworks. Yet 
the objectivists proceed on the assumption that 
some values are preferable to others (for example, 
that it is better for the nation to survive than not 
to survive) and that therefore it is possible to 
discover, cumulate, and objectify a single national 
interest. Consequently, the objectivists find it pos- 
sible to characterize every foreign policy as either 
reflecting or opposing the national interest. In- 
deed, it was precisely this line of reasoning that 
enabled them to posit an objective reality about 
the conditions of prewar Europe and to conclude 
that Great Britain and France did not follow their 
national interests when they ignored this reality 
and acquiesced to Hitler at Munich, 

However, such reasoning breaks down as soon 
as it is recalled that the national interest is rooted 
in values. If every member of a nation wishes to 
have it go out of existence by joining a larger 
world federation, who is to say that the goal or 
act of federation is contrary to that nation’s in- 
terest? If the British and the French believed they 
were satisfying their wants and needs when they 
compromised at Munich, who is to say they were 
wrong and acted in violation of their national 
interests? The analyst who does make such an ob- 
servation is merely enjoying the benefits of hind- 
sight to justify the superiority of his own values 
over those of the British and French policy makers 
who decided to acquiesce to Hitler (obviously, the 
policy makers would have acted differently if they 
could have foreseen the consequences of acquies- 
cence). Since values are not susceptible of sci- 
entific proof, the objectivists have never been able 
to demonstrate the validity of their assessments 
of the extent to which foreign-policy actions reflect 
a nation’s interest. To explain that a certain policy 
is in the national interest, or to criticize it for being 


contrary to the national interest, is to give an im- 
posing label to one’s own conception of what is a 
desirable or undesirable course of action. 

The objectivists do not consider that their own 
values serve as criteria for determining the sub- 
stantive content of the national interest. Rather, as 
has been noted, they conceive of a nation’s power 
as the source of what is best for it and as the 
basis for ranking some values as preferable to 
others. From their point of view, a nation’s power 
has an objective existence and thus values need 
not enter into a determination of its national in- 
terest. The objectivists concede that the national 
interest may be defended, criticized, or explained 
in value, rather than power, terms. Such formula- 
tions, however, are regarded as merely ideological 
justifications and rationalizations which conceal 
the true nature of the policy under consideration. 
Policy makers may claim that moral principles 
serve as the basis for their actions, but the objec- 
tivists assume that in fact “statesmen think and 
act in terms of interest defined as power” (Mor- 
genthau [1948] 1954, p. 5), 

This reasoning of the objectivists is essentially 
erroneous. The dictates of power are never clearly 
manifest. Power is as elusive and ambiguous a 
concept as is interest. Its components are a matter 
of dispute. Furthermore, many power components 
consist of intangibles, such as morale, which are 
difficult to measure. Even more difficult, if not 
impossible, is the task of cumulating the tangible 
and intangible components into a single entity 
called “the power of a nation.” For not only does 
the cumulation of unlike factors constitute a dif- 
ficult problem in itself, but it also necessitates the 
introduction of values. To cumulate the compo- 
nents of power one must assess the relative impor- 
tance of each component, and such an assessment 
can only be made by referring to the goals which 
the power is designed to serve. Hence, whether he 
wishes to or not, the analyst must inevitably fall 
back on a value framework—the one from which 
goals are derived—if he is to define the national 
interest in terms of power. It follows that there is 
no reason to assume that different analysts will 
necessarily arrive at similar, much less identical, 
interpretations of what a nation’s power dictates 
its national interest to be. [See PowER.] 

In short, there may be an “objective reality” 
about the situations in which nations find them- 
selves at various periods of history, but neither 
predictively nor retrospectively can its contents be 
clearly demonstrated. A description of the national 
interest can never be more than a set of conclu- 
sions derived from the analytic and evaluative 
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framework of the describer. The objectivists un- 
knowingly concede this point whenever they criti- 
cize foreign policies for not being in a nation’s 
interest. For such a criticism contradicts their view 
that the national interest determines the contents 
of what nations do abroad. For example, Morgen- 
thau goes to great lengths in his writings to show 
how “a foreign policy guided by moral abstractions, 
without consideration of the national interest, is 
bound to fail” (1951, pp. 33-34), an observation 
which undermines his assumption that policy 
makers always think and act in terms of interest 
defined as power. 

Nor have the various subjectivist approaches to 
the national interest been conspicuously success- 
ful. Although subjectivists carefully and explicitly 
avoid the premise that the national interest can 
be objectified, their formulations and uses of the 
concept are far from free of its inherent limita- 
tions. The recognition that many groups in a na- 
tion have different and often conflicting concepts 
of what external actions and policies are best for 
it—and that consequently the national interest 
is a reflection of these preferences rather than of 
objective circumstances—gives rise to as many 
conceptual and methodological difficulties as it 
avoids, First there is the problem of which groups 
constitute a nation. Should the boundaries of a 
nation be equated with those of national societies 
or does a nation consist of persons with a common 
background who speak the same language? If the 
latter, more traditional formulation is used, then 
analysis is complicated by the fact that some na- 
tions exist within or extend beyond the boundaries 
of national societies. Such nations may have 
neither governments nor foreign policies, so that 
inquiries into their interests would be far removed 
from the concerns which attract attention to the 
concept of national interest. For this reason most 
subjectivists equate nations with national societies 
and employ the terms interchangeably. In other 
words, the national interest usually means the 
societal interest. [See NATION.] 

Defining the nation in this way, however, is a 
relatively minor problem and only proves trouble- 
some for the terminological purist. Much more 
difficult is the problem which follows immediately 
from the assumption that nations are heter- 
ogeneous units encompassing a multitude of 
ethnic, cultural, social, and other types of groups: 
namely, the problem of identifying and classifying 
all the diverse and conflicting interests which 
clamor for satisfaction in a national society. Stated 
simply, how does the observer determine which 
groups have an interest in a particular foreign 
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policy and how does he specify what the substan- 
tive contents of the interests of each group are? 
The subjectivist’s obvious reply is that interests 
are equivalent to the demands and recommenda- 
tions which group spokesmen in the society artic- 
ulate and press. But this answer is of little help 
to the researcher. The groups with specialized in- 
terests in modern industrial societies are so nu- 
merous, and the types of policies in which they have 
a stake are so varied, that the quantitative dimen- 
sions of such a procedure render it virtually un- 
usable. Furthermore, even if this quantitative 
problem could be overcome, there would remain 
the qualitative task of accounting for the groups 
whose interests lack a spokesman (for example, 
future generations, consumers, and repressed 
minorities). Presumably this can only be accom- 
plished through estimates of what is best for such 
groups, a procedure which brings the subjectivist 
perilously close to the objectivist’s practice of 
ascribing his own values to others. 

Nor does the subjectivist’s dilemma end here. 
Assuming that he is somehow able to identify all 
the expressed and unexpressed interests of a so- 
ciety, he must then combine the multiplicity of 
values into a meaningful whole. Not to do so 
would be to treat the national interest as a mass 
of contradictory needs and wants, a procedure 
which is hardly suitable to the description, ex- 
planation, and evaluation of foreign-policy goals. 
But in order to aggregate many contradictory 
values into an over-all formulation of the national 
interest, one must face the probability that some 
of the specific interests carry greater weight than 
others, and it is this probability that perpetuates 
the dilemma. For it raises the question of how the 
relative weight of the conflicting interests is to be 
determined. The most tempting solution is to at- 
tach weights on the basis of one’s own assess- 
ments, but this procedure would again lead the 
subjectivist down the misleading path followed by 
the objectivist. 

Most subjectivists avoid these complex, seem- 
ingly insurmountable problems by relying on a 
procedural rather than a substantive definition of 
the national interest. Rather than attempt to sum 
the values that prevail in a society, they rely on 
the society's political processes to do it for them. 
That is, they fall back on a decision-making con- 
ception in which the foreign-policy goals that a 
society sets for itself are considered to result from 
bargaining among the various groups claiming 
satisfaction of their needs and wants. If some in- 
terests carry greater weight than others, it is as- 
sumed that the differences will be recognized and 


accounted for in the policy-making process. In 
other words, regardless of whether democratic or 
authoritarian procedures are employed, the needs 
and wants of groups within national societies are 
assumed to constitute demands that policy makers 
must sort out and obey, an assumption which re- 
lieves the observer from having to resort to his 
own values to determine which interests are 
weightier. Operationally, the substantive content 
of the national interest thus becomes whatever a 
society's officials decide it to be, and the main 
determinant of content is the procedure by which 
such decisions are made. 

This approach also allows for the operation of 
generalized values which render the national in- 
terest greater than the sum of its parts. If the 
over-all perspective of their high offices leads 
policy makers to conclude that the demands made 
upon them are, taken together, insufficient to 
serve the welfare of the nation, and if they there- 
fore superimpose their own values on the decision- 
making process, then clearly the analyst can treat 
the national interest as more than the cumulated 
total of subnational interests without falling back 
on his own view of what is best for the society he 
is examining. 

Yet the decision-making approach to the na- 
tional interest also suffers from limitations. The 
main weakness is that it is not always possible to 
ascertain when a policy has been officially decided 
upon, since most policies undergo a continuous 
process of evolution and revision as external con- 
ditions change and internal demands shift. This 
difficulty cannot be circumvented by focusing on 
the values which officials espouse at any point in 
the decision-making process. For the various of- 
ficials of a society often hold and assert different 
conceptions of what the goals of foreign policy 
ought to be. Under these circumstances, which 
usually prevail in open democratic societies, the 
analyst gets little guidance from the formulation 
that the national interest of a society is what its 
duly constituted policy makers claim it to be. The 
United States offers a good example of this di- 
lemma. Not infrequently does it happen that the 
president, members of his cabinet, other executive- 
branch officials, and the leaders, committees, and 
individual members of Congress pursue values 
and policy goals which are in direct conflict and 
which they all contend are best for the nation. Who 
is correct? Operationally, they all are, as they all 
have some official responsibility for formulating 
and executing foreign policy. What, then, is the 
national interest? The question seems unanswer- 
able unless the analyst is willing to fall back on 


his own values to decide which officials express 
the soundest and most representative views. 

A second difficulty with the decision-making 
approach concerns closed authoritarian societies. 
Many groups in such societies have no oppor- 
tunity to articulate their needs and wants, thereby 
undermining the assumption that the various in- 
terests of a society are sorted and summed in its 
political processes, It hardly seems plausible, for 
example, to equate the national interest of Ger- 
many during the Nazi period with the actions, 
pronouncements, and aspirations of Hitler. Such 
a formulation runs counter to the concerns which 
make the national interest attractive as an analytic 
concept. Yet, under the decision-making approach 
the analyst has no choice but to view Hitler's 
purging of the Jews and his launching of World 
War n as in the German national interest. To do 
otherwise is either to fall back on an objectivist 
view that there is a “true” national interest of 
Germany which Hitler violated or to be confronted 
with the insuperable problem of identifying and 
aggregating the unarticulated interests of the vari- 
ous groups then existent in Germany. 


Future of the concept 


There can be little wonder, then, that the na- 
tional interest has not sparked research or other- 
wise lived up to its early promise as an analytic 
tool. All the approaches to it suffer from difficulties 
which defy resolution and which confound rather 
than clarify analysis. As political inquiry becomes 
more systematic and explicit, the concept is there- 
fore likely to be used less and less. Serious doubts 
about its analytic utility have already been ex- 
pressed (e.g., Modelski 1962, pp. 70-72), and it 
seems probable that objectivists and subjectivists 
alike will find that they can evaluate and explain 
foreign-policy phenomena adequately without hav- 
ing to resort to the national interest as an over-all 
explanation or characterization. 

The trend toward more systematic inquiry is not 
the only reason why the abandonment of the con- 
cept is likely to be hastened in the future. The 
ever greater interdependence of nations and the 
emergence of increasing numbers of supranational 
actors is also bound to diminish reliance on the 
concept. Increasingly, decision makers act on be- 
half of clusters of nations as well as their own. 
They identify their own interests as inextricably 
tied to the welfare of their region, their continent, 
or their way of life. Many U.S. officials, for ex- 
ample, now argue that the public must make 
sacrifices for Latin America because a higher 
standard of living throughout the Western Hemi- 
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sphere is in the best interests of the United States. 
Similarly, other statesmen make decisions with a 
view to enhancing unformalized supranational 
communities, such as “the West,” “the Arab world,” 
and “the communist bloc,” or formalized eco- 
nomic and political unions, such as the Common 
Market, Malaysia, and the West Indies Federation. 
Clearly such global tendencies further reduce the 
utility of an attempt to explain international be- 
havior in terms of the national interest. 

Yet, the national interest cannot be entirely 
abandoned. Even though the nation is declining 
in its importance as a political unit to which al- 
legiances are attached, the process of decline is 
many decades—perhaps even centuries—away 
from an end. Political actors will no doubt con- 
tinue to make extensive use of the national interest 
in their thinking about foreign-policy goals and in 
their efforts to mobilize support for them, And, 
to the extent that they do, political observers must 
take cognizance of the national interest. In other 
words, while the national interest has little future 
as an analytic concept, its use in politics will long 
continue to be a datum requiring analysis. 


James N. RosENAU 


[See also FOREIGN POLICY; INTERNATIONAL POLITICS; 
NATIONAL SECURITY. Other relevant material may 
be found in INTERNATIONAL RELATIONS; PUBLIC IN- 
TEREST. ] 
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NATIONAL POWER 
See MILITARY POWER POTENTIAL; POWER. 


NATIONAL SECURITY 


The term “national security” has long been used 
by politicians as a rhetorical phrase and by mili- 
tary leaders to describe a policy objective. More 
recently, however, it has been adopted by social 
scientists, to refer to both an analytical con- 
cept and a field of study. When modern social 
scientists talk of the concept, they generally mean 
the ability of a nation to protect its internal values 
from external threats. The field of study, therefore, 
encompasses attempts to analyze the manner in 
which nations plan, make, and evaluate the de- 
cisions and policies designed to increase this ability. 


Development of the concept and the study 


The origins of the concept of national security, 
at least in the United States, can be found in the 
different historical formulations of the concept of 
national interest [see NATIONAL INTEREST]. Al- 
though certain elements of the modern concept 
can be traced back to the thought of James 
Madison and appeared more recently in that of 
Charles Beard and Hans Morgenthau (see Bock & 
Berkowitz 1966), it is Walter Lippmann who first 
defined national security explicitly. “A nation has 
security,” he wrote, “when it does not have to sacri- 
fice its legitimate interests to avoid war, and is 


able, if challenged, to maintain them by war 
(1943, p. 5). However, as Arnold Wolfers pointed 
out (1952), a simple translation of “the national 
interest” into “the national security interest,” 
coupled with a normative admonition that nations 
should pursue such an interest, does not lead to 
any greater conceptual clarity than the earlier 
formulations of “national interest.” 

Only with the postwar trends of behavioralism 
and systems analysis did attempts at conceptual 
clarification and specification emerge. Morton 
Kaplan’s work (1957) reflects the trend to study 
all aspects of societal behavior as part of the total 
pattern which constitutes a behavior system. Na- 
tional interest and the national security interest, 
therefore, are treated as simply one aspect, albeit 
an important one, of the problem of system main- 
tenance. Furthermore, the security of the national 
system is closely linked to the security of the sub- 
systems which make up the national system. [See 
SYSTEMS ANALYSIS.] 

The development of national security as a dis- 
tinct academic field of interest owes much to po- 
litical and administrative changes in the United 
States immediately following World War 11. During 
that war, glaring deficiencies were discovered in 
the administrative machinery which was supposed 
to coordinate the war effort and to provide long- 
range plans for the postwar period. Technological 
advances, especially the advent of atomic weapons, 
demonstrated even more clearly that military 
matters could no longer be dealt with in a vacuum 
but had to be closely tied to political and eco- 
nomic considerations, both foreign and domestic 
in nature. 

In 1947 the U.S. Congress passed the National 
Security Act, whose intent it was “to provide for 
the establishment of integrated policies and pro- 
cedures for the departments, agencies, and func- 
tions of the Government relating to the national 
security” (sec, 2). The act established the National 
Security Council (NSC) “to advise the President 
with respect to the integration of domestic, foreign 
and military policies relating to the national se- 
curity” (sec. 101a). At the same time, the office 
of special assistant to the president for national 
security affairs was established. Both institutions 
have grown in importance since 1947; the NSC 
now rivals the Cabinet as a policy-making body 
and the special assistant for national security 
affairs has become in some ways the most impor- 
tant member of the presidential staff. 

The increase in governmental machinery was 
accompanied by an increased use of advisers from 
the academic and business communities. “Citizens’ 


committees” were set up to consider long-term ob- 
jectives and recommend policy guidelines. Aca- 
demic specialists (some of whom had already been 
used during World War 11) were asked to present 
papers and studies on specific aspects of national 
security to various government agencies; some 
were used as regular advisers by these agencies 
and occasionally joined agencies as full-time em- 
ployees. (Also, of course, government employees 
occasionally joined universities.) In addition, spe- 
cial organizations were established, under the 
auspices of the Air Force, Army, Navy, and De- 
partment of Defense, to conduct research into na- 
tional security affairs. (Foremost among these is 
the RAND Corporation.) All this exthange of 
personnel and information led to intensified aca- 
demic activity (for full details, see Lyons & Morton 
1965). 

Although much of this academic work was a 
continuation of earlier scholarly interests, it was 
only after the establishment of the government 
institutions that a consolidated field of study be- 
came discernible. Most of the post-1945 academic 
work can be classified under two themes: conflict 
and cooperation. Studies falling into the first 
category generally reflect the view that national 
security is concerned primarily with maximizing 
national power in conflict situations—in other 
words, national security policy making should be 
based on the assumption that a nation can be 
secure only if it increases its own power at the 
expense of another nation or other nations. Studies 
in the second category, by contrast, emphasize the 
minimizing of national power and stress conscious 
efforts to increase international cooperation. This 
view suggests that national security policies be 
based on the assumption that the security of a 
nation can increase if the security of all nations 
increases. Around each of these poles are clustered 
many studies, which can be further classified ac- 
cording to their substantive concerns. 


National security and international conflict 


The writings which assume that each nation’s 
security objective is the improvement of its rela- 
tive power position can be subdivided into three 
categories, military, economic, and political, ac- 
cording to the ingredient of national security the 
authors focus on. 

The earliest studies of national security tended 
to concentrate on military problems, a reflection 
of the early conception of national security as 
being rather narrowly concerned with military 
matters, Although scholarly work in the 1950s and 
1960s has broadened the scope of the field con- 
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siderably beyond purely military affairs, problems 
of strategy, tactics, wéapons, and logistics still 
concern many scholars. The most significant work 
in this area obviously concerns nuclear weapons 
and technology (e.g., Kahn 1960; Kissinger 1957) 
and deals with such problems as developing an 
optimum strategy for future total wars (Rowen 
1960), the best methods for deterring nuclear 
attacks (Brodie 1959), the alternatives to all-out 
nuclear war, i.e., limited war (R. E. Osgood 1957), 
and unconventional warfare of different kinds 
(Osanka 1962). [See LIMITED WAR; NUCLEAR WAR; 
STRATEGY.] 

Two world wars have clearly demonstrated that 
the industrial potential and economic resources of 
a nation are at the heart of its ability to wage 
war, and national security experts have increas- 
ingly turned to economic considerations and eco- 
nomic methods in their discussions. They have 
attempted to define and assess scientifically all the 
elements constituting a nation’s economic poten- 
tial for war (Knorr 1957). Similarly, they have 
attempted to create economic criteria by which al- 
ternative weapon systems can be selected (Hitch & 
McKean 1960). This approach has found practical 
applications in the U.S. Department of Defense, 
where strategic programs are often considered in 
this way (Enthoven 1963) and where economists 
occupy important decision-making positions. In a 
broader sense, both academic and government 
economists, in the United States and abroad, have 
devoted much time to the problems of allocating 
national resources and finding the proper criteria 
to use when settling the competing claims of the 
military and nonmilitary sectors of the economy. 
Here is perhaps the clearest illustration of our defi- 
nition of national security and the best instance 
of its application to an operational issue: what 
proportion of resources is needed to protect internal 
values from external threats? (Barnett 1958). [See 
ECONOMICS OF DEFENSE; MILITARY POWER PO- 
TENTIAL] 

Internal values are, of course, not merely eco- 
nomic, and national security makes serious de- 
mands upon political institutions as well. Although 
interest in this aspect of national security devel- 
oped somewhat later, it is now one of the major 
areas of academic speculation. Here we find dis- 
cussions of the relative importance of military and 
strategic interests, as opposed to political ones, 
in the policy-making process (Huntington 1961) 
and of the proper role of the bureaucracy in mak- 
ing national security decisions. More concretely, 
scholars discuss the best organizational structures 
for coordinating the complicated, overlapping con- 
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cerns of national security (Kennan 1958) and the 
ways of recruiting the many kinds of specialists 
needed to deal with nuclear technology and econ- 
omy. Once the scientists have been recruited, the 
problem of their proper use has by no means been 
resolved [Rapoport 1964; see also ClvVIL—MILITARY 
RELATIONS; MILITARY POLICY]. 

Going beyond the government, concern has been 
expressed about the role of the public in formulat- 
ing security policy (Almond 1956). The twin ob- 
stacles of technical complexity and the need for 
secrecy make it impossible for the average citizen 
to “have a say,” and yet the danger exists that 
more and more of the national budget and of 
national policy making will be subsumed under the 
heading “national security.” The result may well 
be a “garrison state,” in which politics will be 
dominated by “specialists in violence” (Lasswell 
1962). In international affairs concern is just as 
great and has brought about serious reconsidera- 
tion of our traditional concepts of alliances (Liska 
1962). To what extent does national security in 
the nuclear age require or even permit alliances 
based on traditional ties of friendship? The answer 
is far from clear. [See ALLIANCES.] 


National security and international cooperation 


The writings that assume, explicitly or implic- 
itly, that national security depends on encouraging 
international cooperation or even international 
integration are fewer in number and generally 
less systematic in treatment than those discussed 
above, Nevertheless, there is a growing realization 
that in the modern world any increase in the secu- 
rity of one nation may depend on an increase in the 
security of other nations and that the concept of 
international security may become as meaningful 
as that of national security. A growing number of 
Studies are therefore being devoted to the various 
areas where international cooperation is consid- 
ered desirable and attainable. 

Arms control and disarmament come immedi- 
ately to mind. Despite all the diplomatic negotia- 
tions and academic discussions, the goals of dis- 
armament have remained as distant as ever. 
Studies have analyzed the technical problems at 
Successive stages in the disarmament process 
(Blackett 1962a), the role that deterrence can play 
(Schelling 1962), and the implications of the 
spread of nuclear weapons (Rosecrance 1964). 
Furthermore, some attempts have been made to 
predict the social and economic effects of disarma- 
ment on different national societies (Benoit & 
Boulding 1963). [See DisARMAMENT.] 


Considerably less is known about other aspects 
of international cooperation, but the outlines of 
some areas have at least been sketched: regional 
integration (Haas 1961; Lindberg 1963; Yalem 
1965), international cooperation in limited eco- 
nomic and technical areas (Jessup & Taubenfeld 
1959), and long-term fundamental trends toward 
international economic integration (Myrdal 1956), 
[See INTERNATIONAL INTEGRATION.] In this sphere 
of limited cooperation one could also include the 
schemes advanced for unilateral peaceful actions 
by great powers, designed to evoke similar re- 
sponses from other nations and thus reduce the 
level of international tension. [See INTERNATIONAL 
RELATIONS, article ọn PSYCHOLOGICAL ASPECTS; 
see also C, E. Osgood 1959 and Kelman 1965.] 

Finally, some attention has been devoted to the 
permanent institutions and processes of interna- 
tional cooperation and integration. In this category 
we find many of the works dealing with interna- 
tional organization and international law. However, 
since few of them concern themselves directly with 
the contributions of these institutions and processes 
to national security, they need not be discussed at 
length here. The works of Claude (1956; 1961) 
and Bloomfield (1960) on the role of the United 
Nations in the security of the United States and 
the works of Henkin (1961) and Clark and Sohn 
(1958) on the role of international law in promot- 
ing a true sense of international community and 
laying the foundations for a lasting peace are im- 
portant exceptions. [See INTERNATIONAL LAW; 
INTERNATIONAL ORGANIZATION; PEACE.] 


Problems and prospects 


From the foregoing discussion it is clear that 
the substantive concerns of the field of national 
security have much in common with those of the 
traditional fields of international relations and for- 
eign policy. It is therefore fair to ask whether the 
concept of national security offers a more fruitful 
method of organizing the growing mass of empir- 
ical data than do the traditional approaches, and, 
in fact, whether it is distinguishable in any signif- 
icant way from these approaches. 

National security and international relations. 
The links between the fields of national security 
and international relations are obvious. We can 
view international relations as being concerned 
basically with two areas of inquiry: the structure 
and process of the international system, and the 
behavior of different actors in the international 
system and the process by which they reach the 
decisions that guide this behavior (see Rosecrance 


1963). The field of national security obviously 
pre-empts much of the material in the latter area, 
since, as we have said, it is the study of how nations 
make the decisions and policies designed to maxi- 
mize the protection of their internal values from 
external threats. Traditionally, most of this mate- 
rial was classified under the heading “foreign 
policy studies.” Most studies of foreign policy, how- 
ever, rested heavily on the concept of national in- 
terest, and our concept of national security has, in 
fact, emerged out of dissatisfaction with the con- 
cept of national interest (Bock & Berkowitz 1966, 
pp. 131-133). 

As an organizing concept, national security has 
several advantages over foreign policy. First, it 
focuses on common elements and uniformities in 
the external policies of all national actors. It pro- 
vides a convenient frame of reference which opens 
up possibilities of comparison between matters 
superficially disparate and which can help organ- 
ize the welter of comparative data produced by the 
historical foreign policy approach. The national 
security orientation also adds a dimension to the 
concern of traditional foreign policy with conflict 
situations. It makes room, conceptually, for the 
consideration of common international interests, 
which could result in a simultaneous increase of 
security for all the actors in the international sys- 
tem. The maintenance of the international system 
thus becomes a legitimate goal of national security 
policy. Finally, national security focuses on the 
underlying unity of the internal and external ac- 
tivities of states by explicitly recognizing that ex- 
ternal behavior is an integral part of the total 
behavior pattern of the national system. It there- 
fore avoids the misleading dichotomy between do- 
mestic and foreign policy, which has typified the 
traditional study of foreign policy, and joins the 
two by envisaging both as designed to protect the 
same set of values and, ultimately, to maintain the 
national and even the international system. In util- 
izing a systems approach, national security facili- 
tates the application of the techniques and findings 
of all the behavioral sciences to this particular seg- 
ment of political behavior. 

Even at this early stage in its development, the 
field of national security is characterized by a rich 
variety of work by scholars from many disciplines. 
It has shown a deep concern for the development 
of precise methodological tools. In many recent 
studies such modern social science techniques as 
game theory, simulation, information theory, in- 
put-output analysis and systems analysis have 
been applied to national security problems. Further- 
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more, concepts from economics, psychology, sociol- 
ogy, political science, and anthropology have been 
used to clarify and define issues of national security 
(see Berkowitz & Bock 1965, pp. 325-416). 

Progress toward comparative analysis. It must 
be pointed out that the further development of 
national security studies as a field is to some extent 
contingent upon parallel developments abroad. If 
the field is to have a truly comparative character, 
it must be enriched by the work of researchers in 
other countries who are working on similar prob- 
lems. Such a development would help make com- 
parative analysis possible and would also serve to 
prevent the discipline from becoming too rigidly 
and narrowly tied to immediate American defense 
needs. 

Some steps in this direction have already been 
taken. Indeed, the model for many of the recent 
developments in the United States with respect to 
establishing institutional links between the private 
intellectual community and the government was 
provided by Great Britain in World War 1. The 
urgency of the military situation in England at 
that time led to a sizable recruitment of scientists 
into the government, in order that they might be 
able to apply their specialized skills to what until 
then had been considered strictly military prob- 
lems. New methods of analysis were developed 
that were crucial in such fields as radar systems 
and strategic bombing techniques. The institutional 
and intellectual techniques that emerged formed 
the beginnings of a body of knowledge called opera- 
tions analysis (see Quade 1964), The patterns of 
cooperation instituted here provided direction for 
postwar developments in both the United States 
and Great Britain. More recently, the founding of 
the Institute for Strategic Studies in England has 
paralleled the emergence in the United States of 
such institutions as the RAND Corporation, the 
Hudson Institute, and the Institute for Defense 
Analysis. Specific examples of recent British con- 
tributions include the work of Beaton and Maddox 
(1962) on the spread of nuclear weapons, the 
studies of war and disarmament by Blackett 
(1962b) and Noel-Baker (1958), and the study by 
Liddel Hart (1960) of problems of deterrence. 

In other countries there have also been encour- 
aging developments, adding to the international 
dimension of the national security field. The fol- 
lowing works illustrate this trend: in France, 
Gallois’s study of the concepts of deterrence strat- 
egy (1960) and Aron’s work on war and related 
problems (1962); in the Soviet Union, Sokolov- 
skii’s writings on the nature of general and limited 
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warfare (1962) and Pokrovskii’s work on the tech- 
nology of nuclear war (1956); and in Australia, 
Modelski’s studies of decision making in foreign 
policy (1962) and of the international aspects of 
civil strife (1964) and Burns's efforts (e.g., 1957) 
to apply rigorous mathematical thinking to stra- 
tegic concepts. The Australian National University 
provides an academic setting for national security 
studies similar to that in many American univer- 
sities. 

National security as a policy science. As we 
have shown, national security combines unique 
possibilities not only for the study but also for the 
recommendation of important policy decisions. It 
is therefore rapidly developing into what Harold 
Lasswell has termed a policy science [see PoLicy 
SCIENCES]. 

This development contains potential disadvan- 
tages, as well as advantages. One of the chief 
dangers is that links between the intellectual and 
the policy maker may constrain the academic at- 
mosphere of free inquiry. The growing dependence 
of university research upon government financing 
and sponsorship may well result in the erosion of 
academic independence. And, of course, similar 
pressures exist when the work is done under cor- 
porate auspices. There are signs that such pres- 
sures have already biased some studies. 

Perhaps an even greater, if less obvious, danger 
lies in the complex differences between the needs 
of policy making and those of pure research. The 
challenge-response character of much of the work 
in the national security field may have weakened 
investigators’ commitment to pure inquiry, The de- 
mand that the intellectual focus his talents on 
finding solutions to urgent and immediate prob- 
lems of policy must inevitably turn his attention 
away from work on the longer-range, more funda- 
mental questions which provide the basis for future 
progress. 

On the positive side, it is undeniable that the 
field of national security has in many ways pro- 
vided a stimulating environment for research, 
precisely because of the strong links between the 
policy maker and the intellectual. Personnel from 
government, corporations, and universities have 
cooperated on research projects, and much success- 
ful cross-fertilization has occurred between a wide 
range of disciplines and professions. 

In the area of military strategy, for instance, 
the traditional insulation of the military from 
broad, academic inquiry has been broken by the 
new type of military man, trained in the national 
security centers. The recent published works of 
several American and European generals, clearly 


enriched by this experience, show considerable 
awareness of the broad political, social, and eco- 
nomic aspects of military matters. Civilian strate- 
gists, most of them from academic life, have also 
benefited from closer contacts with military men 
and from the direct experience with operational 
strategic problems that they have acquired. Their 
work displays a greater precision and a better for- 
mulation of basic military issues than it did before 
the emergence of national security institutions. 
Other areas of the national security field have 
also gained. One has only to compare the pre-World 
War 11 literature on disarmament, conflict manage- 
ment, and international organization with the 
works cited above to realize the progress made. 
Instead of utopian images of a “world state,” ideal- 
istic peace plans, and prescriptions for “the only 
way” to achieve total disarmament, we now find 
attempts at accurate description and realistic 
analysis of policy alternatives, using the latest sci- 
entific concepts and methods. It is this combina- 
tion of science and policy, with all its inherent 
dangers and promises, that seems to assure the 
future growth of the field of national security. 


MorTON BERKOWITZ AND P, G, BOCK 


[See also FOREIGN POLICY and INTERNATIONAL POLI- 
tics. Other relevant material may be found under 
INTERNATIONAL RELATIONS; MILITARY; WAR.] 
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NATIONAL SOCIALISM 


National Socialism started as a political move- 
ment in Germany in 1919. Its official name was 
the “Nazionalsozialistische Deutsche Arbeiterpartei” 
(National Socialist German Workers’ Party); it 
soon became popularly known as the Nazi party, 
and its followers were called Nazis. When Adolf 
Hitler joined the party, Nazism consisted of a little 
group of unimportant malcontents in Munich. Yet 
within fourteen years it became the greatest mass 
movement in German history, including in its 
ranks members of all groups of German society, 
from unemployed workers of the Lumpenprole- 
tariat to members of the imperial family of the 
Hohenzollerns and of several of the royal houses 
of the German states. By 1932 the Nazi vote had 
mounted to fourteen million; in the March 1933 
election, the last in which opposing parties par- 
ticipated, seventeen million Germans (or 44 per 
cent of the electorate) freely voted for the Nazi 
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party, not to speak of several more millions who 
voted for nationalist and militarist policies that 
were barely distinguishable from Nazi objectives. 
Thus well over half the German electorate voted 
for an antidemocratic, totalitarian, imperialistic 
program, After the elections, only the Social Demo- 
crats attempted to resist Nazism in the Reichstag 
(the Communists had not been allowed to take 
their seats in the Reichstag). Even the Roman 
Catholic (and generally democratic) Center party 
gave Hitler the dictatorial powers he asked for in 
the Reichstag on March 23, 1933. This was the 
only case of a modern totalitarian regime that was 
set up by a majority of the electorate and approved 
by the parliamentary body of the nation. 

Once in power, the Nazi regime lived up to its 
promises, First, concentration camps were set up 
for political opponents. Very soon the political 
offenders were a small minority in the concentra- 
tion camps; the large majority consisted not of per- 
sons who had committed a wrong but who (like 
the Jews) belonged to the wrong group. Later, dur- 
ing World War n, large numbers of civilians in the 
occupied countries were put into concentration 
camps, because they too belonged to a “wrong” 
social or political group. 

Politically, the Nazis quickly effected complete 
uniformity (Gleichschaltung). All other parties, 
including the ultraconservatives, were liquidated 
within a few months of the Nazi seizure of power, 
Newspapers were either Nazified or, when they had 
an established liberal-democratic reputation, were 
abolished (as, for example, the Vossiche Zeitung 
and the Berliner Tageblatt). Education, from 
Kindergarten to university, was put under strict 
party and government control, and the state- 
Sponsored Hitler Youth replaced all existing youth 
organizations. All labor unions, whatever their po- 
litical sympathies, were outlawed and replaced by 
the government-sponsored Labor Front, incorpo- 
rating both labor and management in one organ- 
ization, The Christian churches were persecuted 
if they dared to resist the anti-Christian, racist 
policies of Nazi mass murders. Christianity was 
attacked as a Jewish contrivance to weaken the 
military spirit of the Germans, and attempts were 
made to substitute a new religion, “German Faith,” 
for Christianity. More extreme Nazis even went so 
far as to re-establish old Germanic, pre-Christian 
Paganism as the only religion fit for the new Nazi 
Germany. Finally, even the traditional structure of 
the family was attacked, Children were encouraged 
to inform on their parents and unmarried women 
to breed a new Herrenrasse (master race) out of 
wedlock. 

The Nazi regime introduced military conscrip- 


tion in 1935, militarized the Rhineland in 1936 in 
violation of treaty provisions, annexed Austria in 
1938 and Czechoslovakia in 1939, and started 
World War 11 by the invasion of Poland on Sep- 
tember 1, 1939. In the summer of 1940, France 
was vanquished, and Great Britain alone resisted 
the weight of Nazi power. In possession of virtually 
the whole European continent, Nazi Germany 
attacked the Soviet Union in June 1941 and de- 
clared war on the United States in December 1941. 

Nazi Germany lost the war and surrendered un- 
conditionally in 1945. Yet before going to defeat, 
the Nazis had accomplished one major objective: 
over six million European Jews—men, women, and 
children—were murdered in the gas chambers of 
Auschwitz and other extermination camps spe- 
cially set up for wholesale killing. This extermina- 
tion of the Jews in occupied Europe was called, in 
official Nazi language, “the final solution of the 
Jewish question.” 


Sources of support 


Of all the social classes in Germany before 1933, 
the urban working class was proportionately least 
affected by the appeal of Nazism, Membership sta- 
tistics of 1933 indicate that manual workers were 
substantially underrepresented in the Nazi party, 
whereas white-collar workers and middle-class 
persons were greatly overrepresented in relation 
to the total German population, Urban workers 
largely followed the Social Democratic party. Nei- 
ther Communist nor Nazi attempts to win the 
allegiance of German urban workers for totali- 
tarian programs succeeded before 1933, Yet, while 
the German urban workers did not want Nazism, 
they did little to resist its coming into power or its 
operations once it was in power. The deeply in- 
grained respect for authority in most Germans 
made resistance difficult. Moreover, the Nazis man- 
aged to abolish unemployment by embarking on a 
war economy from the outset, as a result of which 
unemployment turned into full employment and 
even a shortage of labor. Many workers were will- 
ing to trade the loss of individual liberty and free 
labor unions for the gain of full employment and 
social security. As a result, the mass of the Ger- 
man workers acquiesced in the other Nazi policies, 
including the policies of imperialist expansion 
through aggressive war. The urban workers (un- 
like those of other countries under Fascist rule, such 
as Italy) played only a very minor part in what- 
ever resistance groups existed in Nazi Germany, 

The lower middle class—particularly the salariat 
—supplied the numerically strongest element of 
popular support for Nazism. Many persons in this 
class dreaded the prospect of joining the prole- 


tariat and looked to the Nazi movement for the 
saving of their traditional status and prestige. The 
salaried employee is jealous of Big Business, into 
whose higher echelons he would like to rise via 
the ladder of management, and he also fears Big 
Labor, into whose proletarian world he disdains to 
sink. Nazism very astutely played on these fears 
and anxieties by attacking both the “interest slav- 
ery of finance capitalism" and the “un-German” 
character of “Marxist Bolshevism.” Logically, prop- 
aganda directed against both capital and labor may 
seem self-contradictory, but its very inconsistency 
both reflected and appealed to the political con- 
fusion of the salaried class, Furthermore, Nazism 
promised them the identification with the “su- 
perior” Nordic master race. This racialism had a 
most impressive appeal to those groups of salaried 
persons—teachers and government employees— 
who were traditionally permeated with nationalist 
and racist ideas even before Nazism appeared, 

As to the numerically less significant, but social- 
ly and economically important upper class of in- 
dustrialists and big landowners, the support re- 
ceived from this group by the Nazi party even 
before 1933 was of great impact. On January 27, 
1932, Hitler addressed the Industry Club in Düs- 
seldorf, the center of Germany's heavy industry; 
his success in winning over the leaders of heavy 
industry was impressive, Most notable among 
active supporters of Nazism before 1933 were such 
world-famous German industrial figures as Fritz 
Thyssen and the Krupp family, While looking down 
upon the Nazi leadership as a group of plebeian 
upstarts without the breeding and background of 
gentlemen, German industrialists and big land- 
owners supported Nazism for two main reasons: 
first, the Nazis promised the abolition of free labor 
unions, and second, the industrialists understood 
that the remilitarization of Germany coupled with 
an aggressive foreign policy would be profitable for 
business. The support of the steel industry was 
particularly significant. Already during the Second 
Reich, the friendship between the Kaiser and the 
Krupp family pointed to the intimate ties between 
German heavy industry and militarism. The alli- 
ance between the steel industry and Nazism be- 
fore 1933 was but a renewal of these historical ties 
between industry and a German government with 
an antisocialist, antidemocratic, and imperialistic 
policy. During World War n, German heavy indus- 
try profited from its ties with the Nazi regime, 
since it was the main beneficiary of the labor of 
millions of foreign workers deported to Germany. 

Another group that was crucial in the rise of 
Nazism was the military, traditionally of great 
social importance in the fabric of German society 
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and government, Even in strong and well-estab- 
lished democratic states, the professional military 
class tends to overestimate the virtues of discipline 
and national unity, Where democracy is weak, as 
it was in Germany during the Weimar Republic, 
the professional bias of the military class becomes 
a political menace, The top military leaders of 
Germany knew, before and after 1933, that a high 
percentage of Nazi leaders were criminals or psy- 
chopaths, yet they supported the Nazi movement 
as a step toward the desired militarization of Ger- 
many. Of the two greatest German military leaders 
of World War 1, General Ludendorff and Field 
Marshal von Hindenburg, the former embraced 
Nazism in the early 1920s and the latter collabo- 
rated with it until his death in 1934. Yet it should 
be pointed out that, toward the end of World 
War 1, high military leaders played an important 
role in attempts to overthrow the Nazi regime. 
These plots culminated in the attempt against 
Hitler's life on July 20, 1944, an attempt that 
failed. It is noteworthy that the German generals 
did not hatch any resistance plans against Nazism 
when the war went well for Germany; only when 
defeat became a certainty did they try to save 
what could be saved of German power by over- 
throwing the Nazi regime, 

In analyzing the sources of support for Nazism 
among the German people, the most important les- 
son is not which particular social group proved 
itself most vulnerable to the Nazi virus—although 
this is an important lesson and has broad political 
implications outside Germany as well. A phenome- 
non of even more general consequence is demon- 
strated by the success of Nazism before coming 
to power and its popularity among the German 
people; an antidemocratic, totalitarian movement 
can be based on mass support. 

From the late seventeenth to the early twentieth 
century, the conventional wisdom of enlightened 
and liberal political thought automatically assumed 
that political oppression was due solely to the ma- 
levolence of a small minority of political oligarchs 
lording it over the mass of the “good” people, The 
assumption, which was hardly ever challenged, 
was that the mass of the people naturally desired 
freedom above everything else; once the obstacles 
to this natural desire—kings, aristocrats, men of 
privilege—were removed, a reign of liberty and 
democracy would inevitably follow. The experi- 
ence of Nazism, both before and after 1933, demol- 
ished this illusion once and for all. The main 
reason why conventional political analysis failed to 
come to grips with the paradoxical phenomenon 
of the mass basis of modern totalitarianism lay in 
its exclusive concern with totalitarian leaders rather 
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than totalitarian followers, the latter being seen 
merely as innocent victims of their evil leaders. In 
the light of the knowledge gained by modern psy- 
chology and psychoanalysis, Erich Fromm (1941) 
has shown the psychodynamic and sociological 
factors that underlie the “totalitarian flight from 
freedom” and that have made modern man feel 
isolated, powerless, and irrational. These forces are 
potentially operative everywhere, but in Germany 
their potential was most fully and most disastrously 
realized. 


Causes of Nazism 


Many interpretations of the nature of Nazism 
have either gone back too far into ancient history 
or have confined themselves too much to the im- 
mediate past. Whatever characteristics the Ger- 
manic peoples may have possessed in the days of 
Tacitus, there have been too many historical 
changes since then to deduce Nazism from Ger- 
man antiquity. Similarly, a movement of such far- 
reaching impact on the whole world can hardly be 
adequately explained by such specific recent events 
as the Versailles Treaty of 1919 or the economic 
depression of 1929-1932. Defeat in war does not 
necessarily end in a totalitarian nihilism of the 
Nazi type, as is evidenced by Germany’s own 
defeat in World War 1m, which did not again pro- 
duce Nazism. Similarly, the impact of the depres- 
sion has been exaggerated, if it is to serve as the 
main cause of explaining the rise and triumph of 
Nazism. There is no doubt that the inflation of the 
early 1920s and the depression that began in 1929 
had a deleterious effect on German democratic in- 
stitutions. But economic depression is, in itself, no 
necessary general cause of fascist totalitarianism. 
There is a relation between economic depression 
and accelerated social change, but it is of a differ- 
ent kind: like war, economic depressions do not 
create new major social and political trends, but 
tend to accelerate the rate of development of exist- 
ing trends. In fundamentally democratic nations 
(like the United States, Australia, New Zealand, or 
the countries of northwestern Europe), the depres- 
sion of 1929 produced neither fascism nor com- 
munism but advanced the cause of democracy on 
the economic, social, political, and cultural fronts. 
Conversely, where the roots of democracy are frail 
and where the dominant social attitude is strongly 
suffused with authoritarian elements, a depression 
may easily accelerate such authoritarian trends, as 
happened not only in Germany in the 1930s but 
also in Japan, Brazil, Poland, and other nations. 

Closely related to the depression theory as the 
major cause of Nazism is the essentially Marxian 
interpretation of Nazism as the logical outgrowth 


of monopoly capitalism. While it cannot be denied 
that monopolistic capitalism was a major force in 
German life from the time of the establishment of 
the Second Reich in 1870 and that on the whole its 
political influence was harmful to the development 
of a liberal society and a democratic government, 
this theory cannot explain why monopoly capital- 
ism produced Nazism in Germany and not in 
Britain and the United States. In purely economic 
terms, the depression in these major citadels of 
world capitalism in the early 1930s was not sub- 
stantially different from that in Germany. The dif- 
ferentiating factor was not the relative degree of 
the severity of economic crisis but the difference 
in political ideas and institutions that circumscribed 
the behavior of political decision makers. 

If Nazism was more than a reaction to the Ger- 
man defeat in World War 1 or to the depression, 
it can be explained only by the persistence of a 
powerful antiliberal tradition—perhaps the domi- 
nant German tradition—in the nineteenth and 
early twentieth centuries. Hegel, Adam Müller, 
Paul de Lagarde, Julius Langbehn, Heinrich von 
Treitschke, and Moeller van den Bruck are but a 
few of the more important figures in the develop- 
ment of a social philosophy that opposed the con- 
cepts of power, authoritarianism, nationalism, 
racism, and imperialism to the ideas of natural law, 
liberty, universalism, equality, and peace. Roman- 
ticism was, from the beginning of the nineteenth 
century onward, perhaps the single strongest move- 
ment in German thought. Whereas in other coun- 
tries (like France and England) romanticism was 
largely confined to the literary imagination as a 
protest against the limiting tradition of the meas- 
ure and orderliness of classicism, in Germany it 
became a systematic philosophy with elaborate and 
coherent views on man, society, law, and the state. 

The German Romantics, in their theory of the 
state, put forward an organistic conception based 
on blood and community, in which the individual 
occupied a relatively minor role; and they rejected 
the Western liberal theory of the state based on a 
social contract, in which the individual had natural 
rights preceding the state. In economics, the Ger- 
man Romantics assailed the free-market economy 
of capitalism as “soulless egotism” and urged the 
revival of the medieval closed economy regulated 
in every detail by the community. The most typical 
German Romantics, like Adam Miiller, did not 
attack this or that particular point of the Western 
tradition in ethics, politics, and economics, They 
fought, instead, against the humanistic and rational 
Western tradition as a whole. 

There is not a single element in the Nazi doc- 
trine as developed by its leaders and apologists 


that does not have a long—and frequently dom- 
inant—tradition in the century and a half preced- 
ing the rise of Nazism. It is true that such ideas 
were not expressed only in Germany. Count de 
Gobineau expressed racist theories in France, 
around the middle of the nineteenth century, and 
Carlyle expressed antiliberal and racist doctrines 
in England in the second half of the same century. 
Yet the important thing is that such prophets of 
authoritarianism and racialism did not obtain sig- 
nificant followings in their native countries, whereas 
their writings became enormously popular in Ger- 
many. The case of Houston Stewart Chamberlain 
is even more indicative of this phenomenon. Born 
an Englishman, he settled in Germany and became 
a German citizen. His Foundations of the Nine- 
teenth Century (1899) became one of the most 
popular books in pre-Nazi Germany, yet made no 
impact on his native country. From anti-Semitism 
to imperialism, there is little in the Nazi doctrine 
that cannot be found in Chamberlain’s writings. 

Conversely, there was also, before the rise of 
Nazism, a liberal and humanistic tradition in Ger- 
many, characterized by such lofty figures as Les- 
sing, Kant, Humboldt, and Goethe. Yet this tradition 
never became dominant and was more influential 
in the academy than in the councils of policy 
makers. In 1848 and in 1918, the liberal elements 
of German society started a new orientation toward 
Western ideals in government and society, but in 
both cases the authoritarian and militaristic ele- 
ments in German life squelched such attempts 
through violence and terror. 


Nazi doctrine and policies 


Nazi doctrine and policy were, however, more 
than a mere revival of traditional antiliberal ideas 
and institutions in German life. In Nazism, these 
antiliberal attitudes and institutions were carried 
to their extreme. Whereas philosophical and politi- 
cal romantic thought in Germany had reacted 
against the excesses of rationalism, Nazi ideolo- 
gists, like Alfred Rosenberg, rejected the principle 
of Western rationalism itself, charging, for exam- 
ple, that Socrates was the first “Social Democrat” 
in Europe and the originator of the disease of 
rationalism, because he established the principle 
of trying to settle vital issues through argument 
and debate. Similarly, whereas in the pre-Nazi 
German intellectual tradition particular points of 
Christianity were criticized or assailed, official Nazi 
ideology rejected Christianity in toto as a devilish 
Jewish plot to weaken Germanic vigor and military 
manliness. In addition, Nazism had the dynamic 
of a popular mass movement, whereas antidemo- 
cratic and antiliberal ideas and policies in the 
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eighteenth and nineteenth centuries still recognized 
some restraints of traditional religion and morality. 
The tragedy of pre-Nazi German politics lay in the 
fact that the masses were sufficiently drawn into 
politics to become highly conscious of political pro- 
grams and movements, but not sufficiently involved 
to build up a sustained democratic experience. In 
this sense, Nazism was the response of a politically 
“semiliterate” people: not illiterate enough to stay 
out of politics and not literate enough to have 
learned the important lessons of politics through 
self-government. 

The potentially dangerous tendencies in pre-Nazi 
German ideas and policies were carried forward to 
the most extreme point of nihilism, rejecting all 
traditional Western moral and religious concepts 
about the nature of man and his inalienable dig- 
nity as a human personality. This nihilism came 
out most clearly in the use of terror and murder 
as an official state policy of extreme totalitarian- 
ism. Nazi concentration camps and gas chambers 
were more than incidental phenomena in the total 
process of Nazism. They were of its very essence, 
for it was in those camps that man was destroyed 
as a moral being and reduced to a mere number, 
tattooed on his body. Such camps were not set up 
primarily to punish ordinary or political criminals. 
Most of its victims, such as the Jews, were not even 
accused of having done anything wrong. The pur- 
pose of the concentration and extermination camps 
was to show to the entire population under Nazi 
control that every person was potentially an in- 
mate, since personal guilt had little or nothing to 
do with such a punishment. The ultimate purpose 
of such camps was to demonstrate that man’s soul, 
his dignity, and his self-respect can be reduced to 
dirt and ashes, and that no one was exempt from 
such fate if it so pleased the Nazi rulers. If killing 
had been the main objective of the camps, such 
killing could have been accomplished with more 
efficiency and without the suffering and the deg- 
radation that accompanied it. In the scheme of 
Nazi totalitarian nihilism, the degradation of man 
was not the incidental by-product of murder, but 
murder was the by-product of the systematic proc- 
ess of degradation. The aim of Nazi nihilism was 
to transform a human into a nonhuman and to 
restrict the quality of being human to those who 
were acceptable to the Nazi rulers. 

This policy was also carried out in foreign af- 
fairs. Thus, when Czechoslovakia was taken over 
in 1939, Nazi legislation referred to its population 
as Germans and “other inhabitants.” In the eyes of 
the Nazis, the Czechs were not merely defeated by 
superior German arms but had ceased to exist as 
a nation, just as the inmate of a concentration or 
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extermination camp was nothing more than a num- 
ber in the files, without any human personality or 
individuality. Nazi plans for the Poles and Russians 
were the same: not only to conquer them militarily 
but to transform both nations into “nonnations,” 
slaves of the higher German Kultur. Eventually, a 
similar fate was also foreseen for the other nations 
to be subdued and then destroyed as national 
entities. 


Historically, Nazism may have left two impor- 
tant legacies. First, it is conceivable that the ex- 
perience of Nazism has irretrievably destroyed the 
authoritarian, antidemocratic, antiliberal, and mili- 
taristic tradition in German society, because Nazism 
demonstrated to what extent the potential of that 
tradition could be realized in destroying the very 
foundations of civilization. Second, Nazism has left 
a broader legacy for all mankind. Whatever psycho- 
logical malformation of behavior occurs in one 
human being may potentially occur in any other. 
The same applies to whole nations. The lesson of 
Nazism is not only how low Germans could fall, 
but how far any nation can fall once critical ra- 
tionalism, moral restraints, and constitutional gov- 
ernment have been substantially weakened or 
destroyed. 

WILLIAM EBENSTEIN 


[Directly related are the entries ANTI-SEMITISM; Dic- 
TATORSHIP; FASCISM; TOTALITARIANISM. Other rele- 
vant material may be found in MILITARISM; NATION- 
ALISM; PERSONALITY, POLITICAL; RADICALISM; and 
in the biographies of Scumrrr and TREITSCHKE.] 
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1 
ESTIMATION 


Most of the conceptual difficulties encountered 
in the measurement and analysis of national wealth 
can be avoided if national wealth and the related 
concepts of national assets and national net worth 
are treated within a system of national accounts. 
The assets, wealth, and net worth of individual 
economic units and of groups of them should also 
be treated within this system. 


National wealth in a system of 
national accounts 


A system of national accounts, like a system of 
business accounts, has two main components. The 
first is a statement of flows of economic objects 
(commodities, services, money, and other financial 
assets) during a period of time, usually a year. 
The second is a statement of the stocks of eco- 
nomic objects, physical and financial, existing at 
a point in time, These stocks are recorded in the 
national balance sheet and in the balance sheets 
of less comprehensive groups. The groups may be 
broad sectors, such as all households, all business 
enterprises, or all government organizations within 
a nation; or smaller subsectors, such as households 
of a given type or individual industries. 

In this approach it is necessary to distinguish, 
in accordance with business accounting practices, 
between combined and consolidated balance sheets. 
The combined balance sheet of a group of units or 
a sector is the result of adding items of assets and 
liabilities as they are found in the balance sheet of 


each constituent unit. The consolidated balance 
sheet cancels creditor—debtor and holder—issuer re- 
lationships among units both of which belong to 
the group or sector. In the consolidated balance 
sheet of the American banking system, for instance, 
the deposits that commercial banks keep with Fed- 
eral Reserve banks or with other commercial banks 
disappear. Similarly, in a national balance sheet 
all claims of one national against another and all 
holdings by a national of equity in domestic busi- 
ness enterprises are eliminated. Hence, in a national 
balance sheet the only assets that remain are 
tangible assets and net claims against foreigners. 

National wealth thus is equal to the sum of 
domestic tangible assets and the net foreign bal- 
ance, The net worth that appears opposite national 
wealth and with the same amount is necessarily 
equal to the combined (or the consolidated) net 
worth of all ultimate domestic units, that is, of 
households and governments. Similarly, the wealth 


COMBINED BALANCE SHEET OF 
DOMESTIC ULTIMATE UNITS 


(Households and governments) 


Tangible domestic assets 


Net worth of domestic ultimate units 


Net foreign assets 


HAN 


COMBINED BALANCE SHEET OF 
DOMESTIC INTERMEDIATE UNITS 


(Business enterprises and nonprofit organizations) 
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of a sector or of a single economic unit is equal to 
the sum of its tangible assets and its net “foreign” 
balance, the latter referring to all its claims against 
and all its liabilities to other sectors or units (in- 
cluding equity securities). The wealth of a sector 
or unit thus equals the sum of its tangible and 
financial assets less its liabilities, This difference 
is called its net worth. 

For some purposes, particularly the study of 
financial structure and development, what is needed 
is not the consolidated but the combined balance 
sheets of groups, sectors, or nations. In these bal- 
ance sheets the sum of tangible and financial assets 
is equal to the sum of liabilities and net worth, no 
distinction being made between intragroup and 
intergroup claims, liabilities, and equities. The 
footing of a combined national balance sheet may 
be called national assets. 

These basic relations are illustrated in Figure 1, 
in which it is assumed that foreign assets exceed 


CONSOLIDATED BALANCE 
SHEET FOR THE NATION 


(4 minus 5) 


1. Claims against domestic ultimate units 
a. by other ultimate units 
b, by intermediate units 
2. Claims against domestic intermediate units 
a. by ultimate units 
b. by other intermediate units 
3. Equity securities of domestic intermediate units 
4. Foreign assets 
5. Foreign liabilities 


Figure 1 — Sectoral and national balance sheets 
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foreign liabilities, In each of the three sections of 
the chart, assets are shown in the lefthand bar 
and liabilities and net worth in the righthand bar. 
Hence, net worth exceeds domestic tangible assets. 
If the same asset is carried at identical values in 
the balance sheet of creditor and debtor (or the 
holder and issuer of equity securities), domestic 
financial assets must equal domestic financial 
liabilities, 

‘The scope of national wealth. According to the 
basic definition of national wealth, it consists of 
the assets remaining in the consolidation of the 
balance sheets of all economic units within the 
national boundaries. The concept therefore in- 
cludes nonreproducible tangible assets, such as 
land and subsoil assets; reproducible fixed and 
movable tangible assets, such as buildings and 
other structures, machinery and equipment, ve- 
hicles, and consumer durables; inventories of mon- 
etary metals, raw materials, work in process and 
finished goods; and the excess of foreign assets 
over foreigners’ holdings of domestic claims, equi- 
ties, and tangible assets. There remain a few prob- 
lems of whether to include or omit specific types 
of assets which are listed below, The decision will 
usually rest on the uses to be made of the estimates 
and the availability and reliability of data. 

(1) Military assets. There is no reason to omit 
these if a comprehensive picture of national wealth 
is wanted, but they should be separated from non- 
military assets for purposes of analysis. 

(2) Works of art and collectors’ items. In prin- 
ciple these should be included at market value, 
which is fairly well defined for several assets of 
this type, In practice they are usually omitted be- 
cause of lack of information on a sufficiently 
comprehensive basis, 

(3) Natural resources. These are excluded in- 
sofar as they cannot be separately appropriated or 
sold, as is the case with sunshine and precipita- 
tion. Otherwise they are included, directly or in- 
directly, as part of the market value of land. 

(4) Human resources. These are omitted be- 
cause human beings are not considered part of 
national wealth unless they can be appropriated. 
Where slavery exists, the market value of slaves, 
which in part reflects their training, constitutes a 
separate category of national wealth. However, cur- 
rent and cumulated expenditures on education and 
training are so important in comparison with the 
usual forms of capital expenditures on structures 
and equipment that they must be considered and 
compared with conn wealth in the analysis of 
economic growth and many other lems involv- 
ing productivity and income enn [see Cart- 
TAL, HUMAN]. 


(5) Rights, such as patents, copyrights, good 
will, and monopoly profits. Any value that may 
be put on such rights by the owner disappears in 
the consolidation of individuals’ balance sheets, 
since it is reflected in the market value of equity 
or offset by corresponding imputed liabilities in 
the balance sheets of buyers or users. For similar 
reasons the taxing power of governments is not 
reflected either in national wealth as an asset of 
the government or in national net worth as a de- 
duction from the net worth of the taxpayers. 

Valuatién. The numerous individual objects in- 
cluded in national wealth, or in the balance sheets 
of economic units or sectors, can be combined or 
consolidated only if each is given a value that is 
additive substantively as well as formally, Formal 
additivity requires valuation in the same unit of 
account, usually the legal tender of the time and 
place. Substantive additivity calls also for the use 
of methods of valuation that are economically uni- 
form for all types of assets. This requires, first, 
that physically or economically identical assets 
enter the balance sheet at identical values and, 
second, that the values for assets of different type, 
age, physical condition, or marketability be deter- 
mined by methods that are similar in economic 
character, although they may differ in sources and 
technique. 

Use of current market prices. Valuation at cur- 
rent market prices, or the nearest feasible approx- 
imation to them, is the only method that meets 
the requirements of both formal and substantive 
additivity, The use of book values, for instance, 
would combine different values for economically 
identical assets, It would also generally produce 
noncomparable values for different types of assets 
and for the wealth of different sectors, depending 
on the time of acquisition of the assets and on their 
price movements between the time of acquisition 
and the balance sheet dates. 

Valuation of net worth. The problem of valu- 
ation arises in connection with the individual phys- 
ical or financial assets and the liabilities that make 
up national wealth or appear in the balance sheets 
of economic units or sectors, and in connection 
with the net worth of intermediate economic units, 
primarily corporations and other business enter- 
prises. It does not arise for the net worth of ulti- 
mate economic units. This can be determined only 
as the difference between the sum of their sep- 
arately valued assets and their similarly valued 
liabilities, since by definition the net worth of ulti- 
mate economic units cannot be the object of a 
market transaction. 

The net worth of intermediate units can often 
be determined both by the residual method (the 


direct method of using the market price of corpo- 
rate stock. These two valuations need not, and 
generally do not, coincide either for individual 


method in the balance sheets of both holders and 
issuers of corporate stock, 
Limitations of market valuation. The principle 


the total stock of any type of physical or financial 
asset is actually traded within a period clove to the 
balance sheet date, The market valuation of assets 
must, therefore, as a rule proceed on the assump- 
tion that the price established in these actual trans- 
actions is representative of all items of the same 
type in existence at the balance sheet date, This 
presumption is generally reasonable for many im- 
portant of assets, particularly securities 
and fairly standardized types of land, structures, 
and equipment. Appropriate adjustments must be 
made where actual transactions are not a random 
sample of the class of assets they are intended to 
represent, either because the transactions are of a 
. Special character (such as distress sales) or because 
the assets actually sold differ in composition, age, 
or condition from the stock of assets of the same 
class, 

Second, market prices do not exist for certain 


used. The two methods most generally applied are 
expected 


and to its future net earnings 
yield established by the market for assets of similar 
character. 


Third, the market value, although established by 
actual ordinary transactions, may not reflect the 
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evaluations of the owner or of the debtor or issuer, 
In such cases, the value used in the estimation of 
national wealth would not be the one that pro- 
sumably guides the transactor’s behavior. 

Fourth, actual market price may not coincide 
with the price that would obtain in a perfect mar- 
ket. These discrepancies are not peculiar to the 
evaluation of national wealth, As in other cases 
of market imperfection, there is usually no prac- 
tical way to eliminate them even if it were regarded 
as theoretically desirable to do so. 

Us ed and depreciated values. In prin- 
ciple there exist two values for each reproducible 
physical asset, even if the market price criterion is 
accepted: the value of the asset as it is at the 
balance sheet date, and the value the asset would 
have if it were then in the physical condition in 
which it entered the national economy. Actually, 
market prices “as is" can be ascertained for only a 
few categories of fairly standardized old (used) 
reproducible assets, such as single-family houses 
and cars. The value of all other reproducible assets 
must be approximated by reducing the current value 
of the asset as new (usually the calculated ro- 
placement cost new) by an allowance for depreci- 
ation. This allowance is necessarily somewhat 
arbitrary—precisely because of the absence of, or 
the nonrepresentative character of, prices of usod 
assets. It is determined by the assumptions made 
regarding the length of useful life of the asset and 
the distribution of total depreciation (original cost 
less scrap value) over that life. There are there- 
fore as many alternative estimates of the value of 
the stock of reproducible tangible assets as there 
are defensible methods of depreciation, and these 
estimates may differ considerably from one another, 

Deflation of wealth estimates, Because the 
prices of different types of assets show both cyclical 
fluctuations and trends, it is necessary to reduce 
estimates of national wealth to a common tem- 
poral price basis if they are to be used in the 
analysis of economic growth, Reduction to a com- 
mon local basis is also often necessary for inter- 
national comparisons, In both cases this process of 
“deflating” the original estimates expressed in cur- 
rent prices is much less important, if required at 
all, for the analysis of the structure of wealth or 
the relation of wealth to income. 

Deflation of national wealth estimates and bal- 
ance sheets is even more difficult than similar 
operations on national product estimates, both the- 
oretically and practically. The additional theoretical 
difficulties arise primarily in the case of those finan- 
cial assets which have a fixed money value. These 
assets have no obvious physical dimension that can 
be regarded as the object of the adjustment for 
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price changes, such as exists both for tangible 
assets and for most of the commodity and service 
flows of the national income and product accounts. 
The use of indexes of wholesale or consumer prices, 
or of other indicators of change in the purchasing 
power of the currency in which the wealth esti- 
mates are expressed, is a conventional but far from 
satisfactory solution. The additional practical diffi- 
culties stem from the paucity, limited reliability, 
and doubtful comparability of the information avail- 
able on the prices of physical assets. 

Methods of measuring national wealth. In prac- 
tice, national wealth can hardly ever be measured 
on the basis of existing balance sheets of sectors 
or smaller groups of units because such documents 
are available only for part of the economy (often 
only for corporate business enterprises) and be- 
cause they do not apply a consistent system of defi- 
nition and valuation of wealth. Most existing evalu- 
ations of national wealth, therefore, are built up 
from separate estimates of the components of 
national wealth. These estimates may cover the 
entire economy or, more commonly, one of several 
sectors; and they may use very different sources 
and methods of estimation. The final national 
wealth estimates are, then, synthetic, representing 
a combination of a number of estimates for indi- 
vidual components and sectors. Such synthetic es- 
timates are acceptable so long as reasonably uni- 
form methods of valuation are used for the different 
components. The main methods used in estimating 
components of wealth are summarized below. 

Exhaustive (item-by-item) enumeration and direct 
valuation (census method) is usually feasible only 
for a few components of national wealth, such as 
agricultural land and single-family houses, where 
it can be based on the declarations of owners or 
occupants. The census method is extremely expen- 
sive if applied to a large number of nonstandard- 
ized properties, Probably its most extensive use is 
represented by the inventory in the Soviet Union 
of most of the country’s fixed reproducible assets 
as of the end of 1959, on the basis of an engineer- 
ing appraisal of millions of individual structures 
and pieces of equipment (Ryabushkin 1961 ) 

Sample evaluations, usually by engineering ap- 
praisal, of a limited number of properties of a 
given type may be blown up to the national total 
for the same item. 

Property tax valuation is applied primarily to 
real estate. Since tax values commonly deviate from 
current values, an adjustment, usually based on a 
sample of properties, is required. 

Fire insurance values may be used if insured 
values are close to market values (or if the differ- 


ence can be ascertained) and if the ratio of insured 
to uninsured properties is known. 

Perpetual inventory (replacement cost) estimates 
are based on the assumption that the market value 
of reproducible assets is equal to or close to their 
replacement cost. In this case the current value 
can be estimated by adjusting the original cost of 
classes of reproducible assets (reflected in past 
capital expenditures on them as given, say, in na- 
tional income accounts) for changes in the cost of 
construction and the prices of equipment and for 
capital consumption. This method has been used 
increasingly in the last decade to estimate the value 
of reproducible fixed assets, which in most coun- 
tries contribute between half and three-fourths of 
national wealth (Goldsmith 1950). It is particu- 
larly important for types of tangible assets that 
have no market value, such as government struc- 
tures, military equipment, and large private non- 
residential structures. Most of the estimates in 
Table 4 have been derived by this method. 

Personal property tax returns can be used in the 
few cases where individuals have had to file returns 
listing their assets in sufficient detail and at suffi- 
ciently uniform valuations to permit an estimation 
of the wealth of the entire personal sector. 

Samples of personal balance sheets obtained 
from a relatively small number of households have ` 
become another source of estimation of the value 
of all or selected assets of the personal sector. This 
approach, made possible by the development of 
modern sampling and interview methods, has been 
widely used in the United States and Great Britain 
(Goldsmith 1955-1956; Lydall 1955). ’ 

Estate tax or probate returns provide a source 
which is based on the assumption that the detailed 
Statements of assets and liabilities, at prescribed 
methods of valuation requested for tax or other 
purposes, of persons within an age bracket dying 
during the year may be regarded as a sample of the 
wealth of all living persons of the same age. The 
wealth of the living having wealth equal to at least 
the lower limit to which the estate tax or probate 
tules apply can then be estimated by multiplying 
the reported figures by the inverse of the mortality 
rates applicable to the different age groups of 
decedents (Lampman 1962). 

Capitalization of income can be used when the 
gross or net income of a specific type of wealth is 
known, say, from the national income accounts, 
and where a market rate of capitalization exists. 
This method is particularly important for types of 
tangible assets to which the perpetual inventory 
estimation cannot be applied, such as land and 
subsoil assets. 


The national wealth of the United States 


Current estimates. Table 1 provides a condensed 
picture of the main components of national wealth 
and of the balance sheets of the main sectors of 
the United States economy at the end of 1958, the 
last year for which a detailed comprehensive esti- 
mate has been made. 

While the combined assets of all sectors were 
in excess of $3,700 billion (excluding nearly $90 
billion of military assets), the consolidated assets, 
that is, the national wealth, amounted to only 
$1,700 billion, consisting of domestic tangible as- 
sets of about $1,650 billion and net foreign assets 
(including monetary metals) of $50 billion. The 
difference of about $2,000 billion between com- 
bined and consolidated assets is represented by 
domestic financial assets, which disappear in con- 
solidation, where they are offset by domestic lia- 
bilities and equities. 

There are considerable differences between the 
broad definition of national wealth used in Table 1 
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and narrower definitions that are sometimes pre- 
ferred. Thus, a definition of national wealth ex- 
cluding military assets and consumer durables 
results in an estimate of not quite $1,500 billion. 
A still narrower definition, limiting national wealth 
to so-called productive assets (defined as the tan- 
gible assets of business enterprises plus net foreign 
assets, and excluding all government tangible as- 
sets, residential real estate, and consumer dur- 
ables), further reduces national wealth to about 
$850 billion. 

Approximately 18 per cent of national wealth in 
the broad definition (but excluding military assets) 
consists of nonreproducible assets, chiefly agricul- 
tural and residential land. Structures, of which 
about half are residential, account for nearly 50 
per cent. Total equipment contributes fully 20 
per cent, of which producers’ durables represent 
12 per cent and consumer durables 10 per cent. 
Inventories are equal to approximately 8 per cent 
of national wealth, while net foreign assets account 
for only 3 per cent. 


Table 1 — Combined national and sectoral balance sheet of the United States, end of 1958 (billions of dollars)* 


BUSINESS ENTERPRISES GOVERNMENT 
Nonfinancial Financial” 
NATION Unincorporated Corporate State HOUSEHOLDS 
Farm Nonfarm Foderal and local 
a) (2) (3) (4) (5) (6) 7) (8) 
|. Physical assets 1,653 182 108 490 10 57° 173 632 
Residential structures and land 472 19 20 26 1 1 6 399 
Nonresidential structures and land 672 104 44 240 9 48 161 66 
Producer durables 200 19 27 146 1 1 5 2 
Consumer durables 179 14 - - - - - 165 
Inventories (including livestock) 130 26 17 79 - 8 0 = 
Il. Financial assets 2,082 25 30 276 693 58 30 970 
Money 222 6 4 33 93 4 n 61 
Deposits 150 3 o 2 1 0 4 141 
Insurance and pension claims 200 7 - - - - - 193 
Other short-term claims* 307 o ó 107 151 21 6 ó 
Mortgages 172 0 0 - 136 8 2 27 
Bonds and debentures! 358 s 0 7 240 7 8 91 
Equities 572 0 (1 79 43 0 - 450 
Miscellaneous 101 4 ) 48 30 18 0 1 
Ill. Liabilities 1,488 21 41 257 632 298 63 176 
Short term 686 10 27 158 421 9 2 59 
Long term" 803 LAI 14 99 212 288 61 117 
IV. Net worth 2,247 187 97 508 71 —182 140 1,426' 
V. Total assets (I + I = Ill + IV) 3,735 208 138 766 704 116 203 1,602 


a. Details may not add to totals because of rounding. 
b. Includes government insurance and pension funds. 


c. Excludes military and Atomic Energy Commission assets ($21 billion structures, of which $4 billion outside United States; $52 billion producer durables; $16 billion 


inventories, according to estimates in Goldsmith 1962, p. 405). 
d. Included in column 8. 
e. Includes U.S. government short-term obligations. 
f. Excludes U.S. government short-term obligations, 
g. Only U.S. government securities; others included in column 8. 


h. Includes only bonds and debentures (other thon U.S. government short-term obligations) and mortgages. 


i. Includes net worth of unincorporated nonfarm enterprises. 


Source: Goldsmith & Lipsey 1963. 
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Table 2 — The national wealth of the United States,° selected dates, 1805-1958 (billions of dollars) 


|. Nonreproducible assets” 
Agricultural land 
Nonagricultural land 
Il. Structures 
Residential 
Private nonresidential 
Public civilian 
Ill. Equipment 
\ Producer durables 
Consumer durables 
IV. Inventories 
livestock 
Other 
V. International assets (net)* 
VI. National wealth, current prices* 
VII. National wealth, prices of 1947/1949° 
VIIL. National product (gross), current prices 
1X. Population (millions) 


1805 1850 1900 1929 1945 1958 
0.70 3.02 31.0 114 122 311 
0.60 2.62 16.1 38 47 101 
0.10 0.40 14.9 76 75 210 
0.445 2.86" 349 190 285 833 
0.20 1.10 17.4 96 152 411 
0.22° 1.64* 15.5 71 78 254 
0.02 0.12 2.0 23 55 168 
- - 12.6 80 95 379 
- - 65 38 49 200 
0.04 0.30 61 42 46 179 
0.16 1.14 9.9 39 53 130 
0.06 0.54 3.1 7 10 18 
0.10 0.60 6.8 32 43 112 
—0.06 —0.15 -0.7 17 22 50 
1.30 7.17 87.7 439 576 1,703 
3.20 29.40 314.0 779 789 1,244 
0.85 2.60 18.7 104 214 445 
6.36 23.67 76.8 122 141 176 


a. Not including military assets (negligible 1805-1929; $73 billion in 1945; $89 billion in 1958). 


b. Does not include subsoil assets. 
c. Includes producer durables. 
d. Includes monetary metals, 


e. Summary estimate for 1966 is 2,460 in current and 1,550 in 1947/1949 prices (Kendrick 1967, p. 12). 


Growth and structural changes since 1805. The 
main function of national wealth estimates is to 
permit analysis of the growth of national wealth 
and of structural changes in it. The essential data 
are shown in Table 2. The estimates for 1805 and 
1850 are only very rough approximations. The fig- 
ures for these two dates do not include the value 
of slaves, although they then constituted part of 
private and national wealth. Slaves would add 
nearly $200 million, or 30 per cent, to the national 
wealth in 1805, and about $800 million, or al- 
most 20 per cent, in 1850 (Goldsmith 1952, 
pp. 317-318). 

An analysis of the rate of growth of national 
wealth requires estimates in constant prices, which 
are shown in line VII of Table 2. Total reproduci- 
ble wealth, the aggregate most relevant for growth 
analysis, increased almost six hundred times be- 
tween 1805 and 1958, or at an average annual 
rate of slightly more than 4 per cent. The estimated 
rate of growth was considerably faster during the 
nineteenth century—5.2 per cent—than in the 
twentieth, when it declined to 2.6 per cent, The 
growth rate, however, recovered in the postwar 
period (through 1958) to nearly 4.0 per cent. More 


Sources: Lines I-Vi—1805 ond 1850 from Goldsmith 1952, pp. 307, 310, 317) 
1900-1958 from Goldsmith 1962, pp. 117-118, 
Line VII—1805 and 1850 linked to 1900 figure on basis of estimates 
in 1929 prices in Goldsmith 1952; 1900-1958 from Goldsmith 1962, 
pe. 119-120. 
Line VIII—-1805 and 1850 based on Gurley & Shaw 1957, p, 256; 
1900 from Kendrick 1961, p. l; 1929-1958 from U.S, Office of 
Business Economics 1958, pp. 118-119. 
Line IX—averages of midyear figures in U.S. Bureau of the Census 
1960, p. 7. 


significant is the estimated rate of growth of repro- 
ducible national wealth per head. This rate was 
nearly 2.0 per cent for the whole period 1805- 
1958; 2.6 per cent for the nineteenth century; 
1.2 per cent for the first 60 years of the twentieth 
century; and 2.2 per cent for the postwar period 
1945-1958. 

One of the outstanding changes in the structure 
of national wealth is the decline of nonreproducible 
assets, practically without interruption, from 55 
per cent in 1805 to 18 per cent in 1958. This re- 
flected primarily the fall in the share of agricultural 
land from almost half at the beginning of the nine- 
teenth century to 6 per cent at present. 

Within reproducible wealth, changes can be fol- 
lowed in detail only since 1900, a period in which 
they have been relatively moderate. The share of 
residential structures declined very slightly, re- 
maining close to three-tenths. That of livestock 
dropped sharply from over 5 per cent to less than 
2 per cent, and that of inventories from 12 to 8 
per cent. On the other hand, the shares of producer 
durables and consumer durables together rose from 
22 per cent to 27 per cent. ( Military equipment, not 
included in the total, was equal to 6 per cent of it 


in 1958 and to about one-fourth of civilian equip- 
ment.) Closer analysis of structural changes in 
national wealth requires detailed estimates by sec- 
tors and by types of structure and equipment, 
which are not yet available for long periods of 
time. 

Financial assets and national product. Much of 
the value of estimates of national wealth lies in 
the possibility of comparing them with estimates 
of financial assets and of national product. 

The ratio of financial assets to national wealth, 
which measures the size of the financial super- 
structure in comparison with the real (tangible) 
infrastructure, increased considerably during the 
nineteenth century (Table 3). By 1850 it was 


Table 3 — The relation of national wealth to financial 
assets and national product in the United 
States, selected dates, 1850-1958 


CAPITAL-OUTPUT RATIO” 


BYNT, Including Excluding 
INTERRELATIONS "O"ePro- nonrepro- Including 
Rios ducible  ducible military 
assets assets assets 
a) (2) (3) (4) 
1850 0.5 2.8 1.6 1.6 
1900 0.8 47 3.0 3.0 
1929 1.3 4.2 3.1 3.1 
1945 17 27 2.1 2.5 
1958 1.2 3.8 3.1 3.3 


a. Financial assets (including equity In unincorporated business) divided by 
national wealth. 
b. Not stock of wealth divided by gross national product. 


Sources: Col. 1, 1850 roughly estimated, 1900-1958 
from Goldsmith 1960. Cols. 2-4 calculated 
from Table 2. 


probably on the order of half, considerably above 
the level of the beginning of the century. In 1900 
it had advanced to approximately three-fourths. 
It continued to rise until 1929, when it reached a 
value of one and one-fourth, partly under the in- 
fluence of the sharp advance in stock prices during 
the 1920s. Since then the ratio has not shown a 
definite trend. Throughout the 1950s, it was close 
to one and one-fourth, after a sharp but temporary 
bulge attributable to the repressed inflation of 
World War 1. 

The ratio of national wealth to national product 
increased sharply during the nineteenth century; 
stayed at nearly the same level from 1900 through 
1929; declined sharply until 1945; and during the 
next 15 years regained only part of the loss. As a 
result, the 1958 ratio was still well below the levels 
of 1929 and 1900. The direction of the movement 
of the ratio is not changed if deflated values are 
used for the numerator and denominator, if mili- 
tary assets are included, or if nonreproducible 
assets are excluded. The extent of the change, 
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however, is sometimes substantially affected by 
these differences in definition. The interpretation 
of these movements in the capital—output ratio is 
not easy and requires more detailed figures than 
are now available. The variations in the ratio re- 
flect in part the shift first toward capital-intensive 
sectors (railroads and public utilities) and then 
the opposite shift toward sectors that require rel- 
atively little capital (services). The movements 
also reflect changes in production functions within 
sectors or industries, particularly the relative im- 
portance of capital-saving technology and changes 
in the degree of utilization of plant and equipment. 


International comparisons 


The basic characteristics of the structural changes 
in national wealth that have taken place in the 
United States since the beginning of the nineteenth 
century are also evident in the differences that 
appear in a comparison of the structure of the 
national wealth of 17 countries in the 1950s 
(Table 4). Unfortunately, the estimates available 
for such a comparison differ greatly in scope, detail, 
and reliability, so that conclusions must be drawn 
with great caution. 

A first characteristic of the comparison is the 
inverse relationship between the share of nonre- 
producible assets in national wealth and the level 
of economic development and industrialization. In 
India, for example, the value of nonreproducible 
assets, mostly representing agricultural land, is at 
least as large as that of all reproducible assets. In 
the advanced countries the share of land and sub- 
soil assets in total assets is usually less than one- 
fifth, and much of this represents the site value of 
urban land. 

The differences are much less pronounced or 
systematic among reproducible assets (excluding 
consumer durables). On the average, the share of 
structures is slightly above three-fifths, including 
about one-fourth of the total for dwellings alone; 
that of producer durables is close to one-fourth; 
and that of inventories is in the neighborhood of 
one-seventh. There is no clear association between 
the share of the main types of reproducible assets 
and the level of economic development. Instead, 
large deviations from the average reflect either 
climatic differences (e.g., the low share of dwell- 
ings in Japan, Colombia, and South Africa) or 
special features of economic structure (e.g., the 
high share of inventories in countries with sub- 
stantial international trade, such as Great Britain, 
Canada, and South Africa). The differences are, 
of course, largest in the share of net foreign assets 
and liabilities, Compare such creditor countries as 
Belgium and the Netherlands, for which net foreign 
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Table 4 — Structure of national wealth of selected countries in the 1950s, reproducible 
tangible assets* (columns 3+5+7) = 100 
NATIONAL REPRODUCIBLE TANGIBLE ASSETS NEF 
WEALTH ~ - 
(2+3+5+ Structures Equipment Inyentories* FOREIGN. 
YEAR 6+7+8) LAND Total Dwellings Producers Consumers ASSETS? 
q) (2) (3) (4) (5) (6) 7) (8) 

Australia 1956 127 22 67* 23 7 n 16 = 
Belgium 1950 149 19 63° 43 29 18 8 12 
Canada 1955 - - 62 23 20 19 17 =14 
Colombia 1953 - - 66 16 20 - 4 - 
France 1954 139 20' 88° 19 9 15 12 4 
Germany (West) 1955 - 29 62 29 26 7 12 - 
India 1950 205 105 55 26 4 - 32 - 
Japan 1955 - - 57 17 22 25 22 1 
Luxembourg 1950 - 20 58 27 40 - - -$ 
Mexico 1950 - - 857 32 9 - 15 - 
Netherlands 1952 160 22 604 25 26 28 13 n 
Norway 1953 - 4 65 25 24 - m -2 
South Africa 1955 =- 34 63 WV 21 - 18 - 
United Kingdom 1953 - - 81" 34 - - 19 - 
U.S.S.R. 1959 146 39 55 23 15 ó 29 2 
United States 1958 139 16 72 35 18 16 12 5 
Yugoslavia 1953 153 40 67% 25 15 14 18 =j 
©, Military assets excluded. 
b. Includes livestock. 
€. Includes monetary metals, 
d. Includes all government reproducible tangible assets; hence slightly overstated and column 5 understated to some extent. 
æ. Livestock only; hence substantially understated and columns 3-5 proportionately overstated. 
f, Includes agricultural structures, but excludes commercial land. 
g. Equipment included with structures, 
h. Includes producer durables. 


investment is equal to about 8 per cent of national 
wealth, with Canada or Australia (not to mention 
some less developed countries for which no data 
are available), where a considerable part of the 
entire national wealth is foreign-owned. 

It will be noted that the structure of national 
wealth is very similar for the United States and the 
Soviet Union, so far as the relationship of residen- 
tial and nonresidential structures, producer dur- 
ables, and livestock is concerned. The main dif- 
ferences are the presumably much higher share 
of consumer durables in the United States and the 
higher share of agricultural land in the Soviet 
Union (no official estimates are available for these 
two items for the Soviet Union). 


Raymonp W. GOLDSMITH 
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It 
DISTRIBUTION 


Wealth and income are closely related concepts 
in economics. Wealth is a stock of natural resources 
and previously produced goods in existence at a 
moment in time, while income is a flow of goods 
and services produced or received during a stated 
period of time. Wealth arises, to a considerable 
degree, out of income, and income requires wealth 
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for its generation. This article is concerned with 
the distribution of wealth by type of holder and by 
the degree of inequality of the personal distribution 
of wealth. 

Distribution by type of direct holder. The 
ownership of wealth varies by type of economy. In 
a fully communized economy all wealth would be 
in the hands of government, while in a fully 
private economy government would hold none of 
the wealth. In fact, all national economies fall 
between these extremes. In 1958 the wealth of 
the United States was six-sevenths privately owned 
(Goldsmith 1962; see Table 1). 


Table 1 — Percentage distribution by sector of net tan- 
gible wealth (excluding military assets) of 
the Unted States, selected years® 


1900 1929 1945 1958 
All sectors 100 100 100 100 
Nonfarm households 32 34 34 35 
Nonprofit organizations 2 2 2 2 
Agriculture 28 15 15 11 
Unincorporated business 8 8 7 6 
Corporations 25 28 25 30 
Net foreign assets 3 3 0 b 
Total private 92 90 82 85 
State and local governments 5 8 10 10 
Federal government 3 2 8 5 
Total government 8 10 18 15 


a. As of end of year. 
b. Not available, 
Source: Goldsmith 1962, table A-19, pp, 145-146, 
rounded to nearest whole percentage point, 
Copyright © 1962, by National Bureau of 
Economic Research. 


Over one-third of the total private and public 
wealth was held by households and a little under 
one-third by corporations; about one-tenth was in 
the hands of farmers, and an even smaller share 
was held by nonfarm, noncorporate business. Non- 
profit organizations such as churches and charitable 
foundations owned only 2 per cent of the tangible 
wealth. 

Accounting for indirect ownership. The preced- 
ing discussion of the distribution of tangible wealth 
by sector should be complemented by a discussion 
of indirect ownership through intangible assets 
that are claims of one sector on the tangible assets 
of another. On a net basis the personal sector, 
along with nonprofit organizations, holds full claim 
to the tangible assets of the business sector, and 
the tangible wealth of the governmental sector is 
substantially offset by liabilities to businesses 
and persons. 

Complex interrelationships among and within 
sectors are present in a modern private enterprise 


60 NATIONAL WEALTH: Distribution 


economy. For example, financial intermediaries 
give claims in exchange for their own assets, which 
are in turn largely intangible [see FINANCIAL IN- 
TERMEDIARIES]. To take another example, a person 
may borrow from a financial intermediary (which 
in turn holds assets deposited by other individuals) 
to buy securities issued by a corporation that owns 
other corporations. 

Easy conclusions about the significance of the 
intersectoral distribution of net assets should be 
distrusted. This is especially true of the govern- 
ment sector, since the net assets of a government 
may be a most imperfect indicator of its capacity 
to meet its obligations. Some authors have sug- 
gested that a capitalization of government's current 
authorization to collect taxes should be entered as 
an asset of the government and as a liability of the 
private sectors. 

Distribution of personal sector wealth. The 
meaningfulness of a distribution of wealth within 
a particular sector is open to some dispute. It may 
be more important to know who actually controls 
an asset than to know who owns it either on a 
gross or on a net basis. Thus, minority owners of 
a business corporation may be in a position to 
control it because of their management responsi- 
bilities or through their control of the assets of a 
financial intermediary. However, there is consider- 
able interest in knowing how wealth is distributed 
within the personal sector, even though this dis- 
tribution may give only limited information about 
the distribution of power. 

Personal sector wealth is made up of tangible 
wealth plus claims on corporate and noncorporate 
businesses in the form of stocks, bonds, mortgages, 
demand deposits, and insurance rights; and claims 
on government in the form of bonds, notes, and 
currency. The tangible wealth may be classified as 
real estate, producer durables, and consumer dur- 
ables; the intangible wealth includes, in addition 
to the types mentioned above, patents, copyrights, 
and rights to pension funds and personal trust 
funds, All of these types of wealth confer a right 
to future income, but the nature of the other rights 
conferred varies considerably among the several 
types. Thus, the rights associated with cash are 
quite different from the contingent rights associ- 
ated with insurance or pension funds, and still 
different from the restricted rights of an interest 
in a personal trust fund. Since all the rights and 
powers of wealth ownership are the subject of law 
and contract, a perceptive analysis will indicate 
by its categorization of personally owned wealth 
something of the consequences that ownership of 
each of the several types has in practical affairs. 
Business equities, though they indicate more influ- 


ence in some economies than in others, generally 
have more meaning in terms of power than do, say, 
consumer durables. 

In measuring personal wealth it is not common 
practice to capitalize the future earnings from one’s 
own personal services, but it is nonetheless true 
that a person’s balance sheet of assets and liabili- 
ties should be viewed with reference to the value 
of such earnings. In this light, investments in 
income-earning assets may be considered as alter- 
native to outlays, such as those on education or 
health, that may increase the value of one’s per- 
sonal services. [See CAPITAL, HUMAN.] 

The total value of a person's tangible and in- 
tangible assets, less his obligations to others, equals 
his net worth, or equity. The aggregate of all per- 
sons’ net worth, less certain intangibles such as 
patents, equals the value of national tangible wealth 
less the net assets of governments and nonprofit 
organizations. 

Sources of information. Knowledge about the 
distribution of wealth among persons is imperfect 
and incomplete, No full census of wealth has ever 
been undertaken in any country, and the few exist- 
ing studies are incomplete both as to the coverage 
of wealth items and as to the part of the population 
surveyed. Three basic methods have been followed: 
censuses or sample surveys, estimates from income 
or property tax data, and estimates from probate 
or estate tax data. The most direct method is the 
census or survey, which involves questioning family 
or spending units or individual wealth-holders 
about the amounts of wealth of various types that 
they hold or have held. 

A less direct method is to estimate the value of 
income-producing wealth from income tax or prop- 
erty tax data. For example, the interest income 
attributable to a person may be capitalized at the 
going rate of interest to estimate the value of his 
debt securities. This income-capitalization method 
frequently suffers from such inadequacies of data 
that only a fragmentary picture of the over-all dis- 
tribution can be developed, giving rankings of asset 
holdings of particular types by income classes. A 
related method is to work from property tax re- 
turns, which in some countries permit an estimate 
of the composite total of the assessed valuations of 
at least limited categories of wealth. 

Another indirect method of estimating wealth 
distribution is to work through records of the wealth 
held by deceased persons as provided for probate 
proceedings or for death tax purposes, From these 
records it is possible to compile, first, a distribution 
of wealth among the persons who died in a given 
year. This distribution may then be converted into 
an estimated distribution of wealth among living 


persons by use of the estate-multiplier method, 
developed by Bernard Mallett in England in 1908. 
Corrado Gini of Italy and G. H. Knibbs of Australia 
were among the first to follow Mallett’s method. 
[See the biography of G1n1.] 

This method involves multiplying the numbers 
of deceased persons and the amounts of wealth 
they held by the inverse mortality rate for their 
age, sex, and socioeconomic group. Thus, if x is 
the number of men aged 70 who died in a year, 
each holding $1 million, then x times the appro- 
priate inverse mortality rate is the number of 
70-year-old men holding $1 million who were living 
in that year. This procedure must be carried out 
for each age-sex group in order to construct a 
representation of wealth-holdings by size, by type 
of property, and by characteristics of wealth- 
holders. 

A principal difficulty with the method is the 
uneven quality and limited availability of data. 
Probate information is locally filed and not cen- 
trally recorded. Moreover, laws concerning the 
information to be recorded vary widely across juris- 
dictions. Estate tax data are less than ideal mate- 
rials for the student of wealth distribution because 
of the relatively high minimum value of estates 
for which tax returns must be filed. In the United 
States in the 1950s, the minimum of $60,000 
meant that only about 2 per cent of all decedents 
were represented on such returns. In Great Britain, 
on the other hand, a relatively low filing require- 
ment of £100 has yielded a much larger sample of 
decedents on the tax returns. Numerous other dif- 
ficulties arising out of the nature of tax and prop- 
erty law prevent an easy translation from estate tax 
returns to a representation of wealth-holdings by 
living persons. 

Socioeconomic distribution. Personal wealth is 
distributed in a pattern quite different from that of 
income. The distribution of wealth is much more 
unequal than that of income, but not all persons 
in the highest income classes are also in the high- 
est wealth classes. Similarly, the lower income 
groups may include some moderately wealthy in- 
dividuals. This is explained to some extent by the 
life cycle of income and the timing of wealth accu- 
mulation and receipt of inheritances, which con- 
centrate wealth in the hands of older persons. 
Because wives are typically younger than husbands 
and because of the greater life expectancy of women 
than of men in many countries, wealth shifts 
toward women as age increases. Proceeds of life 
insurance, in which wives frequently are named as 
beneficiaries, account for the inclusion of younger 
widows among substantial wealth-holders in some 
countries. 
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The Survey of Consumer Finances found that in 
the United States in 1953 the share of wealth held 
by the top 10 per cent of spending units, ranked 
by net worth, was about 60 per cent; their share 
of income was about 23 per cent. In the same year 
the top 10 per cent, ranked by income, received 
about 30 per cent of income, but only about half 
of top income receivers were simultaneously top 
wealth-holders. The Survey also reported that self- 
employed, managerial, and professional persons 
were heavily overrepresented in the upper reaches 
of both income and wealth distributions. 

Further information about wealth distribution in 
the United States is provided by Lampman’s study 
(1962) of top individual wealth-holders, which was 
based upon federal estate tax data. According to 
this study, the top 1.6 per cent of adults (those 
having $60,000 or more of gross estate in 1953) 
had a median age of 54 years, and there was a 
positive association between age and size of estate 
within the group. Women made up one-third of the 
group and held 40 per cent of the wealth. This top 
1.6 per cent held an estimated 30 per cent of total 
personal wealth and substantial shares of most 
specific types of wealth. Business assets were more 
highly concentrated than consumer assets; while 
the top group referred to above held over three- 
fourths of corporate stock, they held only 16 per 
cent of real estate and 13 per cent of life insurance. 
(See Table 2.) 


Table 2 — Share of the top 1.6 per cent of adults in 
personal sector wealth, United States, 1953 


SHARE OF 
TOTAL WEALTH WEALTH HELD 
IN PERSONAL BY TOP 
SECTOR WEALTH- 
(Billions HOLDERS 
TYPE OF PROPERTY of dollars) (Per cent) 
Real estate 445 16 
U.S. bonds 60 38 
State and local government bonds 16 100 
Corporate bonds 6 89 
Corporate stock 4 156 82 
Cash 160 29 
Mortgages and notes 31 36 
Life insurance reserves 78 13 
Pensions and retirement funds 64 6 
Miscellaneous property, including 
equity in noncorporate business 
and consumer durables, etc. 221 18 
Personal trust funds* (54) (70) 
Gross total 1,238 31 
Debts and mortgages 133 22 
Net total 1,105 32 


* Not included in the total because the assets of such funds are distributed 
among the preceding categories. 
Source: Lampman 1962, table 90, pp. 192-193. 
Copyright © 1962, by National Bureau 
of Economic Research. 
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The degree of concentration of wealth varies 
among countries because of differences in indus- 
trial and occupational patterns, the age-sex com- 
position of the population, and many other factors. 
Although the income distribution is less unequal in 
Great Britain than in the United States, the re- 
verse is true of the wealth distribution. The latter 
finding is explained to some extent by the greater 
importance of agriculture and of owner-occupied 
housing and consumer durables in the United 
States (Lydall & Lansing 1959). No fully compar- 
able studies have been made for other countries. 

Historical changes in wealth inequality. It ap- 
pears that in both the United States and Great 
Britain the trend has been toward diminished con- 
centration of wealth. Langley (1950-1951) reports 
that the top 1 per cent of adults owned 70 per cent 
of the wealth in England and Wales in 1911-1913, 
but only 50 per cent in 1946-1947, Lampman sim- 
ilarly finds a drop in the share of wealth of the top 
1 per cent in the United States from 32 per cent 
in 1922 to 25 per cent in 1953. However, the latter 
drop is to some extent explained by transfer of 
wealth within families; the share of wealth of top 
families did not fall so much. 

Factors influencing distribution. The distribu- 
tion of wealth at any moment is the resultant of 
many past events. Each person represented in the 
distribution has experienced some variation in 
income, saving, dissaving, borrowing, value of 
assets held, and making and receiving gifts, in- 
heritances, and transfers of other kinds. Each year 
each person will experience a series of such 
changes, and his position in the wealth-holder 
ranking may shift from year to year. Moreover, 
new persons enter the ranking each year and others 
die, Over a long period the age—sex composition of 
the population may change, the relative prices 
of assets may shift, behavioral characteristics of 
people may be altered, or institutional arrange- 
ments may change so as to restrict or enlarge indi- 
viduals’ freedom and ability to accumulate or 
transfer wealth. 

The many factors involved in the determination 
of the wealth distribution may be classified as be- 
havioral, exogenous, demographic, and institutional. 
Behavioral factors include propensities to save, to 
choose among assets and debts, and to choose 
among alternative ways to transfer wealth before 
and at the time of death. For purposes of this 
analysis, the level and distribution of income and 
the levels of asset prices may be taken as exog- 
enous. Changes in these variables will have an 
independent effect upon the wealth distribution. 

Changes in the age—sex composition of the pop- 


ulation may influence the wealth distribution even 
though the behavioral characteristics of each age- 
sex group remain the same. Institutional changes 
may also lead to a new pattern in the over-all 
wealth distribution, Examples of such changes are 
changes in the progressivity of income, estate, gift, 
or inheritance taxes and changes in laws or cus- 
toms concerning methods of holding or transferring 
wealth. Changes in the rights of women and minors 
to hold wealth; the development of insurance, pen- 
sion funds, and trust funds; and modification of 
governmental loan or subsidy programs are further 
examples of institutional change. 

Changes in the over-all distribution of wealth, 
then, may be seen as the net result of changes of 
the several kinds detailed above. In some historical 
situations price changes may be quantitatively 
most important; in other periods demographic or 
institutional changes appear to be dominant. Some 
of these factors may be identified in explaining the 
decrease in inequality of distribution of wealth 
among persons in the United States between 1922 
and 1953. In this connection it is illuminating to 
know that approximately one-third of the market 
value of wealth in the hands of persons in 1953 
may be considered as having been accumulated by 
previous generations, one-third as the result of 
rising prices over the previous thirty years, and 
only one-third as accumulated out of the incomes 
of persons then living. During the years between 
1922 and 1953 the adult population grew markedly 
older, thus influencing the distribution in the direc- 
tion of greater inequality. The changes in capital 
value also tended to accentuate inequality, but a 
lessening of income and savings inequality and the 
wider spreading of wealth among family members 
worked to offset it. 

Theory of wealth distribution change. The 
theory of the determination of wealth distribution 
is underdeveloped, as is empirical study in this 
field. The Marxian proposition that a capitalistic 
society has a tendency toward increasing inequality 
would seem to be based upon the observation that 
wealth yields income, which in turn is the source 
of wealth, and the idea that an original inequality 
may be reinforced by the institution of inheritance. 
The plausibility of this proposition should not lead 
one to ignore the self-limiting and offsetting proc- 
esses in the picture. A self-limit is apparent in the 
fact that increases in wealth reduce its average 
yield, and this in turn controls the share that prop- 
erty income is of total income. Another example 
of a self-limit is found in the relationship between 
a top group’s share of wealth and its share of 
saving. In year-to-year change the group will main- 


tain its share of wealth only if its share of saving 
is equal to its share of wealth, ceteris paribus. But 
as the share of wealth increases, it becomes in- 
creasingly difficult for the group to maintain a 
parallel change in share of total saving. Demo- 
graphic changes also tend to be self-limiting. 

An example of an offset to increasing inequality 
is the fact that rich people transfer wealth to more 
beneficiaries than do the less wealthy. The “creative 
destruction” of changing market conditions may 
also work to offset an existing pattern of inequality, 
with innovations and obsolescence combining to 
destroy an initial wealth advantage of one group 
or another. Finally, social policy may work to off- 
set a tendency toward increasing inequality. Spe- 
cific areas in which policies may influence the 
distribution of wealth include education, labor mar- 
kets, social security, taxation, land tenure, home 
ownership, credit, insurance, and the law of 
inheritance. 


Relatively little research has been done on either 
national wealth accounting or the distribution of 
personal sector wealth, More research is needed on 
methods of estimating the wealth-holding of per- 
sons, on the relationship between income and 
wealth distributions, on savings and transfers of 
wealth by estate size, on the use of personal trust 
funds and foundations, and on capital gains and 
losses. The theory of changes in wealth distribu- 
tion should be explored in various ways. One prom- 
ising method is the use of simulation techniques, 
in which a hypothetical population is assigned a 
set of behavioral characteristics and demographic, 
exogenous, and institutional changes are succes- 
sively assumed. [See SIMULATION, article on ECO- 
NOMIC PROCESSES.] 

Further comparative research is also needed to 
find how wealth distribution varies by type of econ- 
omy and by stage of development. Finally, research 
is needed into the social and political significance 
of changes in the forms of wealth claims and of 
altered sharing of wealth among sectors. 


ROBERT J. LAMPMAN 
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NATIONALISM 


Nationalism is a political creed that underlies 
the cohesion of modern societies and legitimizes 
their claim to authority. Nationalism centers the 
supreme loyalty of the overwhelming majority of 
the people upon the nation-state, either existing or 
desired. The nation-state is regarded not only as 
the ideal, “natural,” or “normal” form of political 
organization but also as the indispensable frame- 
work for all social, cultural, and economic activi- 
ties. Yet nationalism and the nation-state are com- 
paratively recent historical developments. 

Unknown before the eighteenth century, when 
it originated in northwestern Europe and northern 
America, nationalism spread with ever-growing 
rapidity over all the earth, and since the middle 
of the twentieth century it has become a universal 
idée-force of contemporary history. It expresses it- 
self in the most varied and opposite ideologies— 
in democracy, fascism, and communism—as well 
as in the search for an “ideology,” be it African 
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personality or Arab unity. The nineteenth century 
in Europe has been rightly called the age of na- 
tionalism; the twentieth century, in which history 
has shifted from a European to a global basis, may 
become known as the age of pan-nationalism. 

Although certain traits are common to all forms 
of nationalism, each form is conditioned by the 
social structure, the intellectual traditions and cul- 
tural history, and the geographic location of the 
society in which nationalism asserts itself. There- 
fore, only a comparative historical study of the 
various forms of nationalism can do justice to any 
one of them; and only an interdisciplinary ap- 
proach will be able to cover the many facets of a 
highly complex phenomenon. No major collective 
research effort has yet been undertaken in this 
field, in spite of its vital importance for an under- 
standing of the contemporary world. 


The spread of nationalism 


The spread of nationalism on a global scale is a 
result of the Europeanization and modernization 
of non-Western and premodern societies. As a phe- 
nomenon of modern European history, the rise of 
nationalism is closely linked with the origins of 
popular sovereignty; the theory of government by 
the active “consent of the governed”; the growth of 
secularism; the lessening of the older religious, 
tribal, clannish, or feudal loyalties; and the spread 
of urbanization, industrialization, and improved 
communications. 

Nationalism has from the beginning been a po- 
litically revolutionary movement; it has tried to 
transform or overthrow the “legitimate” govern- 
ments of the past whose claim to authority was 
based upon divine ordination or hereditary rights, 
It wished to establish totally new political entities: 
states coextensive with ethnic or linguistic fron- 
tiers. Lord Acton, in his famous essay “Nationality” 
(1862), drew attention to the potentially danger- 
ous implications of this identification of political 
organization with ethnic divisions. Within one hun- 
dred years (1815-1920), nationalism completely 
transformed the political map of central, central- 
eastern, and southeastern Europe; and since 1947 
it has fulfilled the same revolutionary function in 
temaking the political configuration of Asia and 
Africa, 

The twentieth century has added another revo- 
lutionary dimension to nationalism. Nationalism 
has also become a socially revolutionary move- 
ment, demanding equal economic and educational 
Opportunities for all members of the national group 
and the active promotion of the welfare of the 
socially underprivileged classes. Its aims have be- 


come the establishment of a classless, theoretically 
equalitarian national society. By the middle of the 
twentieth century, all “young” nationalist move- 
ments had also become “socialist” movements—the 
word “socialism” covering as many different mani- 
festations as the word “nationalism” —whereas the 
“young” nationalist movements of the middle of 
the nineteenth century had sharply distinguished 
between nationalism and socialism. 

Nationalism, in the second half of the nine- 
teenth century, was regarded as the political doc- 
trine of the upper classes, of the rightists in the 
political spectrum of the age. It stood in sharp and 
repressive opposition to socialism, an international 
movement that included the industrial workers 
and landless peasants, who generally felt excluded 
from the national society, and expressed their as- 
pirations. In the German empire that Bismarck 
created in 1871, the nationalists regarded the 
workers as vaterlandslose Gesellen, and Bismarck’s 
antisocialist legislation, 1878-1890, treated them 
as enemies of the nation. This Bismarckian atti- 
tude of Germany's ruling classes continued under 
the Weimar republic and was one of the main 
factors causing its overthrow. The situation in 
France, Italy, Spain, or Russia was not very dif- 
ferent. There, too, the right-wing parties and the 
upper classes identified themselves with the nation 
and the national interest, and the lower classes felt 
excluded from both a real stake in the national 
economy and an active partnership in the de- 
termination of the policy of “their” nation. 

The first major nationalist revolution that put 
equal emphasis on the socialist revolutionary aspect 
was the Mexican revolution of 1910-1917. It set 
the pattern for the development of nationalism in 
many underdeveloped countries and for their fight 
against foreign political intervention and economic 
penetration and exploitation. Such an integration 
of nationalist and socialist revolutions was not, and 
could not be, attempted in the nineteenth century. 
The Polish aspirations for independence in the 
period of 1830 to 1848 were unsuccessful largely 
because the nationalist movement was upper-class; 
the peasantry had no interest in it and even turned 
against the landowners and urban intelligentsia. 
But even after World War I the nationalist revolu- 
tions of Kemal Atatiirk in Turkey and of the Kuo- 
mintang under Chiang Kai-shek in China neglected 
the need for social transformations of their na- 
tions. As the upheavals in Turkey in 1960/1961 
and the defeat of Chiang Kai-shek in 1949 revealed, 
the economic and social positions of the present 
masses in both countries had remained backward; 
they continued to feel themselves victims of “ex- 


ploitation,” but under the impact of nationalism 
they were ceasing to accept their age-old status 
passively. Only after World War m did socialism 
become an integral part of the nationalist revolu- 
tion, as in Gamal Abdel Nasser’s Egypt, in Ben 
Bella’s Algeria, in Fidel Castro’s Cuba, and in Sékou 
Touré’s Guinea—to name only a few outstanding 
examples. This development resembles the trans- 
formation of the nineteenth-century capitalistic 
nation-state into the twentieth-century welfare 
state, except that in the underdeveloped countries 
truly radical social reforms were required. This task 
was made more difficult by the need of simultane- 
ously building a cohesive national society, a need 
that, on the whole, had been fulfilled in the ad- 
vanced countries by the time of World War 1. 


Characteristics of nationalism 


Nationalism, despite its relatively brief history, 
has undergone several transformations. It was an 
elite movement in the first century of its historic 
role; a “bourgeois” movement in the age of the 
ascent of the middle classes; it has become, in its 
second century, a mass movement in which the 
people at large demand an ever-widening partici- 
pation in the political, social, and cultural life of 
the nation. 

What remains constant in nationalism through 
all its changes is the demand of the people for a 
government of the same ethnic complexion as the 
majority. Every people awakened to nationalism 
sees political self-determination as its goal. To be 
separate, distinct, and independent from other na- 
tions, and equal to them, is the fundamental claim 
of nationalists for their people. The “individual- 
ism” and “democratic equality” of the revolutions 
of the Enlightenment expressed themselves in 
these aspirations. In the nineteenth century these 
demands were transferred from the individual to 
the collective group. Only thus, said the national- 
ists, could the people become autonomous subjects, 
an end in themselves, instead of being a means for 
the policy of others. The Piedmontese jurist and 
Italian minister Pasquale Stanislao Mancini ex- 
pressed this sentiment in his Della nazionalita come 
fondamento del diritto delle genti (1851), in a 
classical formulation: “The nationalities which do 
not possess a government issuing from their inner- 
most life (governo uscito dalle proprie viscere), 
and which are subject to laws imposed upon them 
from the outside . . . have become means for the 
purposes of others and, therefore, mere objects.” 

Cultural self-determination. Political _ self-de- 
termination is only one part of the demands in- 
herent in all nationalism; an almost equal role is 
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played by demands for cultural self-determination. 
In nationalities that are striving for the creation 
of a nation-state, the quest for cultural self-de- 
termination precedes the quest for political self- 
determination and prepares the ground for the 
latter. This was the case with most central and 
eastern European peoples in the nineteenth cen- 
tury and is the case with the Arabs in the twentieth 
century. Nationalism, from the early nineteenth 
century on, carried with it the demand for “na- 
tional” or “popular” foundations for all cultural and 
intellectual life. This process began with the de- 
cline of supranational and theoretically universal 
cultural elements, such as the founding of all 
Western education on a thorough and often ex- 
clusive training in the classical languages; the role 
of French as the language of diplomacy and inter- 
national relations; the erudition in Arabic in all 
Islamic countries; the exclusive use of classical 
Chinese as the literary language until Hu Shih’s 
language revolution in 1917. These elements were 
replaced by an entirely new emphasis on the ver- 
nacular, on the political prestige of the national 
language, on folklore and folk traditions, and on 
the accessibility of culture to the nonlearned classes. 

Such a demand for cultural nationalism, which 
in its extreme cases parallels the demand for abso- 
lute political sovereignty and self-sufficiency, could 
also be found in United States nationalism, al- 
though it conflicted there with the more cosmo- 
politan trends of the Enlightenment, the age in 
which the United States became a nation. Noah 
Webster, in his Sketches of American Policy, 
wrote: “America is an independent empire, and 
ought to assume a national character. Nothing can 
be more ridiculous, than a servile imitation of the 
manners, the language, and the vices of foreigners. 
For setting aside the infancy of our government 
and our inability to support the fashionable amuse- 
ments of Europe, nothing can betray a more des- 
picable disposition in Americans than to be the 
apes of Europeans” (1785, p. 47). 

Walt Whitman, with all his cosmopolitan em- 
braces of mankind, in the Leaves of Grass, 1855, 
could appeal in his dithyrambic way to America: 
“Strangle the singers who will not sing you loud 
and strong! . . . Call for new great masters who 
comprehend new arts, new perfections, new wants! 
Submit to the most robust bard till he remedy your 
barrenness! Then you will not need to adopt the airs 
of others; you will have true airs, begotted of your- 
self, blooded with your own blood.” Whitman spe- 
cifically referred to Johann Gottfried Herder’s belief 
that creative work can be done only in one’s “own” 
folk language, that great art has always been the 
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expression of and is determined by the “national” 
spirit (Volksgeist). 

In Whitman two fundamental and opposite 
strains of nationalism confront each other—the 
one that corresponds to the “open” society and the 
one that corresponds to the “closed” society. No 
nationalism or phase of nationalism shows one of 
these strains in purity: it is always a question of 
emphasis. The “open” nationalism represents the 
more “modern” form: it inclines toward inter- 
course, and its basis is generally a territorial or- 
ganization and a political society, constituting a 
nation of fellow citizens irrespective of race or 
ethnic descent. The “closed” nationalism stresses 
the nation’s autochthonous character, the common 
origins (race, blood) and rootedness in the ances- 
tral soil. These determine the “purity” of national 
character and preserve it from “alien” influences, 
The romantic, anti-Western, and anti-Enlighten- 
ment Germanophilism and Slavophilism of the 
nineteenth century offer examples of such a 
“closed” nationalism; the image of their ideal so- 
ciety was to be found in the tribal or premodern 
past, in emphasis on Eigenart or samobytnost. The 
“open” nationalism, on the other hand, finds its 
ideal image in a future that will build bridges over 
the separations of the past. The “open” nationalism 
Stresses the free self-determination of the indi- 
vidual; the “closed” nationalism, biological or 
historical determinism. 

One of the outstanding examples of an “open” 
nationalism is provided by the United States. The 
Americans rejected common descent as the basis 
of their nationhood. They did not establish their 
nation on a common past with its roots in an- 
tiquity or medieval times, on a common religion or 
a unique cultural tradition. They owe their nation- 
hood to the affirmation of the modern trends of 
emancipation, assimilation, mobility, and individu- 
alism. They inherited the English tradition of lim- 
ited and mild government and constitutional free- 
doms; but the historical rights of Englishmen 
became, in the climate of the eighteenth-century 
Enlightenment, universal rights, which, theoreti- 
cally, had the strength to transform men of the 
most various pasts and descents into new men, 
building a common future in a new land. 

Racial purity. Racial purity has often formed 
the theoretical basis of nationality throughout his- 
tory. At the time of the building of the second Jew- 
ish state, national regeneration implied the repudi- 
ation of the wives whom Jews had taken from 
foreign tribes and of the children whom those 
wives had borne (Ezra 10; Nehemiah 10.30). Even 
the State of Israel grants an exclusive, privileged 


status to persons of Jewish descent in matters 
immigration and citizenship. In its most ex 

form, racial exclusivism and rootedness in 

ancestral soil became the basis of nationhood 
National Socialist Germany. The German Reich 
was regarded as the true homeland and the ceni 
of loyalty of all individuals of German descent, 
gardless of the “accident” of their political citizen: 
ship or the personal self-determination of their 
individual allegiance. 

The historian Heinrich von Treitschke stressed 
this point of view as early as 1870 in the case of 
the Alsatians. The French historian Ernest Renan, 
in his lecture Qu'est-ce qu'une nation? (1882), de- 
clared the racial theory “a very great fallacy whose 
dominance would ruin European civilization. . . . 
According to this theory the Germans have the 
right to take back the scattered members of the 
German family, even if these members [do not wish 
it]. Thus one creates a primordial right analogous 
to that of the divine right of Kings. . . . Will the 
Germans, who have raised the banner of ethnog- 
raphy so high, not see one day the Slavs [follow 
their example and reclaim the lands] of their 
ancestors? It is good for all of us to know how to 
forget” (pp. 1-29, passim). 

National supremacy. The rise of the new 
nation-states in the nineteenth and twentieth cen- 
turies started bitter disputes about frontier terri- 
tories, as each nationality claimed “historical” 
rights according to its greatest historical expansion. 
Thus, independent Serbia and independent Bul- 
garia, in spite of their close affinity of language and 
religion and their common past of subjection to 
the Turks, faced each other in repeated struggles 
over Macedonia, which both claimed as having 
formed part, in long-past times, of their respective 
empires. The fate of the Slav Ukrainians was in- 
volved in the centuries-old fight of their Slav neigh- 
bors, the Poles, and the Russians, for hegemony in 
the eastern borderlands of Europe. Nationalities 
that had demanded release from oppression often 
became, after liberation, oppressors themselves, 
sometimes subjecting others to more severe oppres- 
sion than they had suffered themselves. Most of 
the new states, although ethnically mixed, regard- 
ed themselves as power instruments of the domi- 
nant, or “state-forming,” nationality among the 
several inhabiting the territory, and denied equality 
to the other nationalities in “their” state. This was 
the case in Poland, Czechoslovakia, and Yugoslavia 
after World War 1, and in Ceylon after World 
War ni. Kurds and Somalis in the 1960s sought na- 
tional union as the Poles and southern Slavs had 
done fifty years before. The potentially fruitful 


innovation of international treaties binding the 
new nation-states after 1918 to protect national 
minorities, under the League of Nations, did not 
become effective. On the other hand, the expulsion 
or shift of populations from their homelands for 
nationalistic reasons, first envisaged by the Ger- 
mans in World War I in order to annex lands 
“without people” for German settlement and strate- 
gic purposes, became widespread in the twentieth 
century. Nationalism has “solved” many tensions; 
it has at the same time created new ones, in which 
modern aspirations and age-old memories are often 
inextricably mixed. 


Ideological transformations and conflicts 


Some of the fundamental beliefs of nationalism 
go far back in history. Among them are the “chosen 
people” idea and the “promised land” concept. Both 
originated with the ancient Hebrews; both pro- 
vided a divine sanction for nationalist aspirations 
and political aims; both are found in various forms 
throughout the ages as a conscious or unconscious 
Biblical heritage. With the advent of Stoicism and 
Christianity, which became the official creeds of 
the “universal” Roman empire, the narrow and 
“closed” tribalism of older times was overcome in 
an ecumenical “open” society. This universalism 
survived in the Christian world until the Renais- 
sance and Reformation; in Islam, until the later 
nineteenth century. In the Western world, the new 
absolutist states of the post-Renaissance period, 
with their emphasis on sovereignty, centralization, 
and raison d'etat, created the political organization 
that eighteenth-century nationalism began to trans- 
form into the modern nation-state. Modern nation- 
alism first took hold in England in the seventeenth 
century and in Anglo-America in the eighteenth 
century. But this nationalism respected, and was 
based upon, the individual liberties and self-gov- 
ernment characteristic of the development of these 
nations. The rise of nationalism in the French 
Revolution was different. The absolutist and cen- 
tralized French monarchy had set the example for 
continental Europe in the seventeenth and eight- 
eenth centuries; the nationalism of the French 
people continued this form and set the model for 
the centralized European nation-state of the nine- 
teenth century. The Napoleonic wars carried the 
aggressiveness of the new nationalism to the four 
corners of Europe. 

The European revolutions of 1848/1849 and the 
defeat of their liberal aspirations marked the spread 
of nationalism to central and central-eastern Eu- 
rope, the “awakening” of the peoples. The striving 
for individual liberty was drowned in the rising 
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tide of national (collective) self-assertion and will 
to power. John Stuart Mill complained in 1849 that 
nationalism makes men indifferent to the rights 
and interests “of any portion of the human species 
save that which is called by the same name, and 
speaks the same language, as themselves” ([1849] 
1865, p. 53). He called the new exclusive nation- 
alism, with its appeal to historical rights, barbaric 
and remarked bitterly that “in the backward parts 
of Europe, and even (where better things might 
have been expected) in Germany, the sentiment of 
nationality so far outweighs the love of liberty, that 
the people are willing to abet their rulers in crush- 
ing the liberty and independence of any people not 
of their race and language” (Mill [1849] 1865, 
p. 53). After 1848 nationalism, originally a move- 
ment of emancipation and constitutional rights, 
became known as Realpolitik and Machtpolitik. 
The sacro egoismo of nationalism reached its 
climax in the fascist movements. 

The war of 1914, which was started by dynastic 
empires, partly under popular pressure, replaced 
the empires with nation-states all over central and 
east-central Europe. At the same time it helped 
the spread of nationalism to Asia. Half a century 
later, nationalism had become the dominant force 
throughout the non-Western world, and the polit- 
ical map of Asia and Africa changed between 1945 
and 1965 as completely as had the map of Europe 
between 1815 and 1920. 

In the middle of the twentieth century, nation- 
alism everywhere prevailed over supranational ide- 
ology, as it had previously. Catholic France and 
Muslim Turkey had made common cause against 
Catholic Austria. At the end of the nineteenth cen- 
tury, republican France and Czarist Russia were 
brought together, not by ideological affinity but by 
the common fear of German aggressiveness and 
overconfidence. Ideological affinity and historical 
friendship between the dynastic empires of the 
Romanovs and Hohenzollerns did not prevail 
against the rising tide of nationalism. German 
statesmen characterized the war of 1914 as a strug- 
gle between Pan-Germanism and Pan-Slavism for 
the control of “Mitteleuropa.” When National So- 
cialist Germany resumed the hegemonial war in 
1939, it attacked semifascist and anti-Semitic Po- 
land, in spite of ideological affinities and friend- 
ship, and destroyed that country in close collabora- 
tion with communist Russia. In October 1940, 
fascist Italy attacked Greece, whose dictator, Gen- 
eral John Metaxas, was an outspoken admirer of 
fascism. 

In the ideological blocs of the post-World War 11 
era, nationalist differences made themselves more 
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and more felt. The authoritarian nationalism of 
de Gaulle’s France might be a factor in the disinte- 
gration of the democratic West and is reviving the 
goal of a European third force under French 
hegemony, independent of both English-speaking 
democracies and the communist East. Within the 
communist bloc, conflicting nationalist interests 
created acute tensions among the Soviet Union, 
communist China, Yugoslavia, and Albania. The 
imperialist trends of traditional Russian and Chi- 
nese national policies reasserted themselves, modi- 
fied and rejuvenated by communist ideology. As 
early as 1948, communist Yugoslavia affirmed and 
maintained her independence from communist 
Russia. Moreover, nationalist territorial claims 
hindered friendly cooperation between Yugoslavia 
and her two communist neighbors, Albania and 
Bulgaria. In the early 1960s the monolithic char- 
acter of international communism was merely ‘an 
ideological specter, not a political reality; the com- 
munist nations were even farther from a supra- 
national federative union than were the democ- 
racies. Even within the Union of Soviet Socialist 
Republics there were centrifugal nationalist trends 
which had been especially marked during the crises 
of World War m. 

Although nationalism remained the idée-force 
after 1945, there were unmistakable trends toward 
Supranational forms of cooperation and political 
organization. Earlier experiments at integration of 
closely related nations had not been promising. In 
the nineteenth century, a strong Pan-Scandinavian 
movement existed; yet these countries jealously 
preserved their national sovereignty, policy, and 
personality, achieving separation (Norway from 
Sweden; Iceland from Denmark), not integration. 
The small Central American republics, apparently 
united by language, religion, and history, tried in 
vain to federate. After World War 1, however, the 
agitations for a union of the democracies, for 
European unity, for an Atlantic community, for 
African unity seemed more promising. Numerous 
conferences were held; organizations were created 
whose strength was greater on paper than in 
reality; and limited Progress was achieved, espe- 
cially in concrete economic and social legislation 
and in organized cultural exchange. But even the 
smaller projects, like Benelux (Belgium, Nether- 
lands, Luxembourg) or the Maghreb (Tunisia, Al- 
geria, Morocco), ran into difficulties once the dis- 
cussion of desirabilities was to be abandoned for 
the realization of concrete possibilities. Traditional 
nationalism and continuing or newly emerging 
hegemonial claims, coupled with the clashing am- 
bitions of national leaders and the staying power 


of existing governmental frameworks, streng 
ened the centrifugal trends among the natio 
highly developed and less developed, “old” 


new. 


Role and prospects 


alism in the Western and non-Western worlds, 
Significant discrepancies exist everywhere, y 
they are more specific and individual than generic. 
On the whole, the “new” nations show trends and 
problems similar to those shown by the “new” 
nations of central Europe in the nineteenth cen- 
tury and of east-central Europe in the early twen- 
tieth century. Some of these “Western” nations in 
central-eastern and southern Europe were at the 
time of their rise to nationhood, and for decade 
thereafter, economically and socially underdevel- 
oped, preserving much of their premodern “feuda 
or medieval character, The Latin American nations 
are “old” as far as the history of national statehood 
goes, but in their social backwardness many re- 
semble the “new” nations of the middle of the 
twentieth century. All these nations bear witness 
to the profound transformation that is being 
brought about by world-wide trends: the possibili- 
ties of, and desire for, rapid technological change; 
the experience of such radical and violent move- 
ments as communism and fascism; the demand for 
social equality and for the active participation of 
the masses in national life; the “population ex- 
plosion” and the growth of giant cities, In this 
transformation, which increases global uniformity, 
nationalism acts as an accelerating factor because 
the policies of the newer or less developed nation- 
States have often been guided by the desire to catch 
up with the older and more highly developed nation- 
states. Nationalism can, however, also act as a 
force preserving older forms of societal life and 
stressing the diversity within a world community 
that is based on the acceptance of the nation- 
State as the basic form of political and cultural 
organization. 

Nationalism and the nation-state form the recog- 
nized foundation of the international organizations 
of the mid-twentieth century. The United Nations 
reflects in its growth the dynamic changes brought 
about by nationalism in the years after World 
War 1. It has successfully smoothed the transition f 
of many colonies to national statehood, a transition 
that had, in the past, frequently been accompanied 
by violent civil wars and protracted unrest, The 
United Nations accepted the principle of the legal 


equality of small and great nations and provided 
each with a voice in world affairs, thereby contra- 
dicting the attitude of the nineteenth-century Con- 
cert of Great Powers and rejecting the twentieth- 
century fascist disregard for the rights of “weak” 
or “small” states. The clashing interests of nations 
found in the United Nations a forum in which, for 
the first time in history, all peoples, civilizations, 
and ideologies could meet and discuss their differ- 
ences according to the procedures developed by 
Western parliamentary traditions. The United Na- 
tions represents a hope of divesting clashing nation- 
alist aspirations of their extremist character while 
recognizing their intrinsic validity. It also helps to 
intensify the peaceful intercourse among nations 
by creating and maintaining the outward forms of 
equality of status. 

It is difficult to foresee the future of nationalism. 
It is a divisive force in a world growing more and 
more interdependent, a force capable of producing 
bitter tensions and one-sided, self-righteous judg- 
ments that threaten the rational solution of inter- 
national conflicts. On the other hand, nationalism 
is an important factor in preventing any one or two 
of the strongest powers from establishing their 
hegemony over the whole globe or over a large part 
of it. In that respect, nationalism is a form of 
resistance to imposed uniformity, a bulwark of the 
beneficial diversity, individuality, and liberty of 
collective groups. It may be that in the future an 
attitude of tolerance and coexistence will divest the 
various forms of nationalism of the aggressive po- 
litical power drive that has characterized the age 
of nationalism. The growing fear of the conse- 
quences of an armed conflict may help to bring 
about such a change of attitude. In the 1960s the 
fear of war is powerful in all European nations, 
even those which welcomed previous wars. Inci- 
dents that in the nineteenth century would have 
led to war no longer play such a decisive role. 

The beginning of a general change of public 
temper in respect to the role of nationalism and 
the nation-state in international relations has been 
noticeable. Some historians have compared this 
change with the change, brought about by the 
Enlightenment and the rise of tolerance, that re- 
placed the age of religious wars in Europe with a 
period of uneasy and distrustful but generally 
peaceful coexistence of conflicting religions. A 
long process of change, beginning in the late seven- 
teenth century and taking at least two hundred 
years, was necessary before this fundamental atti- 
tude was generally accepted in the Western world. 
With the greater acceleration characteristic of the 
twentieth century, a similar process may transform 
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the age of nationalism and of warring nation- 
states with different civilizations and ideologies 
into an age of coexistence of free and equal 


nationalities, 
Hans KOHN 


[See also COLONIALISM; NATION; NATIONAL SOCIALISM; 
PAN MOVEMENTS; SOCIALISM; STATE. Other relevant 
material may be found in INTERNATIONAL POLITICS.] 
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In most countries, at least some commercial ac- 
tivities are carried on by organizations whose 
ownership and control vest in governments or 
bodies established by governments rather than in 
private individuals or groups. The reason for this 
may be simply one of political philosophy. Thus, 
for those who accept a Marxist labor theory of 
value, private ownership of capital and the un- 
earned income flowing from it represent the essen- 
tial evils of the capitalist system, and public owner- 
ship and operation of such capital become political 
ends in themselves. But the political support for 
nationalization is not restricted to Marxists. It also 
influences the thinking of other “democratic” so- 
cialists: the British Labour party, for example, is 
still committed in principle, in 1965, to the nation- 
alization of the means of production, distribution, 
and exchange. Such attitudes are influenced, for 
example, by belief that private capitalism is im- 
moral in its appeal to competitive and acquisitive 
motives or by dislike of the power of some men 
over others that private ownership of property is 
believed to confer. 

At the same time, outside the communist coun- 
tries there is continuing disagreement among so- 
cialists as to whether nationalization is an end of 
democratic socialism in itself or simply a possible 
means to the attainment of other economic and 
social goals. Certainly, in countries in which such 
groups have enjoyed political power, nationaliza- 
tion measures have been pragmatic in character 
and have been restricted to industries considered 
especially suitable for nationalization for reasons 
of history or of economic efficiency. This article 
will be confined to nationalization in the context 
of these “mixed” economies and to the questions of 
economic efficiency, broadly defined, that affect 
nationalized industries in such conditions. This 
should not be taken to mean, of course, that I 
would believe it possible to explain the existing 
Scope and character of public enterprises in any 
country at any time solely by reference to eco- 


nomic criteria. On the one hand, public enterprises 
of economic importance exist in countries, such as 
the United States, in which nationalization as a 
political dogma is of trivial influence. On the other, 
the extent and character of the deliberate acts of 
nationalization in countries such as Britain have 
clearly been influenced by motives other than the 
search for an efficient use of productive resources, 
But whatever the historical reasons for the exist- 
ence of nationalized industries, they have to face 
a common set of economic problems, and it is 
toward these that attention will be directed. 

Equally, I shall exclude from discussion the wide 
variety of other forms of public ownership or inter- 
vention which are also to be found in most coun- 
tries. These range from municipal commercial ac- 
tivities to the share-participation of governments 
or their agents in firms or industries which remain 
in principle private. Many of these other activities 
of governments share common characteristics with 
nationalized industries, but they also differ in ways 
that make it desirable to set them aside in order 
to concentrate attention upon the nationalized en- 
terprises proper, that is, upon the quasi-independ- 
ent corporations set up by central governments to 
indulge in specified commercial activities with the 
help of physical capital which has no private 
owners. 

Efficiency in resource allocation. Let us begin 
from the admittedly oversimple notion that the sole 
objective of nationalized industries is the further- 
ance of an efficient use of community resources, 
and let us simplify our problem further by assum- 
ing that the nationalized industry is operating in 
an economy in which general conditions of compe- 
tion prevail in the private sector, If the nationalized 
industries being considered have appropriate tech- 
nical characteristics, it might seem that the postu- 
lated objective of “ideal resource allocation” could 
be achieved simply by instructing the managers of 
the producing units within nationalized enterprises 
likewise to compete one with another—and with 
private industry. But even in this greatly simplified 
world, certain interesting problems arise. 

Such a solution would require that the producing 
units of nationalized enterprises should endeavor 
to maximize their net returns, and that in doing so 
they should finance their investment programs 
either from their own resources or from revenues 
raised by competitive borrowing in the private cap- 
ital market. That is, the activities of nationalized 
undertakings would be subjected to the same mar- 
ket tests that are applied to a private corporation. 
But this carries with it the implication that the 
riskiness of investment should be unaffected by 


the nature of resource ownership, and it is not easy 
to see how this can be achieved. If public ownership 
is believed to imply an ultimate obligation for the 
government to meet the liabilities of the enterprise, 
and it is clear that such a view is usual, then this 
fact must reduce the risks of those who lend to 
that enterprise and hence must enable it to borrow 
on terms more favorable than those available to 
private producers. Equally, it is necessary that 
ventures of similar inherent riskiness should earn 
a similar rate of return, whether they are public 
or private in nature. 

A genuinely competitive situation, then, would 
seem to require both that there be no inhibitions 
upon the directions in which the activities of the 
nationalized industries might develop (since if their 
scope is regulated, the producing units may be pre- 
cluded from implementing plans they believe likely 
to be profitable), and that they be explicitly sub- 
jected to exactly the same kinds of bankruptcy 
procedures and other financial constraints as enter- 
prises of other kinds. Were such a solution to be 
adopted, assuming it to be possible, the size and 
character of the nationalized sector would effec- 
tively come to be determined by the play of market 
forces, and it is then not obvious what economic 
purpose the deliberate creation of nationalized en- 
terprises could serve. 

Departures from this idealized world arise for 
several reasons. First, those who find nationaliza- 
tion politically appealing are unlikely to agree that 
its extent and functions should be decided solely by 
the competitive criteria just set out. Second, there 
may be specific reasons why competition is con- 
sidered contrary to the public interest. For example, 
the frequent operation of civil airlines as national- 
ized undertakings is clearly related in part to stra- 
tegic considerations and to a related distrust of 
international competition. Third, there are argu- 
ments, usually imprecise, that economic power is 
properly vested in government, for example, that 
it should control the “commanding heights” of the 
economy, and that nationalization contributes to 
this. Fourth, there is a set of specifically economic 
arguments, ranging from the need to deal with 
private monopoly to questions of employer-em- 
ployee relations. 

In what follows, we shall make the realistic as- 
sumption that the idealized conditions of competi- 
tion provide a useful bench mark against which 
to judge actual policies but that once national- 
ized industries exist, they are seldom if ever 
going to be able (or permitted) to use thorough- 
going competition as a solution to their investment, 
price, and output decisions. (It does not, of course, 


NATIONALIZATION 71 


follow that they themselves must be protected 
against competition from elsewhere.) Conse- 
quently, economic policy must take account of the 
fact that nationalized industries are likely to enjoy 
varying but sometimes significant degrees of mo- 
nopoly power, and that means other than competi- 
tion have to be found to promote efficient resource 
use in the special conditions of nationalization. 
The next section deals with monopoly. Subsequent 
sections deal with other economic problems in four 
distinguishable but closely related groups: pricing 
policy, stability and growth, labor relations, and 
denationalization. 

Nationalization and monopoly. The resource 
allocation problem does not disappear if the simple 
competitive solution to it is rejected: some other 
set of behavior rules (administrative procedures) 
has to be devised to guide the output, price, and 
investment decisions of nationalized enterprises. 
Public monopoly brings its own problems of re- 
source allocation and of economic power; this re- 
mains true even if one purpose of nationalization 
is to destroy private monopoly power. Indeed, there 
are some respects in which the monopoly power 
of a nationalized industry must be greater than that 
of any private group. For private monopoly power 
is separated from and opposed to the political 
powers of government, and these latter powers pro- 
vide means of legislative and other control. The 
counterpart of the “commanding heights” argument 
earlier mentioned is the fact that nationalization 
places political and economic power in the same 
hands: there is no lack of instances, for example, 
of governments using legislative or fiscal measures 
to protect nationalized industries from developing 
competition. 

On the other hand, if public enterprises are not 
required simply to pursue profit, then it is argued 
by advocates of nationalization that the absence 
of the motive of personal gain makes it possible 
for administrators to behave “responsibly,” in a 
way that cannot be expected of a private monopo- 
list. Insofar as the policies of nationalized indus- 
tries are politically controlled, the weight given to 
this proposition will depend upon the degree to 
which it is accepted that political motivations are 
likely to be more “responsible” in some sense than 
economic ones. Also for this type of proposition 
to have operational (policy) relevance, it must be 
possible to specify the characteristics of “respon- 
sible” behavior in an unambiguous and agreed 
fashion, so that they can be embodied in adminis- 
trative rules and procedures (“guidelines”). This 
is not a trivial problem: its resolution must incor- 
porate not only a statement of principle as to how 
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a “responsible” monopolist should behave but also 
the provision of agreed answers to all the problems 
that will be discussed in subsequent sections. In fact, 
since World War 1 this notion of public respon- 
sibility has encouraged a particular interest in the 
~ control of nationalized industries among students 
of public administration. In Britain, for example, 
writers have been much concerned to relate the 
(admittedly vague) specifications of the enabling 
acts of Parliament to administrative structures and 
guiding principles. In consequence, the economic 
purposes for which the industries apparently ex- 
ist have become overshadowed by procedural de- 
bate concerned with implementation of an incom- 
pletely specified public interest. It was, indeed, 
only in 1961, with the publication of a White 
Paper—The Financial and Economic Obligations 
of the Nationalised Industries (Great Britain... 
1961) that a British government attempted to lay 
down general policy guidelines explicitly related 
to economic purpose, 

Pricing policy. The most obvious difficulty that 
arises if a “fully competitive solution” cannot be 
(or is not) adopted is that of producing satisfactory 
rules for the pricing of nationalized industry prod- 
ucts. Even if it were possible to find criteria by 
which a “proper” return on investment could be 
determined, there would remain the difficulty that 
monopoly power makes it possible for a national- 
ized enterprise to increase its earnings simply by 
raising prices or by such devices as discriminatory 
pricing—devices which would be precluded by the 
existence of competitive conditions and which can- 
not generally be expected to improve resource allo- 
cation in the community at large. To some extent, 
this is a problem that might be dealt with by gen- 
eral directives as to what types “of pricing policy 
are permissible and by subjecting nationalized in- 
dustries to the same antimonopoly policies as the 
rest of industry. But there remain some awkward 
problems of implementation. In the first place, it 
is frequently difficult to establish whether the 
charging of different Prices (for example, by use 
of multipart rate structures in such industries as 
electricity or telephone service) is evidence of 
monopolistic discrimination or of variations in the 
differential cost of supplying different consumers. 
In the second place, it is in the nature of things 
that different nationalized industries enjoy differ- 
ent degrees both of monopoly power and of poten- 
Hal profitability, and this makes it difficult to 
apply the same general rules to all of them in a 
fashion that will conduce to economic efficiency. 

Again, nationalized -industries are frequently 
multiproduct firms, and both the costs of providing 


the different products and the demands for them 
may be interrelated. This is of course a problem 
that is not unknown in private industry, nor is its 
existence incompatible in principle with efficient 
resource use. But the circumstances in which pol- 
icies such as cross-subsidization are an efficient 
means of dealing with the problem are difficult 
enough to establish in principle, much less to ex- 
press in general rules covering the whole range of 
nationalized industries. What we can be clear about 
is that rules that ignored cross-subsidization and 
rules that forbade it (assuming that practicable 
rules could be formulated) would both be contrary 
to purpose. 

Yet another aspect of the pricing problem con- 
cerns the “public utility” aspect of nationalized 
industries. If the conditions of production are such 
that only a few technically efficient producing units 
are needed to satisfy market demand, then it can 
be argued that a different criterion of pricing from 
that appropriate to competitive conditions may be 
called for. Specifically, indivisibilities in the pro- 
duction process may result in its being possible to 
increase output at an incremental cost that is less 
than the average cost of producing the industry's 
output as a whole, Thus, it is argued, a choice has 
to be made between pricing at average cost, which 
denies access to the product to some potential con- 
sumers who would be willing to pay a price which 
would cover the incremental costs of increased out- 
put, and pricing at marginal (incremental) cost, 
in which case such consumers would be supplied 
but total receipts from sales would be less than 
enough to meet all necessary costs of production, 
so that the industry would have to be subsidized. 
The complexity of the problem is increased by the 
fact that, in such conditions of indivisibility, it is 
also conceptually possible for marginal cost to be 
greater than average cost. There is a considerable 
and continuing literature concerned with this prob- 
lem (see E. H. Phelps Brown & Wiseman 1953). 

In my view, however, this aspect is of greater 
interest as an intellectual exercise than as a prac- 
tical guide to nationalized industry pricing policy, 
for two reasons. First, there are very few industries, 
if any, in which technical conditions of themselves 
preclude the existence of enough production units 
for competition to be at least a conceptual possi- 
bility. Second, the case for marginal cost pricing 
is analytically suspect. All production processes in- 
volve temporal and technical indivisibility of some 
kind: the arguments for the application of special 
tules to “public utilities” rests upon an essentially 
arbitrary decision that the opportunity costs of the 
use of some specially durable assets should be 


ignored. Also (and as a corollary of this), accept- 
ance of the need to subsidize must involve a value 
judgment that the real incomes of consumers of na- 
tionalized industry products are properly increased 
by redistribution from the taxpayers. In practice, 
these difficulties have meant that, even in those rare 
cases where a marginal cost pricing rule is said to be 
adopted (for example, the pricing of electricity in 
France), the actual procedure rests upon essen- 
tially arbitrary solutions to the problems described. 

Finally, it is commonly argued that, while na- 
tionalized industries may serve directly “commer- 
cial” purposes, they also confer other, indirect 
social benefits upon the community which private 
producers would fail to take into account (because 
such benefits produce no profit) but which can 
influence the policy decisions of a nationalized 
corporation, Recent writings have demonstrated, 
in my view conclusively, that these external bene- 
fits (“spillovers”) have effects upon resource use 
and income distribution in the community at large 
that are not to be predicted a priori, that they are 
intimately related to such matters as the nature of 
property rights, and that their existence is unlikely 
to be confined to nationalized industries. Never- 
theless, it is of interest to postulate as a simplifica- 
tion that such externalities exist solely in the case 
of nationalized industries and to ask what policy 
conclusions would follow. It is useful to distinguish 
general and specific benefits. Logically, general 
social benefits (deriving from the very existence of 
the enterprise) would justify the global subsidiza- 
tion of the nationalized industry per se. It is not 
easy, however, to think of benefits of this kind, 
much less to quantify them, Most of the illustra- 
tions commonly given turn out on examination to 
be specific in character. For example, the “strate- 
gic” argument for the subsidization of a whole rail- 
road system cannot be examined rationally without 
specification of the particular strategic objectives 
(for example, the character and timing of the rele- 
vant hostilities) to which policy has to be related, 
But once this has been done, it is at least concep- 
tually possible to identify those parts of the sys- 
tem which have strategic value, and it is to these 
alone that subsidization should be directed. In fact, 
little has yet been done by those who utilize this 
type of argument to identify and measure the ben- 
efits in question, and no country requires nation- 
alized industries to attempt to quantify them for 
such purposes as the setting of “target” rates of 
return on capital. 

Investment criteria, stability, and growth. The 
discussion so far has taken no account of the fact 
that nationalized industries, like others, must make 
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new investments and that a decision has to be 
taken as to what rate of return they should expect 
to earn on such investments. [See INVESTMENT, 
article on THE INVESTMENT DECISION.] Equally, 
the pricing problem must incorporate some notion 
of the opportunity costs of the existing physical 
assets of a nationalized industry (this is the public 
utility pricing problem again, but seen from an- 
other viewpoint), and, in cases where the industry 
has come into being by the purchase of the rights 
of private shareholders, regard also must be given 
to the financial compensation burden that should 
properly be borne by the nationalized industry. 
Ideally, the investment guidelines would have 
to take account not only of the problems discussed 
in the last section but also of differing degrees of 
investment risk in different nationalized industries. 
Additionally, new investment, existing assets, and 
financial obligations all raise different questions 
and can be argued to call for different criteria. In 
the event, no intellectually satisfying “rules” that 
are also operationally practicable have been put 
forward. The most explicit attempt to deal with 
the problems is perhaps the British one already 
referred to (Great Britain... 1961). Effectively, 
this cites the “average” return to private industrial 
investment as a conceptual bench mark. But di- 
vergences are accepted for the kinds of reasons 
that we have examined in relation to pricing policy, 
and the rate-of-return “targets” of the individual 
industries are in fact decided each in the light of 
its own circumstances and subject to the need for 
ministerial agreement to substantial price changes. 
There are manifest dangers in this type of proce- 
dure, for example, in the opportunities it provides 
for the subsidization of unprofitable ventures and 
in the scope it gives for political maneuver. But 
there are also some advantages, particularly in 
comparison with vague directives to cover costs 
“taking one year with another.” In particular, the 
procedure does encourage informed comment and 
comparison: the application of different criteria 
to British gas and electricity enterprises, for ex- 
ample, has attracted critical attention. Nor would 
it be universally agreed that the procedure must 
produce investment decisions manifestly less effi- 
cient than those taken in the private sector. Real 
world investment decisions (and particularly those 
of large corporations), it is argued, are taken in a 
fashion that fits uneasily with the notion of ideal 
resource allocation through informed competition. 
Associated problems, to which no entirely satis- 
factory solutions have been found, concern the de- 
gree to which the enterprises should be expected 
to be self-financing (a decision which has a clear 
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affinity with the pricing problems of the last sec- 
tion) and the imperfection of the capital market 
if the nationalized industries form a significant 
proportion of the total demand for new borrowing. 

The investment policies of nationalized indus- 
tries have also attracted attention from another 
viewpoint. The new theories of income and employ- 
ment developed after 1936 encouraged support for 
nationalization measures, it being argued that pub- 
lic investment programs could be varied to counter 
fluctuations in the general level of community in- 
come and employment [see INCOME AND EMPLOY- 
MENT THEORY]. Since then, increasing analytical 
sophistication has combined with increasing aware- 
ness of the practical difficulties of such a policy 
(and its potential conflict with other objectives) to 
diminish interest in the idea. On the other hand, 
there is a growing interest in the role of public 
enterprise in generating growth in less developed 
countries. While no common or agreed ideas have 
yet emerged, some interesting approaches have 
already been developed. Greece provides an exam- 
ple. A distinction is made between infrastructure 
enterprises (for example, harbors and communica- 
tions), whose technical characteristics appear to 
make public operation a practical necessity for the 
foreseeable future, and other nationalized enter- 
prises, whose existence is necessitated, for exam- 
ple, by the lack of a developed domestic capital 
market, scarcity of entrepreneurship, or similar 
barriers to private initiative, [See CAPITAL, SOCIAL 
OVERHEAD.] The two groups require different pol- 
icies, The first has to be regarded as a continuing 
public obligation, necessary for the stimulation and 
growth of other types of economic activity. The 
second can be treated as self-liquidating, in that 
the further development of the enterprises them- 
selves, and with it of the whole economy, should 
make public operation unnecessary and permit 
them to be sold off to private capitalists. It re- 
mains to be seen whether growth is in fact en- 
couraged by such policies and, if it is, whether it 
will be politically possible to shift the successful 
“commercial” enterprises from the public sector to 
the private sector. 

Labor relations. Historically, workers’ support 
for nationalization is, of course, associated with po- 
litical dogma and embraces a belief that it must 
make for better labor—employer relations. Experi- 
ence provides no particular support for this view. 
True, the trade unions do not have to negotiate 
with private capitalists. But the policy makers 
(whether administrative heads or politicians) still 
have interests to serve and directives to obey that 
are wider than, and always in potential conflict 


with, the interests of organized labor. Also, the 
very nature of nationalized industry attracts pub- 
licity for disputes, and this at least must make 
employer—employee relations no easier. 

Workers’ support also derives from a fear of 
technological unemployment: nationalization pro. 
vides means not otherwise available of preventing 
or slowing the decline of industries adversely af- 
fected by technical change (such as the British coal 
and railway industries), For a community inter- 
ested in economic growth, however, the workers’ 
direct interest may seem here to be in conflict with 
the social interest. At the same time, the workers’ 
distrust of technical change derives from a recog- 
nition of the hardships it can impose upon individ- 
ual workers. Thus the support for nationalization 
as a protection is comprehensible, though it pro- 
vides a policy solution inferior in all major respects 
to income-deficiency supplements, retraining, or 
other measures framed to deal directly with the 
personal, social, and economic consequences of 
dynamic change. [For further discussion of labor 
relations under nationalization, see LABOR UNIONS, 
article on LABOR MOVEMENTS AND COLLECTIVE 
BARGAINING IN EUROPE.] 

Denationalization. If an industry is to be de- 
nationalized, there are difficulties to be solved in 
disposing of the relevant assets. These difficulties 
vary in importance with such things as the dura- 
tion of the prior nationalization, the size of the 
industry and its constituent technical units, ex- 
pected present and future profitability, likelihood 
of renationalization, and so on. But while there are 
questions of technical interest here, they are per- 
haps less fundamental than those concerned with 
the criteria by which denationalization might be 
deemed appropriate, This is a question that clearly 
involves political attitudes of the kind discussed at 
the beginning of this article. But it has received 
remarkably little attention, even within the context 
of agreed political aims. For example, there has 
been little satisfactory discussion of the criteria by 
which it might be decided, in an economy in which 
private ownership of capital is believed preferable 
(ceteris paribus), that industries earlier national- 
ized should automatically become eligible for de- 
nationalization. 


The discussion should have sufficiently indicated 
that there are continuing differences about the po- 
litical economy of nationalization that are perhaps 
unlikely to be resolved by reason, at least in our 
time. There are also important outstanding con- 
ceptual and analytical disagreements, such as those 
concerned with the internal logic and practical 


relevance of incremental (marginal cost) pricing. 
There is also scope for further technical study, par- 
ticularly in the two important areas concerned with 
the further development and application of cross- 
subsidization ideas and with the evolution of pro- 
cedures by which social benefits and costs can be 
more adequately specified, quantified, and ex- 
plicitly incorporated into pricing and investment 
policy guidelines, 

Jack WISEMAN 


[See also Prices, article on PRICING POLICIES.] 
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NATIVISM AND REVIVALISM 


Nativism and revivalism are two forms of social 
movements. Like several other forms of social 
movements—such as millenarism, cargo cults, and 
utopian communities—revivalism and nativism 
have been considered to be aspects of the same class 
of phenomena. This larger class has been termed 
“revitalization movement” and has been defined 
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as a conscious, deliberate, organized effort on 
the part of some members of a society to create a 
more satisfying culture. In revivalism, the aim of 
the movement is to return to a former era of hap- 
piness, to restore a golden age, to revive a previous 
condition of social virtue. In nativism, the aim of 
the movement is to purge the society of unwanted 
aliens, of cultural elements of foreign origin, or of 
both. Frequently, a movement is both revivalistic 
and nativistic. 

The term nativism has also been used to refer 
not to social movements but to a widespread atti- 
tude in a society of rejection of alien persons or 
culture. In this second sense of the term, nativism 
is a form of utopian thought (Mannheim 1929- 
1931). It is thus comparable to such popular beliefs 
as faith in the existence of a land without evil, or 
in the ultimate arrival of a messiah or mahdi, or 
in the coming of a millennium or of the ancestors 
with cargo. All of these beliefs are pervasive “myth- 
dreams” (Burridge 1960), which suffuse a society 
or culture area over considerable periods of time; 
but although the code of a movement may incor- 
porate such a myth-dream, the myth-dream, as well 
as the general social policy, of nativism, millennial 
expectation, revivalistic nostalgia, etc., does not in 
itself constitute a movement. 


Natural history 

Revivalistic, nativistic, and other kinds of re- 
vitalization movements have been generally ob- 
served to go through certain processual stages. 
These stages, if effectively fulfilled, are character- 
ized by the initiation of certain functional tasks 
without which the movement cannot achieve its 
aim, the transformation of society. Manifestly, all 
movements do not complete the cycle, sometimes 
because the movement is suppressed by force, be- 
cause its millenarism or messianic hopes are dis- 
appointed, or because it cannot attract or retain a 
sufficient membership. The successful movement, 
however, passes through the following stages: 

(1) Premovement phase. 

(a) Steady state. The society is satisfied with 
itself; no major group is experiencing sufficient 
stress or is sufficiently disillusioned to be seriously 
interested in radical change. 

(b) Period of increased individual stress. As 
a consequence of one or more of many possible 
circumstances—depression, famine, conquest by 
an alien society, acculturation pressures, or what- 
ever leads to the awareness of a growing discrep- 
ancy between life as it is and life as it could be 
(and is for someone else )—growing numbers of 
people experience psychological and physical stress. 
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(c) Periods of cultural distortion. As increas- 
ing numbers of individuals, singly and in small 
groups, find their situation both intolerable and 
without hope of relief by the use of available, cul- 
turally sanctioned means, they turn to idiosyncratic 
or systematically deviant means. This period of 
anomie (Merton 1949) leads to the distortion of 
the cultural fabric by the institutionalization of such 
socially dysfunctional customs as drug and alco- 
holic addictions, organized crime, excessive cor- 
ruption of officials, mob violence, sabotage, and 
vandalism, etc. 

(2) Movement phase. 

(d) Prophetic formulation of a code. A proph- 
et formulates a code, frequently (in religious 
movements) as a result of a vision in which he is 
instructed by supernatural beings and in which he 
and his people are promised salvation if the instruc- 
tions are followed. The code defines what is wrong 
with the existing culture, delineates a goal that is 
described as better than the existing culture (if 
not utopian), and outlines a cultural transfer, by 
the use of which the people can move from the bad 
existing culture to the good future culture. 

(e) Communication. The prophet preaches 
his revelation to the people, promising salvation to 
the convert and to the society if his code is accepted. 

(f) Organization. Special disciples and then 
mass followers join the prophet. As the number of 
members in the group increases and as the com- 
plexity of the mission grows, a division of labor 
develops. Different disciples take over the respon- 
sibility for various aspects of the movemenťs 
activities, 

(g) Adaptation. The movement will encoun- 
ter resistance from vested interests. These must be 
either defeated in political or military combat or 
converted; sometimes conversion is accomplished 
by making modifications in the code that will re- 
move the fears of the reluctant. 

(h) Cultural transformation. As the whole, 
or a controlling portion, of the population comes 
to accept the new code, the system of cultural 
transfer, and perhaps even the goal culture, is 
instituted. 

(3) Postmovement phase. 

(i) Routinization. Once the cultural transfor- 
mation has been accomplished, or is well under 
way, the organizational structure is divested of 
executive control of many spheres of the culture 
and contracts, maintaining responsibility only for 
the maintenance of doctrine and for the perform- 
ance of ritual. It thus ceases to be a movement and 
becomes, in effect, a church or a political party. 

(J) Steady state. Once the cultural transfor- 
mation has been accomplished and the movement’s 


organization has been routinized in its activities, a 
new steady state may be said to exist. Even if the 
professed aim of the movement was revivalistic, 
this new state will almost certainly be different 
from the initial steady state. Now the cycle is ready 
to begin again. 


History of the concept 

Anthropological interest in revivalistic and nativ- 
istic movements can be considered to have begun 
with the work of Lewis Henry Morgan. In his book 
League of the Iroquois (1851), which is considered 
to be the first systematic ethnography of a prim- 
itive people, Morgan devoted considerable space to 
a study of the New Religion of Handsome Lake. 
This was a religious movement, beginning in 1799, 
that was only mildly nativistic and even more 
mildly revivalistic, but it did represent an effort to 
rebuild a healthy way of life among the reservation 
Iroquois of New York state. Following Morgan's 
work, the next major contribution was James 
Mooney’s study of the Ghost Dance among the 
Plains Indians (Mooney 1896). The Ghost Dance 
was both enthusiastically revivalistic and vehe- 
mently nativistic, 

Comparable phenomena were soon reported by 
anthropologists working in other parts of the world. 
In 1923 Williams published his study of the Vailala 
Madness, a cargo cult among a native people in 
New Guinea. Later workers have described other 
cargo cults in the Melanesian area and in general 
found them, ds Williams found the Vailala Mad- 
ness, to be revivalistic in native theory (in that the 
followers of the cult believe that they are restoring 
a golden age in which they will be reunited with 
their ancestors) and to be nativistic in social policy 
(in that whites are to be driven away). But inas- 
much as the “cargo” is composed principally of 
European goods, and native goods and rituals are 
abandoned, both the nativistic and revivalistic as- 
pects of cargo cults are qualified by a strong motive 
toward acculturation. 

By the 1940s, it was possible to recognize that 
what were then generally called “nativistic move- 
ments” occurred within almost all of the areas of 
primitive culture known to anthropologists, among 
North and South American Indians, African Ne- 
groes, the peoples of the Pacific, and the tribal 
peoples of Europe and Asia as well. (The only 
major culture area in which revitalization move- 
ments are not known to have occurred on a wide 
scale is the aboriginal culture area of Australia 
and Tasmania.) In 1943 Ralph Linton published a 
brief paper on nativistic movements that served to 
establish the phenomenon as a special topic in 
anthropological studies of culture change. 


Because communication among the various so- 
cial science disciplines has been fitful, anthropo- 
logical workers did not at first take full cognizance 
of studies of comparable movements by sociologists, 
historians, Biblical scholars, classicists, and classi- 
cal archeologists. Sociologists have been concerned 
with such contemporary social movements in com- 
plex societies as the Father Divine cult in urban 
areas of the United States; historians have dealt 
with millenarian movements and utopian com- 
munities of the past, with major anti-Western 
movements in Africa (e.g., the Mahdi in Sudan) 
and in Asia (e.g., the Taiping Rebellion), and with 
the origins of the great religious and political move- 
ments in both Western and Oriental traditions; 
Biblical scholars have studied the origin of Chris- 
tianity as a social movement; and classicists and 
classical archeologists have investigated the new 
religion of Ikhnaton in ancient Egypt and the pre- 
Christian Essene cult memorialized by the Dead 
Sea scrolls. These materials, together with anthro- 
pological observations, now make possible sophis- 
ticated field and historical studies as well as 
generalizations about the phenomenon of rapid 
culture change. 


Schools of thought 


Concerning the conditions under which these 
movements arise, there would seem to be four 
major schools of thought. 

The absolute deprivation theory. Perhaps the 
most common, and least sophisticated, theory is 
the view that absolute deprivation, in the sense of 
a low material standard of living, leads to dissatis- 
faction with the status quo and eventually to the 
adoption of a revolutionary ideology. This view- 
point, in political application, leads to a “bread and 
circuses” theory of social control. Mere material 
deprivation, however, does not inevitably prompt 
the deprived to revolt: on the one hand, under 
some circumstances, such as war, both civilian and 
military personnel may maintain high morale while 
hungry, cold, and uncomfortable; on the other 
hand, the response to extreme deprivation, as in 
concentration camps, may be profound apathy, de- 
pendence, and suggestibility. Furthermore, it may 
be empirically observed that social movements 
sometimes occur not in the least fortunate class 
or nation, nor in the most, but in a middle station. 

The acculturation theory. Many nativistic and 
revivalistic movements have occurred among tribal 
peoples in contact with European civilization. This 
has sometimes led to the impression that accultura- 
tion pressure leads directly to fanatical social move- 
ments. This pressure, it is implied, produces a state 
of “culture shock,” in which the tribal people ex- 
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periences a sort of collective hysterical syndrome 
characterized by ecstatic, but unrealistic, commit- 
ment to a utopian social movement. The nature of 
the trauma inflicted by the higher civilization upon 
the lower may be conceived as the imposition of a 
competitive way of life, or as the requirement of 
an unfamiliar pattern of culture, or simply as in- 
terference with tradition. The code of the move- 
ment will, it is argued, represent some sort of 
compromise between withdrawal from, and ap- 
proach to, the higher civilization, Although this 
viewpoint has merit as far as it goes, its relevance 
appears to be restricted largely to tribal populations 
in culture-contact situations; and even here it is 
only a partial explanation, since it fails to account 
for many kinds of social responses to acculturation 
pressure which are not fanatical, hysterical, or un- 
realistic at all, such as rationalistic political and 
economic movements or wholesale adoption of 
major portions of the higher culture, 

The social evolutionary theory. Scholars work- 
ing in the tradition of social criticism and analysis 
founded by Karl Marx have pointed out that in 
many revivalistic and nativistic movements it is 
possible to discern the expression of social protest 
by disadvantaged classes or groups. Furthermore, 
the historical significance in a program of social 
evolution of a given kind of movement may be 
defined by Marxian theory. Thus, millenarian move- 
ments in early modern European history and the 
Taiping Rebellion in nineteenth-century China have 
been interpreted as premature popular protests 
against oppressive social conditions, and the cargo 
cult in Melanesia as an early, naive, supernatural- 
istic effort to overturn a social order that can be 
effectively challenged only by a more rational rev- 
olution. As Worsley (1957) has indicated, the or- 
ganization and tactics of even primitive social 
movements can be usefully analyzed by applying 
the Marxist model of revolutionary procedure. The 
rigid application of a class-revolutionary model in 
an evolutionary perspective, however, can obscure 
both general sociopsychological principles and also 
particular local and temporal conditions. 

The relative deprivation theory. The most gen- 
erally acceptable theory of revivalistic, nativistic, 
and other types of revitalization movement would 
seem to be one that recognizes, on the one hand, 
the influence of local and temporal circumstances 
and, on the other, the effect of a situation of in- 
creasing discrepancy between level of aspiration 
and level of realization. This theory has been called 
the “relative deprivation theory” (Aberle 1962). 
According to this view, the content of the move- 
ment, as expressed in the code promulgated by 
the prophet or other leaders, will be determined by 
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the cultural materials locally available at the time, 
including in particular the myth-dream, the tradi- 
tional customs of the society, and the customs of 
the society that may be exerting the acculturation 
pressure, The occurrence or nonoccurrence and 
timing of the movement will be determined by the 
degree of disillusionment of a significant number 
of members of the society with the way of life now 
available to them. This disillusionment must be 
based on an awareness of extreme discrepancy be- 
tween some available image of the good life and 
the prevailing image of life as it is. The good life 
will be conceived as the life of a happier past era 
if present circumstances contrast unfavorably with 
nostalgic memory, or it will be conceived as the life 
of another group (a higher class, an acculturating 
alien society, or a foreign nation). The good life, 
however, generally is defined not only as a mate- 
tially more comfortable existence but also as a life 
with self-respect and the respect of significant 
others. The precise moment at which the movement 
crystallizes is difficult to predict, and even the con- 
tent is not easy to foretell in detail, because these 
features will be heavily determined by the knowl- 
edge, personality, and circumstances of the prophet 
and other leaders of the movement. Paradoxically, 
individual variability in society thus plays a crucial 
role in determining the nature and timing of a 
movement whose motivation derives from wide- 
spread social and cultural conditions. 


Current status of research 


There now exists a considerable literature de- 
scribing nativistic, revivalistic, and similar kinds 
of movements, and, in view of the commonness of 
the phenomenon, there will be no dearth of future 
Studies, Comparative analysis and theory, however, 
are slight in quantity and uneven in quality. Be- 
cause there is no general consensus concerning the 
classification of movements, it is difficult to relate 
the definitive characteristics of the various types of 
movements to other classes of phenomena. Thus, 
a principal research need is the establishment of 
a taxonomy of social movements, Even though 
Such a taxonomy will very likely require modifica- 
tion as experience with its use reveals inadequacies, 
it will serve to focus research problems more effec- 
tively than in the present system, which is a com- 
bination of culture area types (e.g., Melanesian 
cargo cults, South American terre sans mal mi- 
grations, Judaeo-Christian Messianic movements), 
and nonindependent general attributes (e.g., reviv- 
alism, nativism, millenarism). 

The technical problem in establishing a reason- 
able classification of revitalization movements is 
the same as in most areas of the social sciences: 


there are so many variables of interest that 
inclusion of all of them in one taxonomic mai 
generates a very large number of classes, each de- 
fined by a large number of criteria. Thus the in- 
vestigator is forced to choose between a simple 
rule-of-thumb typology convenient to his own inter- 
ests but of little value as an absolute classification, 
or a complex classification too cumbersome for 
convenient use as a typology and too demanding 
for general use as a guide to observation, Until this 
problem is solved, this area of investigation, like 
many others in the social sciences, must remain 
imprecise. 

Among the research questions of particular in- 
terest in the current state of understanding, the 
following may be mentioned. 

(1) Psychological processes. It has been often 
observed that new codes formulated by prophets 
have a paranoid quality and that in the case of 
religious movements they often occur in hallucina- 
tory visions. To what extent do “psychopathologi- 
cal” processes actually occur in prophets in the 
genesis and development of such movements? In 
particular, to what extent and under what condi- 
tions are the prophets and leaders of the movement 
necessarily dependent upon paranoid and hal- 
lucinogenic modes of thought? It has also been 
observed that the behavior of followers has a sug- 
gestible, even hysterical, quality. Is the hysterical 
conversion process a necessary feature of success- 
ful social movement? 

(2) Cultural processes. Successful revitalization 
movements are able to induce coordinated change 
in large areas of culture in a short period of time. 
To what extent does evolutionary culture change 
depend upon this rapid process, as opposed to 
slower processes of invention, diffusion, and ac- 
culturation? What is the relative importance of 
cultural syncretism and of radical innovation? 

(3) Social processes. The social structure of 
revitalization movements is often inadequately de- 
scribed. What are the relationships among the 
members of the movement, and between them and 
nonmembers (including both potential converts 
and enemies)? To what extent are these relation- 
ships determined by available cultural models, and 
to what extent are they developed in response to 
the exigencies of the movement itself? 


Examples 


Finally, two examples of revivalistic and nativ- 
istic movements may suffice to convey the quality 
of these phenomena. 

The Ghost Dance of 1890, Among the Plains 
Indians of North America during the last quarter 
of the nineteenth century, a religious movement 


emerged under the leadership of the prophet 
Wovoka (Mooney 1896). The central tenets of the 
prophet’s teaching were both revivalistic and nativ- 
istic: the existing world was soon to be destroyed 
by fire, flood, and upheaval, and in the holocaust 
the intruding whites would be annihilated, and 
along with them those Indians who followed their 
ways. Salvation, however, was assured for those 
who participated in certain prescribed dances, who 
led pure lives, and who abandoned white customs 
and returned to Indian ways. The new world, which 
was to follow the apocalypse, would see America 
returned to the surviving “Ghost Dance” followers 
and to their Indian ancestors, who would lead to- 
gether lives of virtue and happiness, The Ghost 
Dance swept across the Plains and into neighboring 
culture areas, with various modifications from place 
to place. Although the Ghost Dancers planned no 
violence, American frontier settlers and military 
authorities feared an armed uprising. These sus- 
picions led directly to the notorious massacre at 
Wounded Knee, in which U.S. troops killed a band 
of Sioux Indians who had fled their reservation 
after the “medicine man,” Sitting Bull, was shot 
while resisting arrest. Indian interest in the Ghost 
Dance subsided gradually in the face of white hos- 
tility and the failure of the millennium to arrive. 

The Paliau movement in the Admiralty Islands. 
Following World War m, the Manus, a coastal 
people residing near New Guinea in the Austral- 
ian Trust Territory, developed a new religious, 
political, and economic system under the leadership 
of a secular prophet named Paliau (Schwartz 
1962), The Paliau movement was not a cargo 
cult and did not aim at the categorical expulsion 
of all whites. It was nativistic, however, to the 
extent that the movement sought to secure for 
Melanesians a greater degree of economic and po- 
litical sovereignty than they had hitherto enjoyed 
under either the Japanese or Australian administra- 
tions. The “New Fella Fashion,” as the movement 
was termed in pidgin, dispensed with many of the 
traditional religious beliefs and observances, reor- 
ganized the settlement pattern and economy, and 
proposed new standards of family organization, 
political structure, and economic activity. In effect, 
the Paliau movement was an effort by a native 
population to bring itself into the main stream of 
world cultural development by a rationally con- 
ceived reorganization of its entire culture. Although 
Australian authorities were suspicious of Paliau, 
and his movement suffered administrative setbacks, 
and although a short-lived cargo cult temporarily 
interrupted its progress, the Paliau movement has 
survived and contributed substantially to the fur- 
therance of the general aims of the Trust Territory 


NATIVISM AND REVIVALISM 79 


itself; the development of self-respecting, self- 
supporting, and self-governing native communities 
linked to the rest of the modern world by mutually 
satisfying economic and political ties (Mead 1956). 


ANTHONY F. C. WALLACE 


[See also MILLENARISM; SECTS AND CULTS; SOCIAL 
MOVEMENTS; and the biography of MooNEY.] 
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NATURAL LAW 


Natural law, which was for many centuries the 
basis of the predominant Western political thought, 
is rejected in our time by almost all students of 
society who are not Roman Catholics. It is rejected 
chiefly on two different grounds. Each of these 
grounds corresponds to one of the two schools of 
thought which are predominant today in the West, 
positivism and historicism. According to positivism, 
genuine knowledge is scientific knowledge; scien- 
tific knowledge can never validate value judg- 
ments; and all statements asserting natural law 
are value judgments. According to historicism, 
science (i.e., modern science) is but one historical, 
contingent form of man’s understanding of the 
world; all such forms depend on a specific Weltan- 
schauung; in every Weltanschauung the “cate- 
gories” of theoretical understanding and the basic 
“values” are inseparable. Hence the separation of 
factual judgments from value judgments is in 
principle untenable; since every notion of good and 
right belongs to a specific Weltanschauung, there 
cannot be a natural law binding man as man. 
Given the preponderance of positivism and his- 
toricism, natural law is today primarily a historical 
subject. 

By “natural law” is meant a law that determines 
what is right and wrong and that has power or is 
valid by nature, inherently, hence everywhere and 
always. Natural law is a “higher law,” but not every 
higher law is natural. The famous verses in Sopho- 
cles’ Antigone (449-460) in which the heroine ap- 
peals from the man-made law to a higher law do 


not necessarily point to a natural law; they may 
point to a law established by the gods, or what in 
later parlance is called a positive divine law. The 
notion of natural law presupposes the notion of 
nature, and the notion of nature is not coeval with 
human thought; hence there is no natural law 
teaching, for instance, in the Old Testament. 
Nature was discovered by the Greeks in contradis- 
tinction to art (the knowledge guiding the making 
of artifacts) and, above all, to nomos (law, custom, 
convention, agreement, authoritative opinion). In 
the light of the original meaning of “nature,” the 
notion of “natural law” (vópos rijs bforeos ) is a con- 
tradiction in terms rather than a matter of course. 

The primary question is less concerned with 
natural law than with natural right, i.e., what is 
by nature right or just. Is all right conventional 
(of human origin), or is there some right which is 
natural (dvcet dixatov)? This question was raised 
on the assumption that there are things which are 
by nature good (health, strength, intelligence, cour- 
age, etc.). Conventionalism (the view that all 
right is conventional) derived its support in the 
first place from the variety of notions of justice, a 
variety incompatible with the supposed uniformity 
of a right that is natural. Yet the conventionalists 
could not deny that justice possesses a core that is 
universally recognized, so much so that injustice 
must have recourse to lies or to “myths” in order 
to become publicly defensible, 

The precise issue then concerned the status of 
that right which is universally recognized: is that 
right merely the condition of the living together of 
a particular society (i.e., of a society constituted 
by covenant or agreement, with that right deriving 
its validity from the preceding covenant), or is 
there a justice among men as men which does not 
derive from any human arrangement? In other 
words, is justice based only on calculation of the 
advantage of living together, or is it choiceworthy 
for its own sake and therefore “by nature”? The 
two possible answers were given prior to Socrates. 
For our knowledge of the thought of the pre- 
Socratic philosophers, however, we depend entirely 
on fragments of their writings and on reports by 
later thinkers. 


Plato 


Socrates’ disciple Plato is the first philosopher 
whose writings proper have come down to us [see 
Pato]. While Plato cannot be said to have set 
forth a teaching of natural law (cf. Gorgias 483E 
and Timaeus 83£), there can be no doubt that he 
opposed conventionalism; he asserts that there is a 
natural right, i.e., something which is by nature 


just. The naturally just or right is the “idea” of 
justice (Republic 5018; also see 500c, D; 484c, D), 
justice itself, justice pure and simple. “Justice” 
is defined as doing one’s own business or, rather, 
doing one’s own business “in a certain manner,” 
i.e., “well” (433A, B; 443D). A man (or, rather, his 
soul) or a city is just if each of its parts does its 
work well and thus the whole is healthy; a soul or 
a city is just if it is healthy or in good order (cf. 
444p, E). The soul is in good order if each of its 
three parts (reason, spiritedness, desire) has ac- 
quired its specific virtue or perfection, and as a 
consequence of this the individual is well ordered 
toward his fellow men and especially his fellow 
citizens. The individual is well ordered toward his 
fellow citizens if he assigns to each what is in- 
trinsically good for him and, hence, what is intrin- 
sically good for the city as a whole. From this it 
follows that only the wise man or the philosopher 
can be truly just. 

There is a natural order of the virtues and the 
other good things; this natural order is the stand- 
ard for legislation (Laws 6318, D). One may there- 
fore say that the natural right in Plato’s sense is in 
the first place the natural order of the virtues as the 
natural perfections of the human soul (cf. Laws 
765E-766A), as well as the natural order of the 
other things that are by nature good. But assigning 
to each what is good for him by nature is impos- 
sible in any society. Such assigning requires that 
the men who know what is by nature good for each 
and all, the philosophers, be the absolute rulers 
and that absolute communism (communism re- 
garding property, women, and children) be estab- 
lished among those citizens who give the common- 
wealth its character; it also requires equality of the 
sexes. This order is the political order according 
to nature, as distinguished from and opposed to the 
conventional order (Republic 4568, C; cf. 4285). 
Thus natural right in Plato’s sense also determines 
the best regime, in which those who are best by 
nature and training, the wise men, rule the unwise 
with absolute power, assigning to each of them 
what is by nature just, i.e., what is by nature good 
for him. The actualization of the best regime proves 
indeed to be impossible or at least extremely im- 
probable; only a diluted version of that political 
order which strictly corresponds to natural right 
can in reason be expected. 

The establishment of the best regime is ob- 
structed in the last analysis by the body, the only 
thing that is by nature private (Laws 739c; Re- 
public 464D), or wholly incapable of being com- 
mon. Accordingly, sheer bodily (brachial) force 
must be recognized as having a natural title to rule, 
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a title indeed inferior to that deriving from wis- 
dom but not destroyed by it (Laws 690A, Cc). Po- 
litical society requires the dilution of the perfect 
and exact right, of natural right proper: of the 
right in accordance with which the wise would 
assign to everyone what he deserves according to 
his virtue and therefore would assign unequal 
things to unequal people. The principle governing 
the dilution is consent, i.e., the democratic prin- 
ciple of simple equality, according to which every 
citizen possesses the same title to rule as every 
other (Laws 756£-758A). Consent requires free- 
dom under law. Freedom here means both the par- 
ticipation in political rule of those unwise men who 
are capable of acquiring common or political vir- 
tue, and their possessing private property. Law can 
never be more than an approximation to the ver- 
dicts of wisdom, yet it is sufficient to delineate the 
requirements of common or political virtue, as well 
as the rules of property, marriage, and the like. 


Aristotle 


It is in accordance with the general character of 
Aristotle's philosophy that his teaching regarding 
natural right is much closer to the ordinary under- 
standing of justice than is Plato’s [see ARISTOTLE]. 
In his Rhetoric he speaks of “the law according to 
nature” as the unchangeable law common to all 
men, but it is not entirely certain that he takes that 
law to be more than something generally admitted 
and hence useful in forensic rhetoric, At least two 
of his three examples of natural law do not agree 
with what he himself regarded as naturally right 
(Rhetoric 1373b4—18). In the Nicomachean Ethics 
(1134b18-1135a5) he speaks not, indeed, of natu- 
ral law but of natural right. Natural right is that 
right which has everywhere the same power and 
does not owe its validity to human enactment. 
Aristotle does not give a single explicit example; 
but he seems to imply that such things as helping 
fellow citizens who are victims of misfortune re- 
sulting from the performance of a civic duty, and 
worshiping the gods by sacrifices, belong to nat- 
ural right. If this interpretation is correct, natu- 
ral right is that right which must be recognized 
by any political society if it is to last and which for 
this reason is everywhere in force. Natural right 
thus understood delineates the minimum conditions 
of political life, so much so that sound positive 
right occupies a higher rank than natural right. 
Natural right in this sense is indifferent to the 
difference among regimes, whereas positive right 
is relative to the type of regime—positive right is 
democratic, oligarchic, etc. (cf. Politics 1280a8— 
22). “Yet,” Aristotle concludes his laconic state- 


82 NATURAL LAW 


ment on natural right, “one regime alone is by 
nature the best everywhere.” This regime, “the most 
divine regime,” is a certain kind of kingship, the 
only regime that does not require any positive right 
(Politics 1284a4-15; 1288a15-29). The flooring 
and the ceiling, the minimum condition and the 
maximum possibility of political society, are natu- 
ral and do not in any way depend on (positive) 
law. 

Aristotle does not explicitly link his teaching 
regarding natural right with his teaching regarding 
commutative and distributive justice, but the prin- 
ciples of commutative and distributive justice can- 
not possibly belong to merely positive right. Com- 
mutative justice is the kind of justice which obtains 
in all kinds of exchange of goods and services (it 
therefore includes such principles as the just price 
and the fair wage) as well as in punishment; dis- 
tributive justice has its place above all in the 
assignment of political honors or offices. Natural 
right understood in terms of commutative and 
distributive justice is not identical with natural 
right as delineating the minimum conditions of 
political life: the bad regimes habitually counteract 
the principles of distributive justice and last never- 
theless. Aristotle is no longer under a compulsion 
to demand the dilution of natural right. He teaches 
that all natural right is changeable; he does not 
make the distinction made by Thomas Aquinas 
between the unchangeable principles and the 
changeable conclusions, This would seem to mean 
that sometimes, in extreme or emergency situa- 
tions, it is just to deviate even from the most 
general principles of natural right. 


Stoicism 


Natural law becomes a philosophic theme for 
the first time in Stoicism. It there becomes the 
theme not primarily of moral or political philosophy 
but of physics (the science of the universe). The 
natural (or divine or eternal) law is identified with 
God, the highest god (fire, ether, or air), or his 
reason, i.e., with the ordering principle that per- 
vades and thus governs the whole by molding 
eternal matter. Rational beings can know that law 
and knowingly comply with it insofar as it applies 
to their conduct. In this application natural law 
directs man toward his perfection, the perfection 
of a rational and social animal; it is “the guide of 
life and the teacher of the duties” (Cicero, On the 
Nature of the Gods 1, 40); it is the dictate of reason 
regarding human life. Thus the virtuous life as 
choiceworthy for its own sake comes to be under- 
stood as compliance with natural law—with a law, 
and hence as a life of obedience. 


Inversely, the content of natural law is the whole 
of virtue. The virtuous life as the Stoics understood 
it is, however, not identical with the life of moral 
virtue (as distinguished from the life of contem- 
plation), for one of the four cardinal virtues is 
wisdom that is above all theoretical wisdom; the 
virtuous man is the wise man or the philosopher. 
One is tempted to say that the Stoics treat the study 
of philosophy as if it were a moral virtue, i.e., as 
something which could be demanded from most 
men. Justice, another of the four virtues, consists 
primarily in doing what is by nature right. The 
foundation of right is man’s natural inclination 
to love his fellow men, not merely his fellow citi- 
zens: there is a natural society comprising all men 
(as well as all gods). The inclination toward the 
universal society is perfectly compatible with the 
equally natural inclination toward political society, 
which is of necessity a particular society. The un- 
changeable and universally valid natural law—a 
part of which determines natural right, i.e., that 
with which justice, in contradistinction to wisdom, 
courage, and temperance, is concerned—is the 
ground of all positive law; positive laws contradict- 
ing natural law are not valid. 

It is sometimes asserted that the Stoics differ 
from Plato and Aristotle by being egalitarians. 
Differing from Aristotle (but not from Plato), they 
denied that there are slaves by nature; but this 
does not prove that according to them all men are 
by nature equal in the decisive respect, i.e., as 
regards the possibility of becoming wise or virtuous 
(Cicero, On the Ends of the Good and Bad Things 
Iv, 56). The peculiarity of the Stoics, in contradis- 
tinction to Plato and Aristotle, that explains why 
the Stoics were the first philosophers to assert 
unambiguously the existence of natural law would 
seem to be the fact that they teach in a much less 
ambiguous way than Plato, to say nothing of Aris- 
totle, the existence of a divine providence that 
supplies divine sanctions for the compliance or 
noncompliance with the requirements of virtue. 
(Cf. Cicero, Laws u, 15-17; Republic mm, 33-34.) 

The Stoic natural law teaching is the basic 
stratum of the natural law tradition. It affected 
Roman law to some extent. With important modifi- 
cations it became an ingredient of the Christian 
doctrine. 


Christian teaching 


The Christian natural law teaching reached its 
theoretical perfection in the work of Thomas 
Aquinas [see Aquinas]. It goes without saying that 
in the Christian version, Stoic corporealism (“ma- 
terialism”) is abandoned. While natural law retains 


its status as rational, it is treated within the con- 
text of Christian (revealed) theology. The precise 
context within which Thomas treats natural law 
is that of the principles of human action; these 
principles are intrinsic (the virtues or vices) or 
extrinsic; the extrinsic principle moving men to- 
ward the good is God, who instructs men by law 
and assists them by his grace. Natural law is clearly 
distinguished from the eternal law—God himself 
or the principle of his governance of all creatures 
—on the one hand, and the divine law, i.e., the 
positive law contained in the Bible, on the other. 
The eternal law is the ground of the natural law, 
and natural law must be supplemented by the 
divine law if man is to reach eternal felicity and if 
no evil is to remain unpunished. All creatures par- 
ticipate in the eternal law insofar as they possess, 
by virtue of divine providence, inclinations toward 
their proper acts and ends. Rational beings partici- 
pate in divine providence in a more excellent man- 
ner because they can exercise some providence for 
themselves; they can know the ends toward which 
they are by nature inclined as good and direct 
themselves toward them, Man is by nature inclined 
toward a variety of ends which possess a natural 
order; they ascend from self-preservation and pro- 
creation via life in society toward knowledge of 
God. Natural law directs men’s action toward those 
ends by commands and prohibitions. 

Differently stated, as a rational being man is by 
nature inclined toward acting according to reason; 
acting according to reason is acting virtuously; 
natural law prescribes, therefore, the acts of virtue. 
Man by nature possesses knowledge of the first 
principles of natural law, which are universally 
valid or unchangeable, Owing to the contingent 
character of human actions, however, those con- 
clusions from the principles which are somewhat 
remote possess neither the evidence nor the univer- 
sality of the principles themselves; this fact alone 
would require that natural law be supplemented 
by human law. A human law that disagrees with 
natural law does not have the force of law (Summa 
theologica 1, 2, 90 ff.). All moral precepts of the 
Old Testament (as distinguished from its cere- 
monial and judicial precepts) can be reduced to 
the Decalogue; they belong to the natural law. This 
is true in the strictest sense of the precepts of the 
Second Table of the Decalogue, i.e., the seven 
commandments which order men’s relations among 
themselves (Exodus 20.12-17). The precepts in 
question are intelligible as self-evident even to the 
people and are at the same time valid without 
exception; compliance with them does not require 
the habit of virtue (Swmma theologica 1, 2, 100). 
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A sufficient sanction is supplied by divine pun- 
ishment for transgressions of the natural law, but 
it is not entirely clear whether human reason can 
establish the fact of such punishment; Thomas 
surely rejects the Gnostic assertion that God does 
not punish and the assertion of certain Islamic 
Aristotelians that the only divine punishment is 
the loss of eternal felicity. He does say that sin is 
considered by the theologians chiefly insofar as it 
is an offense against God, whereas the moral phi- 
losophers consider sin chiefly insofar as it is op- 
posed to reason. These thoughts could lead to the 
view of some later writers that natural law strictly 
understood is natural reason itself, i.e., natural law 
does not command and forbid but only “indicates”; 
natural law thus understood would be possible even 
if there were no God (cf. Suárez, Tractatus de 
legibus ac de Deo legislatore 1, 6, sec. 3; Grotius, 
De jure belli ac pacis, Prolegomena, sec. 11; 
Hobbes, Leviathan, chapter 15-end; Locke, Trea- 
tises of Civil Government 11, sec. 6; Leibniz, Théo- 
dicée, sec. 183). 

Thomas treats natural right (as distinguished 
from natural law) in his discussion of justice as a 
special virtue (Summa theologica 11, 2, 57). There- 
in he is confronted with the task of reconciling 
with the Aristotelian teaching the Roman law dis- 
tinction between ius naturale and ius gentium, 
according to which natural right deals only with 
things common to all animals (like procreation 
and the raising of offspring), whereas the ius 
gentium is particularly human. The Roman law 
distinction might seem to reflect early convention- 
alist teaching (cf. Democritus, fr. 278), Thomas’ 
reconciliation apparently paved the way for the 
conception of “the state of nature” as a status 
antedating human society. (Cf. Suárez, Tractatus 
ur, 18, sec, 4.) 

The Thomistic natural law teaching, which is 
the classic form of natural law teaching, was al- 
ready contested in the Middle Ages on various 
grounds. According to Duns Scotus, only the com- 
mandment to love God—or, rather, the prohibition 
against hating God—belongs to natural law in the 
strictest sense. According to Marsilius of Padua, 
natural right as Aristotle meant it is that part of 
positive right which is recognized and observed 
everywhere (divine worship, honoring of parents, 
raising of offspring, etc.); it can only metaphori- 
cally be called natural right [see MARSILIUS OF 
Papua]. The dictates of right reason regarding the 
things to be done (i.e., natural law in the Thomis- 
tic sense), on the other hand, are not as such 
universally valid because they are not universally 
known and observed. 
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Modern developments 

Natural law acquired its greatest visible power 
in modern times: in both the American and the 
French revolutions, solemn state papers appealed 
to natural law. The change in effectiveness was 
connected with a substantive change; modern natu- 
ral law differs essentially from premodern natural 
law. Premodern natural law continued to be power- 
ful; but it was adapted to modern natural law, with 
varying degrees of awareness of what was involved 
in that adaptation. The most striking characteristics 
of modern natural law are these: (1) Natural law 
is treated independently, i.e., no longer in the con- 
text of theology or of positive law. Special chairs 
for natural law were established in some Protestant 
countries; treatises on natural law took on the 
form of codes of natural law. The independent 
treatment of natural law was made possible by the 
belief that natural law can be treated “geometri- 
cally,” i.e., that the conclusions possess the same 
certainty as the principles. (2) Natural law became 
more and more natural public law; Hobbes's doc- 
trine of sovereignty, Locke’s doctrine of “no taxa- 
tion without representation,” and Rousseau’s doc- 
trine of the general will are not simply political 
but legal doctrines. They belong to natural public 
law; they do not declare what the best political 
order is, which by its nature is not realizable except 
under very favorable conditions, but they state the 
conditions of legitimacy which obtain regardless 
of place and time. (3) Natural law by itself is 
supposed to be at home in the state of nature, i.e., 
a state antedating civil society. (4) In the modern 
development “natural law” is replaced by “the rights 
of man”; the emphasis shifts from man’s duties to 
his rights. (5) Whereas premodern natural law 
was on the whole “conservative,” modern natural 
law is essentially “revolutionary.” The radical dif- 
ference between modern and premodern natural 
law appears most clearly if one studies the still- 
remembered great modern natural law teachers 
rather than the university professors who as a rule 
rest satisfied with compromises. 

The principles informing modern natural law 
were established by two thinkers who were not 
themselves natural law teachers, Machiavelli and 
Descartes. According to Machiavelli, the traditional 
political doctrines take their bearings by how men 
should live and thus culminate in the description 
of imaginary commonwealths (“utopias”), which 
are useless in practice: one ought to start from 
how men do live. Descartes begins his revolution 
with the universal doubt, which leads to the dis- 
covery of the Ego and its “ideas” as the absolute 


basis of knowledge and to a mathematical—mechan- 
ical account of the universe as a mere object of 
man’s knowledge and exploitation. 

Modern natural law as originated by Hobbes did 
not start, as traditional natural law did, from the 
hierarchic order of man’s natural ends, but rather 
from the lowest of those ends (self-preservation) 
that could be thought to be more effective than the 
higher ends [see Hoppes]. (A civil society ulti- 
mately based on nothing but the right of self- 
preservation would not be utopian.) Man is still 


„asserted to be the rational animal, but his natural 


sociality is denied. Man is not by nature ordered 
toward society, but he orders himself toward it 
prompted by mere calculation. This view in itself 
is very old, but now it is animated by the concern 
for a natural-right basis of civil society. The desire 
for self-preservation has the character of a passion 
rather than of a natural inclination; the fact that 
it is the most powerful passion makes it the suffi- 
cient basis of all rights and duties. Natural law, 
which dictates men’s duties, is derived from the 
natural right of self-preservation. The right is abso- 
lute, while all duties are conditional. Since men are 
equal with regard to the desire for self-preservation 
as well as with regard to the power of killing 
others, all men are by nature equal. There is no 
natural hierarchy of men, so that the sovereign to 
whom all must submit for the sake of peace and 
ultimately of the self-preservation of each is under- 
stood as a “person,” i.e., as the representative or 
agent, of each; the primacy of the individual—of 
any individual—and of his natural right remain 
intact (cf. Leviathan, chapter 21). 

The doctrine of Locke may be described as the 
peak of modern natural law [see Locke]. At first 
glance it appears to be a compromise between the 
traditional and the Hobbesian doctrines. Agreeing 
with Hobbes, Locke denies that the natural law is 
imprinted in the minds of men, that it can be 
known from the consent of mankind, and that it 
can be known from men’s natural inclination. His 
deduction of natural law is generally admitted to 
be confusing—not to say confused—which does 
not prove, however, that Locke himself was con- 
fused. It seems to be safest to understand his doc- 
trine as a profound modification of the Hobbesian 
doctrine, 

It is certain that, unlike Hobbes, Locke sees the 
crucially important consequence of the natural 
right of self-preservation in the natural right of 
property, i.e., of acquiring property, a natural right 
that within civil society becomes the natural right 
of unlimited acquisition. Property is rightfully ac- 
quired primarily by labor; in civil society, however, 


labor ceases to be the title to property while remain- 
ing the source of all value. Locke’s natural law 
doctrine is the original form of capitalist theory. 

Rousseau too starts from the Hobbesian premise 
[see RoussEau]. Hobbes asserted that the natural 
right to judge the means of self-preservation is the 
necessary consequence of the right of self-preserva- 
tion itself and belongs, as does the fundamental 
right, equally to all men, wise or foolish. But 
Rousseau demands that the natural right to judge 
the means of self-preservation be preserved as an 
institution within civil society. Every person subject 
to the laws must as a natural right have a say in 
the making of the laws by being a member of the 
sovereign, i.e., of the legislative assembly. The cor- 
rective to folly is to be found above all in the char- 
acter of the laws in general, both in origin and in 
content: all subject to the laws determine what all 
must or may not do. The justice or rationality of 
the laws is thereby guaranteed in the only way 
compatible with the freedom and equality of all. 
In the society established in accordance with natu- 
ral right, there is no longer a need or a possibility 
of appealing from positive law to natural right, 
because the members or rulers of that society are 
not supposed to be just men. 

Rousseau further differed from Hobbes by real- 
izing that if man is by nature asocial, he is by 
nature arational; questioning the traditional view 
that man is the rational animal, he found the 
peculiarity of man in his perfectibility or, more 
generally stated, his malleability. This led to the 
conclusions that the human race is what we wish 
to make it and that human nature cannot supply 
us with guidance as to how man and human society 
ought to be. 

Kant drew the decisive conclusion from Rous- 
seau’s epoch-making innovations: the Ought can- 
not be derived from the Is, from human nature; 
the moral law is neither a natural law nor a deriva- 
tive of natural law [see Kant]. The criterion of the 
moral law is its form alone, the form of rationality, 
i.e., the form of universality. 

At about the same time that Kant, sympathizing 
with the French Revolution, radicalized the most 
radical form of modern natural right and thus 
transformed natural right and natural law into a 
law and a right which are rational but no longer 
natural, Burke, opposing the French Revolution 
and its theoretical basis, which is a certain version 
of modern natural right, returned to premodern 
natural law [see BuRKE]. In doing so, he made 
thematic the conservatism which was implicit to 
some extent in premodern natural law. Therewith 
he profoundly modified the premodern teaching 
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and prepared decisively the transition from the 
natural “rights of man” to the prescriptive “rights 
of Englishmen,” from natural law to “the historical 


school.” 
LEO STRAUSS 


[See also GENERAL WILL; NATURAL RIGHTS; SOCIAL 
contract. Other relevant material may be found 
under POLITICAL THEORY.] 
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NATURAL RIGHTS 


The doctrine of natural rights is properly to be 
understood as an aspect or feature of the modern 
doctrine of natural Jaw. Natural rights (plural) are 
to be carefully distinguished from that natural right 
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(singular) which is a central conception of classi- 
cal, premodern political philosophy. Both the pre- 
modern and modern teachings result in judgments 
that some things are naturally right, or right ac- 
cording to nature, and that these things are intrin- 
sically right, or right independently of opinion. 

In classical political philosophy “natural right” 
refers to the objective rightness of the right things, 
whether the virtue of a soul, the correctness of an 
action, or the excellence of a regime. Thus Aristotle 
says in Politics (1323a29-33) that no one would 
call a man happy who was completely lacking in 
courage, temperance, justice, or wisdom. A man 
who was easily frightened, unable to restrain any 
impulse toward food or drink, willing to ruin his 
friends for a trifle, and generally senseless could 
not possibly lead a good life. Even though chance 
may occasionally prevent good actions from having 
their normal consequences, so that sometimes 
cowards fare better than brave men, courage is still 
objectively better than cowardice. The virtues and 
actions that contribute to the good life, and the 
activities intrinsic to the good life, are naturally 
right. 

“Natural rights,” on the other hand, are the 
rights that all men possess, because of which they 
may be obligated to act, or to refrain from acting, 
in certain ways. According to the teaching devel- 
oped primarily by Hobbes and Locke, there are 
many natural rights, but all of them are inferences 
from one original right, the right that each man has 
to preserve his life. All other natural rights, like 
the right to liberty and the right to property, are 
necessary inferences from the right of self-preserva- 
tion, or are conceived as implicit in the exercise 
of that primary right. Similarly, the natural law 
founded upon natural rights consists of deductions 
made from the primary right and its implications. 
The sum of these deductions is the state of civil 
society. The doctrine of natural rights teaches pri- 
marily, then, that all obligation is derived from the 
right which every man has to preserve his own life. 
Conversely, it teaches that no man can be bound 
to regard as a duty whatever he regards as destruc- 
tive to the security of his life. Thus slavery is wrong 
because no one can reasonably be asked to place his 
life at the mercy of another, and not, as in classical 
natural right, only when it constitutes a wrongful 
appropriation of one man’s life and labor by an- 
other. 

From this point of view, what is intrinsically 
right is no longer what is required by, or what par- 
takes of, the good life; rather, it is what is subjec- 
tively regarded by the individual as necessary to his 
security. The individual, abstractly considered, be- 


comes the subject of rights, apart from any par- 
ticular qualities he may have. “All men are created 
equal” means, among other things, that the rights 
each individual possesses by nature are entirely 
independent of whether he is strong or weak, wise 
or foolish, virtuous or vicious. The premodern doc- 
trine of natural right, holding that men are obli- 
gated by what is required for their perfection or 
happiness, regarded the less intelligent and less 
virtuous as being naturally obligated to obey the 
more intelligent and more virtuous. This natural 
obligation was independent of the many prudent 
compromises that various circumstances might dic- 
tate—some of them very democratic compromises 
—by which the consent and loyalty of the less ex- 
cellent might be enlisted in the service of a regime. 
But classical natural right was inherently aristo- 
cratic in its tendency. The modern doctrine of 
natural rights makes every individual equally the 
source of legitimate authority. Moreover, it makes 
the people as a whole the judge of the legitimacy 
of the exercise of this authority. Thus, although the 
doctrine of natural rights may sanction other forms 
of government—including limited monarchy, as 
the Declaration of Independence indicates—it is 
inherently democratic in its tendency. Classical 
natural right is politically comprehensive, since 
there is virtually no aspect of human life which 
does not bear upon its quality. This is indicated by 
Aristotle’s saying that what the law does not com- 
mand, it forbids. The parallel modern maxim, ex- 
hibiting the far more limited scope of the modern 
state, holds that what the law does not forbid, it 
permits. 


State and polis 


The state erected upon the doctrine of natural 
rights tends in this way to be liberal or permissive. 
For the doctrine gives rise to the notion that there 
is a private sphere within which the activities of 
the individual, or at least those of his activities 
which do not affect the security of the equal rights 
of his fellow citizens, should be immune to public 
inquiry or public control. The activities of the state 
are thus directed toward providing security for life 
and for liberty—which are among the conditions 
of happiness—but not toward providing happiness 
itself. Each man is to be left free to seek this ac- 
cording to his own private opinion of what happi- 
ness is. It is for this reason that Jefferson names, 
not happiness, but the pursuit of happiness, as 
being among those rights for the sake of which man 
organizes civil society. 

Nothing better indicates the difference between 
the earlier and later doctrines than their attitudes 


toward religion. From the point of view of classical 
natural right, religion is one of the most important 
means by which men are directed toward virtue, 
and hence toward temporal no less than toward 
eternal felicity. Accordingly, religious institutions 
are among the most important political institutions. 
The point of view of the adherent of the modern 
natural rights school, on the other hand, was per- 
fectly expressed by Jefferson when he wrote, “The 
legitimate powers of government extend to such 
acts only as are injurious to others, But it does me 
no injury for my neighbor to say there are twenty 
gods, or no god. It neither picks my pocket nor 
breaks my leg.” 

The classical polis, or political community, may 
be defined as that community which includes all 
other communities but is itself included in none. 
It is the comprehensive form of human association, 
and its purposes ascend from the necessary condi- 
tions of human existence—the provision of mate- 
rial necessities and of security from all forms of 
violence—to the sufficient conditions. The latter 
include the formation of good character in the citi- 
zens, education in the liberal arts, and participation 
in politics and philosophy. These are the character- 
istic pursuits of gentlemen, and rule by gentlemen 
is the characteristic solution to the political prob- 
lem, according to classical natural right. The polis 
is a partnership in justice, but justice is essentially 
inferior to friendship. Friendship, writes Aristotle, 
seems to hold political communities together more 
than does justice, and legislators seem to care for it 
more than for justice. For when men are friends, 
they have no need of justice, but when they are 
just, they still have need of friends. This implies, 
among other things, that the polis, as distinct from 
the modern state, is a very small society. Its size is 
such that there is virtually no one among the citi- 
zens who cannot be either a friend, or a friend of a 
friend, of every other citizen. For this reason the 
ultimate sanctions for justice are not the penalties 
that can be exacted in the law courts but ostracism, 
formal or informal, from that fellowship in which 
alone the good citizen feels he can lead the good 
life. That is at least implied in Socrates’ apparent 
preference of death to exile, as expressed in Plato’s 
Crito. 

The modern state, erected upon the doctrine of 
natural rights, is in principle a large society, if not 
a mass society. The natural limits upon the size of 
the polis, within which classical natural right has 
its proper home, are determined by human ability 
to participate in a common good, by face-to-face 
relationships. The modern state, however, is founded 
upon the notion of a social contract and is held 
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together by the power of a sovereign authority to 
enforce the terms and consequences of that con- 
tract. Since the more powerful the sovereign is, the 
better he is able to perform his functions, and since 
increase in the size of the state generally adds to 
the power of the sovereign, the state thus has an 
inherent tendency to an almost indefinite expansion. 


Sovereignty 


Sovereignty, as the term has been used since 
Hobbes, differs radically in meaning from the cor- 
responding term in classical political philosophy, 
for the same reason that polis differs from “state.” 
In a polis, whoever actually governs—whether the 
people, the rich, the nobility, or a tyrant—is the 
sovereign, In the United States of America, how- 
ever, the governing officials are not the sovereign 
authority. The people of the United States is the 
sovereign, even though the people only acts through 
representatives. It is true that the logic of the notion 
of sovereignty would permit the people of the United 
States to transfer its authority to a hereditary 
monarch. Should it do so, however, the monarch 
would still represent the people, although the form 
of the representation would no longer be democratic 
or republican. 

The modern concept of sovereignty can be de- 
duced quite strictly from the proposition that all 
men are created equal. This proposition does not 
mean, as we have noted, that men are equal in 
virtue or intelligence, but that they are equal in 
certain rights. Each man has a natural right to 
preserve his life, and no man has a natural obliga- 
tion to defer to any other man, in deciding what 
does, and what does not, tend to his own preserva- 
tion, Government, accordingly, does not exist by 
nature. The state of nature is the state of men 
without government. In the state of nature, men’s 
rights are perfect, and they have no duties. The 
ground of sovereignty is the complete right that 
every man has to everything in the state of nature, 
a right which is unlimited because, every man being 
equal in authority to every other man, there is no 
one who can prescribe any limits to anyone else. 
There are limits in the state of nature to what a 
man may rightly intend to do, since he may not 
naturally or reasonably intend his own destruction. 
But these are limits implicit in the inclination to 
self-preservation, not limits upon what may be done 
from that inclination. 

For reasons sufficiently evident, life in the state 
of nature, as John Locke puts it, is full of incon- 
veniences or, in the more pungent language of 
Thomas Hobbes, it is nasty, brutish, and short. The 
remedy for the state of nature is the state of civil 
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society, and we must consider carefully how men 
as equal as those in the state of nature can thus 
transform their condition. They can do so by con- 
senting or agreeing, each with the other, that they 
will surrender the exercise of their unlimited right 
to be sole judges of what tends to their own preser- 
vation. This surrender must be equal by each, and 
it must be complete. No one in civil society can 
continue to exercise any part of the right he had in 
the state of nature to be his own master. This agree- 
ment, which is the social contract, is an agreement 
that is made by everyone with everyone. It trans- 
forms many isolated individuals into one people, a 
corporate entity. The agreement is unanimous, for 
the simple reason that whoever does not agree is 
not part of the people. Whoever stands outside the 
agreement is still in a state of nature with respect 
to the people created by the agreement. 

The consequence of the social contract is that 
henceforward the whole power of the incorporate 
people shall defend the life of each one of them, 
instead of each one having to defend himself alone. 
In order for the whole to act thus, there must be 
a part which can represent the whole and which 
can decide for and command the whole. But what 
part is this? The answer or, more precisely, the 
initial answer, to this question is “the majority.” 
The majority is the only part which can stand for 
the whole as soon as the social contract has been 
made. Unanimity is impossible except with respect 
to the contract itself. And this, we have seen, is an 
agreement to let a part stand for the whole. The 
rule of a minority is inadmissible, for this would 
imply some reservation by the ruling minority of 
some of the right each possessed in the state of 
nature but which all are supposed equally to sur- 
render by entering civil society. Any such reserva- 
tion would void their membership in the civil soci- 
ety. Hence the rule of the majority is the only rule 
which is not inconsistent with the original natural 
equality of all. 

Thus the natural right each individual possessed 
alone, the unlimited right to everything he deemed 
necessary to his preservation, is transformed into a 
legal or conventional right possessed by the whole 
people acting by the majority. However, just as the 
Surrender of the individual's right led to the right 
of the majority, so the majority may, according to 
its judgment, surrender its right to a minority. 
Many forms of government may be legitimate, ac- 
cording to the doctrine of natural rights, yet simple 
majoritarianism is the only form which is neces- 
sarily legitimate. Moreover, while legal or conven- 
tional sovereignty may devolve first to a majority, 
then to a minority, the natural right to life and 


liberty remains inalienable in the bosoms of indi- 
viduals, whose consent to be governed is always 
conditional. 


Nature and convention 


We have seen that sovereignty, as a construction 
from the unlimited right of every individual in the 
state of nature, is itself inherently unlimited. The 
government of the United States, however, is a lim- 
ited government, prohibited from doing many 
things, such as passing ex post facto laws and bills 
of attainder, granting patents of nobility, or estab- 
lishing a state church. Yet these limits are them- 
selves impositions by the sovereign people of the 
United States. The people have laid down these 
boundaries to government, and the people may take 
them away. From the point of view of the concept 
of sovereignty, the sovereign may do anything not 
naturally impossible. But the absoluteness of sov- 
ereign power is legal and hypothetical, not natural. 
For example, the American people may establish a 
state church, but they ought not to. They ought not 
to do anything inconsistent with their intention in 
forming a civil society, which intention was to 
overcome the discord of wills in the state of nature. 
Religious disestablishment is now plainly more con- 
ducive to that end than is establishment. This dis- 
tinction reproduces that of the state of nature, in 
which nothing the individual does can be unjust, 
because there is no authority which can prescribe 
to him. Yet he ought not to act in a manner con- 
trary to his self-preservation; for example, he ought 
not to be unwilling to leave the state of nature 
when others are willing to join with him in the 
agreement which produces civil society. Thus, also, 
the American people may do anything they decide 
to do, because there is no sovereign to prescribe to 
them. Yet they ought not to do anything harmful, or 
omit anything beneficial, to their self-preservation. 

The incorporation of naturally discrete individ- 
uals into one people creates an artificial person. For 
the many to regard the decision of a part as if it 
were a decision of a whole involves a second ele- 
ment of artifice or fiction: the first is that the many 
are one and the second is that the part is a whole. 
The doctrine of natural rights logically requires 
employment of this twofold fiction. And the polarity 
of this dual fiction is anchored in a twofold nature, 
a nature constituted by the undeniable concrete 
reality of the discrete individual, at the one end, 
and by the equally undeniable abstract reality of 
the human race, as a species, at the other, “All men 
are created equal” at once entails propositions 
about each individual and about the whole human 
species, of which he is a part. For this reason, the 


logic which leads individuals out of the state of 
nature suggests that sovereigns—who remain in 
the state of nature with respect to each other—can 
also emerge from this state by forming a world 
state. Thus there is also an inherent tendency in 
the doctrine of natural rights toward the world 
state, or at least toward a world society inhabited 
by a comparatively few pacific sovereigns. We may 
observe that if the whole human race were to be- 
come incorporated into one people, then the fiction 
whereby the many are declared to be one would in 
one sense coincide with a natural reality. For the 
fictitious one people would then coincide with the 
abstract one human race. However, we may also 
observe that, were it to do so, the fiction that a part 
represented a whole would thereby become that 
much more fictitious. 

Despite the necessity of the aforesaid fictions, 
individuals do not cease to be individuals in civil 
society. Their self-love, the foundation of their nat- 
ural rights, continues to animate them. A man 
assaulted in the street may use violence to defend 
himself, in the absence of legal protection. More- 
over, if the power of the sovereign should ever be 
perverted, so that it becomes the enemy of the 
people or of any part of the people, the right which 
has been “completely” surrendered may in fact be 
resumed, For the surrender was for a purpose—to 
secure the rights to life, liberty, and the pursuit 
of happiness—and whenever government becomes 
destructive of these ends, obedience may be with- 
drawn. The clear right of the people to alter or 
abolish governments is a constant incentive to good 
behavior by governments. The more a government 
convinces the people it is serving them well, the 
better they will obey it. The better they obey, the 
stronger the government, and the stronger the gov- 
ernment, the better it can serve. 


Unlimited sovereignty and limited government 


The exercise of sovereignty is intended to be lim- 
ited, moderated, and strengthened by the reason 
that makes sovereignty itself illimitable. For this 
same reason it must be indivisible. Although the 
political system of the United States embraces a 
twofold jurisdiction, of the governments of the 
states and of the government of the United States, 
this does not imply a division of sovereignty within 
the United States. John C. Calhoun remarked that 
sovereignty was like chastity, that it could not be 
surrendered in part. This acute witticism accurately 
reflects the fundamental theoretical construction 
presented here, As we have seen, equal individuals 
escape from the state of nature by equally agreeing 
to surrender to a sovereign the perfect freedom they 
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possessed in that state. But just as, in the defined 
sense, the individual must surrender all his right 
to be his own master in order to gain the protection 
of civil society, so the members of a small civil 
society cannot become members of a larger civil so- 
ciety without making a similar surrender of sov- 
ereignty. For this reason Abraham Lincoln agreed 
with Calhoun that any division of sovereignty be- 
tween states and nation was out of the question. 
But while Calhoun maintained that sovereignty had 
remained with the states, Lincoln insisted that it 
must repose in the nation, in the American people 
as a whole. 

Certainly the Declaration of Independence, from 
which we have construed much of this account of 
natural rights teaching, supports Lincoln’s position. 
For it speaks emphatically of “one people” dissolv- 
ing the political bonds which had hitherto con- 
nected them with Great Britain. That people was 
then conducting a war to preserve themselves from 
what they believed to be the anarchic violence of 
the British sovereign. It would have been incon- 
sistent with the purpose of that “one people” to 
have divided themselves into 13 peoples at the same 
moment that they united to resist oppression. To 
have done so would have meant that they had 
deliberately reproduced the dangers of the state of 
nature with respect to each other at the very 
moment when they had combined to escape those 
dangers with respect to the British crown. There- 
fore, in 1776 the states of the union can have been 
sovereign only insofar as they were united, and 
were and are sovereign only because the people of 
those states were and are parts of the one, indivis- 
ible, sovereign people of the United States. To sup- 
pose otherwise would be to suppose that the people 
of the United States had not been formed into one 
people according to the principles and logic of the 
doctrine of natural rights. Yet the Declaration of 
Independence begins with the most ringing affirma- 
tion of those principles the world has ever known. 


Harry V. JAFFA 


[See also NATURAL LAW; SOCIAL CONTRACT; SOVER- 
EIGNTY. Related material may be found in CoNsTI- 
TUTIONAL LAW; HUMAN RIGHTS; and POLITICAL 
THEORY.] 
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1. THE ISLAMIC COUNTRIES 
IL ISRAEL 


I 
THE ISLAMIC COUNTRIES 


The chief social characteristics of the Near East 
in the twentieth century have been the result of the 
accelerated tempo of modern technological ad- 
vance. This effort to increase national power and 
improve living standards began long before the 
present century in some parts of this area but is 
only now beginning in others. At first, such changes 
came about largely through European or American 
intervention or example; more recently the initia- 
tive has been taken by indigenous rulers and 
governments. The resulting social structure is a 
web of traditional and new institutions and asso- 
ciations in which the old sometimes provide the 
foundation for the new, are sometimes simply by- 
passed and allowed to disappear, or persist sig- 
nificantly alongside the new patterns and even 
help to shape them. 


Physical background 


The cultural-geographical area under discussion 
has been variously called the Near East, the Middle 
East, southwest Asia, and the Islamic world; these 
names arose in different times and from different 
points of view. For our purposes, the Near East 
comprises the region from Egypt east to Afghani- 
stan and from Turkey south to the Sudan, that is, 
the following countries: Egypt, Sudan, Jordan, 
Lebanon, Syria, Saudi Arabia and the remainder 
of the Arabian peninsula, Iraq, Turkey, Iran, and 
Afghanistan. (Israel, which shares many of the 
features of this region, is not included in this 
article. ) 


Although these countries share a common his- 


tory and even today preserve a degree of cultural 
unity, they are far from being socially or geograph- 
ically homogeneous. They contain a mixture of 
human physical types and colors—tall and short 
statures, broad and slender builds, dark and light 
skins. Their three main languages—Arabic, Per- 
sian, and Turkish—belong to different linguistic 
families. Their economy is largely agricultural, but 
there are great differences in the extent and im- 
portance of industrialization and of pastoralism, 
Their communities are chiefly rural, but there are, 
again, great differences in the intensity of urbani- 
zation. Income and education are low when com- 
pared to industrial regions, yet the range within 
the Near East is broad. Lebanon and Turkey thus 
have a per capita annual income of several hun- 
dred dollars and a literacy rate of about 50 per 
cent; there are some sections of the Arabian penin- 
sula that have perhaps a fifth that income and a 
tenth that literacy rate. In this widely disparate 
region, the single most common cultural charac- 
teristic is religion, for despite even sizable minor- 
ities here and there, Islam is the religion of nine- 
tenths of the people in the Near East and is by far 
the predominant faith in every country except 
Lebanon, where Christians are almost as numerous 
as Muslims. 

The Near East has a population of 130 million 
to 140 million and an area of nearly 4 million 
square miles. Most of this area is steppe and desert. 
Despite a general proximity to the sea, coastal 
mountain ranges prevent rainfall from reaching 
the interior, which remains arid, whereas the 
coasts receive a large amount of precipitation. 
Water is derived, in some areas, from the two large 
river complexes, the Nile and the Tigris—Euphrates. 
Known mineral resources are inadequate for heavy 
industry. Oil is abundant, but in vastly differing 
amounts; it is found mostly in the countries around 
the Persian Gulf. This uneven distribution of re- 
sources has resulted in uneven population density; 
wide areas are Virtually uninhabited, while a small 
amount of land sustains most village and city life. 


Three types of community 


The social pattern of tribal, village, and urban 
communities corresponds to the geographical—eco- 
nomic division of the region. 

Nomadic groups. The nomads and seminomads 
of desert and steppe have been important histori- 
cally in the spread of Islam; in the development 
of idealized personal traits, such as bravery, pride, 
generosity, and cunning; and in certain economic 
functions, such as stockbreeding and the policing 
of routes of trade and travel. Their mode of exist- 
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ence has required an adaptation to an environment 
so severe that few have been able to survive it. 

In social organization, nomadic communities 
(whose wanderings are not random, but regular) 
have shown considerable reliance upon clan and 
tribe to carry out the diverse functions that in 
settled societies are assigned to specialized agencies. 
Nomadic communities have traditionally evaded 
control by external governmental authority. How- 
ever, since technology has enabled the regimes in 
the capital cities to overcome distance and terrain, 
the nomads retain only a vestige of this inde- 
pendence, 

Sedentarization of nomads has been occurring 
for a long time, as a result both of their declining 
economy and of deliberate governmental policy. 
Some nomads settle on the land, others enter the 
army or special police forces, and still others have 
become workers in modern industry (especially 
Arab bedouins, who have gone to work in the oil 
fields and refineries). The percentage of nomads 
is thus smaller than ever in the Near East; prob- 
ably no more than a few million live in wandering 
communities. 

Villages. The vast majority (probably two-thirds 
to three-quarters) of people in the Near East live 
in villages that, in socioeconomic organization, lie 
somewhere between nomadic and urban cormmu- 
nities. The village has some division of labor even 
within agriculture, the chief distinction being be- 
tween owners and nonowners, and it has a few 
crafts and services that do not reach beyond the 
locality. The physical structure of the villages 
varies, ranging from those that are closely strung 
along the Nile to those that are widely scattered 
in the near-steppe and mountainous regions. 

Social structure is not greatly differentiated, al- 
though the village does have formal and steady 
contact with distant governmental agencies. Rural 
life has remained autonomous except for the eco- 
nomic and military demands placed upon it by 
central authorities. Land is the chief value and its 
tilling the chief occupation; village loyalties are 
traditional and local: to religion, family, and in- 
dividuals rather than to nation, state, and broad 
associations based on political or economic inter- 
ests, These have thus far hardly been altered by 
the growth of national power and the technological 
changes that have swept the area in the last half 
century, 

The social transformation of the village is only 
beginning, although economically it is increasingly 
dependent upon world markets. Changes in trans- 
port and communication bring new ideas to the 
village, and land reform in some countries induces 


fundamental changes by transferring land and in- 
come to peasants while the social power formerly 
exercised by the landlords is replaced by national 
political authority. 

Urban centers. Islamic towns and cities owe 
their character to considerations of religion and 
protection. Muslim values owe much to urban 
society, which has always dominated the country- 
side and the desert-steppe areas, even when the 
latter provided the human force that carried Islam 
eastward and westward from its Arabian starting 
point. Although today the Near Eastern city still 
displays an extraordinary concentration of all forms 
of power, it is itself dependent on the national gov- 
ernment. Near Eastern cities, unlike those in much 
of Europe, did not go through a period of autonomy 
on the way to incorporation within the advancing 
national states. 

Despite the predominance of agriculture and vil- 
lage life, about 20 per cent of the population lives 
in cities of 100,000 or more, a proportion found 
more often in technologically advanced societies 
than in less developed agrarian ones, As industry 
and communications develop, the larger cities grow 
at a faster rate than the smaller ones. Egyptian 
cities of 100,000 or more people have been growing 
twice as rapidly as those with 30,000 to 100,000 
people. Teheran, the capital of Iran, now has over 
2 million people, more than double the population 
of only twenty years ago. 

A large percentage of the people in these urban 
centers lives at or below a subsistence level, and 
many others scratch out a livelihood in services 
and trades that seem to absorb additional hands 
more in accordance with the supply of workers 
than with the demand of consumers. Recent mi- 
grants fill the ranks of these urban underemployed 
and unemployed. They also congregate in shanty- 
towns on the outskirts of major cities, such as 
Baghdad and Teheran. There are few governmental 
agencies that help in this vast transition from 
rural to urban life, but a large variety of voluntary 
associations and less formal groups, composed of 
migrants whose place in urban life is already some- 
what definite and satisfactory, ease the way for 
the thousands who come after them. 

Centuries of foreign influence and more recent 
modernization have altered the traditional physical 
structure of Near Eastern cities. New cities have 
been built alongside the old, and often original 
towns remain as bazaar or rundown residential 
districts. Comprehensive urban planning has only 
recently been adopted in some cities. In the past, 
planning meant the creation of impressive monu- 
ments and vistas to flatter rulers rather than to 
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meet popular needs. City planning has tended to 
be lost in the rush of interest in national economic 
planning, with the result that housing, among 
other things, has been a serious problem every- 
where. 


Agriculture 

Despite the high degree of urbanization, the 
varied influence of large metropolitan areas, and 
recent efforts at industrialization, agriculture re- 
mains the chief element in the Near East’s eco- 
nomic life. It is, however, an agriculture that is far 
from the domestically oriented subsistence type 
that prevailed until around the middle of the nine- 
teenth century, when most of the Near East be- 
came tied to world markets. 

Agriculture absorbs one-half to two-thirds of the 
labor force and contributes one-fifth to three- 
quarters of national income. Industry absorbs an 
almost negligible number of workers, employing 
less than 5 per cent of the labor force in some 
areas and about 15 per cent in some others, con- 
tributing proportionately little to the national in- 
come (omitting the profits derived from oil). Most 
Near Eastern countries, despite the low percentage 
of cultivated land and the undeveloped state of 
industry, do not yet stand in great danger of popu- 
lation outrunning resources, Compared to world 
patterns, population growth in the whole area is 
moderate, except in Egypt, which faces a serious 
problem in achieving a rate of economic growth 
commensurate with population increase. 

Since the middle of the nineteenth century agri- 
culture has gone through a number of important 
changes in land tenure, crops, and techniques. 
Around the middle of the present century another 
wave of change began, especially in land owner- 
ship. Agrarian reform in several countries (Turkey, 
Egypt, Syria, Iraq, and Iran) was carried out with 
varying degrees of vigor. The usual plan follows 
three steps: (1) The government sells small plots 
of land to peasants, on easy terms. These are taken 
from state-held lands or from lands expropriated, 
with compensation, from private owners. (2) The 
government organizes the new owners into co- 
operatives that handle all aspects of farming, from 
seed purchase to marketing. (3) The government 
establishes some safeguards (usually insufficient) 
for landless workers and tenants. These reforms 
have had the effect of raising income among the 
relatively few peasants able to become owners. The 
Political effect has been greater, for these meas- 
ures have greatly weakened the power of large 
landholders, especially in Egypt, where that class 


has been eliminated and its social power trans- 
ferred to the government. 
Oil 

Although oil is found in significant deposits only 
in those countries around the Persian Gulf, the 
industry that has grown up in this century to ex- 
tract and refine it has had very important conse- 
quences throughout the region, The leading pro- 
ducers (Bahrein, Iran, Iraq, Kuwait, Qatar, and 
Saudi Arabia) received over $9,000 million in direct 
payments from the oil companies between 1948 
and 1960, and since then such payments have been 
well over $1,000 million each year (for these and 
most of the following oil statistics, see Issawi & 
Yeganeh 1963, passim). Several countries without 
substantial oil resources receive large payments for 
the movement of oil through their waters or over 
their territory. Oil affects relations between the 
world powers, among Near Eastern countries, and 
between the Near East and the Western oil com- 
panies and governments. 

The Near East’s oil supply is being used up at a 
relatively slow rate, for in 1960 it accounted for 
only a quarter of world production. Production costs 
are low, less than 20 cents per barrel, compared 
with 80 cents in Venezuela and $1.75 in the United 
States. One of the reasons is sheer abundance, re- 
flected in the fact that production per well in the 
Near East is one hundred to five hundred times 
that in the Americas. As its production increases, 
Near Eastern oil constitutes a growing proportion 
of world trade in oil, having reached 60 per cent 
in 1959. Most of this trade is with Europe, which 
obtains about three-quarters of its oil from the 
Near East. 

Such huge operations imply commensurately 
large capital investment and returns. The govern- 
ments of the area have been able to do little other 
than to permit foreign investors and operators to 
find, develop, and market this important resource. 
The gross investment amounted to about $1,000 
million soon after World War u and to over $4,000 
million in 1960. Nearly all of this investment was 
made by Western private and government corpo- 
rations, most of it by private corporations in the 
United States. 

Investment in Near Eastern oi] has been espe 
cially profitable to the foreign companies and the 
Near Eastern governments. Dividends to share- 
holders have constituted a much larger proportion 
of gross receipts than has been the case with other 
international operations. To the Near East itself, oil 
has greatly contributed to gross national product, 
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foreign exchange, and government revenue. Direct 
payments to governments approach $2,000 million 
dollars annually, Revenue derived from oil makes 
up two-fifths to nine-tenths of the budgets and 
provides upwards of two-thirds of the foreign ex- 
change. Near Eastern governments, seeking a 
larger portion of the profits, have nationalized oil 
resources and joined with each other to strengthen 
their positions. They have also responded to chang- 
ing patterns of world demand. 

Labor conditions. Despite the vast sums of 
money involved, the oil industry has not had the 
direct economic impact upon the lower socioeco- 
nomic groups that might be expected. Wages and 
working conditions in the industry are better than 
in others, but the number of persons employed in 
the whole region is only about 100,000, and it has 
not been increasing. In the smaller places, such as 
Bahrein and Kuwait, oil workers constitute a fifth 
or more of the total employed population and much 
higher percentages of those generally employed in 
industry. In Iraq and Iran, however, they constitute 
only 1 per cent of the labor force and, respectively, 
14 and 7 per cent of all industrial workers. Native 
Near Easterners constitute two-thirds to three- 
quarters of the labor force in oil; they are a very 
small but growing part of those in the supervisory 
posts, 

Labor conditions in industries such as textiles, 
food processing, building, and mining are much 
less favorable because native Near Eastern employ- 
ers are not accustomed to the more generous poli- 
cies of Western employers. Near Eastern govern- 
ments have found it easier to persuade or require 
foreign employers to raise wages and improve 
working conditions than to impose such practices 
on native industries. Although some countries have 
legal minimums that are higher, the average weekly 
industrial wage in the more populous countries 
(excluding the oil industry) is about ten U.S. dol- 
lars. Productivity of labor is correspondingly low. 
Labor unrest is not widely manifested in traditional 
Western forms. For a long time after the rise of 
modern industry the governments regarded labor 
unions as subversive. Although they previously for- 
bade the unions’ existence, most regimes now per- 
mit or encourage them but at the same time exer- 
cise close control. Governments have also adopted 
some social legislation, although much of it has 
not been efficiently executed. 

Even where unions are tolerated, governments 
have prohibited strikes, so that labor protest is 
registered largely by rapid turnover, absenteeism, 
and individual complaints to conciliation boards. 


Generalized dissatisfaction by workers is often dis- 
played through political acts, such as open demon- 
strations or clandestine organization and propa- 
ganda. Unions include (as a very rough estimate) 
about 750,000 workers, a very small portion of all 
industrial workers; moreover, many unions are 
poorly led and virtually inactive. Near Eastern 
unions have few and weak ties to the interna- 
tional trade-union movement. Many in the Arab 
countries are affiliated with the International Con- 
federation of Arab Trade Unions, whose main role 
has been in regional political action. 

An important feature of Near Eastern economic 
life has been a division of labor along ethnic, na- 
tional, and religious lines, especially in banking, 
commerce, and the crafts. This has been especially 
true where there was an early variety of groups 
augmented by hundreds of thousands of Europeans 
who came during the nineteenth and twentieth 
centuries. Early in the twentieth century, with the 
growth of nationalism and independence, this pat- 
tern began to break up. 


Social groups 

The trend toward national homogeneity within 
most Near Eastern countries has far from over- 
shadowed group distinctions. 

Probably because religious loyalty has been the 
main unifying element among the followers of one 
creed, it has also been the main dividing force 
between creeds. Until this century Near Easterners 
knew one another chiefly by religious persuasion, 
crossing even national, racial, and linguistic bound- 
aries. Physical type and skin color are relatively 
unimportant in status differentiation, yet in recent 
decades a light skin has come to be regarded, in 
some places, as somehow more desirable. This may 
be the result of Western influence, although there 
are indications in early Islamic literature that a 
fair complexion was considered to be an element 
of beauty. In the early Islamic world differences 
arose between Arab and non-Arab Muslims, but 
these differences receded under centralized political 
authority. 

Religious groups. Since Islam did not absorb 
all the other religions as it expanded, there is a 
web of religious minority groups that cuts across 
national boundaries throughout the Near East. 
Islam is the dominant religion, but Islam itself is 
divided into two large and important segments. 
Of the approximately 120 million Muslims in the 
region, more than three-quarters are Sunnites, who 
emphasize their adherence to the path (Sunna) of 
the Prophet Muhammad, whereas the remainder 


94 NEAR EASTERN SOCIETY: The Islamic Countries 


are Shiites, or adherents of the party (Shia) of the 
Prophet's son-in-law, Ali. Of the Shiites, the vast 
majority live in Iran, where the Sunnites are a 
small minority. 

Christian and Jewish minorities have decreased 
since World War 11. There are perhaps 5 million 
Christians of many sects in the Arab world, espe- 
cially in Egypt, Lebanon, and Syria. There are 
probably no more than 25,000 Jews, representing 
a rapid and considerable decline in numbers and 
influence. In the southern Sudan there are several 
million pagans, many of whom are being absorbed 
into Islam and Christianity. 

National and linguistic differences cut across the 
religious boundaries. There is thus a great sense 
of difference between Turks and Egyptians, al- 
though both nations are largely Sunnite Muslim. 
Moreover, participation in a common language 
community does not always overcome national 
barriers. On the other hand, within certain coun- 
tries, like Iran and Turkey, religion, nationality, 
and language produce a relatively homogeneous 
culture within the respective political borders that 
is distinguishable from the rest of the population 
in the Near East. In this resulting complex of 
crisscrossing affiliations and loyalties, Turkey is the 
most homogeneous, for about 95 per cent of the 
nation are Sunnite Muslims, speak Turkish, and 
are of roughly similar physical type; the least 
homogeneous is Lebanon, where a population of 
around two million is so variously divided that it 
is difficult to identify a “majority” group. 

Traditionally, the Ottoman Empire, which ruled 
much of the Near East from the sixteenth to the 
twentieth century, allowed the non-Muslim reli- 
gious—national communities (or millets) a certain 
amount of autonomy. This separatist arrangement 
permitted a degree of cooperation across religious 
lines through formal and informal mechanisms, 
but the dominant spirit was one of accommodation 
through avoidance. The growth of nationalism in 
the nineteenth century among Ottomans and Euro- 
pean Christians and later among Arabs changed 
this pattern significantly. As the Ottoman rulers 
of the Near East saw their power decline in rela- 
tion to that of Europe, their defensiveness in- 
creased, and they tried various means to close the 
gap. One was the attempt, in the middle of the 
nineteenth century, to eradicate religious distinc- 
tions and create an Ottoman patriotism that would 
override those and other differences. 

Not only did Turks find it difficult in practice 
to accept such legal reforms, but the European 
Christian minorities themselves wanted political 
Separation from the empire rather than equality 


within it. When, toward the end of the nineteenth 
century and early in the twentieth, the Arab prov- 
inces of the Ottoman realm began to stir, national- 
ist leaders held out for independence rather than 
autonomy within the empire. Christian Arabs, how- 
ever, unlike Christian Europeans, could not com- 
bine religious with national loyalty because they 
lived among a vast majority of Muslim Arabs; 
hence, they stressed a secular nationalism removed 
from religious loyalties. Turkish nationalist despair 
and hope reached their most intense levels around 
World War 1 when the empire was further dis- 
membered. Atatiirk was then able to forge a new 
revolutionary movement that led to the expulsion 
of the foreign enemies and the suppression of the 
enemies of “progress” at home. 

National states. The breakup of the old empire 
was now complete, with national states in Chris- 
tian Europe, Arab states in the former Islamic 
Ottoman domain, and a secular national state in 
the very heart of the empire itself. The old millet 
system was gone, as each nation-state insisted upon 
one loyalty even if homogeneity of population was 
not yet achieved. A vast and complicated shuffling 
of religious, national, and language groups (a 
process that antedated World War 1) reached its 
height during and after the war. Ultimately about 
two million Turkish and other Muslims went from 
the Balkans to Turkey, and roughly the same num- 
ber of Christian Europeans, Christian Arabs, and 
Jews left the Near East for Europe and America. 

The growth of nationalism shattered religious 
unity. The new Balkan states that emerged from 
the Ottoman Empire were Christian, but that loy- 
alty did not bring them closer to one another. The 
new Arab states felt no special kinship with Turkey 
or Iran on the basis of common Islamic loyalty. 
“But,” as Hourani has remarked (1955, p. 129), 
“by what might seem almost a paradox, while the 
religious link between one nation and another 
vanished, the relationship, within each national 
community, between religion and nation still per- 
sisted.” 

This was true in Turkey, Egypt, and Iran; the 
identification of religion and nationality was clear 
also in the establishment of Israel. Within each 
nation, however, there was no longer a form of 
millet system to accommodate the various groups 
to one another. Since the national (and national- 
ist) state was supposed to be homogeneous, funda- 
mental distinctions in the population could no 
longer be admitted. The result has been that hos- 
tility and resentment are given no peaceful outlet. 
Thus, the dissatisfaction of Coptic Christians in 
Egypt may not be expressed for fear that it would 
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be interpreted as disloyalty to the nation, and in 
Iraq the Kurds, who are Muslims speaking an Indo- 
European language, are in intermittent revolt for 
autonomy or for the creation of a national state of 
their own to include also the Kurds of Syria, Iran, 
and Turkey. 


Arab nationalism and unity 


The most virulent and as yet unfulfilled nation- 
alism in the Near East is Arab nationalism. The 
sense of unity among the Arabic-speaking peoples 
overrides the still powerful nationalism in individ- 
ual countries like Egypt, Syria, Iraq, Lebanon, and 
so on. Thus far the broader form of Arab national- 
ism has led to several unsuccessful efforts to unify 
two or more Arab states. It has, however, enabled 
most of the Arab states to maintain a fairly unified 
policy on Israel and on many important interna- 
tional issues raised in the United Nations. 

Moreover, Arab nationalism, based on a sense 
of a common culture and history as well as of a 
common destiny, has awakened many Arabs to the 
possibilities of modern life. Although often shrill, 
vague, and authoritarian, it has yet encouraged a 
sense of decent pride in the past and has pointed 
to the need for resolution to improve the lives of 
ordinary people in the future. Arab nationalism, 
however, like other nationalisms, has been unable 
to make a clear choice between two paths; (1) the 
emphasis upon mere sentiment, upon unity for the 
sake of power, and upon progress in technology 
mainly to increase military strength, and (2) the 
use of common sentiment to improve the material 
conditions of life and to promote the sense of 
justice and the rule of law. 

There are a secular and a religious side to Arab 
nationalism, a fact that, taken in conjunction with 
other kinds of loyalty (to family and village, for 
example), has made it difficult for the Arab to 
arrange his affiliations and sentiments in some sort 
of reasonably stable order. The secular character 
of nationalism is clearly shown in its accommoda- 
tion of Arab Christendom as well as Arab Islam. 
In international politics it has been displayed with 
special force in the fact that several Arab Muslim 
countries, notably Egypt, have been inspired by 
and have supported India rather than Pakistan, 
the largest Muslim state in the world. The religious 
element in Arab nationalism appears in the con- 
tinuing evocation of Islamic loyalties as part of 
some nationalist appeals. 

The interweaving of both secular and religious 
elements makes Arab nationalism a more complex 
sentiment than it would be if religion were not an 
issue. The reason it becomes an issue is simple: 


historically the Islamic religion and the Arabic 
language and culture have been almost inseparable, 
yet not all Arabs are Muslims. Christian Arabs have 
had to reconcile their Arab national-—linguistic heri- 
tage with its predominantly Islamic nature, where- 
as Muslims have had to decide on the place of 
traditional religion in the establishment of a strong 
national state. 

When, before World War 1, Arab nationalism 
was directed against Ottoman control, the religious 
issue was simple because the Ottomans were 
Muslims. Both Muslim and Christian Arabs could 
respond to Arab nationalist appeals against a non- 
Arab and non-Christian power whom they regarded 
as the oppressor. In fact, Syrian Christian Arabs 
were among the leading nationalists. In Egypt, 
where Britain was the ruler, nationalism was more 
exclusively Muslim, for the Christian population 
did not view the British as quite so alien and op- 
pressive. After World War 1, however, Turkey was 
no longer a power in the Arab world, and Britain 
and France reached the height of their sway over 
it. For Muslim Arabs the issue was now quite 
plain: The Christian foreigners had to be ejected. 
For Christian Arabs, however, whose own nation- 
alism had come in part from Western sources and 
who looked to the West for help, the issue was less 
clear. 

Two developments between World War 1 and 
World War 11 convinced the Christian Arabs that 
they must join the Muslims in secular Arab na- 
tionalism. First, the decline of European power 
made it clear that the Christian minorities could 
no longer rely on it for protection. Second, the 
growth of Zionism in Palestine quickened Arab 
nationalist feelings, and on this issue the Christian 
Arabs did not need to feel any ambivalence what- 
ever; their religious, national, and economic in- 
terests coincided. The accommodation of both 
Muslims and Christians to Arab nationalism was 
facilitated by the growing secularism both of Arab 
society and of nationalist ideology itself. 

Secular Arab nationalism has not, however, sat- 
isfied certain Islamic supranational religious loyal- 
ties. To many Muslim thinkers, Arab nationalism 
is deplored on two counts, First, as an Arab senti- 
ment it divides the Islamic world, most of which 
is in fact not Arab. Second, as a secular ideology 
it weakens the Islamic loyalties of Muslim Arabs. 
It also weakens the influence of religious leaders 
upon the people and regimes, an influence that 
long characterized Islamic society. Which principle 
of association is the broader depends upon the 
point of view. Secular nationalists argue that Arab 
nationalism is broader because it unites Muslims 
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with non-Muslims, whereas the advocates of Islam- 
ic unity insist that their conception is broader 
because it encompasses Arabs and non-Arabs. 
To some extent these two conceptions have been 
expressed in the ideologies and social movements 
known as Pan-Arabism and Pan-Islam. Certainly 
the former is the stronger influence and has been 
embodied in the Arab national states, in the Arab 
League, and in several attempts, since 1958, to 
create a unitary state out of several Arab states. 


Religion 

Although secularism has been steadily gaining 
in the Near East for a century or more, religion 
remains an influential force among the people. 
Religious leaders, it is true, have little power in 
most of the region’s governments and are forbid- 
den to play an independent political role in carry- 
ing out their religious functions. Yet these leaders 
affect political affairs by conserving popular at- 
tachment to traditional Islam; to the extent that 
such leaders are successful, the “modernizing” 
secular authorities find a considerable portion of 
the population insensitive to their exhortations. 
Turkey, for example, found it expedient in the 
1950s to mitigate earlier official hostility to popular 
religion and to yield to rising demands, both in the 
cities and in rural areas, to facilitate religious 
teaching and practice. In Egypt the presence of 
Al-Azhar has been an additional force to make the 
revolutionary regime take account of religious feel- 
ing, which it has done in two ways: by injecting a 
heavy element of Islamic loyalty into its national- 
ist and other appeals to the masses, and by seeking 
to control religious teaching not only in Egypt but 
also, through Al-Azhar, elsewhere in the Islamic 
world (especially in Negro Africa). The decline of 
Pan-Islam as a social movement and its loss of a 
firm basis in some supranational (though admit- 
tedly weak) agency like the caliphate has left 
religious leaders and followers with little direction. 
Religious influence thus tends to flow into uncon- 
nected channels wherever it is not blocked; secular 
leaders, holding a monopoly of increasingly effec- 
tive power in most states, try to control this flow. 

Islam and nationalism. In Iran and Turkey the 
revolutionary changes introduced in the 1920s, by 
Riza Shah and Atatiirk, soon brought them into di- 
rect conflict with clerical power. They subdued it 
ruthlessly. Clerical power in Egypt had earlier been 
weakened in affairs of state, but a major bid for 
political power was made in 1954 by the Muslim 
Brotherhood, a fundamentalist, “cleansing” move- 
ment, only two years after the military regime had 
come to power. The government suppressed the 


brotherhood but then instituted a deliberate policy 
designed (1) to show the masses that this suppres- 
sion of the brotherhood was not to be interpreted 
as hostility to religion, (2) to substitute the gov- 
ernment-sponsored religious agencies for other, 
more independent ones as the teachers and guides 
in religious affairs, and (3) to discourage similar 
bids for power on the part of traditional or new 
religious leaders, 

Egypt’s use of the theme of Islamic unity in its 
nationalist appeals has had implications outside 
that country for several reasons. First, Egypt's 
policy on Al-Azhar affects the entire Muslim world, 
which has a special relation to the center of reli- 
gious learning. Second, Egypt uses Islamic themes 
in Negro Africa, where this approach has been 
welcomed by some leaders but has been viewed as 
intrusive by others. Finally, appeals to Islam have 
disturbed secular Arab nationalists in other coun- 
tries, including not only those where Christians are 
numerous and influential in the movement for 
Arab unity but also some that do not adhere to the 
form of Islam prevailing in Egypt. 

The Islamic appeal has been used in Egypt as 
an adjunct of both foreign and domestic policy. 
In late 1958 and early 1959, for example, President 
Gamal Abdel Nasser found it expedient to with- 
draw from his position of great dependence on the 
U.S.S.R. One of the reasons was that it appeared to 
him that the U.S.S.R. was aiming to turn the 
recent revolution in Iraq into the first real “people's 
democracy,” or Soviet satellite, in the Arab world. 
To this challenge Nasser responded by calling the 
attention of Egyptians and the entire Arab Muslim 
world to the incompatibility of Islam with the 
atheism fostered under communism, At the same 
time he attacked his Arab rivals in Iraq as tools 
of a foreign, atheist ideology, bent upon destroying 
the traditional unity of Islam; this charge was 
especially significant in some parts of the Arab 
Muslim world because a majority of Iragi Muslims 
are Shiites, a minority wing of Islam. 

For those Arab Muslim countries that could not 
be accused of Supporting atheism but whose re- 
gimes he opposed because they resisted Egypt's 
efforts to set the terms of Arab unity, President 
Nasser reserved a different approach. The “reac- 
tionary” monarchies, Saudi Arabia and Jordan, 
were criticized for being corrupt and exploitative, 
in violation of Islamic precepts requiring justice 
and equality in economic life. When Egypt turned 
more directly, in 1961, to what it called “Arab 
socialism” in the “second revolution,” the attacks 
on the monarchies (now including Morocco) spe- 
cifically identified socialism with Islam. 
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Egypt. This identification of socialism with 
Islam reflected domestic policy in Egypt. Beginning 
in the mid-1950s, President Nasser stressed that 
Egypt was aiming to become a “socialist, coopera- 
tive, democratic” society. There was much talk and 
writing about socialism. In the summer of 1961 
the regime took decisive steps toward the elimina- 
tion of private property and its transfer to the state. 
When Syria, a few months later, broke away from 
its union with Egypt (established in 1958 as the 
United Arab Republic), President Nasser went 
further in the same direction. All along the regime 
had tried to mobilize religious sentiment for its 
secular goals, but with these steps toward state 
socialism it intensified its efforts to identify Islam 
with the regime's own social and economic policies. 

The expression of socialist goals in Islamic terms 
was pointed not only toward the masses of Egyp- 
tians but also toward the religious leaders in Al- 
Azhar. They accepted the identification, although 
there were misgivings among some who apparently 
felt that this was the final step in the separation 
of religious from secular authority and in the 
complete subjection of the former to the latter. 

The religious leaders who accepted the regime's 
definition of their place in the propagation of 
Islamic socialism also participated in a reorganiza- 
tion of Al-Azhar that was consonant with the gov- 
ernment’s supremacy. In doing so, however, they 
seemed to act upon a traditional principle of the 
learned men of Islam: that it does not matter so 
much what secular power does so long as religious 
authority survives and preserves its role as teacher 
to the faithful. Religion, embodied in the influence 
of its adepts, must bend in order not to be broken. 
If the secular regime needs Islam, according to 
this point of view, the religious leaders must lend 
their help and thus continue to have access to the 
masses for religious purposes, whatever the regime's 
purposes might be. This position is likewise a tradi- 
tional one in Islam: that political rulers and reli- 
gious thinkers together can provide good govern- 
ment. The disagreement among the religious 
leaders in the early 1960s was thus not over prin- 
ciples but over whether the regime was not simply 
using them and Islam for its own secular, narrow 
purposes instead of using them to promote the 
religious and broader purposes of Islam. 

Alternating rivalry and limited cooperation be- 
tween political and religious elites continue to 
occur in the Near East because religious feeling 
continues to run high among the masses despite 
growing secularism in government and in other 
domains of life. Among the most powerful mani- 
festations of popular Islam have been the mystical 


orders; they have provided color and drama where 
conventional Islam has discouraged it. Through 
prayer accompanied by dance, music, and strict 
concentration, these groups have led people to a 
sense of union with God. They have also performed 
educational, cultural, and welfare services. Their 
leaders, who usually exercise absolute control, are 
often believed to have hidden, supernatural powers, 
and this belief has led to saint worship. These 
orders, or brotherhoods, often viewed with suspi- 
cion by political and religious leaders alike, have 
declined somewhat in recent years; yet the impulse 
behind them leads to a revival now and then and 
to other forms of lay religious association among 
the masses. 


Political institutions 

After World War 1 a large number of Near 
Eastern countries became independent or, if al- 
ready nominally so, were able to act independently 
to an increasing extent. At the same time, many 
of them also removed the superstructure of political 
forms previously imposed by or borrowed from the 
West, that is, parliaments, elections, and political 
parties. In its place these countries established 
military republics with organizations (sometimes 
called parties) having exclusive access to the popu- 
lation and seeking not so much approval of the 
masses as their mobilization to achieve the “mod- 
ernization” so earnestly sought by the military rulers 
and their followers in the upper civilian bureauc- 
racy. Near Eastern military leaders have found free 
political organization a threat to their programs of 
economic and social change. They fear a resur- 
gence of influence of the propertied classes if open 
competition for power were allowed. Older forms 
of absolutism were suspicious of political parties 
but permitted them a certain amount of freedom 
to compete against one another so long as they did 
not seriously challenge the locus of sovereign 
power. The newer absolutism has regarded all polit- 
ical parties as divisive and without any social ra- 
tionale, since the nation is presumed to be homo- 
geneous and united behind whatever national 
regime happens to hold power. 

Aspects of modern political ideology and practice 
in the Near East derive from and still reflect the 
influence of certain time-honored institutions and 
attitudes regarding the nature of government. 

In early Islamic concept, the purpose of govern- 
ment was not merely to ensure public order and 
to promote welfare but to protect and advance the 
religion of Allah. Secular and religious authority 
were not separated in either ideal or practice. When, 
after the expansion of Islam in the seventh and 
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eighth centuries, the two kinds of authority sepa- 
rated, religious leaders began to adopt a view of 
government that still pervades the Near East. This 
view held that government, no longer based upon 
religion, was evil and to be avoided by the pious. 
Oppressive regimes also alienated the populace. The 
result has been a profound suspicion of govern- 
ment, leading to the desire to control it or to avoid 
it but not to exercise established, regular responsi- 
bility toward it. 

Governments, in their turn, made large jurisdic- 
tional claims but were unable to exercise pervasive 
powers because of internal weakness, public indif- 
ference, and the creation of intermediary agencies 
between ruled and rulers, such as the religious or- 
ganizations, the guilds, and the millets. A con- 
comitant of this popular sense of hostility or 
indifference toward government has been the atti- 
tude that political authority is something at once 
sq remote that the ordinary man has no access to 
it and so close that it is constantly making demands 
upon him. The popular response alternates between 
avoiding government and making demands upon 
it, Moreover, it is still widely felt in the Near East 
that government should be responsible for many 
of the things accomplished elsewhere by voluntary 
associations, The implicit attitude seems to be that 
if a government makes so many demands upon 
the people, the people have the right to expect the 
government to do anything that needs to be done. 

Hostility and indifference to government are 
traditional means of protection against its sway. 
In recent decades, regimes seeking to raise the 
material level of living have had to struggle against 
this powerful legacy. The reciprocal response to 
mass apathy has been extremism among the artic- 
ulate, educated elite in and out of power. This elite 
has sought to bring the masses into political action. 
Leaders exhort the masses to work, learn, and per- 
form military service and to protect the new 
regimes against the “reactionary forces” lurking 
about or foreign enemies close by. Still suspicious 
of government and not fully comprehending the 
new goals, the masses do not respond easily to 
these appeals, and the leaders are driven to greater 
and greater efforts to produce an effect, In addition, 
the masses have been made important by the fact 
that leaders in and out of power seek their support. 
_ But the recognized importance of public opinion 
is growing faster than the public’s knowledge and 
experience of politics. This gap makes the masses 
easy to manipulate. They swing from apathy to 
momentary involvement in politics in response to 
extremist appeals that lead to demonstrations and 


flashes of violence rather than to sustained, in- 
formed political activity. As long as many Near 
Eastern regimes retain.an authoritarian character 
they cannot allow the peaceful political contests 
and fundamental ideological discussion that would 
produce steady political participation. The result 
is often that support for a régime means intoler- 
ance of the slightest criticism; meanwhile, opposi- 
tion is secret, and when it is open it becomes vio- 
lent. Thus government and opposition alike do not 
grant each other the right to play an established 
role peacefully, or even the right to exist at all. 
Political extremism in the Near East is obviously 
not a racial trait but the consequence of certain 
social conditions: poverty and lack of education, 
lack of democracy, and resentment against West- 
ern domination in the past. Such conditions en- 
courage political hyperbole to arouse people while 
enabling leaders also to act without obtaining the 
approval of powerful social classes or organized, 
articulate groups, Resentment against Europe, in 
certain Near Eastern countries, enables those in 
power (and those who aspire to it) to appeal to 
nationalism without having to offer much to sup- 
plement it or to supplant it when its force is spent. 


Recent social change 


Social change in the Near East has been intro- 
duced by a combination of Western and domestic 
influences in which the latter have now become 
more pervasive and more radical. In the nineteenth 
and early twentieth centuries the West induced 
change directly by introducing modern machinery, 
systems of finance, administration of public and 
private enterprise, public health measures, agricul- 
tural techniques, and methods of education. In 
response to Western power, Near Eastern leaders 
acted in various ways. They embraced some of the 
innovations in a wholehearted belief in material 
Progress. While adopting foreign techniques, they 
also tried to fortify themselves against Western 
control by affirming their own values. 

Such change is not necessarily the direct result 
of acculturation, In Turkey, during the 1920s, men 
and groups came to power seeking to impose di- 
rectly upon their people changes far more pro- 
found than any the Westerners had dared to sug- 
gest. Such changes have touched the very core of 
people’s habits of thought—their way of working, 
their religious behavior, relations between the gen- 
erations and the sexes, even their clothing. Change, 
whatever the mixture of foreign and domestic in- 
fluences, is of course uneven throughout the region. 
The main impact has been in the growth of modern 
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industry and agriculture and in the advance of 
secularization. Such changes have gone farthest in 


Israel, Turkey, and Lebanon, have been consider- _ 


able in the United Arab Republic, Syria, Iraq, and 
Iran, have had important effects in the Sudan, 
Afghanistan, and Jordan, and have only recently 
begun in Saudi Arabia and on the rest of the 
peninsula. 

Irrespective of the source of change, foreign or 
local, the state apparatus has given it direction. 
This has been especially true in recent years, when 
native Near Eastern leaders began to assume con- 
trol over the national economies. The first state to 
achieve this position fully was Turkey. In 1923 it 
recovered full sovereignty with the Treaty of Lau- 
sanne and the end of the Capitulations, under 
which Europeans enjoyed extraterritorial rights. It 
was more than a quarter century later that the 
Capitulations were abolished in Egypt. As they 
gained or regained control from the West, Near 
Eastern regimes nationalized property in two ways. 
They encouraged and forced the transfer of private 
capital from foreigners to natives, and they expro- 
priated private capital for the state, usually with 
compensation. In the desire to raise the level of 
living and to increase national power, as well as to 
establish prestige in the international community, 
they adopted several programs of development. 

One was educational. Foreign schools were put 
under national control. All curricula were adjusted 
to the national goals and in some cases to domi- 
nant religious or secular values. Expenditure on 
education was vastly increased, at first dispropor- 
tionately for higher education, but beginning in the 
1950s there was a better appreciation of the im- 
portance of primary and adult education in the 
elimination of illiteracy. 

A second emphasis was upon economic develop- 
ment through planning. This usually took the form 
of the encouragement of industrialization or the 
actual creation of new industries in the effort to 
provide employment for a labor surplus in agricul- 
ture, to reduce the expenditure of foreign exchange 
except for needed capital goods, and to produce 
for export. Turkey, Iran, and Egypt built imposing 
planning apparatuses; the practical results, al- 
though considerable in some respects, have usually 
fallen far short of the plans or hopes. 

A third emphasis in the national resurgence in 
the Near East has been upon military power. 
Virtually all of the states have, with or without 
reason, been apprehensive of their neighbors as 
well as of the great powers in both the Atlantic 
and the Soviet blocs. They sought, therefore, first 


to nationalize their armed forces and then to mod- 
ernize them in equipment and organization. This 
route of modernization, through a military elite or 
out of military considerations, has often been taken 
in the Near East, notably by Turkey in the late 
eighteenth century and by Egypt in the early nine- 
teenth. 

After World War 1 the Near East resorted in- 
creasingly to military regimes. After Atatiirk’s death 
in 1938 civilian regimes gradually assumed power ` 
in Turkey until officers took control again in 1960. 
Riza Shah established military rule in Iran in the 
early 1920s and then established a dynasty, in 
which his heir has ruled with the close collabora- 
tion of the leaders of the armed forces. Iraq became 
independent in 1932; soon afterward, officers be- 
came very powerful, and in 1958 the military took 
full power. Syria, which became independent in 
World War u, was ruled by army officers in 1949. 
Egypt assumed effective independence after World 
War 1 and was taken over by the military in 1952. 
The Sudan became independent in 1955 and three 
years later was under military rule. 

There have been important differences between 
these regimes, chiefly in their types of nationalism, 
attitudes toward private property, conceptions of 
“modernism,” and policies toward the great powers. 
It must be emphasized that although military re- 
gimes have promoted social change, they have not 
been the only innovators. Change has also been 
introduced by parliamentary regimes and even by 
the monarchies that the military elites often revile. 

Two of the most potent types of change in the 
Near East are secular education and the spread of 
modern media of communication and transport. 
Much of this education and communication has 
been sheer indoctrination, yet they have brought 
new ideas and tastes to large numbers of people 
whose only vision, beyond the immediate need to 
sustain life, is a traditionally religious one. These 
developments have already helped to loosen cus- 
tom, stimulate discussion, and call into question 
institutions and authorities whose bases were pre- 
viously ignored or simply revered. 

The new horizons, together with economic 
changes, have shaken traditional family life, Dom- 
ination by the male head, extreme respect for 
age, great parental authority, close linear and col- 
lateral kinship relations—all these remain strong 
in the Near East as compared with the West, but 
they are beginning to weaken, especially in the 
urban areas. One of the most important develop- 
ments is the increase of secular education of girls 
and the employment of women outside the home 
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and field. Although parents may not welcome 
changes in their family relations, they want for their 
daughters a kind of education and employment 
that arouses not only a desire for greater freedom 
and equality but also the expectation—and even 
the experience—of these conditions. 

New economic relations. Economic changes of 
considerable importance began in the early nine- 
teenth century, as the power of the industrial West 
made itself felt in the introduction of new prod- 
ucts, new methods of production in both manufac- 
ture and agriculture, and international trade. Older 
occupations declined or were transformed; newer 
pursuits advanced. New relationships between so- 
cial classes emerged; in this century such changes 
have been even more significant. In Turkey and 
Iran they have included the elimination of the 
ulema, or clergy, from political control. Egypt has 
experienced the virtual demise of the landed class 
holding vast political power. A small number of 
military leaders have come to power in several 
countries. Based not only on the armed forces but 
also on the upper levels of the civil service, they 
have tried to activate the relatively inert rural 
masses and the volatile but inexperienced urban 
masses, Through these changes religion and kin- 
ship have become less important in the determina- 
tion of the individual’s social class, whereas educa- 
tion and occupation, especially in the modern 
sector of urban life, have assumed increasing 
influence. 

Political change. Politically, the most signifi- 
cant change has been the emergence of Near 
Eastern states into independence under the impe- 
tus of a strong nationalism, which has made a 
bid to replace religion as the most effective emo- 
tional bond among people. Through economic de- 
velopment, social legislation, and agrarian reform 
or through sheer nationalist exhortation by means 
of controlled elections, plebiscites, and mass organ- 
izations bearing some resemblance to political 
parties, national leaders have sought the support 
of the urban and rural masses. For this they have 
used all the modern media of communication 
denied, except in a few countries, to those not 
sharing the regime's outlook. 

: Political “development” has been given a special 
interpretation by authoritarian governments. They 
have not attempted to cultivate individual thought 
and to encourage critical evaluation of alternatives. 
Nor have they encouraged the establishment of 
independent groups of varying interests and view- 
points that would contribute to the political edu- 
cation of their members while exercising some 


influence upon government. Rather, political “de- 
velopment” has meant the silencing of “divisive” 
political parties and the harnessing of interest 
groups to a regime's immediate goals or their con- 
finement to nonpolitical functions. 

Although the authoritarian regimes dominating 
most of the Near East have not directly encouraged 
democracy, some of their innovations have pro- 
moted the very conditions that may later produce 
the demand that political power be shared. The 
expansion of education and the effort to overcome 
learning by rote, the greater freedom of women 
and their emergence from seclusion, the introduc- 
tion of higher standards of public administration, 
the growing freedom of the individual within fam- 
ily life and the development of liberal ideas on 
child rearing, and the encouragement of new and 
freer forms of artistic and literary expression will 
all serve this function. Most Near Eastern govern- 
ments coordinate these developments as carefully 
as they can with their own political aims. But the 
attitudes, expectations, and interests created by 
these efforts at modernization can elude control 
and lead to direct expression in political life. 

“Modernization” in the Near East, extensive as 
it is, can nevertheless be easily exaggerated or 
prematurely noted. Much of it is still confined to 
the cities, and everywhere it is still manifested 
more in external, material culture (automobiles, 
roads, and transistor radios) than in the less pene- 
trable and less noticeable individual attitudes, fam- 
ily structure, interpersonal relations, and religious 
conceptions and behavior. Moreover, some changes 
regarded as firm have often been revealed as only 
superficial. Thus the power of kinship and religion 
is reasserted from time to time in political and 
economic life even in Turkey and Egypt, two coun- 
tries that have undergone more fundamental 
change than any others in the region. In Turkey, 
especially, the influence of Islam as a religion 
continues to surprise not only foreign observers 
but the secular, modern rulers as well. The gulf 
thus revealed between elite and masses may be 
even wider in those countries whose modernization 
began later and has proceeded at a slower pace. 

The traditional integration of different groups 
within the Islamic social and religious system has 
been shattered; the groups, themselves changing 
in nature and goals, are reforming. Nationalism 
and political independence give the quick impres- 
sion of a new kind of civic integration, but it is 
very likely that its extent has been exaggerated 
everywhere in the Near East. Social differences 
seem still to be growing; the way to coordinate 


them into a workable, enduring system of relations 
within the framework of the modern state has not 
yet been found. 

MorroE BERGER 


[See also AFRICAN SOCIETY, article on NORTH AFRICA; 
IsLAM.] 
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It 
ISRAEL 


Israel is a new society as well as a new state. Its 
population, more than 2,500,000 in 1965, is over- 
whelmingly composed of immigrants from some 
one hundred countries and their immediate de- 
scendants. The organization of the new society still 
manifests the commitments and systems of action 
of those who founded it. Zionism has included a 
range of ideologies. The new society came into 
existence and took shape as like-minded individuals 
banded into groups, before or after immigration, 
formed organizations and coalitions, and added to 
their numbers and resources through recruitment 
of new members and allies among potential immi- 
grants and supporters abroad. Differences in orien- 
tation among successive waves of immigrants and 
their supporters gave rise to partisan systems of 
action, These contributed in large measure to the 
formation of the nascent society and structured 
much of its prestatehood organization. 

The social system created in Jewish Palestine 
prior to Israel's independence persisted after state- 
hood. It channeled much of the development of 
the new society, particularly during Israel’s first 
postindependence decade. However, the society 
has been transformed by changes effected since 
statehood, and its social system has undergone 
modification. This article traces the distinctive 
development of a new society and a new state that 
were successfully created on the basis of an an- 
cient national identity. 


Origins of the new society 

Three systems of beliefs and values underlie the 
actions which gave rise to Israel. These are Judaism 
as religion and tradition, Zionism, and socialism. 

Judaism maintained the orientation of Jews to 
Palestine as their one-time and future homeland 
through the millennia of dispersion. The Bible and 
the Talmud describe in detail the ancient way of 
life, while religious law preserves and transmits its 
essentials. The yearly cycle of prayers and cere- 
monies constantly anticipates the return to the 
Holy Land or to the holy city of Jerusalem; the 
Talmud states that the duty of Jews to live in the 
land of Israel is greater than the rest of the law; 
throughout the centuries it was considered a holy 
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duty for Jews to contribute to the support of those 
who had succeeded in remaining in or returning 
to Palestine. Small groups from all over the 
Diaspora had so succeeded, and most orthodox 
Jews sought to have buried with them a bit of earth 
from Palestine. 

Traditional Judaism, with its orientation of re- 
turn to Zion, was the force that had motivated the 
settlement in Palestine of the perhaps 24,000 Jews 
living there in the latter part of the nineteenth 
century, on the eve of the emergence of modern 
Zionism as an organized movement. Supported 
largely by the contributions of Diaspora Jewry, they 
were concentrated in the four holy cities of Jerusa- 
lem, Safed, Tiberias, and Hebron and were dedi- 
cated mainly to the study of religious law. They 
represented both the Ashkenazi and Sephardi tra- 
ditions of Judaism, that is, respectively, the tradi- 
tion that had emerged among the Jews of Europe 
and the tradition that centered in Spain prior to 
the expulsion of its Jews and their dispersion 
mainly to Muslim lands. 

These two traditions differ in liturgy, custom, 
ritual surrounding the prescribed forms of wor- 
ship, to some extent in points of observance, and 
in daily behavior around the prescribed observ- 
ances. They have become associated with much 
else in culture and have become an index of differ- 
entiation among Jews in general and in Jewish Pal- 
estine and Israel. In Israel, for example, the Sephar- 
dim and Ashkenazim each have their own chief 
rabbi and their own congregations. Immigrants 
from ancient Jewish communities in the Muslim 
lands also have their own traditions, although they 
are generally classed as Sephardim. While Sephar- 
dim outnumbered Ashkenazim in Palestine prior 
to the Zionist-inspired immigrations beginning in 
the last two decades of the nineteenth century, this 
numerical preponderance was reversed during the 
development of Jewish Palestine prior to the es- 
tablishment of the state. In modern Israel the 
sociological concomitants of the distinction be- 
tween Ashkenazim and Sephardim involve a more 
overriding distinction between European and Ori- 
ental, which is further discussed below. 

Although traditional Judaism was not a major 
factor in the immigration of the predominantly 
European-derived settlers who built up prestate 
Jewish Palestine, known as the Yishuv, much of 
the poststate mass immigration from the non- 
European countries had a strong Messianic ele- 
ment. By many of the immigrants from Yemen, 
Turkey, Libya, the Atlas Mountains of Morocco, 
and Cochin (India), tidings of a new state of 
Israel were received as the long-awaited “redemp- 


tion from exile,” to which of course they had to 
respond. Also, a religious bloc has existed within 
the Zionist movement almost since its inception, 
and some of the Zionist-motivated immigrants have 
been affiliated with it. 

Religious commitment, therefore, has been a 
factor in the creation of the new society, and its 
influence extends to the institutions of the society. 
It is one of the focuses around which political 
parties crystallized in the Yishuv as well as in 
the Zionist movement, and the religious political 
parties have continued under statehood to work 
toward bringing public law and institutions into 
conformity with religious law. The politics of coali- 
tion government operate to their advantage. They 
have been successful in keeping matters of personal 
status, such as marriage and divorce, under the 
jurisdiction of the rabbinical courts. The Chief 
Rabbinate, which includes the Ashkenazi and Se- 
phardi Chief Rabbis and a Supreme Rabbinical 
Council, is supported by the state. State educa- 
tional provisions include a system of religious as 
well as nonreligious schools. Most public transporta- 
tion does not operate on Saturdays or on some Jew- 
ish holidays. Religious girls are exempt from mili- 
tary conscription. Public organizations observe the 
Jewish dietary laws. Religious minorities have their 
rights guaranteed, and matters of their personal 
status are under the jurisdiction of their clergy; 
the Muslim clergy are also paid by the state. 

Thus, while Israel is very far from constituting 
a state in which religious law is the basis of public 
law, which is the goal of the religious political 
parties, it cannot be considered a wholly secular 
state, much to the chagrin of a portion of its pres- 
ent population. Differences in extent of religious 
commitment make for cleavages within the Jewish 
population; they not only separate the most devout 
from confirmed unbelievers, between whom there 
may be no voluntary interaction, but also provide 
the basis for politically divisive issues involving 
the national life. Conflict around such issues prob- 
ably will remain a characteristic of the internal 
politics of Israel for some time to come. 

While the power of Zionism to evoke a response 
from large numbers of people of diverse socioeco- 
nomic backgrounds cannot be understood apart 
from the significance of the “land of Israel” in 
Judaism as religion, tradition, or both, the emer- 
gence of modern Zionism is associated less with a 
religious Jewish identity than with its breakdown. 
The Age of Enlightenment both effected the polit- 
ical emancipation of Jews in the countries of 
western Europe and introduced new currents of 
thought and aspirations into the Jewish communi- 


ties of these countries. Relative secularization and 
assimilation into the cultures of their countries of 
residence occurred among increasing numbers of 
Jews. However, anti-Semitism continued, even 
while secularization and assimilation created the 
new problem of the meaning of a Jewish identity. 

By the middle of the nineteenth century the idea 
of return to Palestine was advanced both as a solu- 
tion to anti-Semitism in Europe and as a way of 
maintaining and normalizing Jewish identity and 
culture, The earlier public proponents were from 
western Europe and Germany. Simultaneously and 
independently, the same ideas began to emerge in 
eastern Europe, where the mass of European Jewry 
was concentrated, still legally disadvantaged and 
largely carrying on the traditional ways of their 
ancestors. In 1881, when a wave of pogroms began 
in Russia, vast groups of Jews began to emigrate 
to western Europe and the Americas; others joined 
revolutionary movements; another response was 
the initiation of organized Jewish immigration to 
Palestine. 

Between 1882 and 1903 from 20,000 to 30,000 
immigrants, mostly from Russia and Rumania, 
some of them former students, some families with 
moderate means, arrived in Palestine to double its 
Jewish population. Purchasing land, they set up 
more than twenty agricultural settlements, most of 
which are now substantial towns. Their difficulties 
in making a success of farming under the arduous 
local conditions were somewhat ameliorated by aid 
extended by Baron Edmund de Rothschild. Almost 
from its inception, the new society enlisted the 
involvement of many people who were sympathetic 
to its goals but who were not committed to joining 
it. This involvement, which can be seen as a re- 
patterning of traditional orientations in regard both 
to Palestine as homeland and to community re- 
sponsibility, has been a major factor in financing 
the growth of the new society and its development 
into a modern nation. 

Both the establishment of the World Zionist Or- 
ganization in 1897 and the immigration of labor- 
oriented young people, mostly from eastern Europe, 
that started in 1904 gave rise, in different ways, 
to the institutions through which the society and 
the new nation were formed. The World Zionist 
Organization brought together religious, reform, 
and secular Jews from most of the European coun- 
tries and the Ottoman Empire in a wide-flung po- 
litical movement dedicated to the creation of an 
independent Jewish nation in Palestine. The young 
immigrants who arrived between 1904 and World 
War 1, known as the second aliyah, or wave of 
immigration, fused socialist ideals and Zionist as- 
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pirations into an ideology stressing redemption of 
the land, most of it submarginal, through their 
own “nonexploitative” physical labor and their 
stoical endurance and transcendence of the hard- 
ships involved. 

Linking its dispersed membership with settlers 
in Palestine, the World Zionist Organization cre- 
ated agencies to acquire and administer the re- 
sources necessary for development of the new 
society. One of the earliest agencies was the Jewish 
National Fund. It was established in 1901 to pur- 
chase land in Palestine with the contributions of 
sympathizers, retain title to it in the name of the 
Jewish people, and lease it to those who would oc- 
cupy it personally and work it themselves. The 
principle of national ownership of the land of 
Palestine remained in effect after the founding of 
the state, which now has title to almost 95 per cent 
of Israel’s territory. Settlers could lease land at 
almost no cost, and its rehabilitation has been 
financed by the fund and now by the state. There- 
fore, the unoccupied land resources of Israel have 
remained freely available to all immigrants, sub- 
ject to priorities of national development and the 
principles of settlement stressing cooperative en- 
deavor that were introduced by the labor-oriented 
settlers. 

Other organizations, principally the Jewish 
Agency for Palestine, were created later in the his- 
tory of the Zionist movement in conjunction with 
the development of the Yishuy and other changes 
relevant to it, particularly the shift from Turkish 
to British mandatory government over Palestine. 
These organizations served as instruments of volun- 
tary self-government, linked to, yet independent of, 
the British administration and responsible to the 
constituency of the World Zionist Organization, in- 
cluding members of the Yishuy. 

While the governmental apparatus of Jewish 
Palestine, a good part of which was transferred into 
that of the state, developed largely during the man- 
date period, the internal politics of the Zionist 
movement, expressing ideological and cultural dif- 
ferences among its members, were factors shaping 
the nascent society almost from its inception. Jews 
of eastern European origin dominated the move- 
ment and became dominant in the Yishuv, suffus- 
ing both with a style of action characteristic of 
their background. This style involved concern with 
principle as a basis for or rationalization of action, 
the formulation of principles into systems or ideol- 
ogies, and intensity of commitment in the defense 
of ideology and ideologically based action, For in 
the beleaguered Jewish communities of eastern 
Europe (see Zborowski & Herzog 1952), intensity 
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of emotion was both generated and culturally sanc- 
tioned in many areas of life, conspicuously in the 
life of the intellect, and passion was legitimately 
associated with differences of opinion and value. 
A recurrent feature of the internal politics of Zion- 
ism was the development of factions over a variety 
of principles and issues, the crystallization of such 
factions into political parties or movements within 
the Zionist Organization or even, in certain cases, 
outside the organization, and orientation and com- 
mitment to ideology and party almost equal to or 
even superseding orientation and commitment to 
the common goals. 

The issue of the secular versus religious charac- 
ter of the institutions being created in Palestine, 
such as the schools, was an early issue around 
which factions crystallized and has remained an 
enduring one. Other issues followed: some of these 
involved economic policies; a later major issue was 
the degree of militancy of the movement in defend- 
ing the Yishuv against attack. These remained 
fundamental areas of controversy well into the 
years after the establishment of the state. The 
predominance of the Labor party in Israel expresses 
the dominant role in building the new society of 
immigrants committed to an ideological synthesis 
of socialism and Zionism, beginning with the sec- 
ond aliyah. 

The young immigrants of the second aliyah de- 
veloped their specific ideology in rejecting other 
alternatives: conformity with the traditional life 
in which they had been raised, assimilation, revo- 
lutionary activity within Russia, or immigration to 
elsewhere than Palestine—all of which had been 
chosen by far greater numbers of peers. They made 
their way to Palestine without help; and, motivated 
by both ideology and subsistence needs, they turned 
to types of work requiring hitherto alien manual 
skills, such as agriculture and the building trades. 
Both necessity and ideology led them to pool re- 
sources and develop cooperative organizations, in- 
cluding kitchens and laundries in the established 
villages in which they worked as hired laborers, 
employment offices, mutual aid funds, committees 
to receive similarly committed new immigrants 
and tide them over until they could find work, a 
consumers’ cooperative, and sick funds. As land 
was made available to them by the Jewish National 
Fund, various groups began to experiment with 
forms of social organization through which agri- 
cultural settlement could be effected under very 
difficult conditions (see Bein 1952). 

The wholly communal settlement, widely known 
today as the kibbutz, was the first new community 


type to emerge (see Baratz 1954). Other, abortive 
community types for agricultural settlement also 
were tried out; and the next successful one to 
develop, after World War 1, was the moshav ovdim, 
the cooperative smallholders’ settlement. While in 
the kibbutz all property is owned in common, food 
is prepared in a community kitchen and eaten in 
a community dining hall, and children are reared 
together in a series of age-graded children’s houses, 
the moshav is organized around family farms and 
cooperative village economics (see Dayan 1961). 

Of the perhaps forty thousand immigrants who 
constituted the second aliyah, some became urban 
workers rather than farmers; but they shared a 
common commitment to the ideology of labor, to 
cooperative endeavor, to Hebrew as the language of 
Jewish Palestine, and to self-abnegating fortitude 
in implementing their values, The norms and atti- 
tudes they developed and the activities in which 
they engaged coalesced into the role of the pioneer. 
This role became a model for successive waves 
of labor—Zionist-oriented immigrants after World 
War I, a standard of value and emulation for most 
members of the Yishuv, including many not en- 
gaged in agricultural settlement, manual labor, or 
other “pioneering” activities; and it still is a potent 
symbol in the Israel of today. Moreover, for more 
than a decade after the state was established, sur- 
viving members of the second aliyah played a sig- 
nificant role in the building of the nation. National 
leadership was drawn from their ranks, including 
David Ben-Gurion (who was the country's prime 
minister almost continually until 1963), the coun- 
try’s second and third presidents, the first speaker 
of the Israeli parliament (the Knesset), and many 
members of the parliament. 

As for the cooperative institutions created by the 
second aliyah, the agricultural settlements they 
initiated, particularly the kibbutz, multiplied and 
became the bastion of Jewish Palestine in many 
activities central to its existence during the period 
it was under attack, before and after World War 1. 
Together with the other cooperative endeavors, the 
labor—Zionist settlements were the nucleus of the 
Histadrut, the General Federation of Labor estab- 
lished in 1920, which came to dominate the econ- 
omy of Jewish Palestine and whose power has been 
second only to that of the national government 
since the establishment of the state. 


The prestate social system 


‘The immigrants and their background. Only 2 
minority of the world’s Jews became Zionists, at 
least until the establishment of the state, and a 


much smaller minority went to Palestine. During 
the period after World War 1, the majority of the 
immigrants were from eastern Europe, and they 
were motivated by essentially the same orientations 
and commitments that had resulted in the preced- 
ing two waves of immigration. On the one hand, 
neither the Russian Revolution nor the breakup of 
the Austro-Hungarian Empire into independent 
countries had resulted in a diminution of anti- 
Semitism in eastern Europe; if anything, its mani- 
festations in some of the newly independent coun- 
tries increased, abetted by the open support given 
it by their governments. On the other hand, neither 
traditional Judaism, local revolutionary activity, 
nor migration to elsewhere than Palestine offered 
a solution to the immigrants’ quest for a Jewish 
identity not bound to religious observance. 

During World War 1 the Jewish population of 
Palestine was reduced to some 56,000 through de- 
portations and depredations by the Turkish govern- 
ment. This population was augmented by about 
25,000 to 35,000 immigrants from eastern Europe, 
primarily from Russia, between 1918 and 1923, 
when the Soviet Union ended emigration of its 
Jews. This immigration, known as the third aliyah, 
was composed of labor—Zionist-oriented young 
people with essentially the same goals as their pred- 
ecessors of the second aliyah. Poland replaced the 
Soviet Union as the principal source of the some 
82,000 immigrants who arrived between 1924 and 
1931. Known as the fourth aliyah, these immi- 
grants included labor—Zionist-oriented young peo- 
ple, but the majority had a middle-class orienta- 
tion. While some members of the latter segment 
of immigrants founded non-labor—Zionist villages, 
most of them settled in the growing cities and 
engaged in small-scale commerce, industry, and 
crafts. Some 9,200 immigrants also arrived from 
Middle Eastern countries, and about 2,000 from 
the Americas; about 23,000 Jews emigrated 
from Palestine during a period of economic de- 
pression in the 1920s. 

Between 1932 and 1938 more than 217,000 
newcomers, known as the fifth aliyah, arrived in 
Palestine. The majority were from eastern Europe; 
but about 40,000 were from Germany and Austria, 
driven from these countries after Hitler’s rise to 
power, and 11,000 were from Rumania, including 
areas of it that had been in the orbit of Austrian 
culture, Some 7,000 arrived from Yemen and Aden; 
and noteworthy numbers of immigrants came from 
Greece, Czechoslovakia, Hungary, and the Amer- 
icas. Although the British mandate government at- 
tempted to limit Jewish immigration from 1929 
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onward, its policies exempted certain special cate- 
gories of immigrants, such as those bringing capital 
to Palestine. 

The immigrants from central Europe included 
people with capital to invest and with experience 
in finance and administration, scholars and scien- 
tists, and highly trained professionals, such as 
physicians, lawyers, and engineers. They promoted 
the development of industry and commerce on a 
much larger scale than previously had existed; they 
contributed to the arts, sciences and human- 
ities; and they added to the reservoir of skills nec- 
essary for the development of a modern society. 
Between 1933 and 1939, 2,700 industrial enter- 
prises were created by the new immigrants, giving 
employment to some 22,500, as compared to 1,625 
enterprises, with less than 14,000 employees, de- 
veloped up to 1932 (see Zweig 1959), However, 
many highly trained newcomers could not find 
professional employment in the economy of Jewish 
Palestine, which still was geared to the primary 
tasks of agricultural settlement and the creation of 
sources of urban livelihood. During the 1930s 
many immigrants with advanced degrees worked 
as manual laborers in the pioneer tradition. 

From 1939 to 1948 some 76,000 immigrants 
from Europe, mostly from central and eastern 
Europe, entered the country legally, and about 
58,000 were brought in despite restrictive measures 
of the mandatory government, A sizable proportion 
of newcomers after 1945 were young survivors of 
the Nazi extermination policy. Some 17,000 immi- 
grants from non-European lands also arrived dur- 
ing the years of World War 11. 

Country of origin was one index of identity and 
differentiation among members of the nascent so- 
ciety, and it was associated in popular stereotypy 
with differences in behavior. Thus, the manners of 
the German immigrants generally were stiff and 
formal in comparison to the relative absence of 
social and personal distance in the style of inter- 
action prevalent among Jewry of eastern Europe, 
and this gave rise to many jokes at the expense 
of the former. Country, region, or city of origin was 
the basis for the formation of traditional commu- 
nity organizations and religious congregations, 
while immigrants from the same country or cul- 
ture area also created secular associations. How- 
ever, ethnic identity associated with geographical 
origin and preimmigration way of life did not con- 
stitute the principal basis of grouping and differ- 
entiation among the majority of the Jewish 
population of Palestine. Of the voluntary organiza- 
tions which structured the emerging society, those 
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most significant for its development were created 
in the framework of the Zionist movement and its 
politics. 

Institutions and systems of relations. The politics 
of Zionism offers the most comprehensive frame- 
work for describing the institutions, groupings, and 
systems of relations that developed in Jewish Pales- 
tine, This is so because partisan systems of action 
and the ideologies on which they were based largely 
effected the Jewish settlement and development of 
Palestine. The political parties of Zionism expressed 
the spectrum of orientations and commitments 
which motivated the immigration and nation- 
building activities of most members of the nascent 
society, and new factions formed as changing con- 
ditions brought new issues to the fore. The factions 
created organizations to serve the needs of their 
members; they competed within the Zionist move- 
ment for the allocation of resources; and they de- 
veloped organizations of potential immigrants and 
sympathizers abroad. Newcomers and veteran in- 
habitants with different orientations from those of 
the major partisan factions created their own 
organizations, which acted as political pressure 
groups; and some of the associations of people 
from the same country of origin or ethnic tradition 
organized as political parties in the Yishuv. 

The adherents of labor—Zionism established a 
ramifying network of institutions through which 
much of the development of Jewish Palestine was 
accomplished, The principal agencies of this devel- 
opment were the most distinctive creations of the 
nascent society: the communal and cooperative 
rural communities through which much of land 
settlement was effected; and the Histadrut, which 
organized a comprehensive economy of interde- 
pendent cooperative enterprises in which both rural 
settlements and urban workers participated. 

However, the adherents of labor—Zionism by no 
means constituted an ideologically or politically 
united body. Two factions, based on different ideo- 
logical syntheses of socialism and Zionism, existed 
as early as 1906, from almost the beginning of the 
second aliyah. They joined forces in founding the 
Histadrut in 1920 and in expanding the scope of 
its activities and power, while competing politically 
within it. They remained in existence as political 
factions, with different views on a range of policies, 
and the crystallization of other ideological synthe- 
ses of socialism and Zionism contributed to the 
formation of additional labor—Zionist factions. 

The creation of new farming villages was a para- 
mount goal of all the labor—Zionist factions; and, 
during the 1920s, three federations of communal 
settlements came into being. Each of them was 


linked to Zionist youth movements in the Diaspora 
which recruited young people for immigration and 
trained them as agricultural pioneers; and each of 
the kibbutz federations was part of a larger politi- 
cal faction within the Yishuv. Despite various re- 
organizations and realignments, the three kibbutz 
federations still are in existence, and two of them 
constitute the enduring nuclei of the factions whose 
mergers and splits underlie the formation of the 
secular labor—Zionist political parties of Israel 

For while members of all the secular labor— 
Zionist factions supported Mapai, the Labor party 
created in 1930, this alliance did not endure for 
much more than a decade. It enabled the Labor 
party to win control of the most politically signifi- 
cant offices within the Zionist movement. However, 
secessions from Mapai and realignments resulted 
in the existence of two labor—Zionist political par- 
ties by 1944 and three by 1954. Mapai has re- 
mained the largest and most powerful of these. To 
its left is Achdut ha-Avodah. Its origins go back 
to the more revolutionary of the two original labor- 
Zionist factions; and its base is a kibbutz federation 
which initially differed from the others over issues 
of village organization as well as over more general 
aspects of Zionist ideology and policy. To the left 
of Achdut ha-Avodah is Mapam, which has a revo- 
lutionary socialist ideology. Its base is a Marxist- 
oriented kibbutz movement, which developed from 
a youth movement founded in Poland in 1917. 

Although the total population of the kibbutzim 
never reached even 10 per cent of the population 
of Jewish Palestine, and the percentage has been 
far lower since Israel’s independence, the com- 
munal settlements played a major role in the for- 
mation and organization of the nascent society. 
They constituted the majority of new villages which 
expanded the rural base of the Yishuv and popu- 
lated and restored to fertility previously barren 
land. Moreover, they served as way station and 
introduction into the society for many more people 
than those who became permanent kibbutz mem- 
bers. Among immigrants recruited for settlement 
in the kibbutzim, those who became disaffected 
with communal life moved into other sectors of the 
society, while usually retaining allegiance to labor- 
Zionist values and goals, Indeed, the first moshav 
ovdim, the cooperative type of village embodying 
labor—Zionist principles, was founded by such dis- 
affected former kibbutz members, and the kib- 
butzim continued as a source of settlers for the 
moshavim, whose federations did not recruit abroad 
until after statehood. From the 1930s on, the 
kibbutzim were in the fore in providing homes for 
groups of youngsters rescued from Nazi Europe, 


and kibbutz organization facilitated their absorp- 
tion in communal units within the collective settle- 
ments. Moreover, refugees who had evaded the 
mandatory government's restrictions on immigra- 
tion to Palestine could disappear into the kibbutzim 
and go elsewhere, if they so desired, when condi- 
tions permitted. 

Land settlement was not limited to adherents of 
secular labor—Zionist ideology or to the founding 
of kibbutzim, but these furnished the point of ref- 
erence for other developments. Thus, founders of 
the mosh’vei ovdim formulated their type of co- 
operative community partly in reaction against 
collective life. As the number of such settlements 
increased, they too banded into a federation, affili- 
ated with the Histadrut and with the Labor party, 
and competed with the kibbutz federations for the 
land funds for settlement channeled through agen- 
cies of the World Zionist Organization. 

Moreover, segments of the Zionist movement 
committed to orientations other than secular labor- 
Zionism had to come to terms with the latter’s 
appeal and goals in order to maintain their own 
position. The religious party in the Zionist move- 
ment, Mizrachi, developed a labor wing, ha-Poel 
ha-Mizrachi, in 1922. While retaining links with 
Mizrachi, ha-Poel ha-Mizrachi became an independ- 
ent political party and established its own settle- 
ment movements, first a kibbutz federation and 
then a moshav federation, and its own labor or- 
ganization. Even Agudat Israel, an organization of 
ultraorthodox Jews opposed to political Zionism, 
decided in the 1920s to support immigration to 
Palestine on a purely religious basis. It, too, founded 
a labor wing, Poalei Agudat Israel, which engaged 
in land settlement, founded a labor organization, 
and became a separate political party. 

Innovations in land settlement initiated by some 
of the immigrants from Germany followed the by 
then established pattern of action. While many of 
these immigrants who went on the land joined or 
founded villages within the existing spectrum 
of federations, those with a liberal rather than 
labor orientation created their own village federa- 
tion. Other immigrants with capital created a part- 
ly cooperative type of village. These settlements 
and the private farmers formed their own organi- 
zational networks. However, the organizations and 
associations established by settlers with a liberal or 
capitalist orientation were not established primar- 
ily to advance or defend an ideology but rather to 
provide their members with some of the services 
which the Histadrut offered its members. 

The Histadrut’s development involved the same 
processes which characterized land settlement in 
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the nascent society and which also can be noted in 
the development of the Zionist movement: the evo- 
lution of the innovations initiated by relatively 
small groups into networks of interrelated organi- 
zations and systems of action which, under radically 
changed conditions, have persisted as systems. The 
nucleus of cooperative organizations brought to- 
gether when the Histadrut was founded grew into 
the federations of kibbutzim and moshavim already 
noted; an agricultural marketing cooperative, 
which purchases all their agricultural products for 
resale and which developed industrial enterprises 
for processing many of them; a wholesale society, 
from which the villages buy their supplies, which 
engages in retail trade throughout the country, and 
which developed its own enterprises for manufac- 
turing many of the goods it sells; dozens of indus- 
trial and craft unions organized into a central fed- 
eration of unions; transport cooperatives, which 
dominate public transportation by road in Israel; 
housing societies; financial organizations, includ- 
ing one of Israel’s major banks, its largest insur- 
ance company, and provident and pension funds; 
a water company; industrial corporations, includ- 
ing much of Israel’s heavy industry; and service 
organizations. 

These service organizations include restaurants, 
hostels, hotels, and a sick fund, which provides 
all Histadrut members with full medical coverage. 
Prior to the creation of the state school system in 
1953, the Histadrut maintained a network of edu- 
cational institutions whose curricula stressed labor— 
Zionist values. It still maintains a youth movement, 
which is the largest in Israel, and a sports organi- 
zation; it offers vocational and evening classes and 
sponsors cultural activities. As early as 1925, the 
Histadrut published its own daily newspaper, and 
it established a publishing house in 1942, By the 
close of the mandate period, the Histadrut linked 
perhaps 40 per cent of the Jewish population of 
Palestine and possibly 75 per cent of its wage and 
salaried workers. 

As the Histadrut expanded, adherents of ideolo- 
gies other than secular labor-Zionism developed 
their own competitive organizations in the spheres 
of activity relevant to their interests. The two reli- 
gious labor federations have focused primarily on 
agricultural settlement and education. Thus, their 
members have been represented in the Histadrut’s 
trade union section and have been insured with its 
sick fund. Their agricultural villages market their 
produce through the Histadrut’s marketing society 
and purchase supplies from its wholesale society. 
In contrast, the settlers affiliated to the Zionist fac- 
tion with a liberal orientation set up their own 
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marketing cooperative and central purchasing or- 
ganization and have their own agricultural credit 
associations. Their members belong to an inde- 
pendent sick fund, which has urban subscribers. 

Both the labor federation of ha-Poel ha-Mizrachi 
and the nonlabor factions also created their own 
housing societies to serve their members in the 
towns. All the Zionist factions organized youth 
movements and sports organizations and have pub- 
lished daily newspapers. School systems maintained 
by the Zionist religious faction and by the General 
Zionists antedated the formation of the Histadrut- 
supported school system, while the ultraorthodox 
faction has had its own schools. 

The General Zionists emerged as an amalgama- 
tion of members of the Zionist movement who 
lacked a concomitant labor or religious commitment 
and were not involved in the factions created by 
partisans of these commitments. Although the Gen- 
eral Zionists constituted the majority in the Zionist 
movement on the international level, they did not 
organize as a party until after the formation of 
Mapai. Created to promote middle-class interests 
in the development of Palestine, the party had two 
factions: one oriented to the Histadrut, the other 
opposed to it and committed to private enterprise. 
In 1948 the more liberal faction, which included 
the liberal land-settlement movement, seceded from 
the General Zionist party and united with a Yishuv 
political party representing immigrants from cen- 
tral Europe to form the liberal Progressive party. 

Of the major Zionist factions, the Revisionist 
movement was the most dissociated from the 
others. It was organized in 1925 to give organiza- 
tional expression to the belief that Zionism had to 
be a militant political movement committed to the 
rapid creation of a Jewish state on both sides of 
the Jordan River. The Revisionists also favored 
mass immigration, largely middle-class, and the 
investment of private capital to create the basis for 
such a state. After the Nazi rise to power, the Re- 
visionists rejected the official Zionist policy of self- 
restraint in dealing with Arab attacks and restrictive 
mandatory government policy in regard to Zionist 
goals and Jewish immigration. The network of or- 
ganizations created by the Revisionist movement 
included its own labor federation, founded in 1934, 
a health-insurance fund, a housing organization, 
mutual-aid funds, an organization for working 
youth, and a youth movement. Between 1935 and 
World War u, the Revisionists seceded from the 
World Zionist Organization to form their own such 
organization, A Revisionist faction, Irgun Zvai 
Leumi, also organized an underground military or- 


ganization that was independent of the Haganah, 
the illegal defense army that had developed under 
Jewish Agency control. The Irgun armed forces 
were integrated into the Israeli army only after the 
founding of Israel as a state. 

During most of the mandate period, the Jewish 
Agency was the quasi-governmental organization 
linking the Yishuv to the larger Zionist movement, 
The mandatory government controlled the law- 
making and law-enforcement agencies of Palestine 
and administered the country as a whole; but it 
accepted the Jewish Agency (see Halpern 1961) 
as the official body for handling Jewish immigra- 
tion and the development of Palestine as a Jewish 
homeland, although these objectives were progres- 
sively blocked by changes in mandatory policy. The 
Jewish Agency had a political department, which 


was in effect a foreign ministry and became this 
when the state was established, departments of 
labor, and trade and industry, which also became 
government ministries, and departments of immi- 


gration, absorption, land settlement, youth and 
pioneering, education and culture, and those con- 
cerned with its own administration. The strength 
of the Zionist parties in elections to the World 
Zionist Organization determined the allocation of 
portfolios in the Jewish Agency Executive. 

The Elected Assembly was the self-governing 
body of the Yishuv recognized by the mandatory 
government as the official representative of Pales- 
tine’s Jewish population. Its executive body, the 
National Council, levied taxes, maintained a social- 
welfare program, and in the 1930s assumed respon- 
sibility from the Jewish Agency for administering 
the three Zionist-supported Jewish school systems 
in the country. More than twenty political parties 
generally competed in elections to the Assembly. 
In addition to the Zionist factions, these included 
minority parties constituted on an ethnic basis, 
such as a Sephardi and Yemenite party. 

By the end of the 1930s, a stable social organiza- 
tion had developed in the Yishuv, and its groupings 
and systems of relations expressed both the orien- 
tations that had made for immigration to Palestine 
and the background of those who had immigrated. 
The majority of the descendants of the pre-Zionist 
Ashkenazi and Sephardi populations seem to have 
been relatively uninvolved in the Zionist systems of 
action, remaining oriented to religious observance 
and traditional community groupings. Of the Zion- 
ist immigrants and their offspring, the adherents 
of labor—Zionism by then had given to Jewish Pal- 
estine its dominant orientation and organizational 
networks. Adherents of other ideologies, to the 


extent that their numbers and resources permitted 
or interests seemed to demand, set up equivalent 
kinds of organizations to provide parallel services, 
to uphold their adherents’ interests both within the 
country and in the Zionist movement, and to draw 
support from Diaspora Jewry. 

The long history in eastern Europe of voluntary 
Jewish community organization within a hostile 
society gives perspective to the eastern European 
immigrants’ readiness to organize, their skill at 
doing so, and the vitality and democracy of their 
organizations. The scope and intensity of factional 
action can also be related to the background of the 
majority of these immigrants, For both orthodox 
Judaism and the revolutionary movements of east- 
ern Europe represent, in very different ways, total 
systems. Judaism provides regulations for prac- 
tically every detail of life and a dialectic that brings 
these regulations within the sphere of systematic 
debate and analysis. The revolutionary movements, 
which were a point of reference for the labor- 
Zionists who rebelled against religious orthodoxy, 
left no area of behavior immune from discussion, 
reformulation, and incorporation within the scope 
of ideology. Both the divergent labor—Zionist ideolo- 
gies and the organizations created to implement 
them had the ramifying character of total systems, 
within which internal debate and dispute could be 
constant but to which primary commitment was 
expected. The comprehensiveness of party activity 
and the acrimony of factional dissent and split can 
be seen as expressions of cultural continuity with 
eastern Europe. 

Continuity with preimmigration values also con- 
tributed to the liberal political orientation of many 
of the immigrants from Germany. Moreover, many 
of the immigrants from the Middle Eastern lands 
had little involvement in the organized systems 
of action of the Zionist factions; and ethnic iden- 
tity was a basis of grouping in both traditional 
community organizations and in associations which 
acted politically. However, the constituency of the 
Zionist political factions and of the ultraorthodox 
Agudat Israel cut across ethnic origin and tradition 
of worship. Cultural background was subordinate 
to other bases of orientation and differentiation in 
the social organization of Jewish Palestine. 

The transition to statehood. As we have seen, 
the prestate systems of groupings and relations 
involved both the institutionalization of innova- 
tions initiated in the development of Jewish Pales- 
tine and the repatterning, in a new setting, of 
elements of behavior expressing the preimmigration 
orientations of members of the nascent society. 
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These systems of grouping and relationship were 
maintained without major alteration in the tran- 
sition to nationhood. Government ministries were 
formed from departments of the National Council 
administration, the mandatory administration, and 
the Jewish Agency, although the last has re- 
mained in existence and has continued to exercise 
much-augmented functions in such spheres as im- 
migration, absorption of new immigrants, and land 
settlement. The body of law in force during the 
mandate period remained the basis of the law of 
Israel, with the exception of restrictions on Jewish 
immigration and land purchase. 

Crucially, Zionist traditions were followed in 
provisions for the nation’s government. A unicam- 
eral legislature, the Knesset, was established, 
elected by universal adult suffrage on the basis of 
proportional representation, with the country con- 
stituting a single constituency. The voters cast 
ballots for party lists rather than for individual 
candidates, and the proportion of votes given a 
party determines the number of its victorious can- 
didates, counting from the head of the list. The 
president of Israel, elected by majority vote of the 
Knesset, exercises his principal nonsymbolic func- 
tion in asking a Knesset leader to form a cabinet 
and government. In practice, Mapai has been the 
party with the largest number of votes since the 
establishment of the state and, lacking an absolute 
majority, has governed in coalition with varying 
combinations of the other political parties, exclud- 
ing Herut, the successor to the Revisionist move- 
ment. On the level of local government, elections 
also are based on proportional representation ac- 
cording to party lists. The politics of Israel were 
cast in the tradition of the factional politics of 
Zionism. 

While the transition to nationhood was being 
effected, a mass immigration was underway. Israel’s 
population, some 650,000 in May 1948, when the 
state was founded, was more than doubled by the 
end of the year. By 1952, when a temporary slack- 
ening of immigration occurred, the population had 
reached approximately 1,629,500; by 1956 it had 
climbed to 1,872,400; in 1958 it passed the 2 mil- 
lion mark. The new wave of immigrants from 
countries of European culture consisted of the sur- 
vivors of Nazism from eastern Europe, who consti- 
tuted the majority, including almost the entire 
Jewish population of Bulgaria, large segments of 
Yugoslavian Jewry, and smaller numbers from the 
countries of western Europe and the Americas. 
However, the majority of immigrants during the 
1950s were from non-European countries, ranging 
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from Morocco to India; and this flow continues to 
the present, its extent and source partly varying, 
as with immigration from Europe, with events on 
the international scene and their implications for 
Jews of the countries involved. 

Provisions for accommodating this influx, inte- 
grating the immigrants into the nation, and, con- 
comitantly, promoting the country’s development 
have been the focus of organized action in Israel 
since the founding of the state, with only provi- 
sions for national security taking greater priority. 
However, while activities related to security were 
removed from the arena of factional politics as part 
of the initial establishment of the machinery of 
government, this was not the case in regard to im- 
migrant absorption and the various spheres of 
national development. Indeed, much of the socio- 
economic organization of the new nation was con- 
tained within the politically affiliated, ramifying 
networks of institutions developed during the pre- 
State decades; and it was largely through these 
networks that the resources of the nation, substan- 
tially augmented by sympathizers’ contributions 
and other sources of aid, were mobilized and chan- 
neled for development of the new society. There- 
fore, the prestate systems of relations and action 
not only represented the means through which the 
Zionist goal was achieved, but also constituted the 
organizational basis of the new society existing 
after the establishment of the state, 


Development of the new society 


Absorption of immigrants. The mass immigra- 
tion that accompanied Israel’s achievement of in- 
dependence brought cultural differences to the fore 
as a basis of cleavage and grouping within the 
society being constituted. These differences can be 
referred to two overlapping sets of indexes for dis- 
tinguishing the majority of the poststate immi- 
grants from the majority of their predecessors. One 
set applies to the factors motivating immigration 
and the orientations involved, including commit- 
ment to the ideologies expressed in the various 
factional systems of action and relations through 
which the Yishuy had been developed. The other 
set of indexes, which is by far the more consequen- 
tial for behavior and stereotypy in the new society, 
represents cultural background in the broadest 
sense, including the ways of life the immigrants 
had previously followed and the possibilities known 
to them to which values were assigned. 

A considerable Proportion of the poststate immi- 
grants from Europe differed from their ideologically 
committed predecessors in that they looked to 
Israel as the homeland in which to reconstitute 


their lives and had little zeal for reconstituting 
themselves for the mission of realizing an ideology, 
To the majority of newcomers from non-European 
countries, immigration meant their return as 
Jews to the ancient homeland, diffuse expectations 
of a better life among their brethren, and, for some 
from the Arab lands, escape from traditional in- 
ferior status under Islam or from increased hostil- 
ity bred of nationalism and enmity toward Israel. 
The preimmigration universe of many of these 
non-European immigrants had no point of contact 
with that of the European-derived population, 
apart from common identity as Jews and whatever 
knowledge of, and commitment to, religious tradi- 
tion they shared. 

Furthermore, provisions for receiving the mass 
influx, especially between 1949 and 1953, de- 
manded of most immigrants fortitude in the face 
of hardship and willingness to accept and build on 
whatever opportunities were offered, as in the pre- 
state pioneering tradition. However, these provi- 
sions differed from conditions obtaining during the 
mandate period, in that the mass influx swamped 
both organizational channels for the creation of 
primary ties between established settlers and new 
arrivals and the informal spirit of comradeship 
and helpfulness that previously had bridged casual 
contacts between strangers, particularly those mani- 
festing the egalitarian influence of pioneer values. 
The country’s resources, depleted by war and the 
continuing demands of security, were initially 
insufficient, despite aid from abroad, to offer 
newcomers more than minimal conditions for main- 
tenance and essential services. Quartered indis- 
criminately together in scores of temporary camps, 
the majority of new immigrant adults had few 
outside contacts, and for months or in some cases 
even years, they were the subordinate clients of a 
harassed representative of one of the many agen- 
cies on which they were dependent. 

The common goal of these agencies, whatever 
their specific function, was to procure the immi- 
grants’ acceptance of the roles and underlying 
ethos of the settlers of the Yishuv, particularly 
those associated with the model of the pioneer, and 
to effect the immigrants’ adaptation to behavior 
and customary relations associated with these roles. 
This goal was achieved among only a limited mi- 
nority of the newcomers. Other responses to con- 
ditions of reception ranged from anomie to prag- 
matic adaptation to, and exploitation of, the 
possibilities increasingly available as Israel’s devel- 
opment progressed. Many variables entered into 
the differential responses of the newcomers to 
largely similar conditions of reception and provi- 


sions for absorption during 1949 and the early 
1950s (see Eisenstadt 1954; Shuval 1963); one 
such variable was the maintenance of a solidary 
immigrant primary group above the level of the 
nuclear family. However, cultural distance from 
the majority of the established population, focused 
in terms of European versus Oriental provenience, 
has been the most enduring basis of differentiation 
in the new society effected by the mass immi- 
gration. 

The immigrants from the European countries 
shared a common cultural baseline with their ideo- 
logically committed predecessors, and differences 
in goal and orientation could be reciprocally com- 
municated and partially accommodated within the 
universe of shared understandings. Integration of 
such newcomers into the established population 
generally involved little more than their adaptation 
to the kinds of work available and either their ac- 
ceptance of relocation in developing areas of the 
country, with a period of hardship until permanent 
housing and other services could be established, or 
their resistance to such adaptation and relocation 
until more congenial possibilities emerged in the 
course of the country’s development. Indeed, not a 
few such immigrants, after brief specialized train- 
ing courses and the acquisition of Hebrew if neces- 
sary, found employment in the burgeoning govern- 
mental or quasi-governmental bureaucracies; their 
affiliations with particular political parties were not 
always irrelevant to such employment, especially 
in the early years of statehood prior to the estab- 
lishment of the government civil service. Such 
employees, in turn, became representatives of the 
established order and its values in relation to less 
easily integrated members of the mass influx, par- 
ticularly those from non-European lands. 

The background of these non-European immi- 
grants included both the distinctive traditions and 
ways of life of ancient preindustrial Jewish com- 
munities practically untouched by European influ- 
ence (such as those from Yemen, Kurdistan, the 
island of Djerba off southern Tunisia, the Atlas 
Mountains of Morocco, and other rural areas of 
French north Africa and Libya) and the spectrum 
of cultural influences experienced by the previously 
urban populations whose provenience extended 
from Morocco to India (see Patai 1953). In some 
of these countries Jews had experienced an expan- 
sion of economic, vocational, and educational op- 
portunities and aspirations as a result of European 
penetration. Westernized businessmen and profes- 
sionals could and did emerge within two genera- 
tions or less from families of semiliterate traders 
and craftsmen previously settled in the crowded 
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Jewish quarters of Eastern cities, and most of the 
Islamic countries included in their Jewish popula- 
tion a traditional wealthy and educated elite. 

However, the majority of migrants to Israel have 
not been members of either the traditional or the 
more recent elites in their countries of origin, and 
many of those exposed to European influences were 
more familiar with new consumption goods and 
aspirations than with skills directly transferable to 
the Israeli context. Moreover, those with a few 
years of European-style schooling and at some 
stage in the process of transition from a traditional 
way of life to one oriented to accessible European 
models often found their vocational claims and 
aspirations as well as their desired style of life 
disdained by proponents of labor—Zionist values; 
at the same time, such immigrants were included 
in the stereotype of backward or primitive people 
that was often indiscriminately associated with 
non-European origin. 

Immigration from Yemen had been actively en- 
couraged by Zionist emissaries in the prestate pe- 
riod in order to add to the number of Jews in 
Palestine prepared to engage in manual labor; and 
such encouragement also had been given to pos- 
sessors of special skills, such as dock workers from 
Turkey. Nonetheless, not until the poststate mass 
immigration was a concerted effort made to mobi- 
lize the population of non-European provenience 
into the systems of action and relations that had 
resulted in the creation of Israel. The goal of this 
concerted effort, which has engaged most of the 
governmental and voluntary organizations, has 
been the cultural transformation of the immi- 
grants and their children. Partial goals have in- 
volved not only the specific concerns of functionally 
differentiated agencies, in regard to such areas as 
health, education, and vocational training, but also 
attempts at factional ideological indoctrination and 
recruitment of the immigrants into the nation’s 
political parties. 

The poststate land-settlement program saliently 
illustrates both (1) strategies commonly followed 
in the mobilization of new immigrants to imple- 
ment national and partisan goals within the frame- 
work of established institutions and systems of 
relations, and (2) the modification of these insti- 
tutions and systems as a consequence of such 
recruitment and the new context of national devel- 
opment. Land settlement retained its high priority 
among nation-building activities for almost a dec- 
ade following statehood. Israel’s food shortage, 
which lasted until the mid-1950s and entailed 
stringent nationwide rationing, made urgent the 
rapid expansion of agriculture, while national se- 


112 NEAR EASTERN SOCIETY: Israel 


curity demanded the promptest possible settlement 
of strategic vacant areas along its extended fron- 
tiers. Although young immigrants of the classic 
pioneer tradition and young Israelis established 
kibbutzim and some moshavim at the most crucial 
and dangerous locations and although the army 
instituted a special corps to aid in founding frontier 
villages, the new immigrant influx constituted the 
main reservoir of manpower for land settlement. 
Improvisation of new approaches to land settle- 
ment within the institutionalized patterns of action 
began with the decision by the vested land-settle- 
ment authorities to recruit the newcomers, despite 
the majority's noncommitment to the traditional 
pioneer ideologies, the relative maturity and family 
status of many of them in contrast to the youthful- 
ness of the classic pioneer settlers, and their lack 
of training for the classic pioneer roles. A conces- 
sion to these attributes was made in giving primacy 
to the cooperative moshav rather than the com- 
munal kibbutz as the village type for the settlement 
of such immigrants, but the traditional politiciza- 
tion of land settlement was maintained. Thus, 
village sites were assigned to the various moshav 
federations according to a formula involving rela- 
tive strength of the political parties involved; and 
Tepresentatives of these federations competitively 
recruited in the immigrant camps and later ex- 
tended their recruitment activities to immigrants in 
transit to Israel and potential immigrants abroad. 
Once brought as settlers to the nascent villages, 
the recruits were committed to observing the full 
range of cooperative provisions involved in moshav 
organization and to using the services provided 
through the federation’s network. As immigrants, 
particularly from the non-Western lands, rebelled 
against the demands of the roles assigned them 
and against partly fortuitous and partly deliberate 
destruction of their own systems of relations and 
patterns of action, successive improvisations and 
some accommodations were devised by the land- 
settlement authorities to cope with settler non- 
acquiescence in the behavior demanded of them. 
For example, extended family groupings among 
traditional immigrants from non-Western lands 
frequently had been disrupted by the assignment 
of the component nuclear families to different vil- 
lages. This occurred partly as a result of bureau- 
cratic and political factors in recruitment of new 
settlers and partly as a deliberate expression of the 
national policy of “fusion of exiles.” Similarly, peo- 
ple of common cultural background were often 
dispersed, while those of diverse background and 
provenience frequently were settled together, with 
ethnic difference then becoming a focus of village 


factionalism. By the mid-1950s the advantages— 
for settler solidarity, cooperation, and morale—of 
preserving extended family ties and relative homo- 
geneity of background in the peopling of new vil- 
lages were acknowledged by the land-settlement 
authorities, and this policy was incorporated in 
official planning, although often not implemented. 

Several years earlier, classic moshav organiza- 
tion had been modified in the new villages by the 
assignment of interim instructors, recruited by the 
moshav federations. These instructors were to com- 
pensate for settler unpreparedness and dissidence 
by supervising village development and promoting 
role socialization, not excluding ideological indoc- 
trination. While the interests of village develop- 
ment thereby were better served, crises with 
settlers continued. Many immigrants left the vil- 
lages, to be replaced by more recent newcomers, 
who were in turn subjected to the pressures of role 
socialization and, depending on provenience, to the 
pressures of massive cultural transformation. In 
the new villages these pressures involved not only 
vocational retraining and civic education specif- 
ically directed to moshav institutions but also new 
patterns of child rearing, family relations, and 
other intimate areas of daily life. Outside the land- 
settlement framework such pressures, while more 
diffuse, were also exerted on new immigrants by 
representatives of governmental and voluntary 
agencies concerned with employment, housing, and 
health, educational, and welfare services. Indeed, 
most agency representatives seemed to consider it 
not only a right but a duty to exhort newcomers 
from the Oriental lands to abandon their “primi- 
tive” ways. 

By the mid-1950s, however, the relative ineffi- 
cacy of ideology and exhortation to effect rapid 
cultural transformation began to be recognized by 
planners and policy makers throughout the agen- 
cies. Concomitantly, national development had 
advanced to the point of introducing systematic 
and sophisticated planning into the further expan- 
sion of hitherto improvised programs, a process 
that is still underway. The practical benefits to be 
gained through adapting to existing institutions 
were increased and were stressed in programs di- 
rected both to new immigrants and to earlier ar- 
rivals who had been unable or unwilling to acquiesce 
in the plans directed at them. 

Moreover, new immigrants are granted immedi- 
ate citizenship and suffrage, and the competitive 
Politics of Israel offer them a means of gaining 
attention for their own interests as they see them. 
In the land-settlement sector, villagers can and 
have threatened to transfer their villages’ political 


affiliations in order to wring concessions from the 
land-settlement authorities. Settlers in cities and 
those directed to new development towns, initially 
the objects of competitive party recruitment, in- 
creasingly are using the ballot and the possibility 
of bloc voting to gain representation at municipal 
and national policy-making levels. In this way they 
have contributed to the progressive obsolescence 
of ideology in the national ethos, Their incorpora- 
tion into factional politics also has helped to inte- 
grate them into the national society. 

Spheres of national development. The develop- 
ment of Israel also has involved a gradual assump- 
tion by the national government of functions 
previously vested in the factional networks. National 
defense was the first area over which unquestioned 
governmental control was brought to bear, For 
during the prestate period the underground defense 
force, the Haganah, under Jewish Agency direc- 
tion, was not the only illegal military organization 
to be developed. During the country-wide Arab 
attacks of the late 1930s, members of the Revision- 
ist movement, opposed to official Zionist policy of 
nonretaliation, formed the separate military force 
of Irgun Zvai Leumi. When, during World War n, 
both of these forces cooperated with the British 
(apart from their separate arrangements for pro- 
moting illegal immigration), a new guerrilla or- 
ganization formed to fight the mandatory govern- 
ment; after the war the Irgun joined this fight. 

The Haganah, the major and by far the largest 
of the three organizations, remained under civilian 
control and had the support of the Yishuv and of 
Zionist adherents and sympathizers abroad. The 
other two organizations challenged its legitimacy 
as Yishuy representative and also sought support 
abroad. In addition, during the final mandate years, 
an elite corps of shock troops was established with- 
in the Haganah but with a separate command and 
organization. 

With the founding of the state of Israel, its lead- 
ers acted to disband the dissident military organi- 
zations, to bring their members within the Haganah, 
and to foreclose the possibility of separate armed 
factions within the state. This was successfully 
achieved, although not without a clash of arms, 
reiterating the principle of civilian governmental 
control over the military. Indeed, Ben-Gurion took 
the further step of disbanding the Haganah shock 
troops as a separate organization. Thus, the Israeli 
Defense Forces were removed from the sphere of 
political factionalism. Since then, they have largely 
retained their prestate character as a primarily 
civilian force. The Defense Forces’ permanent or- 
ganization under arms is limited to its commis- 
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sioned and noncommissioned regular officers and 
the men and women undergoing their period of 
military conscription. The reserve includes all men 
up to the age of 49 and childless women to the age 
of 34, and it can be mobilized in a matter of hours. 
All frontier villages are part of the army, and their 
male settlers, whether subject to conscription or 
not, undergo special military training, stand guard 
over their villages, and maintain the settlements 
in a state of preparedness against attack, 

National pride in the Defense Forces and in the 
recently established tradition of the Jew who fights 
back is an integrative force in the nation. More- 
over, the army serves more concrete integrative 
functions. It provides a basic education to all con- 
scripts, thereby offering immigrant young men 
skills applicable to civilian life and to the autono- 
mous exercise of their rights as citizens. The 
constant interaction between native Israeli and 
immigrant conscripts of whatever background 
works to break down prejudice while establishing 
personal ties and the ties of shared experience be- 
tween members of all segments of the population. 
The army also supplements land-settlement activi- 
ties. Although most young women of Oriental pro- 
venience have sought exemption from military 
conscription on grounds of religious orthodoxy, 
other women conscripts have had the option, after 
basic training, of serving as schoolteachers in the 
frontier immigrant villages; and the army supple- 
ments, through briefer programs, other educational 
provisions in these villages. Moreover, service in 
Nahal, the special army corps established to found 
or reinforce frontier settlements, is open to con- 
scripts after basic training; some members of Nahal 
have chosen to remain as permanent village settlers 
after demobilization. 

The Defense Forces and the Ministry of Educa- 
tion and Culture also have established a youth 
movement, Gadna, which enrolls young people be- 
tween the ages of 14 and 18 (the age gap between 
free, compulsory schooling and military conscrip- 
tion) and offers them both preliminary quasi-mili- 
tary training and activities in the pioneer and 
scouting traditions. The establishment of such a 
youth movement is in the prestate Zionist tradition, 
maintained after statehood, of building youth or- 
ganizations both in the Diaspora and the homeland. 
Indeed, immigrants in the pioneer tradition have 
been recruited largely through youth movements 
organized by the labor—Zionist factions; and the 
Histadrut as well as practically all the Zionist 
factions supports youth organizations in Israel. 
Their programs generally have combined scouting 
activities, indoctrination in the partisan expression 
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of pioneer values, and training for agricultural set- 
tlement. The Histadrut program also includes vo- 
cational training and serves trade-union functions 
for employed young people. Since statehood, the 
activities of Youth Aliyah, the organization created 
for the rescue of youngsters from Nazi Europe, 
have included education in its special youth groups 
and in its day schools for young immigrants of 
primarily Oriental provenience. 

While the partisan networks continue to chan- 
nel many of the nonacademic programs directed 
to the 14- to 18-year-old age group, basic educa- 
tion was transferred from the jurisdiction of such 
networks to that of the state in 1953. In the pre- 
ceding years, political factions had engaged in 
competitive recruitment among new immigrants 
for the assignment of their children to schools of 
the different trends, the greatest area of cleavage 
and competition being that between proponents 
of secular and religious education. The state school 
system established in 1953 substituted residence 
for political affiliation as the basis for school regis- 
tration. However, two state school systems are al- 
lowed—one secular and the other religious—and 
both are under the jurisdiction of the Ministry of 
Education and Culture. Thus, some competition 
between secular and religious factions still con- 
tinues in the field of education. 

Free and compulsory education in Israel includes 
kindergarten, eight years of elementary school, and 
schools for working youths above the age of 14 
who have not completed elementary education. 
Academic secondary education is neither free nor 
compulsory, but the state provides extensive schol- 
arship aid. Moreover, voluntary organizations, some 
of them politically affiliated, maintain a range of 
agricultural and vocational secondary schools for 
which many scholarships are available. Higher ed- 
ucation in Israel is not under state jurisdiction, 
except for teachers’ colleges, nor are the religious 
academies. 

National social security insurance, in effect by 
1954, and the nationalization of employment of- 
fices, or labor exchanges, in 1959 also assigned to 
the state services that were previously within the 
province of politically oriented voluntary organi- 
zations, principally the Histadrut. However, health 
insurance remains within their domain, although 
national health insurance is a possibility favored 
by many labor~Zionist sympathizers as well as 
other factions in the country. Indeed, the total 
medical coverage provided by the Histadrut’s sick 
fund is the most obvious benefit of Histadrut mem- 
bership today. 


Although assuming some of the Histadrut’s pre- 


state services, the government of Israel contrib. 
uted to strengthening its role in the nation’s e 
omy by the many partnerships into which the 
entered, with or without the further partnershj 
of private capital, in financing development proj- 
ects and new industries. Indeed, by its control over 
the funds made available to Israel by United States 
and international aid, German reparations, and 
funds from sympathizers, the government has 
dominated economic development and capital 
vestment. Its regulation of foreign exchange ina 
country economically dependent on its import sur 
plus further contributed to this domination. 

National economic policy during the first decade 
of the state was largely determined by labor—Zior 
goals and values and the exigencies of nationa 
survival. As to the former, Mapai, the majorit 
party in every government, has consistently re 
tained control over the crucial ministries of defense 
and finance as well as education and culture 
foreign affairs, and the police; and top national 
leadership has remained in the hands of the lea 
ship of the Yishuv, primarily members of the see 
ond and third aliyot. Differences over econo 


om 


this period. While the government contributed to 
the expansion of the Histadrut enterprises, 
latter, in its trade-union capacity, maintained the 
wage and salary levels consistent with government 
tal policy for curbing inflation. 
Social stratification. In the prestate period His 
tadrut wage and salary policy stressed egalitari 
ism. Highly educated professionals, their numbers 
in excess of demands for their skills, accepted 4 
salary level little higher than that of manual laf 
borers, doing so on grounds of ideology, lack Ol 
choice, or both. Moreover, apart from purely tech 
nical fields, the labor—Zionist pioneers had little 
use for professionalism. Having repeatedly com 
founded expert predictions in such matters as Pa 
estine’s absorptive capacity and Israel’s capacity 
for survival, they regarded ideology, devotion, and 
common sense as adequate bases for problem 
solving. Although this attitude began to change if! 
the mid-1950s, with economics one of the first 
fields to gain recognition, egalitarian policies 
regard to salary scale were maintained. 
However, differences in economic status were 
not averted in Israel. Some involved continuity with 
prestate conditions. On the one hand, since before 
the second aliyah some middle-class immigrants: 
have achieved and bequeathed relative prosperity OF) 


consolidated resources they had brought with them; 
on the other hand, among those of Oriental pro- 
venience even veteran immigrants have experi- 
enced greater unemployment and, commonly, a 
continuously lower income level than other sec- 
tors of the population. 

Since statehood the number of wealthy has in- 
creased. Indeed, governmental resources for invest- 
ment and the government-controlled multiple rates 
for foreign exchange (see Rubner 1960) offered 
opportunities to the enterprising and knowledge- 
able for profiting from conditions of development. 
However, the mass immigration also vastly in- 
creased the number of immigrants from Asia and 
north Africa, usually with large families, whose 
lack of transferable skills limited their employment 
possibilities. Even the population within the tradi- 
tional labor—Zionist sectors of the economy experi- 
enced differentiation. The settlers of veteran agri- 
cultural villages flourished; members of certain 
Histadrut-affiliated cooperatives, such as those in 
transport, profited from their near monopolies; 
high officials in the governmental and quasi-gov- 
ernmental agencies had their lot eased by nonmon- 
etary perquisites of office. 

From the mid-1950s on, professionals no longer 
could be restrained by Histadrut leadership from 
pressing claims for higher salary scales, and they 
won their point through going on strike. Some, such 
as sick-fund physicians, were employees of the 
Histadrut itself. Since then, groups of industrial 
workers have gone on strike, including employees 
in Histadrut enterprises. The clash of interests 
within the Histadrut, associated with its functions 
as both company holder and trade-union organiza- 
tion, illustrates the increased plurality of groupings 
and systems of relation and action within the new 
society which partly crosscut the older factional 
groupings. 

In the stratification system of contemporary 
Israeli society, achievement criteria are the dom- 
inant but not exclusive basis for recruitment into 
the nation’s elites. Members of the oldest prestate 
pioneer generations still exercise top leadership. 
Their surviving comrades, even if no longer office- 
holders or active in public life, retain the prestige 
of their accomplishments and whatever informal 
influence that may derive from their personal ac- 
cess to those in authority. 

The nation’s administrative elites and, increas- 
ingly, those holding high as well as middle-range 
office are largely recruited from the native Israelis 
who distinguished themselves in the underground 
forces, the war of independence, the poststate 
Defense Forces, and/or responsible posts in the 
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national development programs, Associated with 
them are members of the same age groups with 
similar accomplishments who arrived in Palestine 
in childhood or early youth. Moreover, professional 
expertise has been increasingly valued since the 
mid-1950s. Israeli administrators, including those 
who had previously scorned higher education or 
whose education had been interrupted by the exi- 
gencies of the transition to nationhood, have sought 
further training both in the expanding national 
institutions of higher learning and abroad. Profes- 
sionally trained immigrants have increasingly been 
recruited into posts demanding their skills, although 
the underutilization of professionals has remained 
a problem and has been a factor in the emigration 
of many of them. Outstanding scholarly, scientific, 
or artistic accomplishments confer unquestioned 
elite status and the possibility of easy assimilation 
into the informal networks of the high-ranking 
population. 

These networks are not limited to those who cur- 
rently hold high-ranking posts, Personal loyalties 
and the loyalties born of shared experience are 
long-lasting in Israel; such ties, as well as the older 
politicized systems of relations, continue to link 
individuals involved in different systems of action 
in the increasingly diversified occupational group- 
ings. Agricultural and pioneering activities still con- 
fer some of the prestige of the prestate period, and 
the villages established in the prestate decades, 
particularly the moshavim, have successfully re- 
tained most of the generation born into them, In- 
deed, many young Israelis, including numbers from 
high-ranking urban families, still join new frontier 
settlements, although most may not remain for 
more than a limited period. 

Stratification is notably visible in the still high 
correlation between non-European provenience and 
relatively low socioeconomic status. However, the 
range of national educational programs, including 
enriched curricula at the elementary-school level 
and special scholarships for secondary education, 
represents a determined effort to break this corre- 
lation. Moreover, the politics of Israel and its man- 
power needs favor the recruitment of qualified 
members of non-European ethnic groups into vis- 
ible prestige positions and political candidacies, 
where such groups still are vastly underrepresented 
in proportion to their numbers in the population. 

The systems of relations institutionalized in the 
prestate decades still operate in contemporary 
Israel, cutting across other bases of grouping and 
cleavage, including those involved in stratification. 
After the first decade of statehood there was a dis- 
cernible trend toward minimization of traditional 
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partisanship and the amalgamation of the estab- 
lished factions into fewer, larger, and politically 
more potent parties based on primary orientations 
toward current issues of national life. For example, 
in 1956 the major religious and religious—labor 
parties merged, although the ultraorthodox parties 
have retained their separate existence. Then in 
1961 the two main middle-class parties, the Gen- 
eral Zionists and the Progressives, merged, thus 
strengthening the claims of private enterprise in 
national economic policy. Finally, the successor to 
the Revisionist movement, hitherto excluded from 
all role in government, joined this coalition of lib- 
eral parties in 1965, adding antiestablishment dis- 
sidents to the constituency of middle-class interests. 
However, the amalgamation of factions was not 
accomplished without some further splintering. The 
opponents of the Revisionists, particularly members 
of the former Progressive party, seceded from the 
liberal coalition, Even more dramatic was the se- 
cession from Mapai; Ben-Gurion, who had retired 
as prime minister and resigned from the leadership 
of the party he helped to found, left Mapai and 
with a group of younger leaders founded a new 
political party. The embattled politics and fluid 
stratification system of contemporary Israel mil- 
itate against crystallization of a closed ruling class. 

The deepest cleavage in Israel is between Jews 
and the Arab population, which numbered 212,400 
in 1965. Arabs have equality before the law, are 
represented in the Knesset, and receive free ele- 
mentary education in Arabic as well as other na- 
tional services and the services of the Histadrut. 
However, those in border areas have lived under 
military government, with their movements super- 
vised to prevent passage back and forth across the 
frontier. While an increased Arab enrollment in 
secondary schools and institutions of higher learn- 
ing is supported by the government, and qualified 
Arabs are recruited into civil service posts unrelated 
to security, the majority remain village farmers. 
They do not serve in the army; and their integra- 
tion into Israeli society through more than formal 
organizational channels, market-place transactions, 
contacts between adjacent Jewish and Arab vil- 
lages, and individual contacts still seems dependent 
on larger issues of Middle East relations. 

The Druze, who sided with Israel in the war of 
independence and voluntarily serve in the army, 
are in a different situation. The respect given their 
traditional ways so far represents the major ex- 
ample of valued cultural pluralism in Israel. 


DorotHy WILLNER 


[See also ANTI-SEMITISM; JUDAISM; REFUGEES, article 
on ADJUSTMENT AND ASSIMILATION; ZIONISM.] 
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NEEDS 
See Drives; MOTIVATION; STIMULATION DRIVES. 


NEGOTIATION 


Negotiation is a form of interaction through 
which individuals, organizations, and governments 
explicitly try to arrange (or pretend to do so) a 
new combination of some of their common and 
conflicting interests. Two types of common inter- 
ests can be distinguished: (1) an identical com- 
mon interest in a single arrangement or object 
which the parties can bring about only, or more 
easily, by joining together and (2) a complemen- 
tary interest in an exchange of different objects 
which the parties cannot obtain by themselves but 
can only grant to each other, 

To realize an identical common interest the par- 
ties must agree on the characteristics of the ar- 
rangement (concerning which they may have 
different preferences) and on the division of gains 
and costs (where their interests usually conflict). 
Two nations jointly constructing a dam and two 
corporations merging into a single firm are ex- 
amples of such arrangements. Complementary 
interests are realized through barters, sales, or 
agreements on mutual tariff concessions. Most 
international negotiations embrace a combination 
of identical common interests and complementary 
interests, whereas business negotiations are pre- 
dominantly concerned with complementary in- 
terests, 

The parties may relate their conflicts and com- 
mon interests explicitly or tacitly. The term “nego- 
tiation” usually refers to the explicit process, with 
proposals and counterproposals. “Tacit bargaining” 
occurs if the parties deliberately arrange a new 
combination of common and conflicting interests 
through hints and guesswork, without explicitly 
proposing terms for agreement. Tacit bargain- 
ing is of great importance in military confronta- 
tions, when negotiation may be difficult because of 
incompatible war aims, domestic opinion, or the 
lack of diplomatic relations. Tacit bargaining can 
help to keep the area of hostilities limited, restrain 
the use of force, and prepare the ground for nego- 
tiations to terminate hostilities. These functions of 
tacit bargaining received little detailed attention 
until the 1950s, when they were analyzed in con- 
nection with studies of limited war, arms control, 
and military deterrence (Schelling 1960). Negoti- 
ation, by contrast, is necessary for more complicated 
forms of collaboration, for most exchanges, and for 
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any arrangement where an explicit agreement is 
essential. 

Negotiating for side effects. The process of ne- 
gotiation can have side effects that do not concern 
the agreement which is ostensibly being negotiated. 
In international diplomacy these side effects are 
far more important than, say, in business or labor— 
management negotiations. Indeed, they are some- 
times the principal or only reason why governments 
participate in an international conference. That is, 
governments may negotiate not so much to obtain 
favorable terms of agreement as to spread propa- 
ganda, to gather intelligence, to influence third 
parties, to divert the opponent from the use of force, 
to deceive him, or to maintain contact with him 
in order to communicate on other matters. (Hence, 
the above definition of “negotiation” allows for the 
fact that parties may only pretend to seek an agree- 
ment.) In areas such as disarmament, where 
public opinion generally favors negotiation, gov- 
ernments may attend conferences just to appear 
virtuous (Spanier & Nogee 1962). One advantage 
of private or secret diplomacy when agreement is 
desired is that it reduces the importance of some 
distracting side effects. 

Models of the negotiating process. Negoti- 
ation can be analyzed in terms of (a) the inter- 
action of some basic choices that the parties must 
make and (b) certain basic moves through which 
each party can influence the opponent's choice. 
Game theory provides mathematical models to rep- 
resent this simplified structure, Like other models 
in social science, they are, of course, of limited use 
in the study of negotiation, because they frequently 
do not represent enough of reality to be relevant 
and sometimes misrepresent even that part of real- 
ity which they address. What a model can easily 
represent is the interaction between each party's 
choice of settling for the opponent's terms or not 
concluding an agreement, provided each party 
maintains unchanging evaluations of its gains and 
losses from each outcome (i.e., provided the utili- 
ties of each outcome remain stable for each party). 

Obviously, negotiation can result in agreement 
only if there exists at least one set of terms that 
each party would prefer to having no agreement. 
Frequently several sets of terms or a range of terms 
among which the parties have conflicting prefer- 
ences meet this condition. The question arises, 
therefore, on which terms the parties will settle— 
who will get his way? 

Commitments and threats. Commitments and 
threats are the two basic moves in negotiation. If 
you make a commitment, you try to alter the op- 
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ponent’s expectations about your future conduct 
by changing your own incentives; if you issue a 
threat, you try to alter your opponent’s expectations 
about his payoffs that would result from his making 
certain choices. A commitment is a move to con- 
vince your opponent that you will maintain your 
current position, or implement your prediction, by 
making it more difficult for yourself not to do so. 
A threat is a prediction addressed to your opponent 
(or to those who might influence him) that he will 
suffer a certain loss if he does not comply with 
your wishes, 

A commitment works as follows: Given a range 
of terms that both you and your opponent prefer 
to not having an agreement, you try to induce your 
opponent to accept those terms within this range 
which are most favorable to you by making it diffi- 
cult, or virtually impossible, for yourself to accept 
any other terms, In other words, you narrow the 
range within which you would prefer agreement so 
that it will include only your most favored terms. 
If your opponent realizes this, he must then expect 
that you would reject an agreement unless he meets 
your favored terms. Of course, the opponent might 
commit himself, too, and the terms of agreement 
will then depend, among other things, on the rela- 
tive strength of these commitments. If the commit- 
ments are strong, no agreement might result even 
though both parties, initially, would have been 
better off with agreement. The binding power of a 
commitment in interpersonal negotiations may 
stem, for example, from a wager, an oath, or an 
accord with a third party not to settle for less than 
one’s preferred terms. In international negotiations, 
governments can commit themselves by staking 
their prestige on a position or by the way in which 
they tie their military or economic resources to it. 

The concept of a threat—more complicated than 
that of a commitment—has been analyzed more 
carefully in modern theories of military strategy 
than in studies on diplomacy. (Military deterrence 
is based on a threat.) The party that threatens 
asserts it will make a special effort to cause the 
Opponent a certain loss should he fail to comply. 
But carrying out the threat (i.e. inflicting this 
loss) may be costly for the threatener as well, per- 
haps even more costly than for his opponent. 
Hence, the question arises of the credibility of the 
threat, that is, whether it will be carried out when 
challenged or whether it will turn out to be a bluff. 
To make a threat more credible, it may be rein- 
forced with a commitment [see DETERRENCE]. 

In negotiations where a party wishes to change 
the status quo in its favor, at the expense of a party 
defending it, the offensive party always needs a 


threat to make the defensive party fear that it 
would lose more by not reaching an agreement 
than by agreeing to accept the detrimental change. 
The classic example of such negotiation is the 
Munich conference of 1938. 

In addition to this rudimentary structure of the 
negotiating process, a number of complications 
must be introduced into any theoretical scheme 
that is to explain or reflect reality, These addi- 
tional features are usually omitted in the formal- 
ized models and are difficult to simulate in labora- 
tory experiments. Laboratory experiments have 
been developed to test models which include the 
more basic features of the negotiating process, such 
as the combination of conflict and common inter- 
ests, the choice between concluding an agreement 
and not concluding one, and the use of threats and 
commitments (Rapoport & Orwant 1962). 

Domestic affairs of the parties. An additional 
complication is the fact that the parties often are 
not unitary decision makers. This is of crucial im- 
portance in international negotiation and is often 
relevant in labor-management negotiation. The ne- 
gotiators have to bargain not only with the oppo- 
nent but also with members of the organization 
that they represent. For example, the union repre- 
sentative in labor-management negotiations must 
be mindful not to lose the support of the rank-and- 
file union members. A corporation executive en- 
gaged in negotiations for a merger must align other 
corporate officers and perhaps important stockhold- 
ers. And diplomats, of course, have their domestic 
opinion, legislatures, and sometimes, even compet- 
ing branches of the executive to worry about. 

Secret diplomacy can keep some of these domes- 
tic groups ignorant of the day-to-day negotiating 
process. This often turns out to be advantageous, 
not just for the chief negotiator but also for the 
organization or the government as a whole by help- 
ing it to defend a broader public interest that 
transcends parochial pressure groups. In the United 
States, the influence of domestic affairs on foreign 
policy is more apparent than in many other coun- 
tries and has received considerable attention (Wes- 
terfield 1955). But domestic pressures can also help 
a negotiator to obtain better terms: they can serve 
as a commitment in that they may convince the 
opponent that he must yield because the first nego- 
tiator could not concede even if he wanted to. 
Indeed, a favorite argument among negotiators, 
both Western and communist ones, is that public 
opinion at home forces them to remain firm. 

f Future bargaining strength. A second complica- 
tion in real-life negotiation is that the parties are 
usually trying not only to get good terms for the 


agreement under negotiation but also to protect 
and improve their future bargaining strength. This 
is particularly true in international negotiation. In 
the language of game theory: international nego- 
tiation is never a self-contained game but is a phase 
vaguely related to a never-ending “supergame.” 
Although each negotiation yields its own payoffs, 
the tactics used in it affect the opponent’s calcula- 
tions in later negotiations and thereby influence 
subsequent payoffs. The reason for this continuity 
is that the opponent will impute to the government 
a certain diplomatic style, certain attitudes toward 
risk taking and the use of force, a degree of political 
will, a certain tendency to bluff or to hold fast to 
a position, and so forth. Of course, this reputation 
is not permanent, and it is often confined to a par- 
ticular subject of negotiation. For example, a gov- 
ernment may be known to haggle over a wide range 
in commercial negotiations but to modify its posi- 
tions very little on political issues. 

Rules and norms. The third complicating fea- 
ture in negotiation (as contrasted with more rudi- 
mentary bargaining situations) is due to the fluidity 
of the rules that regulate the way in which the 
parties conduct negotiations. This fluidity is par- 
ticularly pronounced in international diplomacy. 
In business and labor-management negotiations, 
many rules are quite clear and firm, because they 
are enforced by higher authority. For example, the 
use of force is prohibited or severely controlled, and 
an offer that has been accepted can usually no 
longer be withdrawn. According to the U.S. Na- 
tional Labor Relations Act of 1947, the parties must 
follow some rules even in the way in which they 
defend their positions (Cox 1958). 

However, many rules in interpersonal negotia- 
tion, and practically all the rules in intergovern- 
mental negotiation, are not enforceable. Here the 
concept “norms” may be more appropriate than 
“rules,” Each party follows varying sets of norms, 
depending on the situation and on the type of oppo- 
nent. Friendly parties share a far more extensive 
set of norms than do enemies. Indeed, the violation 
of certain norms is likely to terminate, or at least 
interrupt, a friendly relationship. For example, un- 
ambiguous lies and invective must be avoided, 
explicit threats must not be issued, and agreements 
in principle must not be deliberately misconstrued 
later on. 

Since there is no enforcement, what keeps the 
Parties from violating these norms when it would 
be to their advantage to do so? First, it must be 
observed that the norms frequently are violated. 
Even the most elementary norm of diplomacy, the 
tule that the opponent's negotiator should not be 
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killed, has been violated not only in antiquity but 
also in modern times, The norms are usually ob- 
served for two principal reasons: (1) the parties 
want to induce the opponent to reciprocate and (2) 
they want to facilitate agreement, However, recip- 
rocation is not always felt to be mandatory, nor do 
all the norms observed by diplomats facilitate 
agreement. In these cases, the norms are followed 
not because it is expedient but because they are felt 
to be in keeping with the proprieties. 

Thus, the rules that are more or less observed in 
negotiation resemble the norms or mores studied 
by sociologists. They may be followed because (1) 
of expediency; (2) the negotiator does not want to 
shock others who feel the norms prescribe the 
proper behavior; or (3) the negotiator himself con- 
siders it unseemly or immoral to violate these 
norms, Examples of such norms are that an agreed 
agenda ought to be adhered to, that partial agree- 
ments should not be reversed, that concessions 
should be reciprocated, and that emotional out- 
bursts and physical violence at the conference table 
are to be avoided. 

Instability of evaluations. A final complication 
of cardinal importance in negotiation must be 
added. The way in which the parties evaluate their 
own payoffs and those of the opponent is highly 
uncertain and keeps changing as a result of negoti- 
ation. This is true even in negotiation about simple, 
quantifiable issues, such as a purchase price, and 
it applies with particular force in diplomacy. In 
most negotiations, as is not the case in parlor 
games, there are no firm bench marks that divide 
gains from losses. To evaluate whether certain op- 
tions would be advantageous or not, each negoti- 
ator must have a break-even point in mind that 
divides the plus side from the minus side. These 
points are subjective ideas which keep shifting 
under the impact of negotiation. For instance, a 
party may initially feel that it breaks even as long 
as it preserves the status quo, whereas later on— 
after having faced an aggressive opponent for some 
time—the preservation of the status quo may seem 
like a gain worth concessions on another issue. 

Not only these break-even points keep shifting; 
the scales by which the parties evaluate the magni- 
tude of their gains and losses show the same fluid- 
ity. If a large effort is being exerted to obtain a 
small improvement, this improvement may seem 
like a large gain in the end. And the reverse reac- 
tion is well known in everyday human affairs, 
characterized by the “sour grapes” story. 

Important consequences follow from this insta- 
bility of evaluations. First, the parties cannot know, 
before being faced with a specific choice, what 
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their minimum terms are at which they would still 
accept an agreement. The notion of fixed minimum 
positions for each party is realistic only for short 
time periods and very simple issues, which are 
rarely found in diplomatic negotiation. Second, 
since the break-even point for each party keeps 
shifting as the result of negotiation, the point which 
equally divides the gains and losses between the 
opponents keeps changing, too. Concepts such as 
“fair” division and “equitable solution” are there- 
fore dependent variables of the negotiation process. 
In other words, a view that the payoffs between the 
parties are equal, or “fairly” distributed, is in itself 
an outcome of the negotiating process, It is not an 
objective criterion by which one can evaluate nego- 
tiations. This is also true for mediators or arbitra- 
tors: their image of what is a “fair” division 
depends on how the parties succeed in presenting 
the issues to them. 

Related to changing evaluations is the fact that 
the parties change their goals while they are en- 
gaged in negotiation. For example, when the goals 
become more modest, a settlement may turn out to 
be satisfactory that would have initially been found 
to be unacceptable. This relationship between one’s 
goals and one’s evaluations of the outcome achieved 
has been studied by psychologists using the concept 
“level of aspiration” (Siegel & Fouraker 1960; 
Frank 1941), 

Compromise and focal points. Compromise is 
often considered essential for negotiation. And a 
settlement is often called “fair” just because it has 
been reached through compromise. However, there 
are several other ways of reaching agreement, and 
whether or not a particular compromise accords 
with given standards of fairness must depend on 
the intrinsic merits of the positions. All that can 
be observed in a compromise is that the parties 
have revised their positions through concessions 
until they have met in agreement. Yet one side may 
have retreated from an extravagant position while 
the other one has made sacrifices from a modest 
position. 

Certain salient features in the area under dis- 
pute often provide a focal point where the expecta- 
tions of the parties meet, regardless of whether 
their positions converge through a compromise or 
through some other maneuver (such as one side 
catching up with rising demands by the other side). 
Round figures tend to be a likely point for settling 
monetary issues, and geographic latitudes or a river 
for territorial issues, Also, precedents from similar 
Se frequently Betvelas such focal points. 

influence of mediators is due largely to the fact 


that by selecting a possible point of agreement, they 


create a focal point. 
FRED CHARLES IKLE 


[See also DIPLOMACY; GAME THEORY; INTERNATIONAL 
CONFLICT RESOLUTION; LABOR RELATIONS, article on 
COLLECTIVE BARGAINING. Other relevant material 
may be found in DETERRENCE; DISARMAMENT; IN- 
TERNATIONAL RELATIONS; NUCLEAR WAR; PEACE; 
SIMULATION. | 
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NEIGHBORHOOD 


The neighborhood is ordinarily viewed as a 
smaller version of the local community. The latter 
has been conceived of in two distinct ways: first, 
it has been defined as simply a locality and the 
people living in it; second, it has been seen as a 
social fact, a type of group. Both notions have their 
roots in folk theory. It is common in most cultures to 
think of spatial collections as units of some impor- 
tance; and the belief that spatial propinquity leads 
to social interaction and the development of shared 
values and action patterns is also widespread. 

Social and spatial aspects. Thus the local com- 
munity is often discussed as a concrete social unit. 
To it are imputed meaningful social bonds, the gen- 
eration of social value, and psychological support 
for the individual. In short, the local community is 
seen as a primary group. Formalized by Tonnies 
(1887) as a Gemeinschaft, the notion that the 
small, spatially defined collective is a primary com- 
munity has been used in rural sociology to describe 
the open-country neighborhood and the village. It 
has also been used in urban sociology to mark out 
significant subunits within the urban fabric and by 
city planners to mark out the boundaries of residen- 
tial enclaves, with the aim of reinforcing the pri- 
mary-group aspect of the subarea. This last calls 
attention to the normative aspect of the term: 
planners and others frequently are concerned with 
maximizing certain social values through the ma- 
nipulation of spatial boundaries. The “neighbor- 
hood unit” is, for city planners, a form of the self- 
fulfilling prophecy (Dewey 1950). 

The more useful approach is to separate the 
spatial aspect from the social and then to make 
their relationship problematic. People are concen- 
trated in space for various reasons, and, given a 
whole, one may subdivide it in any way that suits 
his purpose, Some ways of subdividing space, how- 
ever, seem roughly congruent with the boundaries 
of social groups, including such boundaries as nat- 
ural obstacles, man-made obstacles, and sharp 
changes in socially relevant characteristics of the 
population (Gibbs 1961, part 1). The key question 
then becomes this: In what sense are these popu- 
lations, so bounded, creating and maintaining 
group structure? or, in other words, What is the 
relevant form of group? 

Functional interdependence. One way to define 
a human group is to specify that “a social group is 
an aggregate of individuals who exist in a state of 
functional interdependence, from which evolves a 
flow of communication, and a consequent ordering 
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of behavior” (Greer 1955, p. 18). In what ways, 
then, and under what circumstances, can the small 
local community or the neighborhood constitute a 
social group? What are the grounds for interde- 
pendence, the nature of the communication flow, 
and the mechanisms producing ordered behavior 
and the patterns resulting from it? 

The nature of interdependence among a spa- 
tially defined aggregate varies as widely as the 
human needs which require cooperation for their 
fulfillment. The spatially defined populations may 
be interdependent for their polity (protection and 
the maintenance of order); for their economy (pro- 
duction and distribution of goods); and for their 
familial structure (mating, reproduction, and as- 
signment of kin rights). Equally important, such 
populations may be internally dependent for the 
tasks made necessary by the simple facts of density 
and propinquity, These include publicly assigned 
functions, such as education, and publicly created 
structures, ranging from temples to roads and 
sewers. The range of bases for interdependence is 
empirically and theoretically this wide. 


The primary community 


In the past the organizational scale of societies 
has gone through cycles of expansion and con- 
traction. Thus the city-state would be greatly aug- 
mented through the integration of peasant villages, 
then would slowly unwind back into its constituent 
parts, the same villages. Because so many tasks 
could be and were carried out in the peasant vil- 
lage, it had a greater survival value than the elab- 
orate organizational networks of the state. Even 
in the classical periods of urban expansion, such 
as that of Rome in the first century A.D., 90 per cent 
of the inhabitants of the Mediterranean Basin were 
illiterate peasant villagers. 

The small spatial unit of the village encompassed 
most of the functions of society, the needs and 
tasks of men. The intensive interaction of the vil- 
lagers generated another ground for interdepend- 
ence—the social process, or interaction as a value 
in itself (MacIver & Page 1949; Greer 1955). There 
is, as Homans puts it, a “social surplus” resulting 
in any working group, and that surplus can be 
used to create rich and complex meaning. This 
valued interaction among a constant population, 
within a common set of norms and a common view 
of the world, is much of what is meant by the term 
“primary community” (Homans 1950). 

Industrialism and community dependence. Since 
the widespread introduction of nonhuman energy 
resources into Western societies, the picture has 
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changed radically. The improvements in agricul- 
ture on the one hand and of transportation on 
the other have allowed vastly increased scope and 
intensity of interdependence and made the vil- 
lager anomalous. The urbanization of the great 
majority of the population in entire nations has 
resulted. While the local community, bounded by 
sparsely settled land and trafficking with farmers 
and ranchers, still exists in these highly developed 
societies, it has nothing like the autonomy of the 
peasant village. It is dependent, in almost all re- 
spects, upon the larger national system and is in 
continuous communication with that system. 

The remaining cohesive force of the small set- 
tlement rests chiefly upon propinquity and the 
boundaries resulting from an environment of 
sparsely settled land. A small number of people 
interacting more with each other than with any 
outsiders tend to generate some of the differen- 
tiated norms and shared values of the archetypal 
“community” (Withers 1945). But the greater in- 
dependence of the local system enjoyed by the 
villager who is also the citizen of a large-scale so- 
ciety critically weakens the community's control 
over his behavior. 

Boundaries, isolation, and inescapable interde- 
pendence are thus the key variables producing 
“community.” Each varies considerably for the vil- 
lages and small towns in even so large-scale a 
society as the United States today; for example, 
compare the hamlets of the rural South with the 
small towns on the resort coasts of New England 
and the West. More important than this variation, 
however, is the immense difference between the 
entire range of American towns and the peasant 
village as portrayed by such authors as Ténnies, 
Robert Redfield, and Ralph Turner (see Vidich & 
Bensman 1958), 


Urban neighborhoods 


The typical inhabitant of a large-scale society, 
however, does not live in a village, hamlet, or 
open-country neighborhood. He lives in a differen- 
tiated part of an extensive urban complex. The 
local community is, for him, a more or less differ- 
entiated neighborhood with whatever place names 
and unique characteristics obtain. Such neighbor- 
hoods have even less autonomy and functional 
significance than do the remaining small towns 
and villages; the neighborhood boundaries are usu- 
ally blurred, reducing ineluctable interaction. These 
neighborhoods’ usefulness to the resident may be 
little more than the provision of a house lot and 
a way to get to it. Yet here again there is great 
variation, by subarea, within a metropolis. 


Natural areas. The Chicago sociologists of the 
1920s and 1930s placed their emphasis upon “nat- 
ural areas.” These were conceived as neighborhoods 
or residential enclaves marked off by sharp differ- 
ences in population characteristics or by physical 
barriers, which are either geographical (as in the 
case of Lake Michigan) or man-made (as in the 
case of the elevated railway). Within such areas 
they noted homogeneity by social class and ethnic 
background; between them they noted great diver- 
sity. Thus, they carried out “neighborhood ethnog- 
raphy” in the Ghetto, the Gold Coast, the Slum, 
the Hobohemia, and other exotically named places 
(see Wirth 1928; Zorbaugh 1929; Anderson 1923), 

Such studies, designed to maximize uniqueness 
in the individual case and heterogeneity among 
cases, have been criticized as pictures of the whole, 
no matter how graphically they illustrated varia- 
tions in the urban ambit. However, it has been 
pointed out that most subareas of the city are not 
“natural areas,” or else they are natural as far 
too huge to constitute any kind of local commu- 
nity; these subareas are not, therefore, studied by 
those who emphasize natural areas, and thus their 
picture of urban subareas is skewed toward the 
deviant neighborhood. Further, the relationship 
between bounded places and distinctive, homoge- 
neous populations is not, usually, a very strong one 
(Hatt 19452). 

Census tracts, The Chicago school had an im- 
portant influence on the Bureau of the Census, 
encouraging the collection of data by small areas. 
The basic unit used, the census tract, was a rough 
approximation of what were believed to be natural 
areas. These units have several advantages for the 
urban analyst. They are small, including around 
seven thousand people, and their boundaries re- 
main constant (or can be reconstructed) from one 
decennial census to the next, allowing compara- 
bility. A large body of data on population and hous- 
ing is collected and reported by census tract. More- 
over, data are reported by tracts for the entire 
metropolitan area, including the central city which 
gives the area its name, the suburban municipal- 
ities, and the unincorporated portion of the urban 
fringe. The tract unit allows comparison of impor- 
tant characteristics over time and between metro- 
politan areas for the entire residential population; 
at the same time it allows the urban whole to be 
analyzed by meaningful subunits. 


Typologies of urban subpopulations 

E. Shevky has developed a typology of urban sub- 
populations based on three indexes which, taken 
together, account for most of the variation among 


tracts on the attributes reported by the U.S. census 
(Shevky & Williams 1949; Shevky & Bell 1955). 
These three dimensions are social rank, or social 
position that is interpreted by the society as social 
class; ethnicity; and urbanism, or “life style.” The 
first two are readily recognizable; they were major 
variables in the work of the early Chicago sociolo- 
gists. “Urbanism,” however, is a new dimension. 
One may say that it takes Louis Wirth’s ideal type, 
“urbanism as a way of life,” and turns it into a 
variable with values ranging from “high urban” 
(the apartment dwellers of the center city, child- 
less, houseless, with both adults in the labor force ) 
to “low urban” (the family-centered life of the hor- 
izontal suburban neighborhoods). The dimensions 
have a high degree of independence; thus we can 
control for two dimensions and study the variable 
effects of the other (Greer & Kube 1959; Bell & 
Force 1956). 

Each of these dimensions is important in ac- 
counting for differences between subareas in the 
associational patterns within such subareas. With 
increasing social rank there is increasing likelihood 
of membership in local voluntary organizations and 
attention to local news in the small, local commu- 
nity newspaper. Ethnicity indicates not only the 
likelihood of a variant culture but also differential 
association with fellow ethnics and, therefore, rel- 
atively less involvement outside the ethnic group. 
With respect to life style, the more “urban” the tract 
population, the less its average involvement with 
neighbors, local organizations, and the local news. 

Community participation. Another long-term 
concern of urban analysts has been the identifica- 
tion of distinctive social types concentrated in dif- 
ferent sorts of neighborhoods. This began with 
studies of “social disorganization,” variations in 
family structure, and the like. Using the Shevky 
approach, three social types have been identified. 
The “neighbor” is one who is involved only with 
the small-scale door-to-door world of the immedi- 
ate neighborhood, the few adjoining blocks. The 
“community actor” may or may not be involved with 
his neighborhood, but he is involved, through volun- 
tary organizations, in the larger enclave, the “local 
community.” Such local communities are incorpo- 
rated in the suburban fringe as municipalities, but 
even when they are subareas of a larger city, they 
are still named places. Their image is in large part 
a creation of the local community press, and it is, 
in turn, a major channel for communication con- 
cerning public affairs (Janowitz 1952). The third 
type, the “local isolate,” is involved at neither the 
neighborhood nor the local community level. 

The proportion of community actors and of local 
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isolates has been found to vary systematically with 
social rank, ethnicity, and urbanism, and this af- 
fects the local polity. Community actors are much 
more apt to be informed and to participate in local 
public affairs than either of the other types, and 
the proportion of the population falling in this cate- 
gory increases as social rank increases and as ur- 
banism decreases. Thus, the local area is most im- 
portant as a social fact, as the scene for a spatially 
inclusive social group, in the more prosperous 
neighborhoods of the family-centered. These are 
often, but not always, suburban. 

The variation in the proportion of neighbors and 
community actors reflects the importance of the 
local area as a social system. Where it is important, 
one can see the functional bases in the necessities 
for a style of life that are provided by local organ- 
izations. Many of the major institutions of Ameri- 
can society are administered by small local areas— 
religion, education, police and fire protection, re- 
tail distribution, political participation, and so forth. 
The organizations carrying out these functions 
foster the development of auxiliary groups, that is, 
voluntary organizations such as parent-teacher 
associations, political clubs, local businessmen’s 
associations, and church-sponsored clubs for all age 
ranges. Such organizations focus the interests of 
the individual householders upon the public bus- 
iness relevant to their household needs. Thus the 
increase in community actors with the decrease in 
urbanism reflects the importance of the physical 
plant and social order (for example, at schools, 
parks, playgrounds) for a familistic, home-focused 
and child-focused population. Contrariwise, the ex- 
tremely “urban” populations (childless apartment 
dwellers of the central city) have little need for the 
local community aside from housekeeping chores 
which are carried out routinely by the municipal 
bureaucracies. 

Increasing social rank is strongly related to in- 
creasing community participation only when urban- 
ism is held constant. Thus the population of high 
social rank but high urbanism participates outside 
the local area—in metropolis and nation. “Social 
rank” indicates only that a person possesses more 
of the economic and cultural resources for organi- 
zational participation, while “life style” indicates 
both the usefulness of such participation at the in- 
dividual level and opportunities to participate at 
the collective level (Greer 1962a; 1962b; Greer & 
Orleans 1962). 


The changing character of neighborhoods 


Neighborhoods do not retain a given character 
indefinitely; yet they may last a long time. Change 
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in social character is due to the interaction of two 
broad classes of factors: housing resources and 
housing preferences. The former indicates the kind 
and quantity of housing available, the latter the 
housing deemed suitable for carrying on a given 
type of life style [see Housine, article on SOCIAL 
ASPECTS]. Thus in the cities of the railway age, the 
time cost of movement within the city put a high 
premium on centrality and density of housing for 
most of the urban population; the result was the 
great neighborhoods of high-rise tenements and 
multistoried town houses in the center of most 
older American cities. Such structures allowed a 
maximum of privacy, given the necessity for dense 
land use. With shifts in the time cost of travel 
brought about by automobiles and trucks, however, 
neither density nor centrality retained the same 
high value. Consequently housing could be and 
was built in dispersed “developments” of single- 
family units, far out on the peripheries of the cities. 
The preferences of familistic populations could be 
satisfied through the increasing resources in land 
made possible by faster transportation. Conse- 
quently, many of the older neighborhoods slowly 
move downward in the preference hierarchy of con- 
temporary Americans. Their declining value and 
price make them accessible to populations with 
fewer resources (the poor) and to populations with 
less access to the free market (the segregated pop- 
ulation); thus, we have the “invasion and succes- 
sion” phenomena underlined by students of urban 
neighborhoods. 

However, it is important to remember that pref- 
erences can and do change, as is demonstrated in 
the cases of central-city neighborhoods that are 
renovated and move upward to higher values. Re- 
sources also change, because of both technological 
innovation and the steady increase in personal in- 
come in most Western societies. These two factors 
together have produced changes in transportation 
that make accessible a greater supply of urban land 
than can be used at present, hence, the difficulty 
of “rebuilding the central city.” A deterministic the- 
ory of neighborhood change, based upon a time- 
bound assessment of a few factors, is apt to be 
inadequate for either prediction or control. 

The general belief is that urban neighborhoods 
are declining to the point of triviality as social sys- 
tems. Certainly the major functions of economic 
production and political control seem to be not just 
metropolis-wide but nation-wide in their jurisdic- 
tion and relevances. Yet other functions of the 
urban neighborhood seem more important than 
ever. The household remains a major center for 
child-rearing, for consumption (including cultural 


consumption by means of the mass media), and 
for social gatherings and conversation. The cluster 
of households with similar functions in the new 
areas of suburbia tends to produce a rich texture 
of social interaction through voluntary organiza- 
tions—usually those relating to school, church, 
shopping center, and neighborhood maintenance. 
Furthermore, these suburban enclaves of like-mind- 
ed people are very frequently incorporated as sep- 
arate municipalities. 

Such municipalities conserve the social charac- 
ter of the local communities, based as they are 
on a consensus concerning the desirable state of 
things; yet they also prevent governmental inte- 
gration for the entire metropolis and, therefore, a 
working consensus on its desirable state. The units 
flourish at the expense of the whole texture of the 
metropolis. But as Robert Maclver has pointed out, 
“The claims of the smaller and of the greater com- 
munity have been in antagonism all through his- 
tory, for history is in large part the record of the 
widening of community” (1917, p. 254). The prob- 
lem remains the conservation of autonomous value 
in the small area at the same time that one reaps 
the advantages of integration in larger units. 


SCOTT GREER 


[See also Crry, article on FORMS AND FUNCTIONS; 
Community; VILLAGE.} 
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I 
EVOLUTIONARY AND BEHAVIORAL 
ASPECTS OF THE BRAIN 

The study of the evolution of the brain and be- 
havior is a paradox. We cannot study behavior in 
fossils or the internal structure of the brain of 
organisms long dead. In our lifetimes, we experi- 
ence but a very brief interlude in the long evolu- 
tionary process. What leads us to believe that it is 
possible to investigate the evolution of the brain 
and its relation to behavior? 

An answer lies in the encouragement investi- 
gators obtain from examining the brains of those 
contemporary organisms which have close links 
with their fossil ancestors. Patterns in the brain are 
found which are common to these and different 
species suggesting parallels with common ancestry. 
Comparative behavior provides similar encourag- 
ing parallels. In practice, it is reasonable to suggest 
that comparative neurology and comparative be- 
havior have information to offer which is relevant 
for evolutionary theory and, conversely, that evolu- 
tionary perspective provides a framework for mak- 
ing sense out of comparative material. 


Historical highlights and approaches 


C. L. Herrick was moved by such considerations 
when he founded the Journal of Comparative Neu- 
rology in 1891. His original vision was that the study 
of comparative neuroanatomy could be integrated 
with the study of comparative behavior. 

Students of C. L. Herrick, such as C. J. Herrick 
and G. E. Coghill, continued in the tradition of the 
elder Herrick after his death. C. J. Herrick (1948) 
concentrated on the connections found in amphib- 
ian nervous systems, relating his information to 
the findings of workers who used other forms and 
interpreting the possible significance of the de- 
scribed connections. Coghill (1929) spent much 
of his life in the study of the development of the 
nervous system of the salamander, in conjunction 
with a study of the development of behavior. Both 
Coghill and C. J. Herrick worked in an era of con- 
siderable interest in comparative neuroanatomy. 
Much of the work of the era was summarized by 
Ariéns Kappers, Huber, and Crosby (1936). 

Concurrently, others studied comparative behav- 
ior effectively without a simultaneous study of the 
structure of the nervous system, but they did not 
overlook the possibility of relating behavior to the 
nervous system. Among these were Yerkes (1916), 
Hunter (1913), Loeb (1918), Noble (1931), and 
Parker (1919), Characteristically, their studies 
were concerned with discriminative capacities, 
learning abilities, tropisms, stereotypical behavior, 


and sensitivities in various species. [See the biog- 
raphies of HUNTER and YERKES.] 

A third group, frequently employing methods of 
intervention in the nervous system as a means of 
studying the relation of the nervous system to be- 
havior or various units of behavior, was led by 
such outstanding men as Lashley, Sherrington, and 
Pavlov. Lashley (1929) was noted for his studies 
of the role of the mass of cerebral tissue in learn- 
ing and intelligence. Sherrington (1906) was con- 
cerned primarily with the problems of the organ- 
ization of the nervous system in the regulation of 
reflex actions. Pavlov (1927) had a similar broad 
concern but an entirely different approach, stress- 
ing the general concept that the study of the elicita- 
tion or suppression of reflexes by systematically 
paired, concurrent stimulation was the key to un- 
derstanding the role of the cerebral cortex. [See the 
biographies of LASHLEY; SHERRINGTON; PAvLov,] 

Current status. Studies of comparative neurol- 
ogy and comparative behavior continue today both 
as separate ventures and as combined enterprises. 
Although the work of many of the men whose 
names are mentioned here and of numerous con- 
temporary workers has never been specifically ad- 
dressed to the problems of the evolution of the 
brain and behavior, much of the data is relevant. 

The information accumulated and available is 
overwhelming but riddled with hidden error, gaps, 
and misconceptions. Since the work on behavior 
and the nervous system is being done on various 
species, we are obliged to make explicit our views 
of the relationships of these organisms, including 
man, if we hope to extrapolate results from one 
species to another. Specifically, we must clarify the 
evolutionary perspective with which comparative 
work is to be viewed. 

Attributes of the central nervous system. The 
common origins of the various vertebrate species 
are reflected in the organization of the brain and 
spinal cord. Man, as would be expected, possesses 
the neuroanatomical characteristics of the verte- 
brates in general, the mammals in particular, and 
the primates especially. Those portions of man’s 
nervous system which are common to the verte- 
brates have been described as primitive, and those 
which man shares with mammals—pre-eminently 
with the primates—are referred to as recent por- 
tions of the nervous system. For example, man pos- 
sesses “paleocortex” (primitive outer cell layers of 
the forebrain) and “neocortex” (more recently 
evolved outer cell layers of the forebrain), Among 
the primates, progressively greater proportions of 
neocortex are found in successively evolved repre- 
sentatives (Clark 1960). 
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Sensorimotor and central systems of connections 


The portions of the nervous system most directly 
significant for behavior are the connections formed 
by the processes of the billions of nerve cells. Mi- 
croscopic examination of suitably prepared biolog- 
ical material reveals the connections to be system- 
atically organized, Systems of connections exist 
within the central nervous system which allow 
identification of the sensory routes, the motor path- 
ways, and the organization of central connections. 
It is a common feature of the vertebrates that the 
sensory pathways have a continuity with motor 
pathways at various levels. We speak, therefore, of 
sensorimotor continuities which form the basis of 
simple and complex reflexes. Together with the 
action of central systems, the outcome of activity 
of higher continuities may become considerably 
more elaborate than the reflexes. 

First-level sensorimotor continuity. The sim- 
plest reflex is formed by the continuity of sensory 
cells and motor cells, the sensory cells sending their 
processes directly to the motor cells. Such reflexes 
are represented in the spinal cord and in the brain 
stem and are common to all vertebrates. The reflex 
closing of the jaws as they tend to open of their 
own weight is an example of the simplest reflex, 
the two-neuron or monosynaptic reflex. Stretch re- 
ceptors in the jaw muscles conduct the excitation 
directly into the brain via sensory neurons. The 
muscle stretched is the same muscle that is there- 
by stimulated to contract. 

The simplest reflexes are segmental reflexes, the 
afferent and efferent neurons belonging to the same 
segment or segment fraction. Vertebrate segmen- 
tation is comparable in terms of embryological 
origins, and the concept of segmentation is used to 
include the structures related to each of the fifth, 
seventh, ninth, and tenth cranial nerves. The jaw 
muscles, for example, which possess an afferent 
and efferent supply from the same segment (fifth 
nerve), tend to remain reflexly in a steady state 
as they respond to the steady pull of gravity. 

The significance of the reflexes of a segment 
fraction is more far-reaching than indicated thus 
far, The first-level reflexes are the reflexes of sus- 
tained activity (tonic reflexes), and they are gen- 
erally opposed by the second-level reflexes. The 
reflexes of the first and second levels form a re- 
ciprocal dichotomy. (We are dealing here with only 
the first half of the dichotomy.) Tonic reflexes are 
supported by cutaneous stimulation as well as 
by stretch stimulation. The skin is systematically 
supplied by spinal nerves and cranial nerves in a 
segmental pattern. In all vertebrates, particular 
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portions of the body surface belong to distinct seg- 
ments or segment fractions. As in the stretch 
reflex, it is reasonable to suggest that moderate 
stimulation of a given area of the body surface 
results in reflex action which is prolonged and 
keeps the stimulated body surface in continuing 
contact with the stimulus. The extensor muscles 
of the limbs, in response to stimulation of the toe 
pads, reflexively support the standing posture of 
the quadruped and thus enhance the continuing 
stimulus. The radial nerve which innervates the 
dorsal surface of the hand also supplies the ex- 
tensor muscles in the upper limb. The extensor 
posture of the ape’s forelimb, as he rests his weight 
on his knuckles, is consistently categorized in these 
terms. The median nerve supplies the palmar 
surface and the flexor muscles of the hand. The 
grasp reflex of the human infant is understand- 
able in a similar way. The value of the grasp re- 
flex to the primate clutching his mother’s hair as 
she carries him through the trees can be readily 
appreciated. 

Second-level sensorimotor continuity. The re- 
flexes of the second level form the other half of 
the reciprocal dichotomy. Each of the first-level 
reflexes can be inhibited and replaced by an 
antagonistic action when the stimulus strength 
changes from moderate to intense. A thorn in the 
knuckle of the ape prevents extension and pro- 
duces flexion. A beesting to the palmar surface will 
break the grasp reflex and produce opening (ex- 
tensor action) of the hand. These are examples of 
the basic withdrawal reflexes which depend upon 
inhibition of the simpler reflexes in order to ap- 
pear. They are polysynaptic reflexes which also de- 
pend upon sensorimotor continuities in the spinal 
cord and brain stem. It is reasonable to regard 
these second-level reflexes as discharging motor 
neurons in segment fractions adjacent to the stim- 
ulus and inhibiting motor neurons in the same 
segment fraction as the originating stimulus. 

The second-level continuities are common to all 
vertebrates. In vertebrates without appendages the 
form of the first-level and second-level reflex or- 
ganization need not differ fundamentally from that 
found in the other vertebrates. It is characteristic 
of all vertebrates that the cutaneous portion of the 
fifth nerve, which provides the nerve supply to 
the face, extends its fibers into the cervical spinal 
cord, where contact is established with motor neu- 
rons controlling the neck musculature. The ar- 
rangement allows for reflex turning of the head 
toward or away from the side of cutaneous stimula- 
tion. When the stimulus to one side of the face 
is strong, the neck musculature reflexly turns the 
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head away from the stimulus. When the stimulus 
is moderate, the neck musculature turns the head 
toward the side of the stimulus. It is instructive 
to note with regard to the above statements that 
the larval form of the primitive eel, Petromyzon 
marinus, shows reactions comparable to those of 
the human infant. The infant turns its head to- 
ward the nipple as it contacts the cheek but turns 
its head away if the cheek is pinched. The larval 
form of Petromyzon marinus burrows into the mud 
of the river bed. The burrowing is accomplished 
by undulatory movements of the entire body as the 
organism penetrates the mud head first. It is logical 
to expect that, as it burrows, it presses the side 
of its head first to one side and then to the other 
side of the hole it makes as it burrows. It would 
be consistent with effective burrowing for the larva 
to continue to press to one side until the cutaneous 
stimulus became strong on that side and then to 
reverse its pressing, the pressing continuing on the 
other side until the stimulus there became strong, 
and so on to produce successive undulations start- 
ing near the head and proceeding tailward down 
the body until the animal has gone far into the mud 
by the forward propulsion of these undulations. 
Copulatory reflexes, which are, of course, essen- 
tial for species propagation, may be viewed as 
special instances of alternating first-level (tonic, 
thrusting) and second-level ( withdrawal) reflexes. 
Third-level sensorimotor continuity. A continu- 
ity of sensory to motor fibers is established through 
the medial reticular formation of the brain stem 
in all vertebrates, Sensory fibers of the dorsal root 
ganglia connect with interneurons of the spinal 
cord. Many of these interneurons send their proc- 
esses directly to the large cells of the medial reticu- 
lar formation. It should be said parenthetically that 
the brain-stem reticular formation consists of a 
meshwork of cell bodies and fibers in the core of 
the brain stem, and it is in the medial part thereof 
that the large cell bodies are situated. The large 
cell bodies send processes back to interneurons of 
the spinal cord, where contact with motor neurons 
can be made. The medial reticular formation is 
also influenced by cranial nerves, which directly 
contact the dorsolateral portion of the reticular 
formation. The dorsolateral portion, in turn, con- 
nects with the medial reticular formation. 
j The significance of the third level of continuity 
lies in providing a route through which existing 
reflex activity may be enhanced and competing, 
antagonistic reflex activity may be weakened. It is 
a route through which stimulation arising in one 
segmental fraction may influence sensorimotor 


action in more widely distributed segments or 
many segments at once. In concert with interseg- 
mental spinal connections, the third level stabilizes 
the stereotyped whole-body postures of terrestrial 
forms and the stereotyped whole-body movements of 
aquatic forms. In stabilizing the upright posture of 
the vertebrate, the reticular role is heavily depend- 
ent on the action of the vestibular nerves. Vestibular 
reflexes, stimulated by the pull of gravity, define 
the symmetrical, upright posture of the head and 
body. Magnification of the action is achieved 
through level 3. Except for primates in which even 
higher levels of sensorimotor continuity are re- 
quired for effective standing, level 3 is adequate 
for maintaining an exaggerated, crude, upright 


position. 
When the organism is ill or weary, however, the 
upright posture collapses. The mechanism of the 


action is unknown, but it is evident that the ves- 
tibular reflexes and the actions of level 3 are con- 
siderably diminished under such circumstances, It 
is possible that a source of inhibition on the medial 
reticular formation rather than simply an exhaus- 
tion of reticular activity enforces the resource- 
restoring condition of rest. A logical origin of the 
inhibition may be the visceral afferent input 
reaching the dorsolateral reticular formation di- 
rectly through the vagus or glossopharyngeal nerves, 
which carry afferent responses to a visceral “crisis” 
(chemical imbalance ). 

Fourth-level sensorimotor continuity. _Interneu- 
rons of the spinal cord, serving spinal afferents, 
and neurons of the brain stem, serving cranial 
nerve afferents, distribute axons to the cerebellar 
cortex. The cerebellar cortex, in turn, connects 
with deep cerebellar nuclei, which send axons to 
the medial reticular formation, These connections 
form the fourth level of sensorimotor continuity. 
The arrangement is one which allows sensory feed- 
back to have a regulative impact on the actions 
of the medial reticular formation. Broadly speak- 
ing, sensory elements responding to muscle ten- 
sion and elements responding to moderate cutane- 
ous stimulation are sources of negative feedback. 
Elements responding to intense stimulation such 
as pain are sources of positive feedback, which 
serves to sensitize all reflexes except the first-level 
reflex to the local stimulus. 

The vestibular nerve connects directly with the 
fourth level. Typically, the result of vestibular ac- 
tivity is dependent on asymmetry of stimulation 
of the vestibular receptors on the two sides. Thus, 
if the right side of the head is lower than the left, 
the extensors of the right side are stimulated, while 
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the flexors of the left side are stimulated. The 
action is magnified by the vestibular contribution 
to the fourth level, the fourth level acting on the 
third level. The balance between the two sides of 
the body which is achieved is characteristic of the 
role of the cerebellum in equalizing the muscle 
tone (sensitivity of postural reflexes) on the two 
sides of the whole body during stationary postures. 
Although the cerebellar level is represented in 
all vertebrates, it becomes progressively more elab- 
orate as the vertebrates themselves exhibit greater 
differentiation of segmental structures. 
Fifth-level sensorimotor continuity. The mid- 
brain tectum and associated central gray also re- 
ceive directly from rostrally conducting interneu- 
rons of the spinal cord and brain stem. The tectum 
sends fibers to the reticular formation and upper 
spinal cord. The fifth level of sensorimotor con- 
tinuity is thus formed. Optic, auditory, vestibular, 
and somatic sensory systems converge on the mid- 
brain tectum and/or central gray. These sensory 
systems provide guidance for primitive, whole-body 
reactions. The reactions, like the isolated reflexes 
of the first and second levels, fall into two mutually 
exclusive categories. They consist of whole-body 
reactions toward or away from a stimulus. Toward- 
turning and away-turning form one dichotomy, 
while forward lunging and backward progression 
form another, The importance of this level for 
food-seizing, whole-body approach (including sex- 
ual approach), and whole-body withdrawal is evi- 
dent. Coincident with approach and withdrawal, 
the midbrain (central gray particularly) is impli- 
cated in the linkages which define “distress” or 
“well-being” in terms of concurrent intrinsic reac- 
tions such as “distress calls,” “cooing” vocalization, 
autonomic changes, and endocrinological reactions. 
Fifth-level connections are common to all verte- 
brates, but they are especially well developed in 
birds. The fifth level is probably most important to 
mammals when they are as yet immature. When 
the seventh level (recently evolved forebrain level) 
reaches maturity in mammals, it provides a more 
elaborate and flexible guidance system for whole- 
body activities. In the long history of the verte- 
brates, the functioning fifth level probably made 
possible the evolution of the seventh level, which, 
in a sense, supersedes the fifth level. In the ontog- 
eny of the mammal, the immediately functioning 
fifth level allows for the more gradual development 
of the seventh. 
Sixth-level sensorimotor continuity. The same 
order of sensory interneurons which entered into 
connection with levels 3 and 5 connects with the 


primitive (mid-line and intralaminar) thalamus. 
The primitive thalamus, besides having numerous 
central interconnections, connects with the stria- 
tum and hypothalamus. Both the striatum and hy- 
pothalamus connect with the midbrain level. It is 
suggested that the striatum paces somatic activity 
and the hypothalamus paces the concurrent vis- 
ceral activity, both the striatum and hypothalamus 
being dependent on the connections through the 
primitive thalamus. The primitive thalamus is, 
thus, critical in adjusting the rate of reaction of 
the organism. Perhaps because of its systematic 
central connections, the presumptive outcome of 
the activity conducted through the primitive thala- 
mus would be to increase or decrease the rate of 
sustained withdrawal or approach at the same time 
concurrent visceral activity is intensified or re- 
tarded. To be of survival value, the outcome must 
be in accord with the demands placed on the or- 
ganism, as these demands are represented by the 
convergence of sensory input to the thalamus. 

Level 6, together with its associated primitive 
forebrain connections, is represented in all verte- 
brates. The contribution of the olfactory nerve is 
represented at the sixth level as it reaches the stri- 
atum, hyypothalamus and paleocortex (or its coun- 
terpart in primitive forms) almost directly. On 
theoretical grounds, olfactory stimulation is thus 
in a position to determine the rate of visceral and 
somatic activity. The contribution to the cortex is 
likely to play a vital role in the memory of spe- 
cific odors. Various odors are, or become, important 
for determining whether the organism hastens its 
approach or retreat. Other substances indicate 
whether the organism is in territory where it usu- 
ally comes to rest. 

Seventh-level sensorimotor continuity. Sensori- 
motor level 7 and its associated central connections 
evolved out of those in level 6 for the most part, 
but links with all other levels are apparent. 

Just as there are second-order neurons connect- 
ing directly with the primitive thalamus, so there 
are second-order sensory neurons connecting di- 
rectly with the neothalamus, the neothalamus 
being defined as those portions of the thalamus 
which have major projections directly to the neo- 
cortex. The neocortex is understood as that part 
of the cerebral cortex which is found in mammals 
only. Parts of the neocortex, in turn, send fibers 
to the striatum, hypothalamus, midbrain tectum, 
cerebellar cortex, reticular formation, the inter- 
neurons of the spinal cord, and, in primates, the 
motor cells themselves. The last connection indi- 
cates the direct way in which the neocortex can 
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control motor activity. Level 7 is a sensorimotor 
level richly interwoven with reciprocal central con- 
nections of the neothalamus and neocortex. Within 
broad limits, every level below level 7 can be in- 
hibited or facilitated by the activity of level 7. 

Level 7, together with its associated, regionally 
specialized, central connections, contributes a vast 
enrichment of permutations and combinations to 
the sensory and central control of sensorimotor 
integration. The regionally specialized connections, 
influenced by lower levels, limit the likely character 
of organized activity. Reflexes and tendencies of 
lower levels are incorporated in body-wide activities. 

However, the regional specialization in the fore- 
brain is, at present, only crudely identified. Agree- 
ment on the significance of various neocortical 
regions, for example, is not universal. Regions re- 
ceiving and processing somatic, visceral, optic, 
auditory, gustatory, and olfactory input are de- 
limited. Zones especially important for speech have 
been outlined. Organization of motor activity has 
been linked with the frontal areas. The connections 
of the forebrain important for such functions as 
learning, memory, emotion, attention, and self- 
Programming (establishing a hierarchy of pre- 
ferred directions and order of complex behavior ) 
are not at all clear. [See the biographies of Broca 
and FLOURENS.] 


Unraveling the behavioral significance of the 
organization of the forebrain is an enormous chal- 
lenge. It required many millions of years to evolve 
the existing advanced vertebrate brain. We should 
not expect to find the task of understanding the 
present end products an easy one. It will require 
many years of integrated, cooperative investigation, 
thought, and communication to reconstruct the 
evolution of the brain and to understand its full 
significance for behavior. 


WALTER Riss 


[Other relevant material may be found in Evotution, 
articles on HUMAN EVOLUTION and EVOLUTION AND 
BEHAVIOR. ] 
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n 
STRUCTURE AND FUNCTION OF THE BRAIN 


The primary function of the central nervous 
system, which includes both spinal cord and brain, 
is the process of neural integration for the expres- 
sion of adaptive responses. Sensory information, 
arising from physical stimuli in the external and 
internal environment, reaches the central nervous 
system via the peripheral and cranial nerves in the 
form of bioelectrical nerve impulses. These sensory 
impulses are integrated with central neurophysio- 
logical activities of the brain to produce appro- 
priate motor response patterns. 


The nerve cell 


The primary morphological element for infor- 
mation transmission in the nervous system is the 
neuron, a special type of cell with projections ex- 
tending from the cell body. In physiology, those 
processes which conduct electrochemical impulses 
toward the cell body are called dendrites and are 
essentially protoplasmic extensions of the cell 
body. They usually extend a short distance and 
tend to branch profusely. In this fashion a great 
increase in the surface area of the nerve cell is 
provided, so that many other neurons may be 
linked with it. There are, however, large numbers 
of neurons in the nervous system which have no 
dendritic processes. The single projection which 
carries information away from the cell body is 
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referred to as the axon or nerve fiber. The axon is 
frequently of considerable length, and although it 
may possess collateral branches along its extent, 
the greatest branching is at its termination. Axons 
either terminate in muscle or form junctions with 
other neurons, Nerve fibers are generally covered 
by a fatty sheath called myelin, and in the case of 
axons lying outside the central nervous system, 
the myelin is in turn surrounded by a thin mem- 
brane known as the neurilemma. The neurilemma 
participates in regeneration of peripheral axons 
following injury or transection of the fiber, and 
since there is an absence of neurilemma in the 
central nervous system, regeneration does not nor- 
mally occur there to any significant degree. While 
there are many structural types of nerve cells and 
great diversity in the arrangement of their projec- 
tions, neurons may be functionally divided into 
three classes. These are sensory (afferent) neurons, 
motor (efferent) neurons, and associational or in- 
ternuncial neurons. (Figure 1 provides a diagram 
of a motor neuron.) Exceptions to this schematiza- 
tion as discussed below are special efferent fibers 
which are found in sensory systems but do not 
serve a motor function. 

The nerve impulse. The ability of neurons 
to conduct nerve impulses along their length is 
largely a function of the cell membrane. When 
the neuron is in the resting state the cell membrane 
is said to be polarized—that is, a steady electro- 
chemical potential difference is maintained over 
the membrane by virtue of its selective ability to 
prevent sodium ions from passing from extracel- 
lular to intracellular space and to permit potassium 
ions to move inward through the membrane. As a 
result of this differential, the concentration of ions 
insfde the cell becomes electrically negative with 
respect to the outside, and the magnitude of this 
difference in potential is usually between —50 and 
—90 millivolts, 

When an appropriate stimulus is applied to a 
nerve fiber, the membrane at the point of stimula- 
tion suddenly becomes permeable to sodium ions, 
and these move across the membrane and inside 
the fiber. This process of membrane depolarization 
is represented by a rapid reversal of the resting 
potential from, for example, —70 millivolts to +40 
millivolts. This change of about 110 millivolts is 
referred to as the nerve-impulse, or nerve-action, 
potential. Immediately following depolarization, 
the permeability of the membrane to potassium in- 
creases, and potassium ions shift from inside to 
outside the cell causing a reversal of the nerve- 
action potential and a restoration of the normal 
resting level (repolarization). The occurrence of 
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the nerve-action potential and the return to the 
resting potential require from 0.5 millisecond to 
2.0 milliseconds. When an action potential is gen- 
erated in a segment of a fiber, the resulting entry 
of ions into the cell from outside produces depo- 
larization of the adjacent membrane as well. This 
marginal depolarization acts as the stimulus to a 
further breakdown in permeability along the mem- 
brane, and the process continues as a chain reac- 
tion along the length of the fiber. In this manner 
the nerve impulse is propagated over the length 
of the fiber in a fashion not unlike that in which 
ignition travels along the fuse of a firecracker. The 
range of velocity of the nerve impulse is very wide, 
varying from approximately 1 to 300 meters per 
second, depending upon the diameter of the fiber 
and other factors. The size of the nerve-action po- 
tential is not a function of the strength of the 
stimulus applied to the nerve. It is true that the 
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Figure 1 — Diagrammatic drawing of a neuron 


Source: Adapted from Physiological Psychology, Third 
Edition, by Clifford T. Morgan. Copyright © 
1965 by McGraw-Hill, Inc. Used by permission 
of McGraw-Hill Book Company. 
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magnitude of the nerve-action potential may vary 
from fiber to fiber, but within a neuron and under 
constant conditions it is usually invariant, occur- 
ring in full amplitude or not at all, although there 
are certain exceptions to this rule. Also of im- 
portance is the fact that the nerve impulse does not 
decrease in size as it sweeps along the fiber. This 
all-ornone principle applies only to the nerve- 
action, or spike, potential of the axon. There is 
another kind of potential change which immedi- 
ately precedes the generation of the nerve impulse 
whenever a stimulus—physiological or artificial— 
is applied to a neuron. This is the local excitatory 
potential, which is graded in amplitude and, thus, 
does not function in an all-or-none fashion. The 
size of the excitatory potential is proportional to 
the magnitude of the stimulus applied and repre- 
sents a partial depolarization of the cell membrane. 
It is propagated only in a weak and decremental 
fashion, if at all. If the stimulus is not very strong, 
the local potential will decay rapidly and no further 
neural events will occur. If, however, the stimulus 
is sufficiently strong, the local potential will in- 
crease in amplitude and reach the threshold for 
triggering the all-or-none nerve-action potential. 

The spike potential is followed immediately by 
a brief period during which the axon is absolutely 
refractory and will not respond to any stimulus, 
regardless of magnitude. However, the fiber very 
soon enters a period of relative refractoriness 
before returning to the resting level. During the 
relative refractory period the fiber will respond to 
stimulation, but only if it is of greater strength 
than is required by the resting nerve. Although the 
refractory period lasts only a matter of millisec- 
onds, it is nonetheless clear that the rate of dis- 
charge possible in a given fiber is limited by the 
presence of periods of absolute refractoriness. The 
exact duration of the refractory period is a func- 
tion of nerve diameter and structure, and the re- 
covery rate in the most rapidly conducting fibers 
permits them to discharge nerve impulses at a 
frequency of approximately one thousand per 
second. 

Synaptic transmission. The synapse is a func- 
tional connection between two neurons in which 
the axon endings of one cell make contact with 
the surface of a dendrite or cell body of another 
neuron. Since there is considerable branching 
toward the end of the fiber, an axon may establish 
Junctions with many cells, Reciprocally, the body 
of a neuron may have in contact with it a very 
large number of axon endings arriving from many 
other cells. The axon terminals, which are often 


referred to as end buttons or knobs, make a slight 
indentation in the postsynaptic cell body. There is, 
in fact, a narrow separation between the axon ter- 
minals of the presynaptic cell and the dendrites 
and cell bodies of the postsynaptic neuron. This 
separation is only a few hundred angstroms in 
width and is referred to as the synaptic cleft. 
When the action potential reaches the axon 
branches and endings resting on another neuron, 
a chemical substance is released by the presynaptic 
terminal. Electron microscopy has revealed that 
the presynaptic terminals contain tiny vesicles 
which are related to the formation and release of 
these transmitter substances. The released chemi- 
cal agent then acts upon the membrane of the 
postsynaptic cell to alter its permeability to ions. 
The presynaptic terminals may be either excitatory 
or inhibitory. In the case of excitatory synapses the 
transmitter substance presumably produces a de- 
polarization of the postsynaptic membrane, which 
is represented by a graded potential. If the input 
to the postsynaptic cell becomes sufficiently strong, 
a nerve-action potential will be generated when 
threshold is reached, and it will be propagated 
down the axon, This will occur when there is spa- 
tial and temporal summation of a number of slowly 
decaying graded potentials which have been pro- 
duced sequentially and simultaneously by a volley 
of incoming signals over many presynaptic ter- 
minals. In the case of inhibition at the synapse, 
the chemical transmitter produces hyperpolariza- 
tion and stabilization of the postsynaptic cell mem- 
brane. As a consequence there is decreased re- 
sponsiveness to excitatory volleys and depression 
of activity in the cell. It is presently believed that 
the transmitter substance released at excitatory 
terminals is different from that released at inhibi- 
tory terminals and that the excitatory substance 
produces depolarization by a nonspecific increase 
in permeability to all ions, while the inhibitory 
substance produces hyperpolarization through a 
Specific increased permeability to potassium ions. 
The excitatory and inhibitory substances them- 
selves have not as yet been identified. Two sub- 
stances involved in synaptic transmission (acetyl- 
choline and norepinephrine) have been known for 
some time, but their precise relationship to excita- 
tory and inhibitory actions is not well understood. 


Development of the central nervous system 


On the back, or dorsal, surface of the embryo 
there lies a sheet of cells called the ectoderm. In 
early development this layer becomes thickened, 
and because cells lying toward the edge of this 
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plate have a more rapid growth rate than cells in 
the middle, a neural groove is formed. As this 
groove becomes deeper, it eventually closes off on 
top, forming a neural tube which extends from 
head to tail of the embryo. The tube then detaches 
itself from the remainder of the ectodermal plate. 
It is from this neural tube and a group of detached 
ectodermal cells lying along its side—called the 
neural crest—that the whole adult nervous system 
is formed, 

In the developing embryo the brain first becomes 
apparent as an enlargement at the anterior end of 
the tube; the remainder of the tube becomes the 
spinal cord. With further growth the embryonic 
brain differentiates into three vesicular subdivisions: 
the forebrain (prosencephalon), midbrain (mes- 
encephalon) and hindbrain (rhombencephalon). 

Six weeks after fertilization there is further sub- 
division of these primary vesicles. The prosen- 
cephalon divides into an anterior portion, the telen- 
cephalon, or endbrain—which will eventually form 
the cerebral hemispheres (cerebral cortex, rhinen- 
cephalon, and basal ganglia )—and a posterior part, 
the diencephalon, or “twixtbrain” (thalamus and 
hypothalamus), The mesencephalon remains rela- 
tively small and undifferentiated, but from it will 
be derived its constituent components: the teg- 
mentum, colliculi, and cerebral peduncles. The 
rhombencephalon divides into anterior and pos- 
terior portions, the metencephalon, or afterbrain 
(cerebellum and pons), and myelencephalon, or 
marrowbrain (medulla oblongata), respectively. 
The long, tail-like remainder of the neural tube 
will become the adult spinal cord. A schematic 
representation of these developmental stages of the 
brain is given in Figure 2. 

The cavity which extends throughout the length 
of the embryonic neural tube forms, in the fully 
developed fetus, the ventricles of the brain and 
central canal of the spinal cord, which contain 
cerebrospinal fluid. 

In mammals, one of the most remarkable events 
during development is the expansion and elabora- 
tion of the cerebral cortex. The degree of develop- 
ment of the cerebral mantle is greatly out of 
proportion to that of the rest of the brain. Its rela- 
tionship to brain stem is not unlike that of the cap 
of a mushroom which has grown over, around, 
and partially down the sides of the stem. In addi- 
tion, in higher mammals the surface of the cortex 
is further increased by numerous foldings or con- 
volutions. A groove created by these foldings is 
referred to as a fissure or sulcus, and the ridge 
formed between two fissures is called a gyrus. 
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Fully two-thirds of the cortex in humans lies buried 
in fissures. The importance of the cerebral mantle 
in human behavior is to some degree expressed 
by the fact that it weighs as much as all other 
central-nervous-system structures combined. 
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Figure 2 — Schematic representation of development of 
the embryonic and fetal brain 


Source: After Patten in Gardner, Fundamentals of Neurology, 
4th ed., Philadelphia, W. B. Saunders, 1963. 
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Organization of structural elements 


The term “gray matter” is used to refer to neu- 
rons and, specifically, to areas where a large num- 
ber of cell bodies are collected together, since their 
natural color is a brownish gray. In contrast, most 
nerve fibers appear distinctly white, as they assume 
the color of the myelin sheath which covers them. 
When bundles of nerve fibers course together in 
the central nervous system, they are referred to as 
white matter, and these pathways of fibers connect- 
ing one area of the nervous system with another 
are called tracts within the central nervous system 
and nerves in the peripheral nervous system. A 
tract which connects a structure on one side of the 
brain with the homologous area in the opposite 
hemisphere is called a commissure. 

A distinct aggregate of cell bodies is referred to 
as a nucleus when it lies within the central nervous 
system and as a ganglion when it is in the pe- 
riphery. In many areas of the brain, neurons are 
distributed in a less concentrated fashion and oc- 
casionally they are arranged in layers, or lamina— 
as in the cerebral cortex, for example. The term 
“center” has frequently been applied to those areas 
or nuclei in which it has been demonstrated that 
a substantial number of neurons play a common 
role in a particular physiological or behavioral 
function. This term is roughly correct if it is taken 
to mean that the area is an important link or locus 
of elaboration in the execution of the response 
under consideration, but it has often been misused 
to imply that a center functions in an autonomous 
manner or in isolation from other areas of the 
brain. Regions near to and distant from an im- 
portant nucleus or center may influence and modu- 
late the particular behavior by virtue of connections 
with the center. No central-nervous-system struc- 
ture operates entirely independently of other areas 
or is solely responsible for producing a response, 
For example, by priming the center with excitatory 
or inhibitory impulses remote structures partici- 
pate in the determination of whether or not a re- 
sponse is to occur at any given point in time. Under 
special circumstances, such as injury to or destruc- 
tion of the center, other areas may become capable 
of producing the behavior through alternate path- 
ways. In this framework, the center is important 
because it is an area for the collection and inte- 
gration of neural information concerned with a 
behavioral mode and because it may be a critical 
link in the normal creation of a response. 

: In the peripheral nervous system a distinction 
is made between somatic and autonomic motor 


nerves, both of which have their origin of action 
in the central nervous system. The somatic motor 
nervous system consists of those efferent motor 
nerves which have their cell bodies in the spinal 
cord or brain but whose axons extend into the 
striated muscles attached to the skeleton. The 
autonomic nervous system is concerned with re- 
sponses of smooth and syncytial muscles. These 
include heart, blood vessels, glands, gastrointes- 
tinal tract, genitourinary structures, irises, and 
other internal structures. Cell bodies of the neurons 
of the peripheral autonomic nervous system lie in 
ganglia outside of the brain and spinal cord but 
are under the influence and control of autonomic 
fibers originating within the central nervous system 
and exerting their influence through synapses in 
the ganglia. The distinction between somatic and 
autonomic nervous systems is sometimes a useful 
one, but fundamentally it is artificial. The reason 
for this is that within the central nervous system 
the areas and pathways responsible for somatic 
and autonomic responses are not clearly differ- 
entiated. While it is true that a nucleus, area, or 
pathway within the brain may function predomi- 
nantly in either autonomic or skeletal muscular 
responses, it is also the case that neural activity 
in such a structure is likely to affect both types 
of peripheral response mechanisms. 

The cell bodies of sensory neurons entering the 
spinal cord lie in ganglia outside the central nerv- 
ous system, and their fibers may be either somatic 
or visceral, depending upon the site of termination 
in the periphery. Somatic afferent fibers convey 
tactile, pressure, pain, or temperature information 
from skin, muscle, tendon, and joint receptors to 
the cord and brain. Visceral afferents transmit 
sensory impulses from the internal organs, glands, 
and blood vessels. Sensory activity in the visceral 
afferents is in close functional relationship to the 
autonomic nervous system, as is somatic sensory 
activity to the somatic motor system. 

The reflex. A simple reflex arc consists of at 
least two or three neurons and provides a mecha- 
nism whereby a relatively fixed and rapid behavior 
pattern may occur in response to an appropriate 
sensory stimulus. 

A two-neuron reflex arc comprises a receptor 
and an afferent neuron bearing information about 
the stimulus to the central nervous system and a 
motor neuron with which the sensory cell synapses 
and through which the response is produced. This 
simple reflex pathway is referred to as a mono- 
synaptic reflex arc, since only one synaptic junc- 
tion is involved. An example is the well-known 
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knee jerk. This reflex may be produced by a tap 
on the tendon just below the kneecap. This elicits 
a slight stretching of the muscle fibers and results 
in stimulation of muscle-spindle receptors. Sensory 
impulses are conducted into the spinal cord, where 
motor neurons are excited, their discharge produc- 
ing quick, phasic contractions of the muscles and 
a consequent “jerk” of the leg. Monosynaptic re- 
flexes also include arcs in which there are more 
sustained or tonic contraction of muscle, and these 
play an important role in the control and mainte- 
nance of posture. 

In a three-neuron or multineuronal arc there 
may be one or many neurons interposed between 
the sensory and motor cells. In general, the greater 
the number of these interposed links (interneu- 
rons) between the sensory input and final motor 
outflow, the more complex and less stereotyped the 
reflex. 

The sensory impulses which produce a reflex 
may also be propagated over fibers that reach the 
cerebral cortex and, thus, result in awareness of 
the stimulus. However, this sensation is incidental 
to, and not critical for, the elicitation of the reflex. 
Furthermore, although a reflex pathway may func- 
tion without the participation of other spinal or 
brain areas, this potential for independence is un- 
usual in the normal activity of the organism. Re- 
flexes which involve only one segment of the spinal 
cord are referred to as segmental reflexes; but as 
already pointed out, most reflexes include the par- 
ticipation of more than one spinal segment (inter- 
segmental reflexes) and frequently the brain as 
well (suprasegmental reflexes). Suprasegmental 
reflexes may be extremely complex, in the sense 
that organization and coordination of the rela- 
tively simple spinal and intersegmental reflexes are 
carried out by the higher centers. For example, the 
postural supporting and shifting reactions, which 
are required for walking and which involve alter- 
nating contraction and relaxation of the limbs, 
are exquisitely integrated by delicately balanced 
mechanisms in the brain stem. A reflex may be 
somatic or autonomic, and it may involve either 
spinal or cranial nerves or a combination of both. 

Examples of reflexes, in addition to the postural 
adjustments mentioned briefly above, are with- 
drawal of a limb from a painful stimulus before 
the pain is appreciated as a sensation; the con- 
strictor response of the iris in response to bright 
light; coughing; sneezing; gagging; vomiting; ad- 
justments in heart rate, blood pressure, and gastro- 
intestinal activity. It is evident from this represent- 
ative, but incomplete, listing that reflexes are 
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appropriate moment-to-moment adaptations of the 
organism upon which its very survival depends. 

The importance of reflexes for behavior in gen- 
eral may be elaborated and summarized by the 
following statements: reflexes are innate in the 
sense that they are the result of genetically trans- 
mitted neural mechanisms; they are hierarchically 
organized so that very simple reflexes can be com- 
bined, by the activity of brain centers, into more 
complex response patterns; while the most simple 
reflexes within an organism are fixed and stereo- 
typed, complex reflexes tend to be variable and can 
be modified by experience; finally, phylogenetically, 
nervous systems have evolved from diffuse and 
crude reflex mechanisms to more specialized link- 
ages capable of clearly differentiated response pat- 
terns. In higher species the borderline between 
elaborate adaptive reflexes and complex flexible 
behavior which can be learned is not clear. In man, 
evolution culminates in the capacity for a high 
degree of symbolization and the appearance of lan- 
guage and speech. This capacity includes the ability 
to abstract general principles from specific items 
of information and memory and the development 
of an extraordinarily complex social organization 
accompanied by the ability to transmit it. 

From the foregoing, it is evident why the reflexes 
are commonly considered the basic units of the 
nervous system and the functional elements from 
which all other behaviors are derived. We have 
observed an apparent structural, physiological, and 
behavioral continuity extending from the simple 
segmental reflex to complex human behavior. How- 
ever, whether this continuity between reflex and 
symbolic capacity is actual or illusory remains 
today very much an open question, for although 
there obviously exist common underlying structures 
and processes, it is not possible to describe or ac- 
count for a large part of motivated behavior and 
learning—much less ideation, reasoning, or ab- 
straction—in terms of basic reflex mechanisms. 

The cranial nerves. In addition to paired spinal 
nerves which are arranged along the vertebral 
column, 12 pairs of cranial nerves enter the brain 
stem and cerebrum directly. 

From each spinal segment project four types of 
nerve fibers, which may be differentiated function- 
ally: the general somatic and visceral afferents 
and efferents. Some of the cranial nerves also con- 
tain these same four functional fiber types, but 
others may carry only one or two kinds. In addition, 
the term “special sensory afferents” is applied to 
fibers transmitting taste, olfactory, visual, auditory, 
and vestibular sensations. The first two of these are 
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considered to be visceral in origin and the remain- 
der somatic, Further, the motor fibers which supply 
the striated muscle of the head and neck fall in 
the unique category of special visceral efferent 
fibers. Table 1 describes the origin (or termination) 
and primary functions of each nerve. (The cranial 
nerves are traditionally designated by Roman nu- 
merals as well as by name.) 

It should be noted that three of the cranial nerves 
are purely sensory in their activity; the Ist, IInd 
and VIIIth. The VIIIth cranial nerve actually medi- 
ates two sensory modalities, hearing and equilib- 
rium. By rigorous definition only the XIth cranial 
nerve has solely a motor function. Nerves V, VII, 
IX, and X contain mixed afferent and efferent fi- 
bers, and both the sensory and motor components 
of these nerves play critical, or substantially im- 
portant, roles in the effective functioning of the 
organism. The sensory components of nerves III, 


IV, VI, and XII are of considerably less practical 
significance. 


Subcortical divisions 


It is important to realize that the whole of the 
central nervous system is a continuous entity in 
which the component structures rarely, if ever, op- 
erate in total isolation, Although we speak of the 
spinal cord and higher structures as mediating or 
being responsible for particular functions, they 
almost invariably do so with some degree of co- 
operation from many other areas. As a general 
principle the simplest reflexes and behavior pat- 
terns are present at the lower levels of the central 
nervous system, and more complex activities appear 
in an integrated form only when higher structures 
are functioning in a normal manner. 

Spinal cord. A basic function of the spinal cord 
is the conduction of nerve impulses to and from 


Table 1 — The cranial nerves 


ORIGIN 
NUMBER NAME (OR TERMINATION) 
| Olfactory Telencephalon (ventral Smell 
thinencephalon) 
I Optic Thalamus Vision 
MW Oculomotor Midbrain 
IV Trochlear Midbrain 


Ocular muscle proprioception 


FUNCTIONS 


Sensory Motor 


Eye movement 

Eyelid elevation 
Contraction of iris 
Accommodation of lens 


v Trigeminal 


M Abducens 
Vil Facial 
Vill Vestibulocochlear 


(a) Vestibular division 
(b) Cochlear division 


IX Glossopharyngeal 


x Vagus 


XI Spinal accessory 


xii Hypoglossal 


Midbrain and pons 


Pons 


Pons 


Medulla 


Medulla 


Medulla 


Medulla and spinal cord 


Medulla 


Ocular muscle proprioception 

Touch, pain, and temperature sen- 
sation from face, jaw, head, ear, 
cornea, sinuses, teeth, gums, 
tongue, mucous membranes of 
mouth, and meninges 

Proprioception of masticatory 
muscles 

Ocular muscle proprioception 

Taste (anterior two-thirds of tongue) 


Equilibrium 
Hearing 
Taste (posterior one-third of tongue) 


Cutaneous sensation of external ear 

Sensation from pharynx, larynx, 
trachea, and esophagus 

Visceral afferents from thoracic and 
abdominal cavity linings and 
organs 

Taste 


Position sense of tongue 


Eye movement 
Muscle movement of jaw, middle 
ear, and palate 


Eye movement 

Muscle movement of face, scalp, 
and middle ear 

Secretion of lachrymal and salivary 
glands 


Swallowing 

Salivary secretion 

Control of activity of heart, gastro- 
intestinal tract, blood vessels, 
and other visceral organs 
thorax and abdomen 

Control of pharyngeal, laryngeal, 
and esophageal muscles 


Muscles of pharynx, larynx, and 
palate 

Muscles for flexing neck, rotating 
head, and raising shoulders 

Tongue movements 
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higher levels of the central nervous system. Except 
for nerve activity carried over the cranial nerves, 
all remaining sensory information and motor out- 
flow travels through the spinal cord and the nerves 
which extend outward from it to the periphery. 
Spinal functions have often been determined by 
studying animals in which the cord has been ex- 
perimentally severed or in humans in whom spinal- 
cord transection has been the result of accidental 
injury. In man there are 31 pairs of nerves ar- 
ranged along the length of the spinal cord, which 
is encased in the bony vertebral column. Each pair 
of nerves lends to the cord the appearance of ex- 
ternal segmentation. These 31 segments have been 
designated as follows, proceeding from the top 
(rostral ) portion of the cord to the bottom ( caudal) 
end: 8 cervical, 12 thoracic, 5 lumbar, 5 sacral, and 
1 coecygeal. If the spinal cord is completely tran- 
sected above the fourth cervical segment, death oc- 
curs immediately, since the nerve fibers originat- 
ing in the respiratory centers of the lower brain 
stem are severed from their connection with the 
muscles of respiration. Humans may survive inter- 
ruption of the spinal cord at lower levels. If the 
transection is complete, voluntary movements are 
permanently lost below the level of the transection. 
The ability of the isolated cord to maintain postural 
reflexes and motor activity depends upon the spe- 
cies of organism and the type of reflex under con- 
sideration, For example, in lower forms such as 
amphibians and reptiles, reflex withdrawal of limbs 
in response to noxious stimuli and reflex swimming 
movements are retained to a remarkable degree. 
In some mammals, such as cats and dogs, occa- 
sional and usually brief periods of reflex standing 
may occur if adequate assistance and support are 
provided, Other basic postural and motor reflexes, 
such as running movements, may also be present, 
but these, as well as the standing reflexes, are 
highly variable. 

In man, although the motor neurons of the cord 
are still connected with skeletal muscle, all reflexes 
are lost during the period referred to as spinal 
shock, which often lasts for several weeks after 
transection. Gradually, some reflexes reappear— 
first flexion of the limbs, then the reflexes of exten- 
sion. These soon become exaggerated, since, follow- 
ing transection, reflexes originating in the cord be- 
low the cut function without the normal inhibitory 
influences from higher levels of the central nerv- 
ous system. Eventually, either flexion or extension 
spasms will predominate clinically. All sensation, 
which is normally conducted from the periphery to 
levels of the cord below the transection, is perma- 
nently absent as the result of severance of sensory 
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fibers in the cord ascending to the brain. Autonomic 
functions—for example, those of the urinary blad- 
der and rectum—are seriously disturbed immedi- 
ately following interruption of the cord but often 
show some recovery, indicating the ability of the 
cord to integrate, at least partially, certain aspects 
of these reflexes. In summary, it is clear that al- 
though certain basic postural, motor, and auto- 
nomic reflexes remain or recover after section of 
the cord, these have only limited autonomy and 
require the higher centers for their full and adap- 
tive coordination. 

The brain stem. In textbooks and research lit- 
erature the term “brain stem” is used differently by 
various authors. The most encompassing definition 
includes all of the central nervous system above 
the spinal cord except the cerebral and cerebellar 
cortices. In this context the brain stem is com- 
posed of the medulla, pons, midbrain, thalamus, 
hypothalamus, and the basal ganglia of the telen- 
cephalon. At the opposite extreme many authorities 
would include only the medulla, pons, and mid- 
brain as constituents of the brain stem. Still others 
would add the diencephalon (thalamus and hypo- 
thalamus) to this list but omit the basal ganglia 
from the definition. Frequently, a distinction has 
been made between the lower brain stem (medulla 
and pons) and the upper brain stem (midbrain and 
diencephalon). 

Externally, the most caudal portion of the brain 
stem, the lower medulla, appears as a continuation 
and expansion of the spinal cord. Internally, how- 
ever, at the microscopic level, there are many im- 
portant differences between brain stem and cord. 
These include the appearance of nuclei serving the 
cranial nerves and the development of large cell 
aggregations and relay nuclei with highly complex 
interconnecting fiber systems. Part of these form 
the reticular formation, the phylogenetically an- 
cient core of the brain stem, which begins in the 
medulla and projects its most rostral fibers in- 
to the diencephalon. Functionally, as mentioned 
above, the brain stem represents an organization 
of physiological and behavioral response patterns 
that not only provides for the suprasegmental co- 
ordination of spinal reflexes, but adds distinctive 
integrating features of its own. Each of the main 
subdivisions of the brain stem will be discussed in 
more detail. Figure 3 depicts the location of the 
brain-stem components. 

Medulla oblongata and pons, The medulla con- 
tains, in addition to relay nuclei and fiber tracts 
conveying information between higher stations of 
the brain and the spinal cord, aggregates of neu- 
rons which are of great importance for the basic 
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Figure 3 — Sagittal view of the right side of the brain 


maintenance of the organism. Certain of these are 
concerned with visceral or autonomic functions, 
and in many of these activities the medulla and 
pons collaborate closely. These areas are funda- 
mental to the normal regulation of cardiac activity, 
constriction and dilation of blood vessels, respira- 
tion, and gastrointestinal functions of motility and 
defecation. Protective reflexes, such as coughing, 
sneezing, gagging, and vomiting, and ingestive re- 
flexes, such as sucking, swallowing, and salivation, 
are mediated through these same lower-brain-stem 
levels. This is not to say that higher-brain struc- 
tures do not also modulate and provide an even 
more elaborate functional assembly of these re- 
flexes, for they do; but the medulla and pons to- 
gether can carry out most of these functions in a 
manner adequate for survival of the organism in 
the absence of the rest of the brain. 

As already noted, suprasegmental regulation is 
also essential for appropriate postural adjustments 
and motor coordination, The medullary nuclei and 
tracts play an intermediary role in the control of 


Source: Adapted from Brooks, $. M.: Integrated 
Science, ed. 2, St. Louis, 196, The C. V. 
Mosby Company, modified from Rogers, 
T. A: Elementary Human Physiology, 
New York, 1961, John Wiley & Sons, Inc. 


spinal neurons concerned with muscle tone. The 
role of the reticular formation, which has its lower 
levels in the medulla and pons, will be discussed 
in a later section. The pons lies immediately above 
the medulla, and cell groups within it exert con- 
trol over the medullary respiratory centers, The 
pons also aids in the elaboration of the expressive 
aspects of ingestive and emotional behaviors, since 
nuclei of the cranial nerves responsible for jaw 
and facial movements and the secretion of tears 
and saliva are present at this level. Additional in- 
hibitory and facilitatory control over postural mech- 
anisms of the skeletal musculature is also imposed 
upon medulla and cord by the pons. Finally, struc- 
tures and fiber tracts within the pons act as inter- 
mediaries in integrating neural activities of the 
cerebellum and cerebral hemispheres, and trans- 
verse fibers extending across the pons provide 
communication between the two hemispheres of 
the cerebellum. 

Midbrain. The midbrain, like the pons and me- 
dulla, contains nuclei of cranial nerves and many 
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fibers in passage, both of which provide communi- 
cation between higher and lower levels of the cen- 
tral nervous system. In addition, the roof, or dorsal, 
region of the midbrain is referred to as the tectum 
and contains two pairs of protuberances, the an- 
terior, or superior, colliculi and the posterior, or 
inferior, colliculi. These bilateral pairs of eleva- 
tions are collectively known as the quadrigeminate 
bodies and are essentially aggregations of gray 
matter. The remainder of the midbrain is composed 
of the centrally located tegmentum—which in- 
cludes the midbrain portion of reticular formation 
—and large fiber bundles, which for the most part 
are laterally placed. The quadrigeminate bodies 
are primarily sensory reflex centers. The superior 
colliculi receive fibers directly from the retinas of 
the eyes, and although in higher mammals other 
pathways and structures mediate complex proc- 
esses such as pattern perception, the colliculi none- 
theless participate with adjacent regions in certain 
reflexes such as blinking in response to novel visual 
stimuli, pupillary constriction to light striking 
the eyes, and certain reflex conjugate eye move- 
ments, 

The inferior colliculi receive fibers from the audi- 
tory pathways which enter the medulla and ascend 
through the pons to the midbrain. These collicular 
midbrain stations are concerned with startle re- 
flexes and eye, head, and body-orienting reactions 
to unexpected noises. An additional level of skeletal 
muscle control is also contributed by other nuclei 
within the midbrain. 

Cerebellum. The cerebellum, consisting of two 
large hemispheres and a midline structure called 
the vermis, is perched over the brain stem and 
attached to it by heavy fiber tracts, the cerebellar 
peduncles (see Figure 3). The cerebellum also has 
connections with the spinal cord and cerebral cor- 
tex. The outer surface of the cerebellum is covered 
by a cortex, as is the cerebrum, and its interior con- 
sists of fiber tracts and several subcortical nuclei. 
One of the more clearly understood functions of 
the cerebellum is its role in producing smooth, well- 
timed, and precise motor movements. It also plays 
a prominent role in maintaining appropriate pos- 
ture and balance by utilizing sensory information 
from the vestibular mechanisms of the inner ear 
and proprioceptive, or position sense, receptors 
in the musculature of the body. Projections from 
tactile receptors in the skin are another major 
source of input to the cerebellum. Experimental 
studies of electrical stimulation of the cerebellum 
indicate it is also active in the regulation of respi- 
ration and certain autonomic responses. 

Thalamus. Sensory information from every 
modality, with the exception of olfaction, is relayed 
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through thalamic nuclei before being transmitted 
upward to the telencephalon. Figure 3 shows the 
location of the thalamus. From these thalamic sen- 
sory nuclei, impulses are projected to appropriate 
and specific receiving areas in the cerebral cortex. 
In addition, neural information exchanged between 
the cerebral cortex, on the one hand, and the brain- 
stem reticular formation and cerebellum, on the 
other, reaches thalamic stations en route. It is im- 
portant to emphasize that the thalamic relay nuclei, 
like other synaptic stations in the central nervous 
system, do not simply send on raw information in 
an unchanged form from one part of the brain to 
another; but, in a manner not well understood, 
modify the neural discharge patterns and their 
temporal relationships in such a way as to provide 
simultaneously for the apparently contradictory 
processes of both increased selectivity, on the one 
hand, and integration of neural information from 
several sensory modalities, on the other. The cru- 
cial role of the thalamus in functions of the retic- 
ular system will be discussed in a later section. 

Hypothalamus. The hypothalamus is perhaps 
best thought of as a major center for the integra- 
tion of a variety of motivated behaviors, such as 
feeding, drinking, sexual activities, and emotional 
responses. It also is of major importance in the 
regulation of autonomic responses which are not 
only critical for the moment-to-moment physiolog- 
ical equilibrium of the organism but are also com- 
ponents of the motivated behaviors themselves. 
Figure 3 shows the location of the hypothal- 
amus. 

The functional assembly of these visceral re- 
sponses and motivated behaviors depends upon the 
integrity of the hypothalamus. For example, in a 
series of classical experiments which were designed 
to determine the central-nervous-system level essen- 
tial for coordinated emotional behavior, all struc- 
tures above the hypothalamus were severed or re- 
moved surgically in experimental cats. Following 
recovery from the operation, an intact and well- 
coordinated pattern of defensive—aggressive display 
was still present. When, however, a brain-stem 
transection was made behind the hypothalamus, 
the integrated rage response—which in the cat con- 
sists of baring of the teeth, hissing, spitting, extru- 
sion of the claws, striking out with the claws, 
arching of the back, elevation of body hairs, and 
sometimes urination and defecation—was no longer 
present in an integrated pattern. Disjointed and 
isolated elements of emotional behavior in response 
to tactile stimuli were still occasionally expressed 
through receptive and motor mechanisms in the 
brain stem below the hypothalamus, but the coordi- 
nation of these responses, which makes the be- 
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havior protective and adaptive, was permanently 
lost. It is worth noting that the defensive—aggressive 
response pattern consists of an assortment of skele- 
tal motor and autonomic reflexes which are exquis- 
itely combined and organized by the hypothalamus 
into a precisely timed sequence of actions. It should 
be pointed out, however, that although the hypothal- 
amus is critical for the integration of this and other 
behaviors, it too responds to higher influences from 
the basal ganglia and cortex of the cerebral hem- 
ispheres. 

Autonomic activities. The regulation of auto- 
nomic activities is maintained by the interplay of 
two major regions of the hypothalamus. Experi- 
mental studies have demonstrated that electrical 
stimulation of the posterior hypothalamus results 
in the expression of a variety of visceral activities 
that, in general, mobilize the organism for action 
and the expenditure of energy. The hypothalamic, 
lower-brain-stem, and peripheral structures which 
mediate this autonomic pattern of mobilization are 
referred to as the sympathetic system, and the ac- 
tivities of these regions are conveyed to peripheral 
organs through the thoracic and lumbar segments 
of the spinal cord. The second subsystem of the 
autonomic nervous system is the parasympathetic, 
which has its outflow to the periphery through the 
cranial nerves and sacral region of the spinal cord. 
Parasympathetic responses are integrated primarily 
in the anterior part of the hypothalamus, in general 
are opposed to sympathetic actions, and are asso- 
ciated with conservation of energy, relaxation, and 
sleep. Sympathetic and parasympathetic fibers 
often innervate the same peripheral structure, and 
while one system may elicit excitation of the func- 
tion of that organ, the other system may produce 
inhibition or a decrease in activity. In the sympa- 
thetic nervous system, the chemical mediator at the 
end organ is norepinephrine, an adrenalinlike sub- 
Stance. In the parasympathetic system, the sub- 
stance released at the junction of nerve endings 
and smooth muscles is acetylcholine. An exception 
exists in the case of the sweat glands, which are 
all innervated by sympathetic fibers but utilize 
acetylcholine as the chemical transmitter in tem- 
perature-reduction sweating and norepinephrine in 
emotional sweating. The sympathetic system pos- 
Sesses preganglionic fibers which synapse in gang- 
lia arranged in chains running parallel and close 
to the cord, just outside the vertebral column. From 
these ganglionic synapses, postganglionic fibers ex- 
tend to the end organs. An exception to this is the 
innervation of the adrenal medulla, which secretes 
adrenalin into the general circulation and thus 
affects other sympathetic organs, The adrenal me- 


dulla receives preganglionic fibers directly from the 
central nervous system. The parasympathetic pre- 
ganglionic fibers, on the other hand, are very long, 
since the ganglia of this system do not lie beside 
the cord but are in close proximity to the end 
organs; thus, parasympathetic postganglionic fibers 
are relatively short. [See INFANCY, article on THE 
EFFECTS OF EARLY EXPERIENCE; STRESS.] 

Usually an autonomic response pattern is a 
highly complex and integrated sequence of both 
sympathetic and parasympathetic reflexes. For ex- 
ample, in the male, successful completion of the 
sexual act requires first a parasympathetically 
evoked relaxation of the arteries of the erectile tis- 
sue of the penis, which produces a slowing of venous 
outflow and, consequently, erection. The flow of 
semen into the posterior portion of the urethra and 
final ejaculation is a sympathetic response. During 
this process another sympathetic reflex in the 
sphincter of the bladder prevents spermatozoa from 
entering that structure and simultaneously blocks 
urination. Finally, after completion of ejaculation 
an increase in sympathetically produced tone of 
the arteries of the cavernous tissue elicits detu- 
mescence and a decrease in bladder-sphincter tone, 
permitting normal urination. 

Metabolic and endocrine response. In addition 
to sympathetic and parasympathetic regulating 
areas in the hypothalamus, there are reciprocal 
control areas for other behaviors. For example, the 
ventromedial nucleus of the hypothalamus is an 
inhibitory center for eating behavior, while a more 
lateral region is concerned with excitation of this 
behavior. Evidence of the existence of critical areas 
such as these often comes from experimental 
studies in which small regions of the brain are 
either destroyed or electrically stimulated in experi- 
mental animals. For example, in the case of the 
ventromedial nucleus, stimulation produces imme- 
diate cessation of eating for the duration of the 
stimulus, and destruction of this nucleus results 
in overeating and, eventually, obesity, The over- 
eating following this lesion presumably occurs be- 
cause the excitatory center in the lateral hypothal- 
amus is released from the inhibitory control of the 
ventromedial nucleus, Conversely, stimulation of 
the lateral hypothalamus elicits the onset of eating. 
and a lesion in this area leads to starvation. 

As part of its controlling role in the expression 
or inhibition of eating, drinking, sexual behavior, 
and additional metabolic functions, the hypothal- 
amus exerts an important influence over the major 
endocrine gland, the pituitary, which lies at the 
base of the brain and is connected with the hypo- 
thalamus by a stalk containing neural fibers which 
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extend from the hypothalamus down into the pos- 
terior lobe of the gland (see Figure 3), The anterior 
lobe of the pituitary is influenced by the hypothal- 
amus through a vascular system, so that chemical 
substances released by secretory cells in the hypo- 
thalamus may act upon the anterior pituitary. The 
pituitary, in turn, plays a central role in the con- 
trol of the several other endocrine glands with 
which it has reciprocal relations. When hormones 
of these glands enter the general circulation, they 
act on not only the pituitary itself but the hypo- 
thalamus as well, to modify its activity. Thus, an 
equilibrium of bodily functions concerned with 
metabolism and consummatory behavior is main- 
tained by a circular system of neural and hormonal 
checks and balances, with the hypothalamus and 
pituitary performing the central integrating steps 
in these processes, 

Basal ganglia. Just as the cerebellum possesses 
nuclei which lie within and below its cortex, so 
the cerebral hemispheres (telencephalon) contain 
aggregates of gray matter which lie beneath their 
cortices. These subcortical structures are collec- 
tively known as the basal ganglia and are composed 
of the caudate nucleus, putamen, globus pallidus, 
claustrum, septal nuclei, and the amygdala, which 
consists of a complex of five closely related nuclei 
lying within the temporal lobe. Frequently, the 
term “lenticular nucleus” is used in referring to the 
putamen and globus pallidus collectively. The term 
“corpus striatum” is used to include the lenticular 
nucleus along with the caudate. 

The traditional view of the basal ganglia and, 
in particular, the structures constituting the corpus 
striatum is that they are primarily concerned with 
coordination of somatic motor movements, espe- 
cially control of postural reflexes and adjustments. 
These structures are complexly connected with cer- 
tain motor areas of the cerebral cortex, the thal- 
amus, the reticular formation, and motor structures 
lower in the brain stem. The corpus striatum, with 
these other areas, forms the so-called extrapyram- 
idal motor system. This series of structures is dis- 
tinguished from the pyramidal system, whose fibers 
also originate in motor areas of the cortex but 
course directly to the motor neurons of the cranial 
nerves and spinal cord. The pyramidal system is 
responsible for individual and distinct phasic move- 
ments of the body and limbs, while the extrapy- 
ramidal system functions to produce coordination 
of these. Thus, normal somatic motor patterns re- 
quire the integrated activity of three apparently 
separate systems, which actually share overlapping 
structures and functions: the pyramidal system, 
the extrapyramidal system, and the cerebellar sys- 
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tem discussed earlier. Recent studies indicate that 
parts of the corpus striatum may also play a role 
in the mediation of attention, learning, and emo- 
tional responses, although the extent and specificity 
of these activities is not yet clear. [See ATTENTION 
and EMOTION.] 

The amygdala appears rather distinct from the 
rest of the basal ganglia, in terms of its location in 
the temporal lobe, its connections, and its func- 
tions. This complex of nuclei receives fibers from 
the olfactory pathways and has strong connections 
with the hypothalamus. Functionally, it is prom- 
inent in the control of autonomic activity, emo- 
tional responses, and probably feeding behavior. 
To a certain degree, it appears to replicate some 
hypothalamic functions. The septal nuclei are also 
connected with the hypothalamus and, like the 
amygdala, exert an influence over autonomic activ- 
ity and affective behavior. Experimental evidence 
suggests that the septal and amygdaloid nuclei 
carry out some of their activities by exercising 
modulatory control over hypothalamic integrative 
mechanisms. Investigations of the claustrum have 
as yet revealed little of its functions. 


The cerebral cortex 

The cerebral cortex is a convoluted, or folded, 
mantle of gray matter arranged as layers of cells 
over the surface of the cerebral hemispheres. It 
virtually covers and surrounds the upper levels of 
the brain stem, The cortex has an average thick- 
ness of about 2.5 millimeters, but this varies con- 
siderably from one region to another. Fibers leading 
to and from subcortical areas and connecting corti- 
cal areas with each other make up the white mat- 
ter within the hemispheres. The corpus callosum, 
a massive bundle of fibers, crosses the midline of 
the hemispheres and provides a direct connection 
between the right and left cerebral cortices. 

The cortex represents the most recent. and 
highest development of the nervous system, In man 
it is proportionally larger than in any other species, 
and to the cortex are attributed those activities 
which are most distinctly human: complex learn- 
ing and reasoning; symbolization, abstraction, and 
generalization; and the development of language 
and speech. While a substantial portion of the 
cortex is given over to sensory receiving areas and 
regions concerned with the initiation and control 
of motor responses, the higher integrated activities 
just mentioned depend upon other cortical regions 
—the so-called cortical association areas—as well; 
these regions are critical in the mediation of com- 
plex perceptions and cognitions. 

At the gross anatomical level, the cortex is di- 
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vided into lobes. In addition, areas which are pre- 
sumably different in their cellular architecture have 
been traditionally assigned numbers (Brodmann’s 
areas), In some instances Brodmann’s areas cor- 
respond to functional zones, but in other cases they 
do not. The relationship of the lobes and functional 
areas is shown in figures 4 and 5. 

Motor cortex. The frontal lobe, which extends 
from the anterior tip of the hemispheres to the 
central fissure, contains two major subdivisions: 
the posterior portion, which is primarily motor in 
function, and the anterior part, which is associa- 
tional. Electrical stimulation of the strip of cortex 
(Brodmann’s area 4) just anterior to the central 
fissure produces discrete and isolated motor re- 
sponses of separate body parts. This is the primary 
motor cortex, and it is organized somatotopically, 
or in terms of body regions. For example, beginning 
at the uppermost part of the motor strip and pro- 
gressing downward along its surface, electrical 
stimulation will successively produce movements 
of the toes, foot, leg, trunk, thorax, shoulder, 


Fissure of Rolando 


Frontal lobe 


Fissure of Sylvius- 


Temporal lobe 


Figure 4 


fingers, neck, and face. These responses are medi- 
ated by the direct connection of motor nerve cells 
in area 4 to spinal motor neurons or cranial-nerve 
nuclei, Stimulation of the right motor cortex will 
produce these responses on the left side of the 
body, and vice versa, since the pyramidal tract in 
which these fibers travel decussates or crosses over 
to the opposite side of the brain before reaching the 
motor cells of the spinal cord. Destruction of parts 
of the primary motor strip will produce paralysis 
of that part of the body which is represented in the 
injured portion. 

The premotor area (6) has fewer connections 
with the pyramidal tract than does area 4 but sends 
a large number of fibers to the extrapyramidal 
motor system via the basal ganglia and reticular 
formation. Often, stimulation of this area does not 
produce an observable effect, although complex 
motor responses and patterns have been reported 
even in the absence of area 4. Some investigators 
assign a predominantly inhibitory function to area 
6, Area 6 also extends mesially over and down 


Parietal lobe 


Occipital lobe 


— The lobes of the cerebral hemispheres and the landmarks separating them 


Source: Adapted from Introduction to Psychology. 
Ird Edition, by Ernest R. Hilgard, copyright, 
1953, © 1957, 1962, by Harcourt, Brace & 
World, Inc., and reproduced with their 
Permission. 
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Central fissure 


Somatosensory receiving area 
(3, 1, 2) 
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Visual receiving area 


(17) 


Temporal association area 


Figure 5 — A diagrammatic representation of the major areas of the cerebral cortex* 


* Numbers refer to Brodmann’s areas. 


along the midline. Its extent there is referred to as 
the supplementary motor cortex, since slow, CO- 
ordinated responses can be elicited by stimulation. 
These consist of the gradual assumption and main- 
tenance of a particular posture, in contrast to the 
rapid, phasic, and highly localized responses ob- 
tained from area 4, Area 8 is often called the 
frontal eye field, since horizontal and vertical con- 
jugate eye movements result from stimulation of 
this region. 

It is of interest that a variety of autonomic re- 
sponses may be obtained from all of these motor 
areas in addition to other parts of the cortex such 
as the tip of the temporal lobe and the undersur- 
face of the frontal lobe (orbitofrontal cortex). 
These responses are, in the main, examples of 
critical modulation and control of amygdaloid and 
hypothalamic integrative mechanisms for visceral 
activity. However, certain vascular responses origi- 
nating in the motor cortex may bypass these sub- 
cortical centers and directly control some activities 
of the blood vessels. 

Sensory cortex. The anterior portion of the 
parietal lobe immediately behind the central fissure 
(Figure 5) is called the postcentral gyrus and con- 


tains the primary somatosensory receiving region 
(areas 3, 1, and 2). These areas receive sensory 
information directly from the posterioventral nu- 
cleus of the thalamus and, thus, are important in 
the initial perception, discrimination, and localiza- 
tion of stimuli exciting nerve endings and receptors 
on the surface of the body and from within its 
somatic musculature. However, a primitive con- 
sciousness of certain somatic sensations probably 
exists in man at the thalamic level, even in the 
absence of cortex. These thalamic sensations are 
at best poorly differentiated and not well localized. 
For example, a patient with a lesion in the primary 
somatosensory receiving area of the cortex may 
indicate that he appreciates the fact that stimula- 
tion has occurred but may have great difficulty in 
distinguishing degrees of intensity of stimulation 
or in localizing the stimulated point on the body. 
[See PAIN; SKIN SENSES AND KINESTHESIS.] 
Fibers carrying the somatic sensations of touch, 
pressure, pain, proprioception, and warmth and 
cold are not well separated in either the thalamus 
or cortex, but the cortical receiving area is topo- 
graphically organized in a manner similar to that 
of the motor cortex: lower extremity representa- 
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tion at the uppermost portion of the sensory area, 
with upper extremities and head represented at 
the basal portion of the region, Our knowledge of 
topographical precision within these sensory areas 
is primarily the result of electrophysiological map- 
ping studies in which punctate physical stimuli are 
applied to the surface of the body and the resultant 
electrical potential changes, called evoked sensory 
potentials, are recorded with small electrodes on 
the surface of the cortex. A distinct and localized 
evoked potential occurs at the specific cortical area 
which represents the part of the body being stim- 
ulated. 

Although the reader may have gained the im- 
pression from the foregoing that the central fissure 
forms an absolute boundary between motor and 
sensory cortex, this is not actually the case. Motor 
responses can be elicited from the sensory areas, 
and sensations have a degree of representation an- 
terior to the central fissure. However, the responses 
become fewer and weaker as we progress from 
motor into sensory cortex or vice versa. It also is 
appropriate to point out at this juncture that taste, 
in addition to the somatosensory modalities men- 
tioned above, is also represented in the parietal lobe 
at the base of the postcentral gyrus. [See TASTE 
AND SMELL.] 

There also exists a secondary somatosensory 
area which yields a mirror image of the topography 
of the primary region. It is also located near the 
base of the primary region, but its functions are 
still obscure. The remainder of the parietal lobe is 
primarily associational in function, 

Visual impulses, originating in the retina, reach 
the occipital lobe of the cortex after relay through 
a specific thalamic nucleus called the lateral genic- 
ulate body. Figure 5 shows this visual receiving 
area, or striate cortex (area 17), to be located on the 
lateral surface of the occipital pole. However, much 
of area 17 extends into the midline occipital cortex 
and lies along the mesially located calcarine fissure. 
The visual cortex, too, is topographically organized. 
Stimulation of the eye with fine beams of light 
while recording evoked potentials in area 17 dem- 
onstrates that there is a point-to-point representa- 
tion in the cortex for retinal areas. In addition, 
fibers from the medial half of each retina cross 
over before reaching the lateral geniculate body 
and go to the opposite side of the brain, while those 
for the lateral half of the retina do not. Through 
this arrangement, visual stimuli appearing in the 
tight visual field (thus producing stimulation of 
the medial retina of the tight eye and the lateral 
retina of the left eye) are represented on the striate 
cortex of the left hemisphere. Conversely, objects 


in the left visual field are represented in the right 
hemisphere. [See VIsIoNn.] 

Complete destruction of area 17 in man leads 
immediately to permanent total blindness. This is 
not quite the case for lower mammals, since even 
after total removal of the visual receiving area 
these species are still capable of primitive bright- 
ness discrimination, although pattern, or form, 
vision is lost. The occipital association cortex has 
traditionally been thought to be concerned with 
visual integration. However, this statement is 
something of an oversimplification, since the func- 
tion of these regions is considerably more than 
that of the organization of visual perception, 

The primary receiving area for audition (area 
41) is located on the upper surface of the tem- 
poral lobe (Figure 5). Its input arises from the 
medial geniculate body of the thalamus, but since 
the auditory paths cross at several points before 
reaching the medial geniculate body, bilateral rep- 
resentation for hearing is a more complex matter 
than that for vision. In view of multiple cross- 
ings of auditory fibers in the brain stem en route 
to the lateral geniculate body, it is remarkable 
that tonotopic organization is present in area 41. 
That tonotopic organization is retained from re- 
ceptor to auditory cortex is indicated by the fact 
that when small areas of the cochlea, which are 
responsive to a relatively narrow range of tone 
frequencies, are stimulated, evoked potentials 
appear in an orderly spatial arrangement over the 
auditory cortex. The temporal association areas 
are extremely complex in their function and will 
be discussed along with the other association areas. 
[See HEARING.] 

The receptors for the sense of balance or equi- 
librium are located in the labyrinthine portion of 
the inner ear, and the cortical representation for 
equilibrium lies in or near that for audition, area 
41 of the temporal lobe. 

Considered from the phylogenetic point of view, 
many sensory and motor functions have under- 
gone a process known as corticalization. That is, 
functions which in lower forms appear to be medi- 
ated subcortically depend in higher species on the 
integrity of the cortex. The instance cited, showing 
visual losses to be greater in man than in lower 
animals following destruction of the striate cortex, 
is an example of corticalization. In the motor 
sphere this evolutionary process is equally evident. 
For example, in the rat the motor cortex is not 
well organized either cytoarchitectonically or phys- 
iologically. If it is completely removed, it is dif- 
ficult for even the sophisticated observer to dis- 
cern any motor impairments. In the cat or dog, 
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where there is greater organization and finer 
differentiation of motor areas, motor weakness 
and a substantial loss of movement follow injury 
to this region. The recovery, however, is usually 
prompt although not necessarily complete. In 
primates—especially man—the reliance upon cor- 
tex for normal activity is even more crucial, and 
motor-cortex injury produces greater and more 
enduring paralysis than in any of the species 
described above. 

Association cortex. The association regions of 
the cerebral cortex are a comparatively recent 
evolutionary development. For example, the rela- 
tively simple and unconvoluted cortex of rodents 
is primarily sensory and motor in function. In 
these species areas of association cortex are few 
in number and very small in size, In carnivores 
which are more advanced in the evolutionary hier- 
archy there is a substantial increase in the amount 
of cortex given over to associational functions. 
When the phylogenetic level of primates is reached 
there is an abrupt and dramatic increase in the 
extent of association cortex. This is especially evi- 
dent in man, in whom the greater part of the cere- 
bral cortex is associational. 

It has been pointed out that upon the associa- 
tion cortex depend those higher functions which 
best distinguish man from the other primates and 
primates from lower mammalian species. While 
the level at which man is able to segregate and 
assemble elaborate perceptions and memories for 
the solution of complex problems is indeed im- 
pressive, it is not unique. Other primates and 
mammals considerably down the scale from man 
also have the capacity to reason, but in an atten- 
uated form and to a lesser degree. Man’s superior 
ability to manipulate symbols, to isolate concep- 
tually a dimension of an object from that object 
(abstraction), and to derive a principle through 
the categorization of these abstractions (general- 
ization) is quantitative rather than absolute. It is 
reasonable to surmise that the magnitude of these 
gifts is related to the extent and degree of elabora- 
tion of association cortex. 

How then does the association cortex perform 
these remarkable integrative tasks? Although the 
precise neurophysiological and chemical mecha- 
nisms are far from clearly understood, it is recog- 
nized that for any behavior above the level of a 
simple innate reflex, associational systems, cortical 
and subcortical, must first bring together stored 
information (memory) and incoming sensory data 
from many modalities. Then, through the as yet 
poorly understood neuronal processes of data se- 
lection, interaction, and matching and comparison, 
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a choice is made which, represented by a particular 
pattern of neural activity, “commands” the motor | 
system to effect a specific response. There are sev- 
eral factors which, when taken together, are as- 
sumed to determine the “decision” of associational 
mechanisms. The most basic of these are the 
strength and pattern of the sensory input and the 
efferent connections of the participating associa- 
tion areas, Of course, ongoing central activities 
which represent the effects of past experience and 
the motivational state of the organism will also 
function as determinants of output by interacting 
with the effects of sensory stimulation. During this 
process the responsivity of individual neurons in 
associational systems will be controlled also by the 
existing level of arousal and attention within the 
brain. 

It has been pointed out that the sensory and 
motor cortices actually overlap. There is also no 
absolute separation of associational functions from 
either sensory or motor regions. Indeed, there is 
evidence that some integrative activities commonly 
attributed to the association cortex are also ex- 
hibited by sensory and motor areas. The com- 
plexity and inseparability of sensory, motor, and 
associational functions is emphasized not only by 
these facts but by the existence of descending 
fibers from many cortical areas and other regions 
of the brain which impose their activity upon lower 
sensory stations, even those as remote as the re- 
ceptor. Through this arrangement, ongoing central 
activity controls and modulates incoming sensory 
impulses long before they reach the receiving areas 
of the cerebral cortex. Furthermore, the ongoing 
activities in these higher levels of the brain which 
modulate sensory activity are themselves modified 
by the sensory input. Thus, it is evident that com- 
plex feedback circuitry exists throughout the cen- 
tral nervous system for the control and balance 
of sensory, motor, and integrative activities. 

Frontal lobe. The prefrontal region, or associa- 
tion cortex of the frontal lobe, lies rostral to those 
areas from which motor effects can be elicited. 
Neurons of the prefrontal region make no fiber 
contribution to the pyramidal motor pathways, nor 
do they receive impulses directly from thalamic 
sensory-relay nuclei. They do, however, have con- 
nections with other subcortical structures, includ- 
ing the thalamus, hypothalamus, corpus striatum, 
midbrain, lower brain stem, and cerebellum, as 
well as with many cortical areas. 

In monkeys, experimental injury to portions of 
the prefrontal cortex may result in a number of 
behavioral alterations, including hyperactivity and 
stereotyped pacing, increased distractability and 
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inattention, response perseveration, and possibly 
disruption of short-term memory. A combination 
of these disturbances may account for the difficulty 
that monkeys with lesions of the prefrontal cortex 
have in solving problems which require sequential 
responding or in holding and retrieving informa- 
tion over a delay period between stimulus presen- 
tation and response, during which delay the stim- 
ulus object is not visible to the animal. As is 
generally the case with injury to the association 
cortex in monkeys and lower species, symmetrical 
bilateral lesions produce much greater defects than 
do unilateral ones. In fact, in some instances uni- 
lateral lesions may result in no observable loss 
of function. Another consequence of prefrontal 
damage is in the sphere of emotional behavior. 
Normal monkeys and chimpanzees working on 
very difficult discrimination problems often display 
emotional disturbances, including violent temper 
tantrums, as a manifestation of the frustration 
resulting from the many errors made. After pre- 
frontal lobotomy, a surgical procedure which 
severs the connections between the prefrontal lobe 
and subcortical areas, the animal no longer be- 
comes upset with its errors and works calmly and 
quickly on the problem, even though the lobot- 
omy may have actually increased the frequency of 
errors, 

These observations on emotional behavior led 
to the development of the prefrontal lobotomy 
in man for the relief of neurotic and psychotic 
symptoms. This operation has been almost entirely 
replaced in recent years by the verbal and experi- 
ential therapies, electroshock treatment, and the 
psychotropic drugs. However, variants of the lo- 
botomy procedure are still occasionally employed 
in the alleviation of intractable pain in terminal 
diseases. The behavioral effects of lobotomy in 
man are highly variable, and reports have been 
frequently contradictory. There is no clear evi- 
dence that prefrontal lobotomy significantly alters 
the intelligence of man, and it is possible that the 
defects which have been shown by some studies 
may actually reflect motivational and attitudinal, 
rather than intellectual, changes. 

The prefrontal lobotomy has been considered 
most effective in disorders characterized by emo- 
tional tensions, such as schizophrenic anxiety, 
agitated depression, and obsessive tension. Since 
after prefrontal surgery the patient becomes rela- 
tively unconcerned about his previous problems, 
anxieties, and pain, this has been the rationale 
for this operation in the above-mentioned disorders 
as well as in cases of untreatable pain. This should 
not be taken to mean that the patient is no longer 


capable of displaying emotion. Indeed, he may be 
emotionally labile and occasionally hyperreactive, 
but the aspect of suffering and caring about his 
psychological and physical difficulties (which may 
still exist) is virtually eliminated. [See MENTAL 
DISORDERS, TREATMENT OF, article on SOMATIC 
TREATMENT.] 

This solution to the problem of anxiety has 
its unfortunate aspects as well, for the normal 
sense of responsibility for one’s self, for one’s work 
and home, and toward others, may practically dis- 
appear. This disintegration of social conscience 
may be accompanied by a high degree of distract- 
ability, inability to plan ahead, and inappropriate 
or socially unacceptable emotional reactions. It is 
possible that some of the signs produced by pre- 
frontal-lobe lesions represent a release of dien- 
cephalic activity from cortical control. 

Parietal, temporal, and occipital lobes. In mon- 
keys, proprioceptive and tactile roughness and 
form discriminations which had been learned pre- 
operatively were impaired after bilateral destruc- 
tion of the posterior parietal cortex, but retraining 
was possible except in the case of very complex 
tactile-form discriminations. These findings of 
somatosensory-discrimination losses attributable to 
parietal-association-cortex damage are in accord 
with the spatial proximity of this region to the 
primary somatic receiving area. 

In the temporal lobe of cats, bilateral lesions 
have been made in association cortex adjacent to 
the primary and secondary auditory receiving 
areas. While such lesions do not disrupt a simple 
intensity discrimination, there is a disturbance of 
both frequency and tonal pattern discrimination. 
However, the frequency discrimination can be re- 
learned, but the pattern discrimination cannot. The 
Severity of disruption in this instance is clearly 
related to task complexity, which falls neatly into 
the order of intensity discrimination, frequency 
discrimination, and pattern discrimination, This 
is a good example of the increasingly important 
role association cortex plays as behavior becomes 
more elaborate, 

It is useful to treat the parietal, temporal, and 
occipital association cortices together, since in 
many instances the functional boundaries are not 
clear and lesions may produce similar or closely 
related functional losses. Relatively large lesions 
of the parieto-temporo-occipital association cortex 
in monkeys result in a loss in previously acquired 
visual-form-discrimination ability, with some im- 
pairment of relearning as well. It is to be empha- 
sized that in order to produce consistent deficits, 
the lesions usually have to be quite extensive. More 
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restricted lesions do not have this effect. In con- 
nection with this point, it is interesting that bi- 
lateral injury to the inferotemporal, or ventral, 
cortex, which is not adjacent to the visual receiv- 
ing areas but does have fiber connections with 
the occipital cortex, is crucial for the retention of 
visually guided discrimination behavior. Somas- 
thetic and auditory discriminations do not appear 
to be affected by lesions. in the inferotemporal 
region, 

It is important to note at this point that many 
of the foregoing deficits appear specifically related 
to the memory of learned tasks, since relearning 
can often take place after the cortical removals 
mentioned above. In a few instances the ability to 
learn is disturbed as well. It has been emphasized 
that bilateral lesions are usually required to pro- 
duce a significant deficit in memory or learning 
in monkeys, In the realm of human behavior, how- 
ever, certain functions appear to be highly lateral- 
ized, and unilateral damage to the cortex may 
produce profound defects. 

Hemispheric dominance. The fact that the left 
and right sides of the brain appear grossly identical 
does not mean that there is necessarily equivalence 
of function for the two hemispheres. For example, 
damage to the right parieto-occipital cortex of 
man often produces impairment of performance 
of nonverbal perceptual motor tasks, while similar 
injuries to the left hemisphere may be without 
consequence with respect to this class of behavior. 
For many complex human activities there is a 
dominant hemisphere, but for a particular indi- 
vidual the dominant hemisphere is not necessarily 
the same for all such activities. Because the py- 
ramidal motor pathways cross as they descend, in- 
dividuals who are unequivocally right-handed are 
cerebrally dominant in the left hemisphere for this 
function. In fact, most people, whether right- 
handed or left-handed, show left cerebral domi- 
nance for speech, but not all do; for example, in- 
dividuals who are dominant for speech in the right 
hemisphere are more likely to be left-handed. Thus, 
for a particular individual the hemisphere which 
is dominant for speech may or may not be the one 
that is dominant for handedness. 

If the dominant hemisphere is injured, there is 
a tendency for the homologous cortical area of 
the opposite hemisphere to assume its function. 
Thus, if the area of the left hemisphere which 
controls speech is severely injured, this function 
will be lost, but it may be recovered if comparable 
regions of the right hemisphere can take over. 
Whether recovery of an activity occurs in a par- 
ticular case is largely a function of the age at 
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which the damage occurs. Plasticity is especially 
pronounced in early life and declines with maturity 
and age. Recent evidence indicates that there are 
persons in whom speech is represented bilaterally 
in the cortex. In these cases, it is not yet clear 
whether both hemispheres have come into use as 
a result of injury to the one which was originally 
dominant. 

Equipotentiality, mass action, localization. Ex- 
periments with monkeys have shown that even 
lesions which destroy both prefrontal lobes will 
not produce the prefrontal-lobotomy syndrome de- 
scribed earlier if the operation is done very early 
in life. This illustrates that the central-nervous- 
system plasticity involved in the restitution of 
certain abilities does not necessarily require the 
presence of an anatomically intact and comparable 
region in the opposite cortex. Other, as yet unde- 
termined, structures of the hemispheres are some- 
times able to assume the role of regions which 
are morphologically quite different. Many findings 
such as these have led to the concept of func- 
tional equipotentiality of cortical areas, which is 
almost certainly an oversimplification. But if the 
limits of plasticity are taken into account, the 
“equipotentiality” notion has merit, especially in 
lower mammals, in which structural and func- 
tional differentiation is not so prominent and re- 
covery is more frequent and rapid. 

For many years, in neurology and psychology, 
a lively debate has raged over the degree to which 
behavioral functions are localized in specific areas 
of the hemispheres, At one extreme, some scien- 
tists have taken the view that functions and sub- 
functions are highly localizable. Proponents of 
strict localization have created detailed maps of 
the cortex relating precise cortical regions to a 
myriad of specific complex psychological activities. 
At the other end of the spectrum are those who 
believe that except for the primary sensory and 
motor areas, the cortex functions as a whole in 
practically all psychological activities and that 
localization is a rare finding. Evidence for this 
“mass action” viewpoint is often taken from rat 
studies in which large amounts of several cortical 
areas had to be removed before a deficit in learn- 
ing or memory was noted, as well as from the fact 
that in humans certain psychological-test perform- 
ances are disturbed regardless of the location of 
the lesion. The concepts of mass action and equi- 
potentiality are obviously closely related. As has 
often been the case in similar scientific controver- 
sies, the evidence of both sides is frequently valid 
and not so mutually exclusive as the proponents 
of the extremes would sometimes have us believe. 
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Recent studies on human cortical injuries and elec- 
trical stimulation of the brains of patients indi- 
cate that some psychological functions are rather 
well localized, although the precise region on the 
cortex may vary somewhat from individual to 
individual. For other classes of behavior, specific 
regions have not been found, and it is reasonable 
to assume that certain complex intellectual activi- 
ties require roughly equivalent participation of 
neuronal circuits from many areas of the cortex. 

Speech areas. The cortical areas responsible for 
speech and language are of considerable impor- 
tance, since these are the activities above all others 
that come very close to being truly unique to man. 
They provide one of man’s immeasurable advan- 
tages over other species: the verbal and written 
communication of acquired knowledge, beliefs, 
and attitudes or, more simply, the transmission of 
culture, 

The speech and language areas are located in 
the frontal, temporal, and parietal lobes, usually 
lateralized to the left hemisphere. Lesions in, or 
electrical stimulation of, the frontal area, first 
described by Broca, which lies near, but not in, 
the primary motor area may result in motor 
aphasia, and inability to use spoken words prop- 
erly. In aphasia, which may be categorized as being 
motor (expressive) or sensory (receptive), there 
is no injury to the primary sensory receiving appa- 
ratus, no paralysis of the muscles used in speech, 
and no damage to the area of the primary motor 
cortex controlling these muscles. Aphasia, then, 
is not a disorder of articulation. It is an ideational 
or associational disorder involving cortical struc- 
tures and systems concerned with the use of lan- 
guage in thinking. [See LANGUAGE, article on 
SPEECH PATHOLOGY; the biography of Broca.] 

In cases of injury to Broca’s area and the region 
immediately above it, there is also frequently pres- 
ent a loss of ability to write ( agraphia) associated 
with the speech deficit. It appears, then, that just 
as Broca’s area is critical in the organization of 
sounds and words for the spoken expression of 
concepts, so is this adjacent superior region im- 
portant in the arrangement of words for writing. 

Lesions in the temporal-lobe portion of the 
Speech area, adjacent to the auditory receiving 
cortex, produce an inability to understand spoken 
language (word deafness), although sounds can 
be normally heard. This loss constitutes a sensory 
aphasia, as does the inability to understand written 
language (alexia) as a result of injury to the pos- 
terior parietal Speech area close to the visual cor- 
tex. While this classification is undoubtedly an 
oversimplification and while it is rare that the 


aphasic difficulties and lesions of an actual patient 
correspond perfectly with this schematic presenta- 
tion, it is a useful frame of reference for further 
consideration of the problem. [See READING pis- 
ABILITIES. ] 


Rhinencephalon, limbic system, and 
reticular formation 


“Rhinencephalon” literally means “nose-brain”; 
it is basically composed of those structures con- 
cerned with olfaction. These are the olfactory bulbs 
and pathways, the amygdaloid nuclei, the three- 
layered cortex (paleocortex) on the ventral and 
mesial surface of the temporal lobe, and the hip- 
pocampus (archicortex ), which is folded under the , 
neocortex, which comprises the remainder of the 
covering of the hemispheres and has been dis- 
cussed in the previous section. The paleocortex and 
archicortex are relatively primitive structures in 
terms of point of appearance in evolution and de- 
gree of morphological differentiation. [See TASTE 
AND SMELL.] 

Although the involvement of rhinencephalic 
Structures in olfaction has been assumed and 
studied for many years, it was not until the past 
few decades that their role in emotional and vis- 
ceral activity was also recognized. Out of such in- 
vestigations of structures concerned with emotion 
has grown the concept of a limbic system. A rather 
loose term, “limbic system” is often defined some- 
what differently by various investigators. It arose 
from the even older term “limbic lobe,” which 
Broca used to designate the cingulate cortex in 
addition to most of the rhinencephalon. Broca as- 
signed no specific function to this series of inter- 
connected structures, which form a ring, or 
“limbus,” around the brain stem, but noted its 
presence as a common denominator of the mam- 
malian and vertebrate brain. The modern concept 
of the limbic system is more inclusive than either 
that of rhinencephalon or limbic lobe. The limbic 
system is now generally taken to include many 
Structures, from brain stem to cortex, that are 
connected anatomically and are involved in the 
expression of affect and other motivated behaviors 
of which emotion is a part. Some of these struc- 
tures are the orbitofrontal cortex, cingulate cortex, 
hippocampus, ventral paleocortex, septal nuclei, 
amygdala, several hypothalamic and thalamic nu- 
clei, and certain midbrain areas. The limbic syste™ 
also has strong connections with the reticular 
formation. 

It should not be assumed that structures of the 
limbic system participate only in states of emo- 
tional experience or expression. Many components 
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of the limbic system are also active in the control 
of autonomic activities, which, according to the 
particular instance, may or may not be involved 
in affect—for example, digestion, cardiac and 
vascular regulation, and respiration. In addition, 
one of the central structures of the limbic system, 
the hippocampus, plays a crucial role in the process 
of memory storage. Before the importance of the 
hippocampus in recent memory was appreciated, 
a few patients underwent bilateral removal of this 
structure in an attempt to relieve severe seizures 
or psychotic behavior. Following these operations, 
the patients usually had a good memory of events 
that had occurred up to a few months or weeks 
before the surgery but virtually no memory of 
events occurring after that time. Some of the pa- 
tients have been followed carefully for several 
years and continue to show a profound inability 
to remember events which have occurred since the 
operation, 

It is clear from these unhappy clinical observa- 
‘tions that the hippocampus is vital in recording 
perceptual impressions and laying down the mem- 
ory trace. It is also evident that this structure is 
not crucial for the retention, retrieval, or expres- 
sion of stored material that has been well estab- 
lished, since memory of events long past was 
adequate in these patients. It is presently hypoth- 
esized that the human hippocampus acts as the 
first-stage recorder of experience but that transfer 
of accumulated information to other systems is 
soon effected for long-term storage. 

The reticular formation is an intricate and amaz- 
ingly complex network of neurons forming a core 
of the brain stem and extending from the medulla 
up to the diencephalon. In the thalamus, fibers 
from the brain-stem reticular formation synapse in 
cell groups of nonspecific nuclei. These nonspecific 
nuclei—so called to distinguish them from the 
specific sensory nuclei of the thalamus—relay in- 
formation to virtually all areas of the cerebral 
cortex. The thalamic nuclei and fibers which pro- 
ject reticular activity to the cortex are collectively 
known as the diffuse thalamocortical projection 
system, Brain-stem reticular activity is also pro- 
jected to the cortex via the hypothalamus and cer- 
tain limbic-system structures. 

The reticular formation has both descending 
and ascending fibers mediating activities which 
may be placed in four functional categories: 
(1) control of postural-reflex tonus by inhibition 
or facilitation of motor neurons in the spinal cord; 
(2) participation in the control of certain auto- 
nomic reflexes and respiration; (3) mediation of 
efferent activity in the central control of sensory 
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input; and (4) regulation of the general state of 
excitation in the cerebral cortex. This last function 
is primarily executed by the ascending components 
of the reticular formation. Arousal, wakefulness, 
and sleep are to a large extent controlled by the 
reticular formation and may be measured by the 
electroencephalogram (EEG), The EEG is a re- 
cording of electrical activity in the cerebral cortex 
by the application of electrodes to the scalp, skull, 
or cortex itself. The potentials thus recorded are 
amplified about a million times for visual inspec- 
tion on a cathode-ray oscilloscope or inkwriter, and 
each lead or channel represents the activity of a 
large number of cortical neurons. When a subject 
is in a highly aroused or excited state the EEG 
shows desynchronized low-amplitude (voltage), 
high-frequency activity (beta waves). A subject 
in a state of relaxed wakefulness whose eyes are 
closed displays a more synchronous rhythm that 
is of higher amplitude and lower frequency (alpha 
waves). As the individual falls into sleep, the fre- 
quency decreases further and the amplitude be- 
comes even greater. This rhythm is most evident 
in deep sleep, in which the EEG is in a state of 
hypersynchrony (delta waves). A stage of sleep 
not recognized until the 1950s, called paradoxical 
sleep, is reflected in low-voltage, high-frequency 
activity that is virtually indistinguishable from the 
excited waking state. There is considerable con- 
troversy as to whether paradoxical sleep is deeper 
or lighter than traditional deep sleep of the delta 
variety, but there is little question that much 
dreaming occurs during this stage. [See DREAMS 
and SLEEP.] 

The brain-stem reticular formation receives col- 
lateral fibers from the laterally placed ascending 
specific sensory pathways. Increased activity in 
these specific sensory tracts is crucial in producing 
a change in the pattern of firing of the brain-stem 
reticular-formation neurons, which in turn pro- 
duces desynchronization (activation) of the corti- 
cal EEG rhythm. The appearance of this desyn- 
chronization of EEG activity is coincident with 
behavioral arousal and increased awareness. Thus, 
the sensory collaterals leading into the reticular 
formation form the anatomical pathway by which 
a sensory stimulus arouses a sleeping individual 
to wakefulness. Arousal from sleep can also be 
produced by direct electrical stimulation of the re- 
ticular formation, as well as by sensory stimulation. 

There is no question that the arousal function 
is normally mediated by the reticular formation 
rather than by the specific sensory pathways. If, 
in experimental animals, the reticular formation is 
destroyed at a level rostral to the point at which 
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the sensory collaterals project into it, the subject 
enters an enduring state of coma, for although 
specific sensory information will reach the cortex 
over the lateral pathways and sensory systems will 
continue to fire into the reticular formation, as- 
cending reticular discharges are blocked by the 
transection from reaching the cerebral cortex. In 
this situation, evoked potentials, representing spe- 
cific sensory information provided to the cortex via 
the lateral afferent pathways, continue to be elic- 
ited; but generalized EEG desynchronization of the 
cortex is not sustained. These facts are of con- 
siderable importance, since without generalized 
cortical activation produced by reticular-formation 
activity, there can be little or no cortical integra- 
tion of the sensory information which arrives over 
the specific pathways. Under these conditions 
information is probably neither retained by the 
central nervous system nor integrated with other 
incoming sensory data or stored perceptions and 
memories. 

If, on the other hand, the specific sensory path- 
ways are cut at a level above the point at which 
their collaterals enter the reticular formation, cor- 
tical arousal and consciousness is still possible, but 
the animal is awkward in its movements and lim- 
ited in its abilities, since the cortex is deprived of 
specific sensory information, 

FREDERICK A, KING 


[Directly related are the entries HEARING; LEARNING, 
article on NEUROPHYSIOLOGICAL ASPECTS; PSYCHOL- 
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SKIN SENSES AND KINESTHESIS; TASTE AND SMELL; 
Vision. Other relevant material may be found in 
GESTALT THEORY; MENTAL DISORDERS, articles on 
ORGANIC ASPECTS and BIOLOGICAL ASPECTS; MENTAL 
DISORDERS, TREATMENT OF, article on SOMATIC 
TREATMENT; and in the biographies of Broca; 
FLOURENS; LASHLEY.] 


BIBLIOGRAPHY 

Brazier, Mary A, B, (1951) 1960 The Electrical Activ- 
ity of the Nervous System. 2d ed. New York: Macmil- 
lan. > A readable and concise account of neuro- 
physiology. 

Brooks, S. M. (1961) 1966 Integrated Science. 2d ed. 
St. Louis, Mo.: Mosby. 

Devutscu, J, ANTHONY; and Deutscw, Diana 1966 
Physiological Psychology. Homewood, I1.: Dorsey. 

Fern, Jonn; Macoun, H. W.; and Harr, Victor E. (edi- 
tors) 1959-1960 Handbook of Physiology. 3 vols. 
Section 1: Neurophysiology. Washington: American 
Physiological Society. > A series of articles on virtu- 
ally all aspects of neurophysiology and its behav- 
ioral correlates. 

GARDNER, ERNEST (1947) 1963 Fundamentals of Neu- 
rology. 4th ed. Philadelphia: Saunders. + An excel- 


lent basic text on the structure and function of the 
nervous system, 


Hizcarp, ERNEST R. (1953) 1962 Introduction to Psy- 
chology. 3d ed. New York: Harcourt. 

Lurma, A. R. 1966 Higher Cortical Functions in Man, 
New York: Basic Books. > A review of studies on 
brain-damaged patients, including original Russian 
contributions. 

McCreary, ROBERT A.; and Moore, RoBERT Y. 1965 
Subcortical Mechanisms of Behavior: The Psychologi. 
cal Functions of Primitive Parts of the Brain. New 
York: Basic Books. > A summary of the subcortical 
organization of arousal, motivation, and emotion. 

Macoun, Horace W. (1958) 1963 The Waking Brain. 
2d ed. Springfield, Ill.: Thomas. -> An excellent treat- 
ment of the anatomical and physiological bases of 
waking, sleeping, and attention. 

Morcan, CLIFFoRD T. (1943) 1965 Physiological Psy. 
chology. 3d ed. New York: McGraw-Hill. -> A stand- 
ard and highly readable text on the biological basis 
of behavior, 

Oswatp, IAN 1962 Sleeping and Waking: Physiology 
and Psychology. Amsterdam and New York: Elsevier. 
> A general volume on sleep from both the physio- 
logical and the behavioral viewpoints. 

PEELE, TALMAGE L. (1954) 1961 The Neuroanatomic 
Basis for Clinical Neurology. 24 ed. New York: Mc- 
Graw-Hill. > An advanced text on neuroanatomy with 
clinical interpretations. 

PENFIELD, WILDER; and Ronerts, LAMAR 1959 Speech 
and Brain-mechanisms. Princeton Univ, Press. > An 
exhaustive account of the cerebral mechanisms of 
speech and language functions, based upon clinical 
observations of the brain-damaged and upon electrical 
stimulation of the human cortex. 

PONTIFICIA ACCADEMIA DELLE Scienze, Rome 1966 
Brain and Conscious Experience, Edited by J. C. 
Eccles. New York: Springer. > A fascinating sym- 
Posium of the research and thoughts of 22 dis- 
tinguished scientists on the neurology of conscious- 
ness. 

Rucu, THEopore C. et al. (1961) 1966 Neurophysiol- 
ogy. 2d ed. Philadelphia: Saunders, -> A thorough 
and advanced text on the electrophysiology and func- 
tion of the nervous system. 


m 
BRAIN STIMULATION 


One of the most powerful tools for the study of 
the interaction between brain function and be- 
havior has been stimulation of the brain. Experi- 
menters have employed a variety of techniques in 
order to induce an excitation or inhibition of the 
activity of the nervous system. Some of these in- 
clude the introduction of chemicals and drugs into 
the brain. Others have employed heat and cold. 
However, by far the most widely used technique 
has been the application of electric currents to 
brain tissue. The use of these various methods of 
brain stimulation has resulted in an impressive 
body of data on the structure and function of the 
brain and its role in the control of behavior. Using 
these methods, experimenters have been able to 
stimulate the brain in one region and record the 
brain’s response in another region, thus determin- 
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ing the functional connections between brain areas. 
Others have stimulated various brain sites and ob- 
served the overt responses of the organism under 
study. These responses span the range of behaviors 
from simple reflexes to complex patterns of re- 
sponses. Still other investigators have studied the 
effects of stimulation of specific brain areas on 
ongoing behavior in order to determine whether 
that particular brain area is involved in the main- 
tenance of the behavior in question. Space does not 
permit us to summarize all of the various methods 
of stimulation that have been utilized. Therefore, 
discussion will be limited to the effects of electrical 
stimulation, since this has been the method most 
generally employed. 

History. The study of the effects on behavior 
of electrical stimulation of the brain is as old as the 
study of electricity itself. Count Alessandro Volta 
(1745-1827), while observing the results of the 
application of electric currents to his various sense 
organs, passed the current through his own brain. 
Fortunately, he survived the experiment. Since that 
time, but particularly in the present century, many 
investigators have utilized electric currents to map 
pathways through the nervous system and to study 
the effects of stimulation of certain brain areas on 
behavior. Advances in electronic engineering have 
made available to the researcher more and more 
precise instrumentation, and it is now possible to 
stimulate tiny areas deep within the brain with 
only minimal damage to neural tissue. 

In 1870, G. Fritsch and E. Hitzig reported that 
an electrical stimulus applied to certain regions of 
the cerebral cortex could elicit movements of the 
face, arms, and legs of experimental animals. In 
1899, Charles S. Sherrington utilized electrical 
stimulation to demonstrate the reciprocal innerva- 
tion of flexor and extensor muscles. Although ex- 
periments such as these shed a good deal of light 
on the functioning of reflex pathways, little was 
learned about voluntary behavior since exposure of 
such large areas of the nervous system required 
extensive restraint and anesthesia. The work of 
W. R. Hess, however, represented a major tech- 
nical advance with the introduction of the first 
practical technique for permanent implantation of 
electrodes into the brain. The development of these 
chronic electrodes made possible the study of 
stimulation effects involving deep structures in 
awake, unrestrained, unanesthetized animals. 

Methodology. Hess’s electrode technique is still 
the basic method employed, although many labora- 
tories have carried out modifications to suit their 
own particular needs. Generally, thin, rigid, stain- 
less steel or platinum wires, insulated except at the 
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tip, are lowered into the brain through a small hole 
in the skull. The animal is deeply anesthetized. Its 
head is held in a metal frame called a stereotaxic 
instrument. This frame, a modification of a device 
originally conceived by Horsely and Clark (1908), 
not only provides firm support for the head during 
surgery, but also allows the experimenter to locate 
deep cerebral structures with a fair degree of ac- 
curacy. This is accomplished by means of a three- 
dimensional system of coordinates that locates any 
given brain structure by its position relative to 
some zero point. This is analogous to locating a 
region deep within the earth by means of its lati- 
tude, longitude, and distance from the surface. 
Stereotaxic atlases, or brain maps, of a number of 
common laboratory animals and of man are avail- 
ble. Many of these may be found in a book edited 
by Sheer (1961) along with illustrations of stereo- 
taxic instruments and detailed descriptions of 
electrode techniques. 

Because of the variability in skull dimensions 
among animals, the stereotaxic technique is not 
completely accurate. It is necessary, therefore, at 
the termination of the experiment to sacrifice the 
animal and verify the exact location of the elec- 
trode tip by microscopic examination of the brain. 

An important factor in the study of brain stimu- 
lation is the nature of the electrical stimulus. Di- 
rect current generally destroys nerve tissue. Indeed, 
prolonged application of direct current has often 
been used to produce lesions for experimental or 
clinical reasons. However, brief pulses of direct 
current, alternations of positive and negative 
pulses, and sine waves have all been used with 
good results in brain stimulation studies, provided 
that the current is not excessive. 

Sleep, wakefulness, and emotion. Using the 
technique of chronic implanted electrodes, Hess 
systematically explored the diencephalon of the 
cat. He has summarized his findings in two recent 
works (1949; 1954). In brief, Hess observed that 
stimulation of the massa intermedia of the thala- 
mus resulted in a progressive decrease of activity 
followed by sleep. The animal could be aroused 
from this sleep by some external stimulus, but once 
the external stimulus was removed, the animal 
returned to sleep. Stimulation of the posterior hy- 
pothalamus, however, resulted in immediate wake- 
fulness and a state of excitation. With stronger 
stimulation in this area, the cat would hiss, bare its 
teeth and claws, arch its back, and show all the 
signs of rage and fear. Upon termination of the 
stimulation, the rage reaction ceased. Moreover, if 
the cat was provoked during this period of stimula- 
tion, a highly organized attack reaction would be 
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directed toward the provocative object. Egger and 
Flynn (1962) have described a study, however, in 
which stimulation of the lateral nucleus of the 
amygdala suppressed the attack reaction elicited 
by hypothalamic stimulation. [See EMoTION and 
SLEEP.] 

Motivation. A variety of motivational effects 
have been evoked by electrical stimulation of the 
brain, particularly in the region of the hypothala- 
mus. Perhaps the most striking of these are the 
effects on food and water intake. 

Food intake. Delgado and Anand (1953) have 
demonstrated that cats stimulated in the lateral 
hypothalamic area will perform gnawing and chew- 
ing movements and will increase their food intake. 
It is interesting to note that the increased eating 
is not necessarily directed toward edible objects. 
Miller (1960) reports that upon presentation of 
stimulation to the hypothalamic feeding area, rats 
will gnaw on blocks of wood, sticks, etc., when food 
objects are not available. This behavior has been 
described as “stimulus-bound” eating. Miller also 
described a study in which rats with electrodes in 
the lateral hypothalamus had been trained to press 
a lever for food. The animals were later food- 
satiated and stimulated in the hypothalamus. Im- 
mediately following the onset of stimulation the 
animals began to press the food lever, When the 
Stimulation was terminated, the rats immediately 
stopped lever-pressing. 

Suppression of food intake by stimulation has 
also been reported. An investigation by Wyrwicka 
and Dobrzecka (1960) demonstrated that stimula- 
tion of the ventromedial nucleus of the hypothala- 
mus of hungry goats resulted in immediate cessa- 
tion of eating. 

Water intake, Less work has been done in the 
area of central nervous system control of other 
motivational mechanisms by means of stimulation. 
Chemical stimulation of the hypothalamus has been 
demonstrated to control water intake effectively 
(Grossman 1960). There is also evidence that elec- 
trical stimulation can influence the thirst mecha- 
nism (Andersson & McCann 1955). 

Sexual behavior. Vaughan and Fisher (1962) 
have demonstrated that stimulation of the lateral 
anterior hypothalamus results in a marked increase 
in sexual capacity in male rats. MacLean and Ploog 
(1962) have reported that stimulation of a number 
of forebrain and diencephalic areas in monkeys re- 
sults in erection of the penis and ejaculation. [See 
SEXUAL BEHAVIOR, article on ANIMAL SEXUAL BE- 
HAVIOR. | 
n Learning. The application of the techniques of 
intracranial stimulation to the study of the neuro- 
psychology of learning has been a fruitful one. The 


typical approach has been to compare the perform- 
ance of experimental organisms on a learning task 
with and without stimulation of some neural areas, 
This method has advantages over the traditional 
lesion technique, which requires elaborate control 
groups to assess the possible extraneous effects of 
surgical shock, blood loss, intracranial pressure, 
etc. In a stimulation study, each animal frequently 
serves as its own control, since the effects of stim- 
ulation are almost always reversible. However, 
certain learning tasks may necessitate the use of 
a control group with nonstimulated electrodes. 
Experiments employing electrical stimulation of 
the brain to investigate learning are numerous, A 
few examples of recent studies may serve to illus- 
trate some of the major trends in current research, 

Delayed alternation and visual discrimination. 
Rosvold and Delgado (1956) trained monkeys to 
perform a delayed-alternation task in which the 
animals were required to seek a peanut reward 
under the right food cup on one trial, under the 
left food cup on the next trial, under the right on 
the next, etc. A five-second delay was interposed 
between trials. Failure to alternate responses on 
successive trials was regarded as an error, The 
monkeys were also trained to perform a simple 
visual discrimination. Following stabilization of 
performance on these tasks, the monkeys were 
implanted with multiple-lead electrodes. Upon re- 
covery from surgery, the animals were again tested 
on the delayed-alternation and visual discrimina- 
tion problems. During a part of the testing session, 
electrical stimulation was delivered through one of 
the electrode leads. The experimenters found that 
performance declined markedly on the delayed- 
alternation test when the lead stimulated was in 
the caudate nucleus, However, the same stimula- 
tion failed to interfere with the visual discrimina- 
tion task. A later study by Buchwald and others 
(1961) revealed that stimulation of the caudate 
nucleus does interfere with acquisition of a visual 
discrimination task, but does not alter performance 
once the task has already been learned. 

Operant conditioning. In another type of eX 
periment, Knott and others (1960) trained cats to 
press a lever for meat reward in a modified Skinner 
box, an apparatus provided with devices, e.g. le 
vers or buttons, whose appropriate manipulation 
leads to some desired outcome. Electrodes were iM- 
planted in a number of deep cerebral structures. 
Continuous low-intensity stimulation was delivered 
during the lever-pressing task. The experimenters 
reported that stimulation of the hippocampus, cau- 
date nucleus, and thalamus failed to alter the lever- 
pressing rate. Stimulation in the septal area, how 
ever, resulted in a cessation of response during 
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stimulation. The animals resumed pressing the 
lever following termination of the stimulation, but 
only after a prolonged delay. Hypothalamic stimu- 
lation generally had the same effect as septal stim- 
ulation, except that the post-stimulation delays 
were shorter. The authors concluded that these 
data support the notion that certain neural path- 
ways critical to the mechanisms of learning and 
retention become “occluded” by the stimulation. 
The data could also be interpreted, however, in 
terms of interfering emotional responses evoked 
by stimulation of the hypothalamus and the septal 
area. 

Classical conditioning. A third type of investi- 
gation utilizes the classical Pavlovian conditioning 
paradigm except that both the conditioned stimu- 
lus (CS) and the unconditioned stimulus (US) are 
presented to the subject by means of implanted 
electrodes. Doty and Giurgea (1961) implanted 
electrodes in the motor area of the cerebral cortex 
of dogs, cats, and monkeys. Stimulation of these 
electrodes served as the US. The unconditioned 
response was the particular limb movement that 
resulted from the stimulation of a direct motor- 
pathway. Stimulation in some other cortical area 
served as the CS. Six to ten pairings of the CS and 
the US were made daily. The experimenters were 
able to obtain clear-cut evidence of conditioned 
reflexes with cortical stimulation as the US. 

Aside from their implications for learning the- 
ory, the experiments of Doty and Giurgea are im- 
portant because they suggest the possibility of 
studying the electrophysiology of the conditioning 
mechanisms in a simple form with known and 
well-defined inputs. 

Speech and memory. A series of remarkable 
experiments have been carried out by Penfield and 
Roberts (1959). These investigations have involved 
the electrical stimulation of the cerebral cortex of 
awake humans. The studies were carried out during 
the course of neurosurgical procedures for the treat- 
ment of epilepsy. During the operation, the comfort 
of the patient was maintained at all times by the 
careful administration of local anesthesia. In addi- 
tion to yielding highly detailed maps of the sensory 
and motor regions of the human cerebral cortex, 
this research has also indicated the location of 
areas that are involved in the mechanisms of speech 
and memory. Penfield has described regions on the 
temporal lobe in which electrical stimulation will 
result in the recall of vivid experiences of sights 
and sounds. The patients report that they feel as 
if they were reliving those events. Moreover, a spe- 
cific memory can be repeated by interrupting the 
stimulation and then quickly reapplying it. Penfield 
suggests that memory is organized on the temporal 
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lobe in somewhat the same manner in which elec- 
trical impulses representing visual and auditory 
patterns are stored on magnetic tape. Thus, when- 
ever a particular region of the brain is made to 
yield up its stored memories, the memories are 
recalled with the same vividness and clarity as 
when they were originally stored. He further sug- 
gests the existence of mechanisms in the brain that 
inhibit the retrieval of those stored memories, the 
electrical stimulation somehow bypassing or other- 
wise nullifying the influence of these inhibitory 
mechanisms. 

Penfield and his coworkers have also mapped 
areas of the cortex that upon stimulation produce 
vocalization, hesitation and distortion of speech, 
repetition of vocal sounds, interference with speech, 
and complete arrest of speech. In addition, stimula- 
tion of certain areas results in an inability to name 
specific objects although the remainder of the 
speech mechanism seems to be unimpaired. These 
areas are mainly included in the parieto-temporal- 
occipital cortex, although the effects have also been 
obtained from several other well-defined cortical 
regions, 

Self-stimulation. In an attempt to study the 
effects of subcortical stimulation on learning in 
rats, Olds and Milner (1954) placed a rat with a 
forebrain electrode in an open field maze. They 
observed that if the rat received stimulation in a 
particular place in the maze, it would spend more 
and more time in that place. The stimulation 
seemed to have a rewarding effect on the rat's 
behavior. Next, they placed the rat in a T maze. 
The animal learned to go to that arm of the maze 
in which it was rewarded with brain stimulation, 
Microscopic examination of the rat's brain revealed 
that the electrode tip was in the vicinity of the 
anterior commissure. Rats used subsequently were 
trained to deliver the stimulation to themselves by 
pressing a lever in a Skinner box. These animals 
would deliver several hundred stimulations per 
hour to themselves. The stimulation served as the 
sole reward for pressing the lever; food and water 
were never present in the box. 

Since the original report by Olds and Milner, 
many of the variables affecting the self-stimulation 
phenomenon have been explored. Some of the prin- 
cipal variables have been the animal species; the 
location of the electrodes in the brain; the intensity, 
frequency, and duration of stimulation; the moti- 
vational and emotional state of the organism; and 
the schedule of reward. In addition, the effects of 
drugs on self-stimulation have also been studied. 

Species. Self-stimulation of the brain has been 
clearly demonstrated in the following animals: rat, 
dog, cat, pigeon, monkey, dolphin, guinea pig, gold- 
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fish, and man. At this point, experimenters would 
be more interested in learning of a vertebrate in 
which the phenomenon could not be demonstrated 
than in further additions to the list of positive 
instances, 

The demonstration of self-stimulation in man is 
of particular interest since this is the only species 
that can provide verbal reports about the subjective 
experience of brain stimulation. Sem-Jacobsen and 
Torkildsen (1960) have reported cases in which 
electrodes were implanted in the brains of human 
patients for several months in the course of treat- 
ment for Parkinson's disease. During exploratory 
stimulation, the experimenters encountered several 
regions of the forebrain in which the patients 
seemed to enjoy the stimulation. They would smile 
or grin and express a desire for repeated stimula- 
tion, If given an opportunity to press a button to 
deliver the stimulation to themselves, the patients 
would press the button often. Their verbal reports 
ranged from descriptions of tickling sensations to 
expressions of satisfaction and euphoria. 

Anatomical variables. In general, self-stimula- 
tion has been demonstrated in those structures of 
the brain that form part of the limbic system or 
have strong anatomical connections with it (Olds 
1956). Application of stimulation in purely sensory 
or motor areas does not appear to have any reward- 
ing effect. Moreover, there are areas in which the 
effect of the stimulation appears to be punishing, 
and animals will learn to escape and avoid stimu- 
lation in these areas (Brown & Cohen 1959). Many 
of these negative regions correspond to the areas 
that, upon stimulation, produce rage and attack 
reactions, 

Within the positive reward system, rates of self- 
stimulation vary widely from area to area and from 
species to species. Thus, a rat may self-stimulate 
800 to 1,000 times per hour when the locus of 
stimulation is the septal area, 2,000 times per hour 
when it is the hypothalamus, and 4,000 times per 
hour when it is the tegmentum, Furthermore, the 
reported rates of responding may vary considerably 
among laboratories because of differences in ap- 
paratus. For example, it is possible to obtain higher 
response rates with a telegraph key lever than with 
a microswitch lever. 

If an animal presses a lever 1,000 times per hour 
for septal stimulation and 2,000 per hour for hypo- 
thalamic stimulation, is it safe to conclude that the 
hypothalamic stimulation is more rewarding to the 
animal? An experiment by Hodos and Valenstein 
(1962) Suggests that it may not necessarily be. 
Rats were given a choice, in a two-lever box, be- 
tween septal stimulation and hypothalamic stimu- 


lation. These were presented at various intensities, 
When presented with high-intensity septal stimula- 
tion on one lever and low-intensity hypothalamic 
stimulation on the other lever, the animals showed 
a clear preference for the lever producing septal 
stimulation once they discovered it, even though 
their response rates were much higher on the lever 
producing hypothalamic stimulation. Considering 
these findings, it would seem hazardous to draw 
general conclusions about the relative reward 
values of stimulation in different neural areas 
based solely upon rate of responding. Direct- 
preference tests at several intensities of stimula- 
tion provide data that are less subject to interpretive 
ambiguities resulting from the interfering influ- 
ence of motor side-effects produced by the stim- 
ulation. 

Stimulation variables. Reynolds (1958) dem- 
onstrated that as intensity of stimulation increased, 
the rate of responding increased, passed through a 
maximum, and then declined. Keesey (1962) con- 
firmed this finding and, in addition, reported sim- 
ilar effects with variation in stimulation frequency 
and duration. 

This decline in self-stimulation rate with high 
intensities of stimulation should not necessarily be 
interpreted as a decline in the rewarding properties 
of the stimulation. Hodos and Valenstein (1962) 
reported that when given the choice between two 
intensities of stimulation in the same area, rats 
consistently chose the higher of the two intensities, 
even though they self-stimulated at a lower rate for 
the higher intensity. 

Motivational factors. Brady and others (1957) 
observed that animals would self-stimulate faster 
when deprived of food or water than they would 
when they were satiated. This was later confirmed 
by Olds (1958), who further reported that injec- 
tions of male sex hormones in castrated male rats 
also increased the rate of electrical self-stimulation. 
However, Hodos and Valenstein (1960) failed to 
find any effects on the septal self-stimulation rate 
in spayed female rats injected with female sex 
hormones. Recently, Hoebel and Teitelbaum (1962) 
have demonstrated an interesting correlation be- 
tween the hypothalamic areas controlling feeding 
behavior and those yielding self-stimulation. They 
Suggest that the feeding system may control self- 
stimulation in a manner similar to its control of 
food intake. 

Emotion. Brady and Conrad (1960b) demon- 
strated an interesting effect of self-stimulation on 
emotional behavior. Rats were trained to press 4 
lever for either intracranial stimulation or water 
reward. Periodically, an auditory stimulus was pre- 
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sented for a period of five minutes. At the termina- 
tion of this stimulus, a painful electric foot-shock 
was administered to the animals. When the ani- 
mals were pressing the lever for water, presenta- 
tion of the auditory stimulus produced a clear sup- 
pression in the rate of responding. This has been 
described as conditioned “fear” or “anxiety” in pre- 
vious studies. No conditioned “fear” response to 
presentation of the auditory stimulus could be 
elicited, however, when the animals were lever- 
pressing for brain stimulation in the medial fore- 
brain area. Moreover, the possibility that the ani- 
mals were unable to hear the auditory signal during 
the brain stimulation has been eliminated by Beer 
and Valenstein (1960), who showed that rats could, 
with little difficulty, make auditory discriminations 
when the auditory signals were presented simulta- 
neously with brain stimulation. 

Reward schedules. Sidman and others (1955) 
presented data illustrating the point that behavior 
rewarded by brain stimulation on either a fixed- 
ratio or a variable-interval schedule generally has 
the same characteristics as food-rewarded behavior 
on the same schedule, However, Brodie and others 
(1960) reported that fixed ratios of more than 20 
responses to each stimulation are difficult to main- 
tain in monkeys unless very slow and gradual train- 
ing is permitted, Fixed ratios of several hundred 
responses to each reward are not at all uncommon 
when animals are rewarded with food. Moreover. 
these researchers report far less resistance to ex- 
tinction of performance when brain stimulation is 
the reward than has often been observed when the 
schedule is equivalent but the reward is food. 

In a study of timing behavior, Brady and Conrad 
(19602) required monkeys to space their responses 
at least 20 seconds apart in order to receive a re- 
ward. In the case of food reward or anterior tha- 
lamic stimulation reward, the animals had no diffi- 
culty in delaying their responses for the required 
period of time and thereby received a large percent- 
age of the possible rewards. The most frequent 
interresponse time was 20 seconds. However, in 
the case of stimulation of the globus pallidus, the 
animals were unable to delay their responses suffi- 
ciently and thereby received few rewards. The most 
frequent interresponse time was 10 seconds. There 
is some suggestion that the stimulation may have 
interfered with the mechanism of time perception 
in these animals. 

Pharmacological effects. Olds and others (1956) 
as well as other workers have demonstrated that 
Some drugs can affect self-stimulation perform- 
ance. Chlorpromazine and reserpine, both tranquil- 
izing drugs, each depressed the self-stimulation rate 
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in some areas, but not in others. However, pheno- 
barbital was not observed to produce any specific 
effect on performance. 

Some general problems. When an experimenter 
produces a change in behavior by stimulation of a 
brain area, there is a great temptation to speculate 
on the possible role of that area in the mechanisms 
underlying the behavior. However, such specula- 
tions should be made cautiously, because the elec- 
trical stimulus may not have the same effect on all 
neural areas. For example, we have seen that stim- 
ulation of the caudate nucleus results in a deficit 
in delayed-alternation performance and that stimu- 
lation of the amygdala suppresses rage reactions. 
These behavioral deficits are the same as the effects 
of lesions in those neural areas. Therefore, it seems 
likely that the electrical stimulation had a sup- 
pressing effect upon normal function. On the other 
hand, we have seen that electrical stimulation of 
the lateral hypothalamus yields eating responses 
while ventromedial hypothalamic stimulation re- 
sults in suppression of eating. These effects are 
the opposite of those observed when lesions are 
made. Presumably, in this case, the stimulation was 
augmenting the activity of the feeding areas. Thus, 
the effects of stimulation studies alone are not suf- 
ficient for determining the role that a cerebral 
structure may play in behavior. 

A second and related problem is that of attempt- 
ing to generalize from the highly unnatural type of 
stimulation that experimenters introduce into the 
brain to the normal types of physiological events 
present in the nervous system. The two may not 
necessarily have the same effect on neural tissue, 
and cautious interpretation of data is required. A 
detailed discussion of the problems of the interpre- 
tation of the behavioral effects of electrical stimu- 
lation of the brain may be found in a recent paper 
by Valenstein (1964). 

A third problem is that of the possible interaction 
between brain stimulation and other environmental 
variables that may be affecting the behavior under 
study. We have seen that caudate nucleus stimula- 
tion will markedly interfere with visual discrimina- 
tion performance if the animal is still in the process 
of acquiring the discrimination. However, the ef- 
fects will be scarcely detectable once the animal 
has mastered the problem. Similar difficulties may 
arise in the study of emotional behavior, memory, 
and perception. Therefore, a thorough knowledge 
of the environmental variables that influence be- 
havior is essential before attempting to study the 
effects of stimulation on behavior. 


WILLIAM Hopos AND JOSEPH V. BRADY 
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[Other relevant material may be found in LANGUAGE, 
article on SPEECH PATHOLOGY; LEARNING, article on 
NEUROPHYSIOLOGICAL ASPECTS; MENTAL DISORDERS, 
article on ORGANIC ASPECTS; PSYCHOLOGY, article on 
PHYSIOLOGICAL PSYCHOLOGY; SLEEP.] 
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Iv 
ELECTROENCEPHALOGRAPHY 


Electroencephalography derives its name from 
the fact that it provides a graph or recording of 
the electrical activity of the brain, or encephalon. 
In human subjects—except when the brain surface 
is exposed during a surgical operation—the electro- 
encephalogram (EEG) is always recorded from 
electrodes distributed over the surface of the scalp. 
A variety of electrodes have been used in the past, 
but at the present time two principal types are in 
use, One type consists of silver dish-shaped or cup- 
shaped electrodes, 5 millimeters in diameter, under 
which a conductive paste or electrode gel serves as 
an electrolyte between the surface of the scalp and 
the electrode. The other type consists of a fine- 
gauge stainless-steel needle, which is painlessly 
inserted through the scalp. The electrodes are con- 
nected by fine insulated wires to a plug-in box 
which identifies numbered locations on the surface 
of the head, usually according to a system agreed 
upon by the International Society of Electroenceph- 
alography (Jasper 1958) and known as the 10-20 
system. The locations of the electrodes on the head 
are adjusted according to the over-all head dimen- 
sions by measuring from the nasion (bridge of the 
nose) to the inion (external protuberance at the 
back of the skull) and by defining electrode posi- 
tions at intervals of 10 or 20 per cent of the total 
distance; measurements from the auditory meatuses 
over the top of the head are similarly fractionated. 
Finally, measurements of the circumference of the 
head enter into the placement of the electrodes. 
Such procedures insure that electroencephalog- 
raphers will use a uniform system of deriving the 
potentials on which to base their reports. 

The number of electrodes placed on the head 
depends upon whether only a survey of representa- 
tive areas is to be made or whether a detailed 
comparison of many areas is to be made in order 
to localize some relatively focal disorder, as in the 
cases of temporal lobe epilepsy, of a local region 
of trauma, or of a brain tumor. 

r From the plug-in box, where electrodes represent- 
ing the various areas converge, a larger, electri- 
cally shielded cable enclosing all of the individually 
insulated wires carries these to the electroenceph- 
alograph. The EEG recording unit consists of volt- 
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age amplifiers and power amplifiers and some type 
of write-out and/or display unit. Since the potential 
differences between any two electrodes on the sur- 
face of the scalp are of the order of millionths of a 
volt and usually range from 2 or 3 microvolts to 
100 microvolts—except in the case of some path- 
ological discharges which may reach 500 microvolts 
—these minute potentials must be amplified at 
least a millionfold in order to be recordable or 
visible on the display unit. The voltage amplifiers 
are responsible for the amplification; the power 
amplifiers, as their name implies, provide an in- 
crease in the power of the signals so that a rel- 
atively insensitive recording unit may be driven by 
the variations in the alternating current. The am- 
plifiers themselves are usually of the resistance- 
capacity coupled type with a time constant suitable 
for amplifying and transmitting the potentials to 
the recording unit without appreciable distortion. 
The recording unit is typically an inkwriting oscillo- 
graph, although it may be a cathode-ray oscillograph 
or an optical oscillograph. The latter types require 
the recording of the deflections of a light beam on 
photographic film or paper. The inkwriting oscillo- 
graph consists of a pen motor which converts the 
electrical alternations from the brain into mechan- 
ical movements of the pen, tracing in the form of 
an arc the variations of potential as they occur. 
These oscillations of the tip of a stylus or pen upon 
paper moving at a uniform rate provide a tracing 
or recording upon the paper, which is a form of 
graph, with time represented on the abscissa and 
voltage on the ordinate, Thus, the frequency, mag- 
nitude or voltage, and the pattern of the waves so 
traced on the paper may be studied and measured 
relative to standards or norms of expectancy estab- 
lished for normal subjects. Abnormal or pathologi- 
cal deviations may be noted and contrasted with 
normal patterns. 

History. Now let us return to the history of 
electrical activity of the brain. Caton (1875), an 
English physician, was the first to publish an ac- 
count of the recording of electrical variations from 
the surface of the exposed brain of the rabbit. His 
findings were confirmed in a study of the dog by 
Russian and Polish investigators (see Brazier 1959), 
during the last quarter of the nineteenth century, 
but little further use of the phenomenon in inter- 
preting the brain’s activities and functions was 
made until Hans Berger (1929), a German neuro- 
psychiatrist, published the first report on the re- 
cording of the human EEG. Since Berger's record- 
ings showed predominantly wavelike oscillations, 
the classical neurophysiologists of this period, being 
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acquainted only with spikelike deflections due to 
transient electrical potentials observed in peripheral 
nerves, looked upon his findings with doubt and 
felt that the waves might result from artifacts 
rather than brain activity.. However, Adrian and 
Matthews (1934), distinguished electrophysiolo- 
gists at Cambridge University, England, verified 
Berger's findings and put their stamp of approval 
upon the work. Thus, for the first time it was rec- 
ognized that at least two types of electrical activity 
exist in the nervous system: (1) when a nerve or 
neuron is excited and transmits an impulse, it is 
accompanied by a sharp, spikelike negative poten- 
tial and some wavelike after-potentials; (2) aggre- 
gates of neurons with their cell bodies and dendritic 
processes, such as are found in the cerebral cortex 
of the brain, give rise to continuous, autonomous 
electrical changes of wavelike nature. 

The spike discharge of the neuron is a discrete 
event and tends to occur only when the neuron is 
excited by impulses reaching it from a receptor or 
another neuron. In contrast the wavelike activity 
of large numbers of neurons closely arrayed struc- 
turally appears not to be dependent upon sensory 
input or excitation. 

Patterns of brain activity. The EEG is com- 
posed principally of this wavelike activity, which 
in the human is continuously present in one form 
or another as long as the brain is alive. Typically, 
with the eyes closed and with the subject or patient 
relaxed and not thinking about anything in par- 
ticular, Berger found, in the adult, that rhythmic 
waves, which he called alpha waves, were emitted 
at a rate of 10 per second. He noted that the fre- 
quency of these waves ranged from about 8 to 12 
per second in different individuals, including older 
children and adults, but that very young children 
had slower alpha waves. We now know that alpha 
waves first appear in regular sequence over the 
sensory areas of the brain at about 3 months of age, 
at a frequency of 3 to 4 waves per second. By 1 year 
of age they have increased to 5 or 6 waves per 
second, and by 10 or 12 years they have reached 
the adult frequency, averaging about 10 per sec- 
ond. Such waves are remarkably constant in fre- 
quency, varying less than 1 cycle per second in a 
given individual, from one time to another—from 
moment to moment, from day to day, or from 
month to month. Anything which increases or de- 
creases metabolic rate will shift the frequency, up 
or down respectively, 1 or 2 cycles per second. For 
example, in hypothyroidism, where metabolic rate 
is lowered, the alpha frequency tends to be 1 or 
more cycles slower than the norm for that indi- 


vidual or the ayerage for a normal group; hyper- 
thyroidism, with increased metabolic rate, tends to 
increase the alpha frequency. The administration 
of thyroxine, a thyroid extract used in the treat- 
ment of hypothyroidism, increases the alpha-wave 
frequency from its low level. Artificially increasing 
the body temperature by diathermy or other means 
usually increases alpha frequency accordingly; also, 
natural fevers tend to increase alpha frequency. 
Some people believe that the alpha rhythm consti- 
tutes a basic time clock in the nervous system, regu- 
lating or modulating other rhythmic processes of 
the brain and body and possibly playing a role in 
our estimation of time [see TIME, article on psy- 
CHOLOGICAL ASPECTS]. 

It is normal to have an alpha rhythm when 
awake, relaxed, and not being stimulated. Also, it 
is normal for the alpha rhythm to be blocked by 
any type of stimulation which attracts the attention 
of the subject and alerts him or arouses him from 
his temporary quiescence or daydreaming. The 
alpha rhythm quickly recovers from this blocking 
and continues with its oscillations until another 
stimulus occurs. If the same, identical stimulus is 
repeated a number of times at more or less regular 
intervals, the subject tends to ignore it, and the 
brain’s response of blocking the alpha rhythm dis- 
appears. This state of adaptation or lack of re- 
sponsiveness is sometimes referred to as habitua- 
tion. As soon as the quality of the stimulus changes 
in either intensity, kind, or another characteristic, 
the blocking of the alpha rhythm will return, as 
it does in the case of any novel stimulus which 
attracts attention [see ATTENTION]. 

The alpha rhythm, or other characteristics of the 
EEG, are not significantly correlated with intelli- 
gence or personality traits. For references to numer- 
ous early studies of this relationship, as well as 
more detailed descriptions of the EEG, see extensive 
reviews by Jasper (1937) and Lindsley (1944). 

Sleep. The electroencephalogram has proven to 
be a useful correlate of psychological conditions. 
In the case of the transition from a waking state 
to sleep, in human subjects as well as lower ani- 
mals, there are a series of fairly well-defined stages. 
There is an alpha rhythm of about 10 waves per 
second in normal, adult humans who are relaxed, 
awake, and free of special sensory stimulation. AS 
drowsiness occurs the alpha rhythm slows slightly, 
decreases slightly in voltage, and shows brief periods 
of subsidence. As the drowsiness increases to the 
point where the subject is not aware of events in 
his environment, slower waves (3 to 8 per second) 
begin to appear, interspersed with declining alpha 
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waves, and as the first true state of sleep occurs, 
there appears in the tracing periodic bursts of 
waves, called sleep spindles, at about 14 per second. 
In the next stage of sleep the spindles disappear 
and niore and slower waves of higher voltage ap- 
pear. Beyond this point, the waves become still 
slower and more prominent, ranging from 1 to 3 
per second; these are called delta waves. At this 
time the state of sleep is deep, and it is more diffi- 
cult to arouse the subject from his slumbers by 
various kinds of sensory stimuli. More recently, 
however, a still deeper stage of sleep has been 
described in which muscles of the limbs, body, and 
neck become more relaxed and the EEG shifts from 
large and very-low-frequency slow waves to fast, 
low-amplitude waves similar to those seen when a 
person is awake and alerted or excited. This state 
of sleep is sometimes called paradoxical sleep, 
because while the subject appears to be sleeping 
deeply and is difficult to arouse, his EEG appears 
to be that of a waking person. Such periods occur 
within an hour after going to sleep and last only a 
few minutes but recur again and again during the 
course of the night's sleep, usually several times 
for durations of 5 to 15 minutes. It was discovered 
by Kleitman ([1939] 1963) and his associates, 
Aserinsky and Dement, that rapid eye movements 
occur during these comparatively brief periods in 
the course of the night’s sleep; more recently these 
periods have been referred to as REM-sleep, that is, 
rapid-eye-movement sleep. If a subject is awakened 
during one of these REM states, he usually reports 
that he has been dreaming, but he is less apt to 
have been dreaming if awakened during the slow- 
wave stage of deep sleep. Consequently, REM-sleep 
is associated with dreaming, and there is some evi- 
dence (Dement 1965) that a person needs a certain 
amount of this type of sleep to remain mentally 
healthy. [See DREAMS and SLEEP.] 

Contrary to widespread belief, fostered by adver- 
tising and commercial agencies selling “learn dur- 
ing sleep” apparatus or programs, no well-controlled 
and validated studies using the EEG to monitor the 
presence and depth of sleep show that learning 
does indeed occur during sleep. Simon and Em- 
mons (1955) have reviewed this subject. 

f Waking and anesthetic states. As we have seen 
in the instance of transitions from wakefulness to 
sleep, variations in the state of consciousness or 
awareness of one’s environment are correlated with 
marked changes in the EEG. When a person is 
alerted, excited, or aroused, EEG activity is low, 
fast, and not particularly rhythmic; when a person 
is awake, relaxed, and not very attentive—as in 
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reverie or daydreaming—there are good rhythmic 
alpha waves; at the onset of true sleep, the alpha 
waves are displaced by slower waves and spindle 
bursts at the rate of 14 per second; in deep, quiet 
sleep without complete relaxation, large, billowy, 
slow waves are the rule; finally, in the most relaxed 
state, but with REM and dreaming, there are low- 
amplitude fast waves again. 

During the anesthetic state, in passing from one 
plane of anesthesia to another, similar sequences 
occur but vary with the type of anesthetic used. In 
general, it is believed that in both sleep and the 
anesthetic state a core structure of the lower brain 
stem, known as the reticular formation, is de- 
pressed or inhibited. The reticular formation can 
be activated by all types of sensory stimulation; in 
fact, it is generally referred to as a nonspecific 
sensory-arousal mechanism, which appears to keep 
the brain awake and alert, and perhaps even atten- 
tive, when properly sensitized to incoming sensory 
stimuli. It may also be affected by neurohumoral 
substances and endocrine secretions and by other 
constituents of the blood stream and cerebrospinal 
fluid. These physiological and biochemical factors 
are important in the functioning of all brain and 
nervous-system cells, and the activity of these 
neurons is reflected in their electrical activity, of 
which the EEG provides one measure. [See PAIN.] 

Neurological disorders. Although in the past 
the EEG has not proved very useful in differenti- 
ating psychiatric patients from normal subjects or 
in distinguishing different personality character- 
istics, it has been a very valuable supplementary 
tool for the neurologist and neurosurgeon. It quickly 
proved its value in connection with a variety of 
neurological disorders. 

Perhaps most outstanding in this respect were 
the epilepsies. The minor attack, or petit mal, has 
a characteristic signature known as the spike-and- 
slow-wave pattern, which recurs about three times 
per second. In the major attack, grand mal, where 
there are convulsions and loss of consciousness, 
the pattern of the EEG begins with increased- 
amplitude fast activity associated with stiffening 
and loss of consciousness, known as the tonic state, 
and shifts to grouped activity as the convulsive 
movements occur; finally, after the series of con- 
vulsions ceases and the EEG quiets to an almost 
isoelectric state, a period of coma and marked re- 
laxation exists, Generally, this state of the EEG and 
behavior lasts only several seconds. Then the EEG 
develops a deep-sleeplike pattern with large, billowy 
waves, and the patient appears to sleep deeply for 
a number of minutes or even a few hours before 
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awakening, usually with return of the EEG to a 
fairly normal status. 

A concussion produced by a blow on the head 
with temporary or prolonged loss of consciousness 
is accompanied by large, slow delta waves. A focal 
injury to the brain, caused by a rupture of a very 
small blood vessel (petechial hemorrhage) or a 
penetrating wound, gives rise to aperiodic spikelike 
discharges in that local region and, thus, helps to 
localize the source of the damage or injury. En- 
cephalitis and other infections which damage tis- 
sue have toxic effects or create pressures which 
usually produce some kind of aberration in the 
EEG. Thus, the neurologist and the neurosurgeon 
find the EEG useful as a supplement or adjunct to 
their other techniques of detecting and localizing 
brain injuries or in diagnosing the source or cause 
of a neurologic disorder, The merit of the EEG is 
that it is a painless and harmless procedure and 
has no aftereffects, as do some neurologic test pro- 
cedures. Almost every major hospital in the coun- 
try and many lesser ones now have some facility 
for EEG recordings and for using these not only as 
a diagnostic tool but also as a tool measuring the 
effects of anesthetics in the operating room or 
measuring the effects of particular drugs or treat- 
ment procedures. 

Sensory processes. The most recent develop- 
ment involving the EEG is the recording of average 
evoked potentials over sensory areas of the brain 
by means of computer summation of responses that 
are time-locked to their stimuli; the stimuli are re- 
peated as many as a hundred times in order to 
build up a reliable average. These responses from 
sensory areas, and even from association areas, are 
very small in comparison to the ongoing EEG ac- 
tivity; their voltage generally is of the order of five 
to ten microvolts, whereas the alpha waves and 
other EEG activity may be several times larger. 
Using these computer-averaged responses, the 
senses of vision, audition, and bodily sensation 
have been studied (see Sensory Evoked Response 
im Man 1964), Thus, human sensory experience 
may be studied by judgmental or psychophysical 
methods and, at the same time, by electrical re- 
cording from the particular area of the brain con- 
cerned. Other sensory-related functions have been 
Studied with interesting results. Haider and his col- 
leagues (1964) demonstrated the usefulness of the 
computer-averaging methods in relation to an ex- 
periment on attention and vigilance maintained 
over time, They found that as attention diminished 
generally, so that a subject no longer detected cer- 
tain dim flashes imbedded in a series of brighter 
flashes with the same percentage of accuracy, the 


magnitude of the evoked response diminished. As 
vigilance waned, so did the voltage of the evoked 
potentials; however, if a subject had a five-minute 
period in which he detected all interspersed sig- 
nals, the magnitude of his evoked potentials for 
that period was as great as at the beginning of the 
task. In another experiment, Spong and his col- 
leagues (1965) demonstrated that by directing the 
attention of a subject to a particular class of sensa- 
tion—vision, for example—and by telling him to 
ignore audition, even though flashes and clicks 
alternated as stimuli throughout, the magnitude of 
the average evoked potential was always greater 
over the visual area when attention was directed 
toward visual stimuli and auditory stimuli were 
ignored; the opposite was true for responses over 
the auditory area. Thus, there is significant evi- 
dence that attention and vigilance have a central 
response which may now be recorded and studied 
in relation to psychological set or instruction. In 
addition, Walter has found by a similar method, 
using direct-current recording, that giving a subject 
an instruction and an “expectancy” leads to a 
build-up of potential until that expectancy is met, 
either correctly or incorrectly, at which time a dis- 
charge occurs (Walter et al. 1964). He calls this a 
“contingent negative variation.” It is a brain proc- 
ess which undoubtedly plays a role in decision mak- 
ing and other higher cognitive functions, and will 
prove to be a valuable probe for getting at problem 
solving, thinking, and other functions of a higher 
level. [See ATTENTION; HEARING; SENSES; SKIN 
SENSES AND KINESTHESIS; VISION.] 


The electroencephalogram has played a very use- 
ful role in the hands of the neurophysiologist and 
neuropharmacologist in the study of the electrical 
activity of the brain. It has proved to be a very 
considerable stimulus to brain study in animals and 
man, because it provided a parameter of measure- 
ment not previously possible. The psychologist and 
neurophysiologist today frequently combine their 
talents in the study of brain and behavior correla- 
tions, and through this work a better understand- 
ing of the mechanisms and organization of the 
brain and nervous system is coming about. Thus, 
the electroencephalogram is an experimental tool 
useful in research on the brain, as well as a clinical 
tool useful in the clinic and hospital. 


Dona p B. LINDSLEY 


[Other relevant material may be found in ATTENTION; 
DREAMS; ELECTROCONVULSIVE SHOCK; HEARING; 
LEARNING, article on NEUROPHYSIOLOGICAL ASPECTS} 
MENTAL DISORDERS, TREATMENT OF, article on SO- 


MATIC TREATMENT; PSYCHOLOGY, article on PHYSIO- 
LOGICAL PSYCHOLOGY; SENSES; SLEEP; VISION; and 
in the biographies of FLOURENS and LASHLEY.) 
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NEUROSIS 


Of the concepts which modern psychology has 
introduced into current vocabulary and thinking, 
“neurosis” and its adjectival form “neurotic” are 
among the most significant. Applied to individual 
behavior the terms have been used so loosely and 
so widely that they have become at times synony- 
mous with morbid, perverse, inexplicable, ab- 
normal, compulsive, and uncontrollable. Since neu- 
rosis is so common and neurotic behavior patterns 
are so varied, there is some justification for the 
overly generalized, although inexact, use of these 
terms. 

The more precise word for the conditions sub- 
sumed under neurosis is “psychoneurosis,” a term 
which is closer in its etymology to the way we 
presently understand the nature of these condi- 
tions. Currently, psychoneurosis (or neurosis) des- 
ignates any of a number of psychic or mental dis- 
orders which are accompanied by no demonstrable 
structural or organic change but which result in 
disorganization of personality and mental func- 
tion, This disorganization, however, must not be so 
severe as to impair reality testing and the use of 
language or to lead to extensive primitivization of 
behavior. These three limiting conditions serve in 
a rough way to distinguish neurosis from psy- 
chosis. Because quantitative assessment of mental 
functioning is involved, this distinction is not al- 
ways easy to make. This is particularly true in the 
so-called “borderline” conditions. [For further dis- 
cussion of this aspect of the problem, see Psy- 
CHOSIS.] 

There is a certain irony of history that in both 
popular and technical usage the shortened form of 
the word—neurosis—has come to replace the more 
accurate form—psychoneurosis. Actually, neurosis 
is the earlier term. It was coined by William Cullen 
in the late eighteenth century, in keeping with his 
concepts of pathology. Earlier theories of psycho- 
pathology were based on the assumption that dis- 
orders of mental functioning were related to dis- 
turbances of the circulation of the blood or to 
noxious elements carried by the blood stream to 
various parts of the body. Cullen, a pioneer neuro- 
pathologist, advanced the idea that a disturbance 
of the function of the nerves is the basis for neu- 
rosis. To this day, however, it has not been pos- 
sible to substantiate this hypothesis. 

A completely different approach to the concept 
of neurosis was initiated, however unwittingly, by 
Cullen’s contemporary, the controversial hypnotist 
Anton Mesmer, Although the supernatural aspects 
of Mesmer’s animal magnetism were soon dis- 
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missed by the scientific community, hypnotism as 
a method for investigating the mind continued to 
hold the interest of many physicians, This trend 
was culminated by the work of Charcot, who used 
hypnotism to demonstrate that hysterical symp- 
toms were genuine, not simulated. According to 
Charcot and his pupil Janet, the essential difficulty 
in neurotic conditions was neurophysiological. For 
various reasons usually determined by heredity, the 
nervous energy of the brain was not strong enough 
to integrate its many functions, Some functions 
became dissociated and pursued an independent, 
autonomous course, When these dissociated ele- 
ments superseded the normal mental organization, 
they became the symptoms of hysteria. [See Hyr- 
Nosis; HYSTERIA; the biographies of CHARCOT; 
JANET; MESMER.] 

Freud also used neurophysiology as his frame of 
reference in his early studies of neuroses, which 
he too pursued by means of hypnosis. Later, he 
discarded hypnosis and substituted for it the tech- 
nique of studying the free associations of his sub- 
jects under the standard conditions of the psy- 
choanalytic situation. From the data obtained by 
this method, Freud concluded that neuroses re- 
sulted from disturbances of the physiology of the 
sexual drive, specifically from inadequate or ab- 
normal discharge of sexual energy, which he desig- 
nated libido, 


Classification 


Neuroses, Freud wrote, could be grouped into 
two major categories according to the origins of 
inadequate discharge of libido, In one group this 
could be traced to unfavorable sexual practices, 
This group of neuroses he called the actual neu- 
roses, Excessive masturbation, for example, dissi- 
pates the libido, causing listlessness, apathy, weak- 
ness, etc.—the clinical picture of psychasthenia. 
Abstinence and coitus interruptus, on the other 
hand, lead to an accumulation of libido, to a pent- 
up state. The libido is ultimately discharged ab- 
normally along subcortical pathways, giving rise to 
apprehensiveness, irritability, tachycardia, perspi- 
ration, and breathlessness—the clinical Picture of 
anxiety neurosis. [See ANXIETY, ] 

In the second group of neuroses, libidinal dis- 
charge is blocked not by physical causes but by 
psychological ones, Accordingly, Freud called this 
group the psychoneuroses. In the psychoneuroses, 
the symptoms represent not somatic but psycholog- 
ical derivatives of the sexual impulse. Because of 
their primitive and disturbing nature, such im- 
pulses are denied access to consciousness and their 
energy is barred from discharge by a process which 


is called repression. Under certain conditions, to 
be discussed below, the energy of the repressed 
sexual drive partially overcomes the barrier of re- 
pression and achieves disguised and distorted ex- 
pression in the form of symptoms. In contradis- 
tinction to the symptoms of actual neuroses, the 
symptoms of psychoneuroses have a mental con- 
tent which, like dreams, can be translated into 
ordinary verbal language once they are properly 
understood. What determines the nature of a par- 
ticular psychoneurotic symptom is what happens to 
the libido. In obsessions, it is displaced from its 
proper context onto some ordinarily insignificant 
thought; in phobias, it is projected onto some sub- 
stitute object, which is then avoided; in hysteria, 
the sexual energy is converted into abnormal inner- 
vations, causing paralyses and sensory disturb- 
ances. [See OBSESSIVE-COMPULSIVE DISORDERS; 
PHOBIAS.] 

Using the same nosological principle, that is, the 
disposition of libido, Freud made.a further division 
of the psychoneuroses, a division which enabled 
him to incorporate the psychoses into his system of 
Psychopathology. He divided the psychoneuroses 
into those conditions in which the libido is vested 
in mental representations of other persons from 
those in which the libido is vested almost exclu- 
sively in representations of the self. 

In the first group he observed that the symptoms 
were substitute expressions of sexual wishes origi- 
nally directed toward the significant figures of one’s 
childhood. During treatment of neurotic patients 
these wishes of childhood are transferred onto the 
therapist. Accordingly, he called these neuroses 
transference neuroses. This group includes the 
more familiar psychoneurotic entities, for exam- 
ple, hysteria, obsessive-compulsive neurosis, and 
phobia, 

The situation is different in case of psychoses. 
Transferences presumably do not take place, and 
the libido remains fixed upon the self, as in the 
very young child. Freud applied the term “narcis- 
sism” to all phenomena in which the self is taken as 
the object of libidinal wishes. He called the psy- 
choses which constitute extreme forms of narcis- 
sism narcissistic neuroses, 

Further study has indicated that this nosology 
is not quite satisfactory. The entire concept of the 
actual neuroses has been challenged. While it is 
ttue that there are neurotic conditions which are 
associated with abnormal sexual practices, it has 
been difficult to establish that these conditions 
necessarily result from such practices or that the 
Symptoms are devoid of psychological content, In 
addition, transferences do take place even in the 


so-called narcissistic neuroses or psychoses. To be 
sure, such transferences may be indiscriminate, 
transient, and volatile, but they often do represent 
expressions of libidinal wishes toward figures from 
childhood foisted onto the therapist or other per- 
sons, Furthermore, other elements, such as con- 
flicts over aggression, the inadequacy of the ego's 
defenses, and a tendency for regressive primitiviza- 
tion of mental functioning, seem to be equally 
important in the development of psychosis. 

Another group of conditions was subsequently 
added to this nosology, namely, the traumatic 
neuroses, Here, neurosis occurs when the organ- 
ism has been exposed to a life-threatening psycho- 
logical stimulus of such intensity as to overwhelm 
the mind's ability to deal with it. As in the case of 
the actual neuroses, the essential etiological ele- 
ment is the magnitude of the stimulus, and, here 
too, the symptoms presumably are without sig- 
nificant psychological content. 

Although this, classification is widely used, it 
does justice neither to the great variety of clinical 
forms which the neuroses assume nor to the multi- 
plicity of factors involved in their etiology. Actually 
it is difficult to classify the neuroses because they 
are very complex, They are not distinct, circum- 
scribed entities like tuberculosis, diabetes, or other 
physical diseases. Any attempt to make a close 
analogy between neurosis and physical disease is 
bound to break down upon meticulous examina- 
tion. Neurosis is part of a developmental process. 
This is true even though many neuroses appear 
suddenly and acutely. The neurotic process con- 
sists of a wide range of responses to the failure of 
the mind to resolve inner conflicts. These responses 
may take the form of symptoms, inhibitions, char- 
acter traits, and repetitive patterns of behavior. All 
of these may justly be considered neurotic because 
they have a common etiology and derivation. 

In the 1930s Freud revised his theory of the 
nature and origin of neurosis in keeping with his 
appreciation of the central position of intrapsychic 
conflict in mental life. According to this newer 
theory, conflict is a regular, normal feature of 
mental life, The mind can be divided into three 
groups of functions according to the role each 
group plays in conflict, One group, the id, consists 
of instinctual drives—sex and aggression and their 
derivative manifestations. Another group, the su- 
perego, is composed of moral demands and self- 
punitive tendencies—the products of training and 
experience. Finally, there is the group of executant 
functions, the ego, which integrates the conflicting 
aims of the id and the superego with due regard to 
the external or “real” world. As the individual de- 
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velops and learns, his ability to resolve intrapsychic 
conflict grows, Mental health depends upon the 
capacity of the ego to effect an acceptable solution 
of the conflicting claims made upon it. 


The genesis of neurosis 


Childhood conflicts. In the genesis of neurosis, 
the vicissitudes of the psychological conflicts of 
childhood are of critical importance, The most sig- 
nificant conflicts involve the wishes of the Oedipal 
phase of development (ages 3 to 6). These wishes, 
uniformly encountered in almost all children in 
civilized societies, are concretely experienced in the 
form of fantasies or fantasy-like thinking, These 
wishes are simple enough: to possess what one de- 
sires and to eliminate what stands in the way of 
gratification. To the immature mind of the child, 
its wishes in themselves may appear threatening. 
In addition, the child fears punishment or retalia- 
tion for these thoughts. Anxiety, an innate re- 
sponse to an unpleasant state, appears as a warn- 
ing affect, signaling the threatening emergence 
into consciousness of some unacceptable wish, to- 
gether with the danger of retaliation associated 
with it. If the ego is unable to master anxiety by 
repressing the emergent drive, an anxiety state of 
traumatic proportions may follow. To prevent this 
and to fend off anxiety, other maneuvers have to 
be instituted. It is at this point in the conflict that 
the neurotic process of inhibition or symptom 
formation begins. 

The particular form which a neurosis assumes 
in a child depends upon the specific measures used 
to ward off anxiety. All children have severe con- 
flicts, and most children develop some kind of 
childhood neurosis. Usually a childhood neurosis 
assumes the form of general apprehensiveness, 
nightmares, phobias, tics, mannerisms, or ritual- 
istic practices, Most of the primary behavior dis- 
orders of children represent disguised forms of 
neurosis from which the element of manifest fear 
has disappeared. Phobia is probably the most fre- 
quent symptom of childhood neurosis, Fears of 
darkness, burglars, intruders, ghosts, and animals 
are so common they are often regarded as part of 
normal development. Tics and ritualistic practices 
are less common and in general represent some- 
what more severe manifestations of childhood 
neurosis. 

The factors which serve to intensify the conflicts 
of childhood and to diminish the ego’s capacity to 
resolve them are many. Some of these factors are 
constitutional or innate, for example, the inherent 
strength of the drives and, to a considerable extent, 
the predisposition to anxiety. The contributing 
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etiological factors which to date have been investi- 
gated more thoroughly are the developmental and 
accidental ones, Most prominent among these are 
overstimulation, seduction, the effects of physical 
illness, and various traumata. How the child dis- 
charges and controls his drive impulses is deter- 
mined to a very large measure by the behavior of 
the parents and earliest teachers, Using gratifica- 
tion and frustration—reward and punishment— 
and serving as models for identification, parents 
and educators set the patterns for how conflicts 
may be mastered. 

With the passing of the Oedipal phase the in- 
tensity of childhood conflicts abates. The conflicts 
do not, however, disappear altogether, and, invari- 
ably, vestiges of the neurotic process may be dis- 
cerned in the mental life of children during the 
period of latency (ages 6 to 12). In some instances, 
elements of the childhood neurosis may persist 
through the period of latency into adult life. More 
often, evidence of the persistent effect of the neu- 
rotic process may be seen during latency in char- 
acter traits, work habits, and sublimations. Not all 
of these are necessarily pathological. [See PERSON- 
ALITY, article on PERSONALITY DEVELOPMENT, | 

Adolescence. The conflicts of the childhood 
period are evoked anew during adolescence as a 
result both of physiological changes and of the 
psychological implications of assuming the adult 
role, Variations of the fantasies which originally 
served as vehicles to express drive derivatives dur- 
ing childhood become the conscious concomitant 
of adolescent masturbation. The guilt over mas- 
turbation derives less from the physical activity 
and more from the unconscious wishes which find 
substitute expression in the masturbation fantasies. 
During the period of adolescence a second attempt 
is made to master conflicts arising from childhood 
wishes. With the successful resolution of these 
conflicts, the individual finds his adult identity in 
his sexual role, moral responsibility, and choice of 
social role and career, Thus, one can see that the 
transformation of the neurotic process and its fail- 
ures is an inherent part of the process of civilizing 
the individual in the broadest sense. [See ADOLES- 
CENCE. ] 

Neurosis in adults. Neurosis develops anew in 
adults when the balance between the pressure of 
the drives and the defensive forces of the ego is 
upset. This balance may be disturbed in many 
ways. There are three types of situations which 
account for neurotogenesis in adults: 

(1) An individual may be unable to cope with 
the additional burden which normal development 
places upon him. The physiological transformation 


of the drives, the responsibilities of marriage, and 
the exigent, competitive, and aggressive require- 
ments of maturity may overtax the capacities of 
the ego, reviving childhood anxieties. 

(2) Disappointment, defeat, loss of love, phys- 
ical illness, and other inevitable consequences of 
the human condition may lead an individual to 
turn from current reality to seek gratification in 
the world of fantasy. In the development of a neu- 
rosis the fantasies to which one turns are usually 
derivatives of the wishes of the Oedipal phase. As 
these wishes are regressively reactivated, the con- 
flicts and anxieties of childhood are revived and 
the process of symptom formation is reinstituted. 

(3) By a combination of circumstances an indi- 
vidual may find himself in adult life in a situation 
which corresponds in its essential features to some 
childhood trauma or some conflict-laden fantasy. 
Current reality is then misperceived in terms of the 
childhood conflict, and the individual responds as 
he did in childhood by forming symptoms, 

The central position of childhood conflicts in neu- 
rosis is an empirical finding. It demonstrates that 
fundamentally neurosis is a developmental disturb- 
ance. Although the neuroses of adults, without fail, 
stem from the neurotic conflicts of childhood, the 
symptoms from which the adult suffers may be 
entirely different from those which were prominent 
during his childhood. It is also important to note 
that the same unconscious conflict may be resolved 
by the ego in many different ways. Some resolu- 
tions represent progressive development, an enrich- 
ment in fact of the adult personality, In the same 
individual, however, pathological transformation of 
the conflict may give rise to symptoms. 

In order to demonstrate the implications of what 
has just been said the following clinical material, 
highly condensed and severely simplified, is offered. 
A woman patient realized during her childhood that 
she enjoyed being an only child. Observing her 
playmates, she felt that the presence of a sibling 
would deprive her of her parents’ love and of food. 
She wanted no more children to be born to her 
parents, and in fantasy she imagined destroying 
potential rivals by devouring them inside her 
mother’s body. She felt guilty over these wishes 
and imagined that the unborn siblings would 
wreak vengeance upon her. She was told that the 
fetus, developing in the mother’s body, lived like 
a fish in a special fluid. She developed a fear of 
going into the ocean and was unable to eat fish of 
any kind. When she heard that there are worms 
which live inside people, she became frightened 
of all worms. This young, unconscious devourer 
suffered from nightmares in which animals threat- 


ened to devour her. During the latency phase she 
became interested in raising tropical fish and from 
this activity developed a deep knowledge of ichthy- 
ology. When young fish were born she took pains 
to remove them from the tank to prevent their be- 
ing devoured by the older fish. She became a 
teacher who was extremely protective of her pupils, 
demanding so little of them that one of her super- 
visors said in criticism that she tended to “feed” 
the pupils too much. For her part she was aware 
that she was afraid her students would disapprove 
of her. As an adult, she still suffered from a fear 
of worms. In addition, she developed claustro- 
phobia, Whenever she found herself in a confined 
space she became extremely anxious. Unconscious- 
ly, she feared that the occupants of the confined 
space would turn upon her and destroy her. She 
avoided marriage because she thought that she 
would die of some complication of pregnancy. 
She avoided any kind of food which could serve 
as a carrier for intestinal parasites. 

In this illustration one can observe the trans- 
formations of the childhood conflict over wishes to 
devour and fears of being devoured in retaliation. 
The fear of worms persisted from childhood into 
adult life. The childhood animal phobia and fear 
of the ocean were replaced in the adult neurosis by 
claustrophobia. The woman had various eating in- 
hibitions throughout life: in childhood an aversion 
to fish was dominant, and in adult life an inhibi- 
tion to eating meat was most prominent. The phe- 
nomena just cited represent pathological deriva- 
tives of the unconscious conflict, namely, neurotic 
inhibitions and symptoms. However, the woman’s 
becoming a teacher and her interest in ichthyology, 
derived from the same conflict, represent sub- 
liminations, that is, activities which enriched her 
life and contributed to adaptation. 

It is evident from what has been said that in 
every individual some remnant of the anxieties of 
the childhood period will find representation in the 
mental life of the adult. To a certain extent, there- 
fore, neurotic trends may be found in every person 
even in the absence of clinically discernible neu- 
rosis. Reliable statistics are very hard to come by 
because the criteria employed for defining the 
presence of neurosis vary from one observer to 
another, It has been estimated by different authori- 
ties that between 30 and 60 per cent of the com- 
plaints which bring patients to the general prac- 
titioner are functional, that is, neurotic in nature. 
Neurosis as a clinical entity becomes significant 
only when it reaches the point of severity, where it 
interferes with the individual's normal functioning 
and adaptation. 


NEUROSIS 165 


Most symptom neuroses are mixed in nature. 
They combine into the clinical picture reactions 
which are typical for several neuroses. A patient 
does not necessarily have either hysteria or com- 
pulsion neurosis. Usually, his illness shows fea- 
tures of both. Furthermore, there are conditions 
appropriately termed neurosis which are without 
symptoms. In such persons their “sickness is their 
soul,” Their history, their very fate, constitutes a 
record of failure to contain neurotic conflict within 
the bounds of symptom formation. Repetitive epi- 
sodes of failure, victimization, self-induced defeat, 
and bizarre personal liaisons replace symptoms. 
Very similar are the severely inhibited personali- 
ties, By denying themselves many of the usual 
sources of gratification and success, they appear to 
be free of anxiety or symptoms. 


Neurosis is a commentary on the ubiquity of 
intrapsychic conflict in civilized life. It is a distor- 
tion of the process of development and adaptation 
that may afflict an individual as he grows up in a 
community which demands renunciation of primi- 
tive antisocial tendencies. Freud felt that the preva- 
lence of neurosis reflects the deep-seated and 
universal opposition to the need to effect the re- 
nunciation which is essential for civilization. 


Jacos A, ARLOW 


[Directly related are the entries ANXIETY; MENTAL 
DISORDERS; MENTAL DISORDERS, TREATMENT OF; 
Psycuosis. Other relevant material may be found 
in CONFLICT, article on PSYCHOLOGICAL ASPECTS; 
DEVELOPMENTAL PSYCHOLOGY; INDIVIDUAL PSY- 
CHOLOGY; NERVOUS SYSTEM, article on STRUCTURE 
AND FUNCTION OF THE BRAIN; PSYCHOANALYSIS; and 
in the biographies of ALEXANDER; FREUD; HORNEY; 
KLEIN.] 
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The terms “neutralism” and “nonalignment” are 
used in the 1960s loosely and interchangeably to 
refer to the desire of a majority of Afro—Asian 
nations to avoid military alliances with either side 
in the cold war. They are associated with the two 
most important international developments of the 
post-World War 11 era: first, the emergence of in- 
dependent nations in Asia and Africa, ending three 
centuries of western European expansion; and sec- 
ond, the emergence of a bipolar conflict between 
the United States and the Soviet Union, each in 
possession of weapons capable of world-wide 
destruction. 

The terms lack the precision of the older notion 
of “neutrality,” which referred to a legal condition 
in which a country refrained from taking sides in a 
war between two or more belligerents. Laws of 
neutrality, which were formed chiefly by treaties 
of the seventeenth and eighteenth centuries but- 
tressed by judicial rulings in the nineteenth cen- 
tury, delineated specific rights and responsibilities 
for both belligerents and neutrals. Neutral States, 
for example, agreed not to aid any of the belliger- 
ents or permit their citizens to do so. Citizens of 
neutral states who aided a belligerent would be 
denied protection by their own government and 
might be punishable by the laws of their own coun- 
try. In return, belligerents were expected to respect 
the commercial activities of neutrals. Attacks by 
belligerents upon neutral shipping often signaled 
the end of neutrality. In the absence, however, of 
an international authority with the power to en- 
force neutrality, the observation of neutral “rights” 
by belligerents depended upon their interests and 
power. The rights of neutrals were largely disre- 
garded during the two world wars, and after World 
War 1 almost no one—barring Switzerland and 
Sweden—viewed neutrality in the legal sense as a 
viable policy for maintaining independence. 

The term “neutrality” signified a policy pursued 
during a State of belligerency. In a specific conflict 
even a major power might choose to be neutral. 
Smaller and weaker Powers sometimes declared 
themselves to be in a state of neutralization, that is, 
they chose to be in a state of perpetual neutrality 
in all wars. Switzerland is the most frequently cited 


example of neutralization, and the Austrian Peace 
Treaty of 1955 provided for the perpetual neutral- 
ization of Austria. Neutralization should be differ- 
entiated from demilitarization, which refers to the 
absence of or limitations on a nation’s capacity to 
engage in war, while neutralization bars participa- 
tion in a state of war. A nation may be demilitar- 
ized but not neutralized (Japan in the 1950s) or 
neutralized but not demilitarized (Austria after 
1955). 

None of the new nations of Asia and Africa, 
with the exception of Laos, declared themselves de- 
militarized or neutralized. Indeed, the leaders of 
the new nations rejected the negative association of 
the term “neutral” and its implication of indiffer- 
ence, isolationism, and a position in world affairs 
which denied the full exercise of their sovereign 
powers. Prime Minister Jawaharlal Nehru of India, 
one of the first spokesmen for the emerging na- 
tions, preferred to speak of “nonalignment” rather 
than neutralism, and President Gamal Abdel Nasser 
of the United Arab Republic used the phrase “posi- 
tive neutrality.” At a meeting in Belgrade in Sep- 
tember 1961 sponsored by Nehru, Nasser, President 
Kwame Nkrumah of Ghana, and President Tito of 
Yugoslavia, the term “nonalignment,” rather than 
“neutralism,” was generally used, and the confer- 
ence was called the Conference of Unaligned States. 
A proposal by Prime Minister Nehru that the tradi- 
tional neutrals, Ireland and Sweden, be invited, 
was rejected by the other sponsors, who thereby 
made clear the distinction in their minds between 
neutralism or nonalignment as a state of noncom- 
mitment in the cold war and neutrality as a legal 
concept. 

Though a desire for greater independence from 
the two great powers has been expressed through- 
out Europe, it is among the newly independent 
nations that neutralism and nonalignment as a 
policy is generally found. The terms are ambiguous, 
So that to label a country “nonaligned” says little 
about either its motivations or the particular man- 
ner in which its neutralism is implemented. Coun- 
tries which retain close links to the French Com- 
munity or the British Commonwealth, as well as 
countries closely associated with the Soviet Union 
or the People’s Republic of China, may label them- 
selves “neutralist” or “nonaligned.” The terms also 
embrace those which are moved by fear of either 
Western or communist expansion, as well as those 
employing it as a shield for their own expansionist. 
policies. To speak of neutralism, therefore, is to 
speak more generally of the foreign policies of the 
newly emergent nations. 

The term “nonalignment” was first popularized 
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by India’s Prime Minister Nehru. Nehru made it 
clear that while a newly independent India would 
be active in the United Nations, retain her mem- 
bership in the Commonwealth of Nations (as a 
republic), and seek to play a “positive” role in in- 
ternational affairs, she would not become “entan- 
gled in any alliances, military or other that might 
drag us into any possible conflict” (Nehru 1950). 
Nehru viewed the struggle between the United 
States and the Soviet Union as rivalry for world 
power, not as an expression of ideological differ- 
ences. It is useful to recall that American and west- 
ern European concern over communist expansion 
took place at a time when Indian and other Asian 
and African nationalists were struggling against 
colonialism or wrestling with the immediate prob- 
lems of independence. Thus the first Asian Rela- 
tions Conference, held in New Delhi in March 
1947, on the eve of Indian independence, focused 
on colonialism and mutual cooperation within Asia. 

But Nehru was very much concerned with the 
possibility of a war between the great powers which 
would inevitably engulf the entire world. He be- 
lieved that India could serve as a channel for com- 
munications among the great powers—which she 
did in the Korean War—and could play a moder- 
ating, if not mediating, role. Within a few years 
after India became independent, the terms “non- 
alignment” and “neutralism” became more widely 
adopted: by countries in Asia and Africa which 
had newly achieved independence, by Egypt after 
a revolution overthrew the traditional monarchy, 
and by Yugoslavia when it declared its independ- 
ence from the Soviet Union. The terms were soon 
used interchangeably, not as analytical concepts 
but as popular terms which actually covered a wide 
range of policies, Still, the terminology is itself of 
political importance, for a line was thereby drawn 
in international conferences between countries 
which subscribed to it and countries which did not. 

Neutralism and nonalignment—for the moment 
we shall put aside the wide range of policies which 
these terms encompass—can be viewed as (1) a 
Strategy for maximizing one’s security in a bipolar 
world; (2) a foreign policy expression of domestic 
political, cultural, and psychological needs; and 
(3) a policy of newly independent countries for 
securing their regional interests. 

Neutralism as strategy. Neutralism has been 
widely explained (and justified) as a strategy which 
can best serve the security interests of the new 
nations in the cold war. The nations which achieved 
independence in Asia and Africa after World 
War 1 have for the most part been militarily, eco- 
nomically, and politically weak. The armies left by 
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colonial rulers were generally small, underequipped, 
and more concerned with internal than external 
security. Almost without exception, the new nations 
were also economically underdeveloped and in need 
of foreign capital and technical assistance to speed 
their economic growth. Moreover, internal political 
differences have often been so great that many new 
nations have tottered between internal wars and 
authoritarian oligarchies. The internal weaknesses 
of the new states provide a standing invitation for 
stronger powers to intervene. 

Many new nations, in an effort to deter foreign 
intervention or to increase their independence from 
the dominant powers, have turned to neutralist 
policies. Nonalignment can be viewed in certain 
circumstances as a strategy of deterrence without 
commitment to a military alliance. An assertion of 
nonalignment may be directed at one bloc with the 
implied warning that active intervention will result 
in closer relations with the other bloc. Nasser's 
positive neutrality for the United Arab Republic, 
for example, was directed at reducing Western in- 
fluence in the Middle East by delicately flirting 
with, but not inviting, Soviet intervention. On the 
other hand, neutralism for Yugoslavia—the only 
self-declared neutralist country in Europe—was 
directed at deterring Soviet interference and dicta- 
tion by seeking American assistance without a mili- 
tary alignment with the West, which might pre- 
cipitate Soviet intervention. This strategy of using 
one or more powers to deter others was successfully 
employed by the United Arab Republic during the 
Suez crisis of 1956, when the U.A.R. turned to both 
the United States and the Soviet Union against 
Britain, France, and Israel. The strategy proved 
less successful in 1962, when India found that the 
Soviet Union was unable to restrain China, as the 
United States had successfully restrained the Brit- 
ish and French in the Suez crisis. 

Other new nations have attempted to deter ex- 
ternal intervention through a more defensive neu- 
tralist posture. While some countries on the periph- 
ery of Soviet or Chinese communist power, such as 
Iran, Turkey, and Thailand (none newly independ- 
ent, incidentally), have sought security through 
military alliances with the West, others have feared 
“provoking” their neighbors. Some new nations 
have thus assumed that independence could best 
be ensured by a posture of independence, if not 
active hostility, toward the West, combined with 
support for Chinese or Soviet policies, a position 
which has sometimes come perilously close to an 
alignment with one or the other of the two major 
communist powers, In the 1950s, for example, sev- 
eral countries of southeast Asia abjured alliances 
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with the West and adopted a neutralist position as 
a means of demonstrating to the People’s Republic 
of China that it was not being encircled by hostile 
pro-Western powers. With the demonstrable growth 
in Chinese military power and expansionist tend- 
encies in the early 1960s, there was a proliferation 
‘of pro-Chinese neutralism throughout southeast 
Asia. The leaders of several smaller countries took 
the position that through a policy of friendly rela- 
tions and even acquiescence to Communist China 
they could avoid internal subversion or external 
attack and a situation in which they might become 
a battlefield for the great powers. Rather than turn 
to the West, it appeared likely that a number of 
smaller states might seek to maintain a measure 
of “independence” (ensuring at least that they were 
not militarily occupied ), even if it meant a degree 
of dependency on Communist China. 

Since virtually all of the new nations are in need 
of financial and technical assistance from abroad, 
nonalignment has also been viewed as a strategy 
for maximizing the flow of foreign aid while mini- 
mizing the restrictions or “strings” attached. Though 
obtaining aid on the most favorable terms has gen- 
erally not been the chief objective of a nonalign- 
ment policy, this has been an important by-product. 
Both the Soviet Union and the United States—and, 
increasingly, countries throughout Europe—have 
used economic aid programs as instruments of 
foreign policy. There has been competition not only 
over the amount of aid but also, more importantly, 
over the sectors to which aid is given. Soviet aid to 
the Indonesian air force, for example, was balanced 
by American aid to the Indonesian army. In certain 
circumstances military aid has often been viewed 
by donors as a more effective device for exerting 
“influence” than other types of aid. 

Nonalignment has often made it possible for the 
recipient country to turn elsewhere if one donor 
attempts to attach “strings” to its aid, Thus, when 
the United States declared it would not assist in 
the construction of the Aswan Dam in Egypt, pre- 
sumably because Egypt had accepted communist 
military equipment, Egypt turned to the Soviet 
Union and received economic aid. Similarly, when 
American Congressmen expressed reluctance to aid 
a proposed public sector steel plant in India, the 
Indian government turned to the Soviet Union for 
assistance. In both instances, however, the United 
States continued to maintain substantial aid pro- 
grams, 
ont ae ae aligned nations have profited 
abe sai oi nonalignment adopted by neigh- 

es since the United States found it 
necessary to give more aid to its allies, thus pro- 


viding economic rewards as compensation for what- 
ever limitations military alliances impose. More- 
over, the volume of aid and its spread to almost 
every newly independent country is in some meas- 
ure a consequence of the concern of both the 
United States and the Soviet Union that in the 
absence of aid the other power might establish a 
new beachhead in a neutralist country. This con- 
sideration has been particularly important in the 
American aid program, [See FOREIGN A1D.] 

Finally, there are other foreign policy advantages 
for those new nations which have adopted non- 
alignment policies. Nonalignment has served to 
enhance the role played by militarily weak powers 
in disputes among the great powers. In both Korea 
and the Congo the nonaligned have exercised influ- 
ence which committed powers could not exert. For 
some 15 years, until India suffered serious reverses 
at the hands of the Chinese in an attack on India’s 
northern borders, India’s influence at interna- 
tional meetings was far greater than her economic 
strength or military capabilities warranted. Non- 
aligned India also argued as an advantage of non- 
alignment that it permits countries to pursue de- 
velopment programs without a heavy commitment 
of funds to arms. In practice, however, some non- 
aligned countries have availed themselves of for- 
eign economic aid while diverting their own internal 
resources for the purchase of foreign arms and the 
expansion of the army. Moreover, it has not been 
uncommon for the nonaligned to turn to both sides 
for military assistance, as India subsequently did 
at the time of the Chinese attack. 

Neutralism and domestic politics. Neutralism 
can also be viewed as a foreign policy instrument 
for satisfying domestic political needs. Almost all 
of the newly independent nations have been faced 
with the problems of (1) establishing a sense of 
national identity and integration in culturally plu- 
ralistic societies, (2) establishing strong central 
authority in societies in which regional and tribal 
authorities have been great, (3) establishing a 
sense of legitimacy over the forms and purposes of 
newly created political institutions, and (4) satisfy- 
ing elite and sometimes popular sentiment for 
demonstrating cultural as well as economic and 
Political independence from the influences of for- 
mer colonial rulers, 

From a domestic viewpoint, neutralism can be 
viewed as a vehicle for achieving these nation- 
building objectives, especially by minimizing inter- 
nal schisms over foreign policy issues and by 
strengthening national pride and loyalty as against 
parochial and primordial attachments, Neutralism 
may, for example, be a useful central position for 
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nationalist leaders against domestic opponents who 
seek closer ties with the former colonial ruler and 
against other domestic critics who seek a more rev- 
olutionary anticolonial, procommunist position. 

Those interpretations of neutralism stress the 
calculated and rational component of the foreign 
policies of new nations. An alternative, but not 
necessarily conflicting, interpretation lays greater 
emphasis on the psychological needs and cultural 
traditions of political elites in the new nations. 
This interpretation stresses neutralism as not sim- 
ply a strategy for the achievement of a country’s 
national interest but an end in itself. The leaders 
of new nations often emphasize the necessity of 
maintaining a position of nonalignment in spite of 
changing circumstances. Thus Prime Minister 
Nehru of India stressed the need for maintaining 
India’s policy of nonalignment after the Chinese 
attack in 1962, not simply as an appropriate strat- 
egy for securing Soviet and American support but 
as a cardinal principle of policy. In this context 
neutralism has come to mean an independent pol- 
icy, while alignment suggests a subordinate posi- 
tion. Neutralism is thus the foreign policy expres- 
sion of domestic nationalism. It is a policy which 
facilitates closer ties with other Afro—Asian states. 
It strengthens respect for and attachment to one’s 
own cultural traditions and one’s homeland. It pro- 
vides a way of giving vent to anticolonial, anti- 
Western sentiments without denying oneself the 
benefits of continued Western assistance. It affords 
a vehicle for satisfying the aspirations of new, 
weak states to exert their influence as a “third 
force” in world affairs. Finally, neutralism may be 
less a calculated move to strengthen independent 
nations than an effort to enhance the domestic 
authority and international prestige of new elites. 

Neutralism and regional interests. Each of the 
Major neutralist powers belongs to a subordinate 
international system. Ghana and Nigeria play im- 
portant roles in sub-Sahara Africa, especially in 
west Africa; Egypt in the Middle East and north 
Africa; India in south Asia; and Indonesia in south- 
east Asia. Whether the variations in nonalignment 
policies pursued by each of these nations is a con- 
sequence of the personal outlooks and ambitions 
of the political leaders who guide these countries, 
or whether they are the result of the peculiar 
regional contexts in which each country operates, 
we shall not know until there are changes in leader- 
ship. In any event, it would be difficult to evaluate 
the meaning of neutralism without seeing each 
practitioner in his regional environment. 

While some neutralists speak of “ending” the 
cold war, Egypt's “positive neutralism” has had as 
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its primary concern the creation of a Pan-Arab 
movement and the establishment of Arab socialism. 
Concretely, this has meant the unification of Arab 
states under the hegemony of modernist Egypt. 
The movement for the unification of Arab states 
precedes, of course, Egypt’s neutralism, It was 
strengthened in March 1945 by the Pact of the 
League of Arab States, which included Egypt, Iraq, 
Lebanon, Saudi Arabia, Syria, Jordan, and Yemen. 
In 1953 Libya joined the league, Sudan in 1956, 
Tunisia and Morocco in 1958, and Kuwait in 1961. 
President Nasser viewed the overthrow of tradi- 
tional monarchies and the expulsion of Western 
influence from the Middle East as essential steps 
toward the achievement of Arab political unifica- 
tion. For Nasser, neutralism was not a policy of 
opposing both sides in the cold war but, rather, of 
using each of the two great powers in an effort to 
further Egyptian goals. Thus, in the early 1960s 
Egypt supported the Soviet position that the West 
should withdraw its foreign bases, but at the same 
time Egypt continued to permit the transit of oil to 
the West. While Egypt aimed to minimize Western 
power in the Middle East and pressed for the 
breakup of the Baghdad Pact, which linked Iran, 
Iraq, Turkey, and Pakistan to Great Britain, Egypt 
also combated Soviet influence in Syria and later 
in Iraq, which did withdraw from the Baghdad 
Pact. Political leaders in Egypt have described pos- 
itive neutrality as the external manifestation of 
their revolutionary policies and as a justification 
for “liberation” movements aimed at the overthrow 
of pro-Western traditional monarchies which stood 
in the way of Arab unification. 

A similar link between neutralism and regional 
aspirations could be found in the policies of Ghana. 
As the first newly independent country in Africa, 
in April 1957 Ghana immediately pressed for the 
establishment of a Pan-African movement. Under 
Ghana's leadership the first conference of inde- 
pendent African states was organized in Accra in 
1958 to press for anticolonial, pro-Pan-African pol- 
icies. Shortly thereafter Ghana formed a federation 
with Mali and Guinea as a step toward African 
unification. The federation secured the support of 
the Patrice Lumumba group in the Congo, but with 
the death of Lumumba the Pan-African movement 
Jost some of its force. Moreover, Ghana failed to 
win the support of Nigeria and the bulk of the 
newly independent former colonies of France. In 
January 1961, Ghana and Egypt, joined by Guinea, 
Mali, Morocco, Syria (in the then United Arab 
Republic), and, later, Algeria, formed a bloc of 
neutralist powers at Casablanca whose neutralism 
stressed Pan-African and Pan-Arab nationalism, 
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advocated eliminating African and Arab ties with 
the West, and favored an Arab-African form of 
socialism. The Casablanca group supported “na- 
tional liberation” movements against those govern- 
ments which opposed the creation of Arab and 
African unions. [See PAN MOVEMENTS.] 

Later in the year Nigeria and 19 other African 
states formed the Monrovia group to reassert the 
doctrine of political independence for each African 
state and to stress a more moderate Pan-African 
ideal built on a program of integrated transport 
and communication facilities, educational and cul- 
tural exchanges, and economic coordination. The 
following year the Monrovia group constituted 
themselves into the Inter-African and Malagasy 
States Organization, with 20 members. Though 
committed to nonalignment, or “independence,” in 
cold war policy, the Monrovia group contained the 
Brazzaville bloc of 12 ex-French colonies which 
maintained extensive economic, political, military, 
and cultural links to France. The “neutralism” of 
the Monrovia and Casablanca blocs thus involved 
quite different sentiments toward the West, toward 
Pan-Africanism and toward African socialism. 
While Pan-Africanism, African socialism, and neu- 
tralism became the ideology of the militant wing 
of African nationalism, interstate African cooper- 
ation, mixed economies, and Eurafricanism had 
become the viewpoint of the moderate Monrovia 
powers, 

Nonetheless, despite these deep differences, the 
ambiguity of the term “nonalignment” provided a 
meeting point for the conference of African states 
in Addis Ababa, Ethiopia, in May 1963. For the 
first time, virtually all the newly independent Afri- 
can states attended (30 out of 33) to find a basis 
for African unity. A charter for the Organization 
of African Unity (OAU) was signed, and later rati- 
fied by all, which affirmed support for nonalign- 
ment. Conference resolutions called for the estab- 
lishment of Africa as a denuclearized zone, the 
removal of military bases from Africa, and the dis- 
entanglement of African countries from foreign 
military pacts. But more significantly, the OAU 
charter declared in effect that African states would 
not interfere in each other’s internal affairs and 
that each member state should have an “inalienable 
tight to independent existence” a reaffirmation of 
the Monrovia Position. The two unanswered ques- 
tions at the close of the Addis Ababa conference 
were (1) whether the militant efforts of the Casa- 
blanca group to unite all of Africa, if necessary 
through national wars of liberation and internal 
subversion and assassination, had come to an end 
or been simply temporarily suspended and (2) 


whether the links of the ex-French colonies to 
Europe through the European Common Market 
signified a more relaxed relationship with the West 
which would positively affect the policies of other 
African states, or was a passing phase which would 
diminish with a rise in anti-European nationalism, 

The process of settling relations with one an- 
other as well as with the West was a greater prob- 
lem for south and southeast Asians in the 1960s 
than for the Africans because of their geographic 
proximity to an expanding communist power. This 
region was the first to achieve independence from 
colonial rule, and at the Colombo Conference in 
Ceylon in 1954 the earliest regional grouping of 
predominantly nonaligned countries was formed, 
Ceylon, India, Burma, and Indonesia, all non- 
aligned, and Pakistan, which was shortly to sign 
the Southeast Asia Collective Defense Treaty and 
the Baghdad Pact, attended. A year later, in April 
1955, the Colombo powers sponsored the famous 
Bandung Conference in Indonesia, which brought 
to one conference hall representatives of almost all 
of the then independent Asian and African nations. 

Like the Middle East, south and southeast Asia 
were soon torn between the aligned and the non- 
aligned. Cambodia and Laos joined the nonaligned, 
while Pakistan joined Thailand and the Philippines 
in the SEATO Pact with Australia, France, Great 
Britain, New Zealand, and the United States. The 
lines did not simply reflect varied assessments of 
Soviet and Chinese threats to the region but were 
also an expression of interstate conflicts among 
neighboring powers. India’s espousal of nonalign- 
ment no doubt encouraged Pakistan to join an alli- 
ance with the United States and Britain. Similarly, 
Cambodia’s pro-Chinese neutralism was engen- 
dered not only by fear of Communist China but 
also by traditional antipathies to Thailand and 
Vietnam, both of which had allied themselves with 
the United States, 

But it is important to emphasize the impact of 
expanding Chinese communist power on the par- 
ticular forms which nonalignment and alignment 
have taken throughout this region. Burma opted 
for a pro-Chinese neutrality and in January 1960 
signed a treaty with Communist China which pre- 
vented Burma from entering into “any military 
alliance directed against the other Contracting 
Party.” Thailand, faced with a “Free Thai” move- 
ment in China, chose to ally itself with the West. 
As we have seen, Pakistan signed treaties with the 
West after India assumed a nonaligned position. 
But when India and China clashed with one an- 
other in the border dispute of 1962, Pakistan moved 
closer to China even while maintaining her alli- 
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ances with the West. As tensions between China 
and India rose, several countries of southeast Asia 
emphasized their neutrality in this dispute. Some 
southeast Asian leaders, fearful that the Chinese 
communists did not really want their nations “neu- 
tral and independent,” privately hoped that a pro- 
Chinese form of nonalignment might serve to 
restrain the Chinese. 

The Chinese attack against India in October 
1962 marked an important turning point for the 
particular form of nonalignment followed by India, 
the largest of the nonaligned nations. Prime Min- 
ister Nehru had taken the position that military 
conflict between the two largest Asian powers was 
unthinkable even though genuine differences ex- 
isted concerning their borders. When the Chinese 
attack began and it became apparent that India’s 
army and air force were ill-prepared, India turned 
to the United States and Great Britain for military 
assistance. India, in her desire to maintain Soviet 
support as a counterweight to China on the Asian 
mainland, also asked the Soviet Union for assist- 
ance and took the position that her nonalignment 
was intact so long as she was willing to accept mili- 
tary assistance from both the United States and the 
Soviet Union. Thus, though no substantive Soviet 
aid was immediately forthcoming, the Indian gov- 
ernment noted that since it was willing to accept 
Soviet aid (and had actually contracted for Soviet 
military assistance in the future), the principle of 
nonalignment had been maintained. But clearly its 
form had undergone important changes. 

Neutralism in the 1960s. The Indian position 
well illustrates several general aspects of nonalign- 
ment policy as practiced in the early 1960s. Non- 
aligned powers were prepared to accept military 
assistance from one or both great powers so long 
as there was no military alliance. Thus Indonesia, 
Ghana, India, Egypt, and Yugoslavia had all ac- 
cepted American or Soviet military aid, and some- 
times both. Second, the quest for peace stressed by 
the nonaligned referred to peace among the cold 
war powers, that is, the absence of nuclear warfare 
or any direct military confrontation between the 
Soviet Union, the People’s Republic of China, the 
United States, and the NATO powers. It did not 
refer to the absence of conflict among Afro—Asian 
powers. In fact, neutralism was sometimes used as 
a justification for undermining neighboring coun- 
tries who were not sufficiently “independent” of 
Europe or the United States. Third, the disarma- 
ment advocated by the nonaligned referred only to 
disarmament among the great powers and not 
among the smaller nations or the newly independ- 
ent. This reaction reflects the unique significance 
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of nuclear weapons as a threat to the whole world. 
But, as we have noted, many nonaligned nations 
have turned to the West or to the Soviet Union for 
military support, sometimes to enhance their own 
capabilities to cope with other Afro—Asian nations. 
Fourth, the term “neutralism” covered such a wide 
range of policies and attitudes that the nonaligned 
did not in fact develop as a truly “third force” bloc 
in world politics, although there was a measure of 
unity in voting in the United Nations on some 
issues. Increasingly, “polycentrism” within the com- 
munist world and disagreements among the West- 
ern powers provided the new nations with a wider 
range of policy choices. If India could obtain Soviet 
and American assistance in her conflict with Com- 
munist China, couldn’t another newly independent 
nation win the support of the United States and the 
Soviet Union against a Western nation? 

If the terms “neutralism” and “nonalignment” 
seemed ambiguous in the 1950s, their meaning 
became, if possible, even more all-embracing in the 
1960s as each country subscribing to this policy 
worked out its own relationship to the great powers. 
Moreover, the U.S. policy of establishing new mili- 
tary alliances in the 1950s had all but disappeared 
by the early 1960s. American attitudes toward neu- 
tralism had changed markedly from the famous 
Dulles statement of 1956 calling neutralism an 
“immoral and short-sighted conception” to the 
Eisenhower statement of 1960 that “we do not 
desire that you [the newly independent nations in 
the UN] should belong to one camp or the other.” 
The earlier fears of many new nations that their 
former colonial rulers and the United States repre- 
sented threats to their independence, culturally and 
economically as well as militarily, seemed to be 
diminishing. Neither for the West nor for the 
newly independent countries was it easy to shift 
from a superior-subordinate relationship, which 
prevailed for decades and in some instances cen- 
turies, to a relationship of equality. For many new 
nations, neutralism was simply a mark of inde- 
pendence from the West, though it became a con- 
venient posture for countries like Yugoslavia, strug- 
gling for independence against the Soviet Union. 
This anti-Western sentiment found its expression 
in a “double standard” which made the neutral 
nations unwilling to judge the communist world 
with the same standards as those applied to the 
United States and its allies. Thus, the nonaligned 
nations meeting in Belgrade in 1961 refused to 
condemn the Soviet breaking of the moratorium on 
nuclear testing but instead passed a resolution 
condemning all nuclear tests, thereby equating the 
United States with the Soviet Union. 
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It was to be expected that some time would pass 
before a more relaxed relationship could be estab- 
lished between the developing areas and the West. 
Insofar as fear, hatred, and especially spite entered 
into this relationship, the danger existed that appro- 
priate policies to meet real, as opposed to imagined, 
threats to independence might not be formulated. 
As more and more nations declared themselves 
neutralist or nonaligned, and as each country in 
practice formulated policies based upon their in- 
ternational interests, domestic needs, psychological 
sentiments, regional relations, and the personal 
ambitions of political leaders, it became clear that 
the terms were analytically meaningless descrip- 
tions of widely differing policies. Finally, as poly- 
centrism increased and as relations between the 
United States and the Soviet Union became less 
threatening, it appeared possible that in time the 
terms themselves might disappear even from pop- 
ular usage. 

MYRON WEINER 


[See also FOREIGN Am; FOREIGN POLICY; INTERNA- 
TIONAL POLITICS. Other relevant material may be 
found in ALLIANCES; DISENGAGEMENT; INTERNA- 
TIONAL RELATIONS; MODERNIZATION. ] 
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Internationally known during his lifetime as one 
of America’s greatest scientists and one of the 
world’s leading astronomers, Simon Newcomb 
(1835-1909) has only recently been recognized as 
the first major American mathematical economist, 
His four books and numerous articles on economics 
represent only a small part of an extraordinarily 
rich and varied intellectual contribution, but, as 
he remarked in his autobiography, economics was 
only his avocation. 

Newcomb's early experiences directly influenced 
his approach to economic and social questions, 
Born in Nova Scotia in humble circumstances, he 
received little formal education, but after a brief 
period as a country schoolmaster he obtained his 
first degree at the Lawrence Scientific School of 
Harvard in 1858, while employed as a computer 
on the Nautical Almanac. From 1861 until his re- 
tirement in 1897, he was professor of mathematics 
in the U.S. Navy and was superintendent of the 
American Ephemeris and the Nautical Almanac 
from 1877. His prodigious lunar investigations 
earned him innumerable awards and distinctions, 
including 17 honorary degrees from leading uni- 
Versities in 10 different countries. He occasionally 
lectured in political economy at Harvard and ex- 
amined in economics at Johns Hopkins, where he 
was professor of mathematics and astronomy, and 
his bibliography includes many works on politics, 
education, theology, psychic research, and fictional 
topics, as well as his contributions to various scien- 
tific disciplines, 

Having overcome his early difficulties by sheer 
ability, will power, and unceasing toil, Newcomb 
took a ruggedly individualistic view of current eco- 
nomic and social problems, A vigorous critic of 
labor unions, he wrote an unsigned review of 
Richard T. Ely’s sympathetic study The Labor 
Movement in America (1886c), denouncing his 
Johns Hopkins colleague as unfit to occupy a uni- 
versity chair. Newcomb’s first book on economics, 
A Critical Examination of Our Financial Policy 
During the Southern Rebellion (1865), attacked 
inconvertible paper money as immoral and inex- 
pedient. This book, like his popular works The 
ABC of Finance (1878) and A Plain Man’s Talk on 
the Labor Question (1886a), was designed to ex- 
Pose the fallacies prevalent among “practical” busi- 
nessmen and the general public and to show that 


elementary economic principles were directly ap- 
plicable to current affairs. But although his lucid 
prose made him an effective expositor, Newcomb’s 
scientific training blinded him to the complexities 
of social processes. 

During the controversy of the mid-1880s, which 
marks the beginning of modern American eco- 
nomics, Newcomb was prominent among those 
who defended the “orthodox,” or “old,” school of 
Ricardian political economy against the attacks of 
the “new,” German-inspired school of historical 
economists. His influence as president of the Po- 
litical Economy Club, a small but important group 
of conservative economists and publicists, was en- 
hanced by his scientific reputation, which lent 
additional authority to his claim that no essential 
methodological difference existed between the “old” 
and the “new” viewpoints. But although the “old” 
school emerged victorious in the 1890s, Newcomb 
became increasingly dissatisfied with the trends of 
academic economics, protesting that the profes- 
sional economists were retreating before the rising 
tide of popular nostrums. 

Newcomb’s enthusiasm for economics began in 
1854, when he rejoiced at finding “human affairs 
treated by scientific method” in J. B. Say’s Political 
Economy. But although he was primarily interested 
in the practical applications of economic principles 
to public policy, twentieth-century economists es- 
teem him as a mathematical theorist. Before he 
published his favorable review of Jevons’ Theory of 
Political Economy in 1872, Newcomb had discussed 
the theory of competitive price with the pragmatist 
philosopher C. S. Peirce, and unpublished manu- 
scripts reveal his familiarity with the concept of 
demand elasticity and the principle of substitution. 
Newcomb may, however, have learned both of these 
ideas from reading Cournot, whereas his Prin- 
ciples of Political Economy (1886b) was strikingly 
original. In Book 4 of the Principles, entitled “The 
Societary Circulation,” he elaborated the crucial 
distinction between “fund” and “flow” concepts, 
tracing the interactions between currency and 
commodity flows. A few pages later he presented 
an “equation of societary circulation,” VR = KP 
(where V = number of dollars; R = average velocity 
of circulation; K = number of transactions; and 
P=scale of prices), which formed the basis of 
Irving Fisher’s well-known “equation of exchange,” 
MV = PT. Newcomb also anticipated Fisher’s pro- 
posal for a “compensated dollar,” the gold content 
of which could be varied to compensate for price 
changes; and his discussion of the interest rate, his 
distinction between prospective and actual demand, 
and his analysis of income and hoarding all repre- 
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sent significant steps toward modern macroeco- 
nomic and monetary analysis. 

Unfortunately, Newcomb’s Principles was too far 
ahead of its time to exert much influence upon the 
economists of his day. Yet despite his disappoint- 
ment, Newcomb’s confidence was unshaken, and 
in 1904 he unsuccessfully endeavored to obtain 
support for an ambitious systematic investigation 
of the “practical” operation of economic causes that 
was somewhat akin to a present-day econometric 
study. Also in that year Newcomb presided at the 
Congress of Arts and Science, a great international 
gathering of scholars at the Louisiana Purchase 
Exposition of 1904 at St. Louis. He remained 
active until his death in 1909. 


A. W. Coats 


[For the historical context of Newcomb’s work, see the 
biographies of Cournot; ELY; for discussion of the 
subsequent development of his ideas, see MONEY, 
article on QUANTITY THEORY; and the biography of 
FISHER, IRVING.] 
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Alfredo Niceforo (1876-1960), Italian criminol- 
ogist, statistician, and sociologist, was born in 
Castiglione di Sicilia. He held academic posts, first 
as a professor of criminology and later as a pro- 
fessor of statistics, at the University of Lausanne, 
the Université Nouvelle in Brussels, and the uni- 
versities of Naples, Turin, Messina, and Rome. 

Empirical research and statistical work. Nice- 
foro began his career as a criminologist of the 
Italian school, closely following Lombroso and 
Enrico Ferri. However, he broadened the theory of 
criminology, asserting that for a comprehensive 
understanding of crime and its dynamics it is not 
enough to study the criminal; an understanding of 
the normal man and his social environment is also 
necessary. As a positivist, Niceforo sought to study 
the individual in all his biological and psychological 
complexity, rather than simply examining a par- 
ticular phenomenon—for example, crime—as it 
impinges on the individual. His early interests in 
criminology prepared the way for his work as one 
of the first Italian social scientists to conduct ex- 
tensive empirical social research (see La “mala 
vita” a Roma, 1898a, describing his collaborative 
work with Scipio Sighele), as well as for his later 
interest in the related fields of biology, anthro- 
pology, demography, social psychology, and soci- 
ology. Indeed, his theory of criminology might be 
called an anthroposociological one, and in 1905 
he published his first major work in social anthro- 
pology, Les classes pauvres. 

In method, not only did Niceforo adhere to the 
positivism of the Italian criminologists but he also 
believed that statistics are necessary to any analy- 
sis of social phenomena. Prominent among those 
from whom he derived his conception of statistics 
were Quetelet and Messedaglia (see 1919, Intro- 
duction), Niceforo held statistics to be an essential 
tool for the discovery of lawful regularities among 
Social phenomena: statistics can help to reveal 
regularities by quantifying, measuring, and classi- 
fying; such regularities form the necessary basis 
for the development of the social sciences. 

s His approach to social phenomena through sta- 
tistics was presented first in La misura della vita 
(1919) and later in an enlarged version of that 


work, Il metodo statistico (1923). In his search for 
regularities, Niceforo applied statistics even to such 
fields as linguistics, art, and literature, He also 
sought to find indices of progress and civilization 
(1921) by first breaking down collective life into 
such components as material, intellectual, and 
moral life, as well as political and social organiza- 
tion, and then presenting what he called a symp- 
tomatologie sociale for each of them. His discus- 
sion of the relationship between “social symptoms” 
and the concepts they are presumed to measure 
showed an early awareness that this relationship 
is only one of probability. 

Concept of egos. Niceforo developed the idea 
that there is a stratification of egos in human psy- 
chology, distinguishing a deep ego from a super- 
ficial one, (This notion first appeared in 1902, in 
his La transformación del delito, and was elabo- 
rated in his 1939 article “La structure du ‘moi’ 
d'après l'école italienne de criminologie.”) The 
deep, unconscious ego reflects primitive, atavistic 
instincts (here Lombroso’s influence is apparent) 
and is the habitat of antisocial, egoistic, and in- 
fantile tendencies. The superior ego, on the other 
hand, reflects what man has learned from society 
in terms of values, norms, rules, thoughts, feelings, 
and attitudes. Partly anticipating some of the de- 
velopments of psychoanalysis, Niceforo analyzed 
the conditions under which and the ways in which 
the deep ego—the egoistical antisociality which is 
in every man and which he called latent delin- 
quency—eludes the curbing action of the superior 
ego and manifests itself. He discussed the masks 
under which it comes to the surface and is able to 
make itself accepted by the individual and by so- 
ciety, and he indicated both the mechanism of 
self-justification, which permits the individual's 
conduct to be dominated by the deep ego, and the 
mechanism of self-consolation, by which the indi- 
vidual covers his disappointment when an attempt 
of the deep ego to come to the surface, however 
masked or transformed, fails or is stifled. (See also 
L’ “io” profondo e le sue maschere, 1949.) 

Residues. Niceforo’s sociology, which was some- 
what similar to that of Comte and Spencer, was 
based on the belief that underneath the apparent 
variability of social fact there exist what he called 
residues. His use of this term is independent of and 
differs in content from Pareto’s [see PARETO, article 
On CONTRIBUTIONS TO SOCIOLOGY]. As early as 1911 
Niceforo used the term to designate the constant 
elements of social structure and activity, primary 
among which is the existence of diversity among 
men. Each man is different from every other, and 
these differences—physical, psychological, etc.— 
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are amenable to precise measurement. Later ( 1953) 
he referred to these residues as general, universal 
categories which are not easy to discover but which 
allow the scientist to examine social life in depth. 
To discover the residues is to discover the funda- 
mental laws which govern the structure and proc- 
esses of human society. 

Toward the end of his career (see 1957) Nice- 
foro listed all the residues he had found in the 
course of his studies. The list contains two residues 
that transcend diversity; the hierarchy of indi- 
viduals, groups, and societies; and the “principle 
of agglutination” of the group, exemplified in the 
totem and in the organization of the free commune, 
the religious system, and the national ideology. 


It was Niceforo’s continuous attempt to approach 
all phenomena of social life with the instrument of 
statistical method that led him to try to understand 
social phenomena not only in their interrelations 
but also in their relations to biological and geo- 
graphical phenomena. As in his early work on 
crime, he maintained that any social fact must be 
considered the result of a combination of physical, 
biological, and social forces. This aspect of his 
work made him a forerunner of those who consider 
sociology an interdisciplinary science. 


FRANCESCO P. CERASE 


[For the historical context of Niceforo’s work, see CRM- 
INOLOGY; PENOLOGY; SOCIOLOGY, article on THE 
EARLY HISTORY OF SOCIAL RESEARCH; and the biog- 
raphies of BECCARIA; LOMBROSO; QUETELET.] 
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NONPARAMETRIC STATISTICS 


The articles under this heading are to be re- 
garded not as a handbook of nonparametric proce- 
dures but as an introduction that stresses principles 
through discussion of some important examples. 
The first article deals mainly with nonparametric 
inferences about measures of location for one and 
two populations. The article on ranking methods 
presents further examples of nonparametric meth- 
ods, all involving ranking of observations. Other 
related topics are treated in the articles on order 
statistics and on runs. 


1. THE FIELD I. Richard Savage 
II. ORDER STATISTICS Bernard G. Greenberg 
m, RUNS Peter G. Moore 


Iv. RANKING METHODS Herbert A, David 


I 
THE FIELD 


Nonparametric, or distribution-free, statistical 
methods are based on explicitly weaker assump- 
tions than such classical parametric procedures as 
Student's t-test, analysis of variance, and standard 
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techniques connected with the correlation coeffi- 
cient. Examples of nonparametric methods are the 
sign test, the Wilcoxon tests, and certain confidence 
interval methods for population quantiles (median, 
quartile, etc. ). 

The basic distinction is that parametric proce- 
dures make demanding, relatively narrow assump- 
tions about the probability distributions from which 
empirical observations arise; the most common 
such distributional assumption is that of normality. 
Nonparametric methods, in contrast, do not make 
specific distributional assumptions of this kind. 

The dividing line is, of course, not a sharp one; 
some parametric methods are so robust against 
errors in distributional assumptions as to be almost 
nonparametric in practice, while most nonpara- 
metric methods are only distribution free for some 
of their characteristics. For example, most non- 
parametric hypothesis tests are distribution free 
under the null hypothesis but distribution depend- 
ent with regard to power [see Errors, article on 
EFFECTS OF ERRORS IN STATISTICAL ASSUMPTIONS]. 

The underlying motivation for the use of non- 
parametric methods is the reluctance to make the 
traditional parametric assumptions, in particular 
(but not only) the assumption of normality, The 
relaxation of assumptions is paid for by decreased 
sharpness of inference when the narrower para- 
metric assumptions are in fact true and by less 
flexibility in mode of inference. An advantage of 
some, but by no means all, nonparametric methods 
is that they are easy and quick to apply; some 
authors call them “rough and ready.” 

Nonparametric procedures are, of course, con- 
cerned with parameters, that is, numerical char- 
acteristics of distributions, but usually with param- 
eters that have desirable invariance Properties under 
modifications in scale of measurement. For exam- 
ple, the median is an important parameter for non- 
Parametric analysis; if X is a random variable with 
median m and if f is a strictly increasing function, 
then the median of F(X) is f(m). 

History. Many of the first Statistical techniques 
were applied only to massive quantities of data, so 
massive that the computed statistics had effectively 
known normal distributions, usually because the 
Statistics were based on sample moments, Later, 
the need for methods appropriate to small samples 
became clear, and the exact distributions of many 
Statistics were derived for Specific assumed forms 
(usually normal) of the underlying distribution. 
This work, begun by William S. Gosset (“Student”) 
and R. A. Fisher, led to the standard statistical 
tests: for instance, t, chi-square, and F, 

Somewhat later, although there was substantial 


overlap, procedures were developed having exact 
properties without the need for special distribu- 
tional assumptions. Procedures were also developed 
that simplified numerical analysis of data; much 
of the motivation for all this work was a desire for 
procedures that could be applied to data given in the 
form of ranks or comparisons. These developments 
often arose from the work of social scientists deal- 
ing with data that clearly did not come from dis- 
tributions of standard form; for example, rank sum 
tests were proposed by G. Deuchler and Leon Fes- 
tinger (psychologists), and one form of analysis 
of variance by ranks was proposed by Milton Fried- 
man (an economist). 

When dealing with data that clearly do not arise 
from a standard distribution, an alternative to the 
use of nonparametric methods is the use of trans- 
formations, that is, application to each observation 
of some common function, chosen so as to make 
the resulting transformed data appear nearly nor- 
mal or nearly following some other standard distri- 
butional form [see STATISTICAL ANALYSIS, SPECIAL 
PROBLEMS OF, article on TRANSFORMATIONS OF 
DATA]. 

In the first section below, the nonparametric 
analysis of experiments generating a single sample 
is examined. Artificial data are used in order to 
concentrate attention on the formal aspects of the 
statistical analysis. The second section is a less de- 
tailed discussion of two sample problems, and the 
last mentions some additional important nonpara- 
metric problems, 


One-sample problems 


A single sample of seven numerical observations 
will be used to illustrate the application of non- 
parametric procedures. Suppose the observations 
are (A) —1.96; (B) —.77; (C) —.59; (D) +1.21; 
(E) +.75; (F) +4.79; and (G) +6.95. These 
data might be the scores achieved by seven people 
on a test, the per cent change in prices of seven 
stocks between the two points of time, the ratings 
of seven communities on some sociological scale, 
and so forth. 

There are several kinds of statistical inferences 
for each of several aspects of the parent popula- 
tion. Also, for each kind of inference about each 
aspect of the population there exist several non- 
parametric techniques. The selection of a kind of 
inference about an aspect by a particular technique 
is guided by the experimenter’s interests; the avail- 
able resources for analysis; the relative costs; the 
basic probability structure of the data; the criteria 
of optimality, such as power of tests or length of 
confidence intervals; and the sensitivity (robust- 
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ness) of the technique when the underlying as- 
sumptions are not satisfied perfectly. 

Point estimation of a mean. Assume the data 
represent a random sample. As a first problem, the 
mean (a location parameter) of the population is 
the aspect of interest; the form of inference de- 
sired is a point estimate, and the technique of 
choice is the sample mean, 


pu (-1.98) + (77) + + (46.95) 


7 = 1.483. 


The following are some justifications for the use 
of ž: (a) If the sample is from a normal popula- 
tion, then x is the maximum likelihood estimator 
of the population mean, @. (b) If the sample is 
from a population with finite mean and finite vari- 
ance, then < is the Gauss—Markov estimator; that 
is, among linear functions of the observations that 
have the mean as expected value, ¢ has smallest 
variance. (c) The least squares value is <; that is, 
% is the value of y that minimizes (—1.96 — y)? + 
+++ + (6.95 — y)?. (This last holds whether or not 
the sample is random.) 

Result a is parametric since a functional form 
is selected for the population; in contrast, b is non- 
parametric. The least squares result, c, is not de- 
pendent on probability considerations. 

Point estimation of a median. The sample 
median, +.75, is sometimes used as a point esti- 
mator of the population mean. (When the sampled 
population is symmetric, the population mean and 
median are equal.) 

For some populations (two-tailed exponential, 
for example) the sample median is the maximum 
likelihood estimator of the population mean. The 
sample median minimizes the mean absolute devia- 
tion, |—1.96—y| + -+- + |6.95—-y]. 

Parametric confidence intervals for a mean. If 
it can be assumed that the data are from a normal 
population (mean 0 and variance g? unknown), to 
form the conventional two-sided confidence interval 
with confidence level 1— one first computes % 
and s? (the sample variance, with divisor n — k 
where n is sample size) and then forms the inter- 
val % + tys,/28/ni, where ty-10/2 is the upper a/2 
quantile of the t-distribution with n — 1 degrees of 
freedom. For the present data, ¢ = 1.483, s? = 10.44, 
and the conventional 95 per cent confidence inter- 
val is (—1,508, 4.474). 

Nonparametric confidence intervals for a median. 
In a sample of size n let Xa) < Xe) < +++ < Xim be 
the ordered observations, the order statistics. (It 
will be assumed throughout that no ties occur be- 
tween observations, that is, that the observations 
come from a continuous population.) If Med is the 


Le 


population median, then the probability of an ob- 
servation’s being less (greater) than Med is 4. The 
probability that all the observations are less (great- 
er) than Med is 2-". The event xu, < Med < X(n-i+1) 
occurs when at least i of the observations are less 
than Med and at least i of the observations are 
greater than Med. Hence, xi, < Med < %(n-i1) has 
the same probability as does obtaining at least i 
heads and at least i tails in n tosses of a fair coin. 
From the binomial distribution, one obtains 


Pr(x,i) < Med < X(n-in1)) 
i-l 


n n 
FO (a 
i 70 

In words, (Xi), X(n-is1)) is a confidence interval for 
Med with confidence level given by the above 
formula; for example, (Xa), Xom) is a confidence 
interval for Med with confidence level 1 — 2". 
Thus, for the present data with i= 1, (—1.96, 
6.95) is a confidence interval with confidence 
level = 63%, = .984 and (—.77, 4.79) has confi- 
dence level = % = .875. 

Confidence intervals for the mean of a normal 
population are available at any desired confidence 
level. For the above nonparametric procedure, par- 
ticularly for very small sample sizes, there are 
sharp restrictions on the available confidence levels. 
The restriction occurs because nonparametric pro- 
cedures are often based on random variables with 
discrete distributions even though the underlying 
random variables have continuous distributions. 

Comparisons of confidence intervals. The con- 
fidence statement appears more definitive the 
shorter the confidence interval. One relevant cri- 
terion is the expected value of the squared length 
of the confidence interval. The value of expected 
squared length for the confidence interval based on 
the t-distribution is 4n-'ot2_,,.. If the normality 
assumption is dropped, then the t-distribution con- 
fidence interval will, in general, no longer have the 
desired confidence level, although its expected 
squared length remains the same. (For some spe- 
cific nonnormal assumptions—uniform distribu- 
tions, for example—confidence intervals at the 
desired confidence level may be obtained with 
much smaller expected squared lengths.) The ex- 
pected squared length of the order statistic confi- 
dence interval, assuming only symmetry of the 
distribution, is 2[Ex?,, — E(%()%(n-i;))]- If normality 
actually holds, the confidence interval based on the 
t-distribution has smaller expected squared length: 
with n = 7 the ratio of expected squared lengths is 
.796 for confidence level .984 and .691 for con- 
fidence level .875. A general approximate result is 
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that to obtain equal expected squared lengths one 
needs a sample size for the order statistics interval 
of about 4 m = 1,57 times the sample size for the 
t-distribution interval. 

Tests of hypotheses about a mean. Aside from 
the estimation of a parameter, one often wishes to 
test hypotheses about it. If the data are from a 
normal population, then the best procedure, uni- 
formly most powerful unbiased of similar tests, is 
based on t= (%—,)n'/s, where 0, is the hypo- 
thetical value for the population mean. Consider 
testing the null hypothesis 6, = —.5 on the basis of 
the specific sample of seven; the resulting value 
of the t-statistic is 1.21, which is statistically sig- 
nificant at the .1575 level ( two-sided). For these 
data and this test statistic the null hypothesis 
would be rejected if the required significance level 
exceeds .1575, but if the significance level is 
smaller, the null hypothesis would not be rejected. 
The power of this test depends on the quantity 
(0 — 0,)?/o*, where @ is the population mean [see 
HYPOTHESIS TESTING). 

Sign test for a median. Perhaps the simplest 
nonparametric test comparable to the t-test is the 
sign test. It is easily applied, which makes it useful 
for the preliminary analysis of data and for the 
analysis of data of passing interest. The null hy- 
pothesis for the sign test specifies the population 
median, Med,. In the example, the null hypothesis 
is that the median is —.5. The test statistic is the 
number of observations greater than Med,. In the 
example, the value of the statistic is 4. (The term 
“sign test” arises because one counts the number of 
differences, observation minus Med,, that are pos- 
itive.) If one rejects the null hypothesis whenever 
the number of positive signs is less than i or greater 
than n—i,i<n/2, the Significance level will be 


i-t 


$O 


In this example the possible significance levels are 
0 (i=0), 4%, (i=1), % (i=2), %4 (i=3), 
1 (i= 4). For these data, the sample significance 
level is 1. The sign test leads to exact levels of 
significance when the observations are independent 
and each has probability 4 of exceeding Med,. It is 
not necessary to have each observation from the 
same population. One needs only to compare each 
observation with the hypothetical median. The test 
can be applied even when quantitative measure- 
ments cannot be made, as long as the signs of the 
comparisons are available, 

The power of the sign test depends on P, the 
probability that an observation exceeds Med,. When 


p #3, the power of the sign test approaches 1 as 
the sample size increases. Tests having this prop- 
erty are said to be consistent. All reasonable tests, 
such as those discussed below, will have such con- 
sistency properties. When p + }, the power of the 
sign test is always greater than the significance 
level. Tests having this property are said to be 
unbiased, 

The sample significance level can be found from 
a binomial table with p = 4, and the power from a 
general binomial table. If z is the number of pos: 
itive results in an experiment with moderately 
large n, one can approximate the significance level 
by computing ť = (2z — n)/n? and referring t to 
a normal table. A somewhat better approximation 


can be obtained by replacing z with z + 1 in com- 
puting t’; this is called a continuity correction and 
often arises when a discrete distribution is approx- 


imated by a continuous distribution. 

The sign test and the nonparametric confidence 
intervals described above are related in the same 
manner as the t-test and confidence intervals based 
on the t-distribution. 

The sign procedure is easily put into a sequential 
form; experiments in this form often save money 
and time [see SEQUENTIAL ANALYSIS]. 

Conventionalized data—signed ranks. One can 
think of the sign statistic as the result of replacing 
the observations by certain conventional numbers, 
1 for a positive difference and 0 otherwise, and 
then proceeding with the analysis of the modified 
data. A more interesting example is to replace the 
observations by their Signed ranks, that is, to re- 
place the observation whose difference from Med 
is smallest in absolute value by +1 or —1, as that 
difference is positive or negative, etc. Thus for the 
present data with Med, = —.5, the signed ranks are 
—4, —2, —1, +5, +3, +6, and +7. 

Signed-rank test or Wilcoxon test for a median. 
The one-sample signed-rank Wilcoxon statistic, W, 
is the sum of the positive signed ranks. In the ex- 
ample, W = 21. The exact null distribution of W 
can be found when the observations are mutually 
independent and it is equally likely that the jth 
smallest absolute value has a negative or positive 
sign, for example, when the observations come 
from populations that are symmetrical about the 
median specified by the null hypothesis. 

When the null hypothesis is true, the probability 
that the Wilcoxon statistic is exactly equal to w is 
found by counting the number of possible samples 
that yield w as the value of the statistic and then 
dividing the count by 2”. When n = 7, the largest 
value for W, 28, is yielded by the one sample hav- 
ing all positive ranks. In that case Pr(W = 28) = 
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2-7, The distribution of W under the null hypothe- 
sis is symmetric around the quantity n(n + 1)/4, 
so that Pr(W = w) = Pr(W = 3n(n + 1) — w). Thus 
for n= 7, Pr(W = 21) = Pr(W = 7). A value of 7 
for W can be obtained when the positive ranks are 
(7), (1,8), (2,5), (3,4), (1,2,4)—each parenthesis 
represents a sample. Thus Pr(W = 21) = Pr(W = 
7) = 5/2". By enumeration, the probability that 
W > 21 or W <7 is 3%o5 = .281. The present data 
are therefore statistically significant at the .281 
level when using the Wilcoxon test (two-sided). 
The sign test has about n/2 possible significance 
levels, and the Wilcoxon test has about n(n + 1)/4 
possible significance levels. For small samples, 
tables of the exact null distribution of W are avail- 
able, Under the null hypothesis the mean of W is 
given by n(n + 1)/4 and the variance of W is given 
by n(n+1)(2n+1)/24. The standardized vari- 
able that is derived from W by setting t = (W — 
EW)/(var W)? has approximately a normal distri- 
bution. Most statements about the power of the W 
test are based on very small samples, Monte Carlo 
sampling results, or large-sample theory. For small 
samples, W is easy to compute. As the sample size 
increases, ranking the data becomes more difficult 
than computing the t-statistic. 

W can be computed by making certain compari- 
sons, without detailed measurements. Denote by 
m (n’) the number of negative (positive) observa- 
tions, Let u, «++, U» be the values of the positive 
and v;, ***, Um be the absolute values of the neg- 
ative observations. Let Ai; = A(u;,v;) be equal to 1 
if u; > u; and equal to 0 otherwise. Define 


S=Di LD; Ay. 


W=S4+n(n'+1)/2. 


If m’ > 0 and n’ > 0, the statistic S/m’n' is an un- 
biased estimator of the probability that a positive 
observation will be larger than the absolute value 
of a negative observation. Hence S is of independ- 
ent interest. The relationship between S and W 
involves the random quantity n’, so that inferences 
drawn from S and W need not be the same. 
Wilcoxon statistic confidence intervals. Given 
a population distribution symmetric about the 
median, the Wilcoxon test generates confidence 
intervals for the population median, Med. In the 
example, the significance level 1% = .0781 corre- 
sponds to rejecting the null hypothesis when 
W>25 or W<3. Thus the corresponding confi- 
dence interval at confidence level 1 — 1%23 = 9219 
consists of values of Med'that will make 4 < W < 24. 
An examination of the original data (the signed 
ranks are not now sufficient) yields the interval 


Then 
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(—1.28, 4.08). An examination of some trial values 
of Med will help in understanding this result. Thus 
if Med = 4.2, one sees that F has rank 1, G has 
rank 2, no other observation has positive rank, and 
W =3, which means this null hypothesis would be 
rejected, If Med = 4, then F has rank 1, G has rank 
3, no other observation has positive rank, and 
W =4, which means this null hypothesis would be 
accepted. 

Permutation test for a location parameter. The 
final test considered here for a location parameter 
(like the median or mean) is the so-called permu- 
tation test on the original data. Under the null 
hypothesis, the observations are mutually independ- 
ent and come from populations symmetric about 
the median Med,. This includes both the cases 
where the signed ranks are the basic observations 
and the cases where the signs are the basic observa- 
tions (scoring +1 for a positive observation and 
—1 for a negative observation). Given the absolute 
values of the observations minus Med,, under the 
null hypothesis there are 2” equally likely possible 
assignments of signs to the absolute values. The 
nonparametric statistic to be considered is the total, 
T, of the positive deviations from Med,. One works 
with the conditional distribution of T, given the 
absolute values of the observed differences. Using 
Med, = —.5, for the present data T = 15.70. There 
are 13 configurations of the signs that will give a 
value of T at least this large and another 13 con- 
figurations that will give a value of 17.52 — 15.70 = 
1.82 or smaller, the latter being the lower tail of 
the symmetrical distribution of T. Thus the signifi- 
cance level of the permutation test on the present 
data is 2649, = .2031. With the test, each multiple 
of 14, is a possible significance level. The compu- 
tations for this procedure are prohibitive in cost if 
the sample size is not very small. A partial solution 
is to use some of the assignments of signs (say a 
random sample) to estimate the conditional dis- 
tribution of T. The ordinary t-statistic and T are 
monotone functions of each other; thus a signifi- 
cance test based on t is equivalent to one based on 
T. The distribution of t for the permutation test is 
approximately normal for large values of n. Actu- 
ally, the randomization that occurs in the design 
of experiments yields the nonparametric structure 
discussed here. 

The power of the permutation test is approxi- 
mately the same as that of the t-test when the data 
are from a normal population. 

Finally, one can construct confidence intervals 
for Med, using the permutation procedure. The 
confidence interval with level 12%2,=.955 is 
(—1.107, 4.376) for the above example. 
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Tests for a location parameter compared. All 
the tests are consistent; for a particular alternative 
and large sample size, each will have power near 1. 
To compare the large sample power of the tests, a 
sequence of alternatives approaching the hypothe- 
sized null value, Med,, is usually introduced. Let 
Medy be the alternative at the Nth step of this 
sequence, and assume that Med, — Med, = c/N?. 
where c is a constant. As N increases, the alterna- 
tive hypothesis approaches the null hypothesis and 
larger sample sizes will be required to obtain a 
desited power for a specified level of significance. 
The efficiency of test 11 compared to test 1 is defined 
as the ratio Eru (N) = m(N)/nu(N), where n, and 
My are the sample sizes yielding the desired power 
for test 1 and test 1, For large N, Ey(N) is usually 
almost independent of test size, power, and the 
constant c; that is, as N grows, Ern (N) usually has 
a limit, Ern, called the Pitman efficiency of test 11 
compared to test 1. The Pitman efficiency will de- 
pend on the tests under consideration and the 
family of possible distributions. Consider the 
two experiments, I and 11, to test the hypothesized 
null value, Med,. They have sample sizes m(N) 
and mi(N) and costs per observation y; and yn. 
Then the cost of experiment 1 divided by the 
cost of experiment m is y1(N)/yunn(N) = 
(Yı/Yu)Ern. When this ratio is > 1 (<1), experi- 
ment II costs less (more) than experiment 1 and the 
two experiments have the same power functions. 
The y associated with the sign test can be much 
smaller than the y's associated with the Wilcoxon 
and permutation tests. 

For normal alternatives, where the parameter is 
the mean, one has 


Et-test, permutation = 1, 
Ee-test, witeoxon test = 3/m = .955, 
Ee-test, sign test = 2/7 = .637. 


Tolerance intervals. Tolerance intervals resem- 
ble confidence intervals, but the former permit an 
interpretation in terms of hypothetical future ob- 
servations. One nonparametric tolerance interval 
method, based on the order statistics of a random 
sample of size n, provides the assertion that at least 
90 per cent (say) of the underlying population lies 
in the interval from xu, to *X(n-is2). This assertion 
is made in the confidence sense; that is, its prob- 
ability of being correct (before the observations are 
taken) is, say, .95. The relationship between i, the 
confidence level (here :95), and the coverage level 
(here .90) is in terms of the incomplete beta func- 


tion. Special tables and graphs are available (Walsh 
1962-1965). 


Another form of nonparametric tolerance inter- 
val interprets the interval from xu, to Xineinay a8 
giving a random interval whose probability under 
the underlying distribution is approximately given 
by (n — 2i + 1)/(n +1). In this context the word 
“approximately” is used to mean the following; 
the probability measure of the random interval 
[Xos Xin] is itself random; its expectation is 
(n-2i+1)/(n+1). The probability (before any of 
the observations are made) that a future observa- 
tion will lie in a tolerance interval of this form is 
(n-2i+1)/(n+1). 

For the data set, with i = 1, the interval [-1.96, 
6.95] is obtained. This interval is expected (in the 
above sense) to contain [(7—24-1 )/(7+1)] = 75 
per cent of the underlying population. 

Other one-sample tests. Goodness of fit proce- 
dures are used to test the hypothesis that the data 
came from a particular population or to form con- 
fidence belts for the whole distribution function. 
[These are discussed in GOODNESS oF FIT.] 

Tests of the randomness of a sample (that is, of 
the assumption that the observations are inde- 
pendent and drawn from the same distribution) 
can be carried out by counting the number of runs 
above and below the sample median, [Such tech- 
niques are discussed in NONPARAMETRIC STATIS- 
TICS, article on RUNS.] 

The statistic based on the number of runs, the 
Wald—Wolfowitz statistic, was originally proposed 
as a test of goodness of fit. It is not, however, 
recommended for this purpose, because other, more 
powerful tests are available. 

Tied observations. The discussion so far has 
presumed that no pairs of observations have the 
same value and that no observation equals Meds. 
In practice such ties will occur. If ties are not ex- 
tensive their consequence will be negligible. If 
exact results are required, the analysis can in prin- 
ciple be carried out conditionally on the observed 
pattern of ties. 


Two-sample problems 


Experiments involving two samples allow com- 
parisons between the sampled populations without 
the necessity of using matched pairs of observa- 
tions. Comparisons arise naturally when consider- 
ing the relative advantages of two experimental 
conditions or two treatments, for example, a treat- 
ment and a control, or when absolute standards 
are unknown or are not available. 

Most of the one-sample procedures have two- 
sample analogues (the exceptions are tests of ran- 
domness and tolerance intervals). With two sam- 
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ples, central interest has been focused on the 
difference between location parameters of the two 
populations. To be specific, let x,,---,%m be the 
observed values in the first sample and y1, +++, Yn 
be the observed values in the second sample. Let 
M: and M, be the corresponding location param- 
eters with A= M, — Mz. 

Estimation of difference of two means. As a 
point estimator of A when M, and M, represent the 
population means, one often uses the differences 
in the sample averages, 7 —%. This is the maxi- 
mum likelihood estimator of A when all of the ob- 
servations are independent and come from normal 
populations; it is also the Gauss—Markovy estimator 
whenever the populations have finite variances, 
and it is the least squares estimator. With the nor- 
mal assumption, and also assuming equal vari- 
ances, confidence intervals for A can be obtained, 
utilizing the t-distribution. As in the normal case, 
nonparametric confidence intervals for A will not 
have the prescribed confidence level unless the two 
populations differ in location parameter only. 

Brown—Mood procedures—two medians. The 
analogue of the confidence procedure based on 
signs, the Brown—Mood procedure, is constructed 
in the following manner: Let wy, *`*, Wmin be all 
of the observations arranged in increasing order, 
that is, w, is the smallest observation in both sam- 
ples; w is the second smallest observation in both 
samples; etc. Denote by w* the median of the com- 
bined sample. (It will be assumed that m +n is 
odd.) Let m* be the number of w’s greater than 
w*, from the x-population. When the two popu- 
lations are the same, m* has a hypergeometric 
distribution and a nonparametric test of the hy- 
pothesis that the two populations are the same, spe- 
cifically that A = 0, is based on the distribution of 
m’. To obtain confidence intervals, replace the 
x-sample with x} =x; +A (i= 1, +: , m); form w’, 
the analogue of the w sequence; compute the 
median of the w’ sequence and call it w'*; compute 
the number of observations from the x’ sequence 
above w” and call it m”; see if one would accept 
the null hypothesis of no difference between the x’ 
and y populations; if one accepts the null hypothe- 
sis, then A is in the confidence interval, and if one 
rejects the null hypothesis, then A is not in the 
confidence interval. 

Wilcoxon two-sample procedure. The analogue 
to the one-sample Wilcoxon procedure is to assign 
ranks (a set of conventional numbers) to the w’s, 
that is, w, is given rank 1, w, is given rank 2, ete. 
The test statistic is the sum of the ranks of those 
w’s from the x-population. When the two popula- 


181 


tions are identical, the distribution of this test sta- 
tistic will not depend on the underlying common 
distribution. The test based on this statistic is called 
the Wilcoxon test or Mann—Whitney test. 

The Wilcoxon test can be used if it is possible to 
compare each observation from the x-population 
with each observation from the y-population. The 
Mann-Whitney version of the Wilcoxon statistic (a 
linear function of the Wilcoxon statistic) is the 
number of times an observation from the y-popula- 
tion exceeds an observation from the x-population. 
When this number is divided by mn, it becomes an 
unbiased estimator of Pr(Y >X), the probability 
that a randomly selected y will be larger than a 
randomly selected x. This parameter has many in- 
terpretations and uses, for instance, if stresses and 
strains are brought together by random selection, 
it is the probability that the stress will exceed the 
strain (the system will function). 

Permutation procedure. The permutation pro- 
cedure is based on the conditional distribution of 
the sum of the observations in the x-sample given 
the w sequence, when each selection of m of the 
values from the w sequence is considered equally 


likely. There are le) such possible selections. 


The sum of the observations in the x-sample is an 
increasing function of the usual t-statistic. The im- 
portance of the permutation procedure is that it 
often can be made to mimic the optimal procedures 
for the parametric situation while retaining exact 
nonparametric properties. 

Comparisons of scale parameters. To compare 
spread or scale parameters, one can rank the ob- 
servations in terms of their distances from w°, the 
combined sample median. The sum of ranks corre- 
sponding to the x-sample is a useful statistic [see 
NONPARAMETRIC STATISTICS, article on RANKING 
METHODS]. 

When the null hypothesis does not include the 
assumption that both populations have the same 
median, the observations in each sample can be 
replaced by their deviations from their medians, 
and then the w-sequence can be formed from the 
deviations in both samples. This ranking procedure 
will not be exactly nonparametric, but it yields re- 
sults with significance and confidence levels near 
the nominal levels. 


Other problems 

Invariance, sufficiency, and completeness. Such 
concepts as invariant statistics, sufficient statistics, 
and complete statistics arise in nonparametric the- 
ory. In the two-sample problem, the ranks of the 
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first sample considered as an entity have these 
properties when only comparisons are obtainable 
between the observations of the x-samples and 
y-samples or if the measurement scheme is unique 
only up to monotone transformations, that is, if 
there is no compelling reason why particular scales 
of measurement should be used. For the two-sample 
problem, confidence intervals based on ranks are 
invariant under monotone transformations. If a 
nonparametric confidence interval has been formed 
for the difference between location parameters, the 
interval with end points cubed will be the con- 
fidence interval that would have been obtained if 
the original observations had been replaced by their 
cubes. Maximum likelihood estimation procedures 
have a similar property. 

Multivariate analysis. Although correlation tech- 
niques have not been described in this article, 
Spearman's rank correlation procedure acted as a 
stimulus for much research in nonparametric 
analysis. Problems such as partial correlation and 
multiple correlation have not received adequate non- 
Parametric treatment. For these more complicated 
problems, the most useful available techniques are 
associated with the analysis of multidimensional 
contingency tables. Multivariate tolerance sets are 
also available. [See MULTIVARIATE ANALYSIS, arti- 
cles on CORRELATION; NONPARAMETRIC STATISTICS, 
article on RANKING METHODS; STATISTICS, DESCRIP- 
TIVE, article on ASSOCIATION, | 

Semantic confusions. In the nonparametric area 
a number of semantic confusions and multiple 
uses of the same term have arisen, For example, 
“rank correlation” has been used by some to refer 
to the Spearman rank correlation coefficient, by 
others to refer to both the Spearman coefficient and 
to the so-called Kendall tau (another measure of 
bivariate association that is analogous to the Mann- 
Whitney statistic), and by still others to mean any 
correlationlike coefficient that depends only on 
order relationships. 

Sometimes the same procedure, described by two 
superficially different but really equivalent statis- 
tics, is erroneously regarded as two separate pro- 
cedures. An example is confusion between the 
Mann-Whitney and the Wilcoxon forms of the 
same test, 

Sometimes writers fail to recognize that a word 
(for example, “runs”) describes a variety of differ- 
ent objects. 

Analysis of variance analogues. A substantial 
body of nonparametric procedures has been pro- 
posed for the analysis of experiments involving sev- 
eral populations, Analogues have been devised for 
the simplest standard analysis of variance pro- 


cedures. One such analogue involves the ranking 
of all of the measurements in the several samples 
and then performing an analysis of variance of 
rank data (Kruskal-Wallis procedure ). Another 
procedure involves each of several judges ranking 
all populations (the Friedman-Kendall technique), 
Decision theory. Decision problems other than 
estimation and testing have not received extensive 
nonparametric attention. One result, however, is 
the Mosteller procedure of deciding whether or not 
one of several populations has shifted, say to the 
right. It can be thought of as a multiple-decision 
problem, since if one rejects the null hypothesis 
(that the populations are the same) it is natural to 
decide that the “largest” sample comes from the 

“largest” population. 
I. RICHARD SAVAGE 


[Other relevant material may be found in Errors, arti- 
cle on EFFECTS OF ERRORS IN STATISTICAL ASSUMP- 
TIONS; PROBABILITY. Information on the probability 
distributions discussed in this article may be found 
in DISTRIBUTIONS, STATISTICAL.) 
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ORDER STATISTICS 


Order statistics is a branch of statistics that con- 
siders the rank of an observation in a sample as 
well as its algebraic magnitude. Applications arise 
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in all parts of statistics, from broadly distribution- 
free (nonparametric) problems to those in which a 
specific form for the parent population is assumed. 

Order statistics methods are particularly useful 
when a complete sample is unavailable. For exam- 
ple, suppose a biologist is studying treated animals 
and observes the time of survival from treatment 
to death. He obtains his observations automatically 
as order statistics and might end the experiment 
after a fixed time has elapsed or after a certain 
number (or proportion) of the animals have died. 
In such cases estimation of, and testing of hypoth- 
eses about, parameters describing survival time can 
be successfully handled by order statistics. 

Early literature in the field was on the use of 
order statistics for complete samples, usually of 
small size. Interest in order statistics can be traced 
back to the use of the median and the range as 
estimators of location and scale, This method of 
estimating location and scale has been generalized 
into linear functions of order statistics for use in 
censored small samples and as easily computed, 
but somewhat inefficient, estimators in complete 
samples, [See STATISTICS, DESCRIPTIVE, article on 
LOCATION AND DISPERSION.] 

Applications of order statistics also include meth- 
ods of studying extreme observations in a sample. 
This theory of extreme values is helpful in the 
analysis of the statistical aspects of rainfall and 
floods, in fatigue testing, in the analysis of injury 
experience of industrial personnel, and in the 
analysis of oldest ages among survivors. Order 
Statistics are of fundamental importance in screen- 
ing the extreme individuals in a sample either in 
a specific selection procedure or in judging whether 
or not these extreme observations are contaminated 
and should be rejected as outliers [see SCREENING 
AND SELECTION; STATISTICAL ANALYSIS, SPECIAL 
PROBLEMS OF, article on OUTLIERS]. The range, the 
“Studentized range,” and other functions of order 
Statistics have been incorporated into analysis of 
variance itself [see LINEAR HYPOTHESES, article on 
MULTIPLE COMPARISONS]. 

There were early and isolated investigations con- 
cerning order statistics, such as computations of 
the relative efficiency of the median versus the 
mean in samples from the normal distribution (for 
example, Encke as early as 1834), but the first 
systematic development of sampling theory in order 
Statistics occurred in 1902, when Pearson consid- 
ered the “Galton difference problem.” He found the 
expected yalue of the difference between the rth 
and (r + 1)st order statistics in a sample of size n. 
Daniell (1920) derived the expected value of the 
rth order statistic and of products of order statis- 
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tics. He also considered linear estimators for meas- 
ures of location and scale in the normal distribution. 
In 1921, Bortkiewicz considered the distribution 
theory of the sample range, and in 1925, L. H. C. 
Tippett found the mean value of the sample range 
and tabulated the cumulative distribution function 
(cdf) of the largest order statistic for a sample 
(n < 1,000) from the normal distribution. 


Definitions and distribution theory 


Suppose X is a continuous random variable with 
cdf F(x) and probability density function f(x). 
(For discrete random variables the work of Soliman 
Hasan Abdel-Aty in 1954 and Irving W. Burr in 
1955 may be helpful; see the guide at the beginning 
of the bibliography.) If the elements of a random 
sample, X,, X,,°*+, Xn, are rearranged in order of 
ascending magnitude such that 


Xo <1 <Xm, 


then X;,, is called the rth order statistic in the 
sample of n. Arrangement from least to greatest is 
always possible, since the probability that two or 
more X’s are equal is zero. In actual samples ties 
may occur because of rounding or because of in- 
sensitive measuring devices; there are special rules 
designed to handle such cases for each procedure 
(see, for example, Kendall 1948, chapter 3). 
Exact distribution of X,,,._ The probability ele- 
ment, 6(x;,)) dx,,,, for the rth order statistic is 


n! 
G=1)(n=r) 
[E(X E EE En) dX. 


The heuristic argument for the above expression 
can be described in terms of Figure 1, showing the 
x-axis. To say that the rth order statistic lies in a 
small interval, (Xe), Xin + 4%»), is to say that 
r — 1 unordered observations are less than Xn, that 
n — r observations are greater than Xi) + dx, and 
that one observation is in the small interval. These 
probabilities correspond to the three factors above 
just to the left of dxXır) . The factorial factor at their 
left allows for rearrangements of the unordered 
observations. 


1 
observation| 


n=r 
observations 


r-1 
observations 


Xy Xin FAK ry 


Figure 1 
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Similar expressions exist for the joint distribu- 
tion of the ith and jth order statistics, i < j. This 
can lead to the cdf of the sample range and other 
useful measures. 

Limit distributions. Limit distributions will be 
discussed first for extreme order statistics and then 
for proportionately defined order statistics. 

Extreme order statistics. Limiting (large sam- 
ple, asymptotic) distributions for the extreme order 
statistics—the largest and the smallest—have been 
much studied because of their applicability to the 
analysis of floods, strength of materials, and so on. 
For brevity, only the largest order statistic, X,,), 
will be considered here. 

The large sample behavior of X,,, clearly depends 
on the behavior of F(x) for large values of x, and 
since X will in general become large and highly 
variable as n grows, an appropriate kind of center- 
ing and scaling must be introduced. 

Perhaps the most important case is that in which 
F(x) approaches unity at an exponential rate when 
x grows but in which F(x) never attains the value 
unity. More precisely, the case considered is that 
in which, for some a> 0, [1 — F(x)]/e“* has the 
limit zero as x grows but in which 1 — F(x) never 
actually attains the value zero. If a centering se- 
quence, {u,}, is then defined by 


F(u) = (n—1)/n, 
and a scaling sequence, {a,}, is defined by 
a, = nf (un), 


where f is the probability density function associ- 
ated with F, then the basic result is that Y = 
(Xn) — Un) has as its limit distribution the double 
exponential, with cumulative distribution function 
exp (—e-"), 

There are two other kinds of such limit distribu- 
tions, depending on the structure of F for large x; 
a detailed discussion was given by E. J. Gumbel 
in 1958. One of these is, in effect, the distribution 
studied by W. Weibull in 1949 to investigate the 
breaking strengths of metals. 

The three kinds of limit distributions are inter- 
transformable by changes of variable. Since the 
limit distributions are relatively simple, large sam- 
ple estimation is relatively easy; in fact, graphical 
methods have been worked out and were explained 
by Gumbel; an exposition of procedural steps is 
given in Botts (1957). 

Proportionately defined order statistics. A sec- 
ond kind of limit distribution is for the rth order 
Statistic where one thinks of r as growing propor- 
tionally to n. For any number À, 0<A<1, the 


d-quantile of the population is defined as that value 
of x, č, such that 


F6) = [f(x) dx =a. 


For instance, if \=.5, x) is the median, or the 
50th percentile (a percentile is a quantile expressed 
on the base of 100). A A-quantile, like the median, 
may sometimes be indeterminate in an interval 
where f(x) = 0. In such a case every £ in the inter- 
val that satisfies F(é,) =A can be taken as the 


A-quantile. 
The A-quantile of a sample is defined by 
Ximo if nì is not an integer, 
"E where [nd] is the 
n = 


largest integer < na, 
if nà is an integer; this 
is an arbitrary defi- 
nition for the in- 
determinate case, 
since z} can be any 
value in the interval 
(Xinay, Xima): 


N. V. Smirnoff showed in 1935 that if r/n =A 
as n > æ, and if f(x) is continuous and positive at 
the d-quantile, &, then V/n(X,,, — g) is asymp- 
totically normal with mean zero and variance 
A(1 — A)/If(é)]?. This result in the special case 
of the median (A = 4) was studied by Encke (1834). 

The joint limit distribution of two order statistics 
was also given by Smirnoff. Frederick Mosteller in 
1946 extended this to a set of k order statistics 
with a normal k-variate limit distribution. 

The normalized sample quantile given by trans- 
forming to Vm (z, — &) has the same asymptotic 
distribution as V/n(X;.,—£), discussed above. 
For sampling from a normal distribution with 
parameters u and ø, the median of a sample (Zs) 
of size n is asymptotically normal with mean x and 
standard deviation «\/7/2n. The sample average 
has mean y and standard deviation o/ vn, so the 
ratio of standard deviations is V 7/2. In other words, 
for normal sampling it takes about 100 V/z/2 pet 
cent more observations to obtain the same precision 
with the sample median as with the sample average. 
For samples from other distributions, this relation- 
ship may be reversed. 


{Xena + Xenan} 


Nonparametric procedures for quantiles 
Although the distribution of the Xa), the order 
Statistics, depends upon the underlying distribution 
with cumulative distribution function F, it is im- 
portant to observe that so long as F is continuous, 


ma 


NONPARAMETRIC STATISTICS: Order Statistics 


the distribution of the F(X,;,) does not depend upon 
F. A great many nonparametric procedures are 
based on this fact, and some of them will now be 
briefly described. (See Mood & Graybill [1950] 
1963.) 

Confidence limits for quantiles. A simple con- 
fidence interval for the quantile at confidence level 
a is obtained by choosing integers O<r<s<n 
such that 


K(r,n-—7r+1) -h(s,n—s+1)=1-a, 


where I is the incomplete beta distribution [see 
DISTRIBUTIONS, STATISTICAL, article on SPECIAL 
CONTINUOUS DISTRIBUTIONS]. The consequent con- 
fidence interval is just Xon < ëa S Xo» . If the order 
statistics are symmetrically placed (s =n — r +1), 
further simplification is achieved. If, further, the 
quantile is the median, the interval is particularly 
easy to use; its testing analogue is called the median 
test. Binomial tail summations may equivalently 
be used for the incomplete beta distribution values. 

An example. Given a sample of 30 observations 
from any continuous distribution, 1 — æ symmetric 
confidence intervals for the population median are 
as shown in Table 1. 


Table 1 — Symmetric confidence intervals for the popu- 
lation median, based on a sample of 30 
observations 


Order statistics Confidence coefficient 


Xia» and Xas 995 
Xero) and Xan 95 
Xan and Xiao» 90 


For the quartiles (À = .25, .50, .75), with n going 
from 2 to 1,000, and for several values of a, sec- 
tion 12.3 of Owen (1962) provides convenient 
tables. Noether (1949) gives an elementary dis- 
cussion of these procedures and the ones for tol- 
erance intervals. 

Tolerance intervals. Nonparametric tolerance 
intervals also may be based on the concept of choos- 
ing integers determined by the incomplete beta 
distribution. A tolerance interval says that at least 
1008 per cent of the probability distribution de- 
scribed by F is contained in a random interval, so 
chosen that the statement is true with probability 
1 — a. If the ends of the interval are X;,) and X, 
with r<s <n, and if £ is given, then a may be 
computed in terms of the incomplete beta distribu- 
tion; similarly, if œ is given, 8 may be computed. 
This was discussed further by R. B. Murphy in 1948. 

Furthermore, many nonparametric tests of hy- 
Potheses are based squarely on the above funda- 
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mental property of the F(X,;,). [Many of these 
tests are discussed elsewhere in NONPARAMETRIC 
STATISTICS; see also GOODNESS OF FIT.] 


Linear systematic statistics 


Mean and standard deviation. A linear system- 
atic statistic is a linear combination of the sample 
order statistics. This article deals with the use of 
linear systematic statistics to estimate location 
parameters (j) and scale parameters (a) from a 
random sample with underlying distribution of 
known form up to unknown p and/or ø. The dis- 
tributions for which linear systematic statistics 
have been considered include the normal, exponen- 
tial, rectangular, gamma, logistic, and extreme- 
value distributions and others of only theoretical 
interest, such as the right triangular. The choice 
of the coefficients of a linear systematic statistic 
should be optimal in some sense—for example, in 
terms of bias, sampling variance, and computa- 
tional convenience. 

The expectation of X,,, can be expressed as a 
linear function of u and ø, and the variance and 
covariance of the X,,, can be computed up to a 
scalar constant. Using generalized least squares, 
E. H. Lloyd in 1952 showed how to find the mini- 
mum variance unbiased estimators among all linear 
combinations. Linear combinations can also be con- 
structed if only a portion of the sample order sta- 
tistics are used, either from necessity or for con- 
venience (the use of only certain order statistics 
corresponds to the requirement that certain coeffi- 
cients of the linear combination be zero). The 
following general notation for linear systematic 
statistics will be used: 


n 
w= Dak, 


n 
o= LaxXw, 
jet 
where the a’s are the coefficients. 

For the normal distribution Daniel Teichroew in 
1956 calculated with ten-decimal precision the ex- 
pected values of the order statistics and of the 
products of pairs of order statistics for samples of 
size 20 and under. These were used by Sarhan and 
Greenberg in 1956 to compute variances and co- 
variances of the order statistics and to derive tables 
of optimal coefficients a,; and si. 

Censored samples. For complete samples this 
procedure does not represent much of an achieve- 
ment, although the loss in efficiency in comparison 
to optimal unbiased minimum variance estimators 
is often negligible. When some of the sample ob- 
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servations are censored, however, the tables become 
almost indispensable. [See STATISTICAL ANALYSIS, 
SPECIAL PROBLEMS OF, article on TRUNCATION AND 
CENSORSHIP.] Observations may be censored, usu- 
ally at the extremes, because the errors of measure- 
ment at the extremes are greater than those in the 
central portion of the distribution or because it is 
difficult and/or costly to determine their exact mag- 
nitude. If censorship is by a fixed point on the 
abscissa (regardless of whether this point is known 
to the investigator), the censorship is referred to as 
Type 1. If a certain predetermined percentage of 
the observations is censored, the censorship is re- 
ferred to as trimming or as Type 1 censoring. 
(F. N. David and Norman L. Johnson in 1956 
used the terms “type B” and “type A” to denote 
Type 1 and Type 1 censoring. Another sort of cen- 
soring is known as Winsorization, discussed below 
in relation to the rejection of outlying observations, ) 
Censorship and trimming may be on one or both 
ends of the ordered sample, 

If r, and r, observations are censored from the 
left-hand and right-hand sides, respectively, esti- 
mates of the parameters can be based upon the 
values of the n—r,—r, observations remaining 
after censoring, together with the corresponding co- 
efficients a, and a’,. Tables of the coefficients for 
the best linear estimators under Type 11 censoring 
are given by Sarhan and Greenberg (1962). They 
also include tables of variances (and covariances ) 
of the estimators and their efficiencies relative to 
the best linear estimators using complete samples, 
These tables are valuable because such a use of 
order statistics gives the best possible estimation 
among the class of linear Statistics for samples up 
to size 20. Another use of the tables is for the rapid 
appraisal of patterns of relative information in 
each order statistic, 

An example. Students made measurements 
(shown in Table 2) of strontium-90 concentration 
in samples of a test milk in which 9.22 micromicro- 
curies per liter was the known correct value. One 


Table 2 — Measurements of strontium-90 concentration 


Ordered observations a, 23, 
jet 0 o 
2a E7 1884 ~.4034 
3 82 1036 —1074 
a BA 1040 ~.0616 
ee £104) —.0201 
6 98 £1041 .0201 
7 99 -1040 0616 
8. 10.5 1036 1074 
% 113 1884 4034 

10 ie o o 


observation was censored at each side because of 
suspected unreliability. The resulting estimates are 
= 9.27 (with variance .10430?) and o* =2.05 
(with variance -08240*). If the two censored ob- 
servations had been trimmed from one side rather 
than symmetrically, the efficiency in esti mating the 
mean decreases from 95,9 per cent to 93.0 per cent, 
but that for estimating the standard deviation in- 
creases slightly, from 69.9 per cent to 70.9 per cent, 

Other linear systematic statistics. In samples 
larger than size 20, the calculations for optimal 
linear systematic statistics are cumbersome, and 
tables of coefficients are not available. Alternative 
nearly best and nearly unbiased estimators were 
Proposed by A. K. Gupta in 1952, Sarhan and 
Greenberg in 1956, Gunnar Blom in 1958, and 
Sarndal in 1962. These alternatives have coef- 
ficients that are easier to compute than a’, and a',. 


Estimators from less than available sample 


The linear systematic statistics discussed above 
were based on all the order statistics. or on all 
those available after censorship. Optimality was 
Stressed in terms of minimum variance, The pres- 
ent section considers simpler linear systematic sta- 
tistics based on less than all available order statis- 
tics. They are simpler in the sense that many of the 
Coefficients are zero and/or the nonzero coefficients 
take only a few, convenient values. 

Small samples. Simplified estimators in small 
samples will be considered for several different 
distributions. 

Normal distribution. Wilfrid J. Dixon in 1957 
Suggested simple estimators of the mean of a nor- 
mal distribution that are highly efficient for small 
samples. Two of these were: 

(1) The mean of two optimally selected order 
Statistics; 

(2) Xan = (/(n—2)) Xa, that is, the 
mean of all order statistics except the largest and 
smallest, 

Estimator (1) is, more specifically, the average 
of the order statistics with indices ([.27n] + 1) and 
([.73n] + 1), where [k] is the largest integer in k. 
Its asymptotic efficiency, relative to the arithmetic 
mean of the sample, is 81.0 per cent, and in small 
samples its efficiency, measured in the same way, 
is never below this figure and rapidly approaches 
100 per cent as n declines toward 2. (Asymptotic 
estimation is further discussed below.) When n is 
a multiple of 100, computation of the 27th and 
73d percentiles runs into the same problem of 
indeterminacy as does computation of the median 
of an even-numbered sample. As explained in the 
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definition of quantiles above, an arbitrary working 
rule for the 27th percentile is 3[X( orn) + Xion]. 

The 27th and 73d percentiles as used here are 
relevant to the problem of optimum groupings dis- 
cussed below for large samples in the univariate 
and bivariate normal distributions. Their applica- 
tion to a problem in educational psychology was 
originally pointed out by Kelley (1939), seeking to 
select the upper and lower group of persons with 
a normally distributed score for the validation of 
test items. He recommended selection of 27 per 
cent at each extreme to be reasonably certain that 
the groups were different in respect to the trait. 
Cureton (1957), using the same method, and 
David R. Cox in 1957 noted that one-third at each 
extreme is optimal when the distribution is rec- 
tangular. 

The second estimator has higher efficiency than 
the first for n>5, and its efficiency asymptoti- 
cally approaches unity. Its chief advantage comes 
about when there is a possible outlier (discussed 
below). 

Dixon in 1957 also considered linear estimators 
of the standard deviation for the normal distribu- 
tion. These estimators are better than, although re- 
lated to, the sample range. One such estimator is 
based upon the sum of that set of subranges 
Way = Xin-ir) — Xu that gives minimum variance. 
For a sample of size 10, for example, the unbiased 
estimator of the standard deviation based upon the 
range is .325W,,, and has an efficiency of 85.0 
per cent relative to the unbiased complete sample 
standard deviation. Dixon’s improved estimator is 
.1968(W,,) +W), and its efficiency is 96.4 per 
cent. 

Exponential distributions. The one-parameter 
exponential distribution, with density 

f(x) = 1 evs, x>0,0>0, 
is most useful in life-testing situations, where each 
observation is automatically an ordered waiting 
time to failure or death. Benjamin Epstein and 
Milton Sobel in 1953 showed that the maximum 
likelihood estimator of o based only on the first 
r order statistics is 


o = Xo t Xt + Xin + (N= T)Xin 
n= = 


and that 27ré,,,/o is distributed as chi-square with 
2r degrees of freedom. 

This estimator is a multiple of a simple weighted 
average of the first r uncensored observations. For 
other conditions of censoring, the best linear es- 
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timator o* described by Sarhan and Greenberg in 
1957 may be used. Other simple estimators include 
those of Harter (1961) and Kulldorff (1963a). 

For the two-parameter exponential distribution 
(with unknown left end point of range, a) Epstein 
and Sobel in 1954 considered best linear unbiased 
estimators a° and g° based again upon the first r 
order statistics. These are 


wu = Xe = Key = 1 = Xie = (=F Xon 
mr—1) 
and 


ot = OM DX + Xa + Key H Xe + (nar 1)Xe 
r=1 4 


Still further simplified estimators for both the 
one-parameter and two-parameter exponential dis- 
tributions have been considered; they use the 
optimal k out of n sample order statistics. Tables 
and instructions for the use of these can be found 
in Sarhan, Greenberg, and Ogawa (1963), Kull- 
dorff (1963b), and Saleh and Ali (1966). 

Large samples. In a large sample, unbiased 
estimators of location and scale with high asymp- 
totic efficiency may be derived by selecting k suit- 
ably spaced order statistics, where k is considerably 
less than n. The spacing of the quantiles à, 
A2, ***, A, which produces an estimator with maxi- 
mum efficiency is termed an optimum spacing. 

The problem of optimum spacing of sample 
quantiles for purposes of estimation is related to 
asymptotically optimum grouping of observations 
for convenience in exposition or for purposes of 
contrast. Cox in 1957 considered optimum group- 
ing for the normal distribution and obtained the 
same set of quantiles as Ogawa found in 1951 for 
optimum spacings. Kulldorff in 1958 derived op- 
timal groupings for the exponential distribution, 
and the results were identical with those sample 
quantiles derived by Sarhan, Greenberg, and Ogawa 
(1963) for optimum spacings. The optimum spac- 
ing for estimation in the rectangular distribution, 
however, is obtained from the extreme observa- 
tions, whereas optimal grouping requires equal 
frequencies. 

Normal distribution. For the mean, Mosteller 
in 1946 considered the estimator 


i k=1,2,0r3, 
i LSN, <i <M Sn. 


The integers n; are determined by k fixed numbers, 
Mst, Ap Such that O<, <- <À <1, and 
ni = [nai] +1. For k=3, for example, Mosteller 
obtained A, = .1826, A» = .5000, and A; = .8174, with 
asymptotic efficiency equal to 87.9 per cent. A dif- 
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ferent set of quantiles and a weighted average of 
them was selected by Ogawa in 1951 to minimize 
asymptotic variance, Tables for Ogawa’s estimators 
are given in Sarhan and Greenberg (1962). 
Estimation of both mean and standard deviation 
is a more complex problem; it has been solved only 
for symmetric spacing with k = 2, in which case 
the quantiles selected are \, = .134 and A, = .866, 
Exponential distribution. For the one-param- 
eter exponential distribution, the estimators of the 


form 
k 


= x b; X inp 
have been obtained with high maximum asymptotic 
relative efficiency for k = 1(1)15 by Sarhan, Green- 
berg, and Ogawa (1963). Kulldorff (1963a) has 
also done this, with greater arithmetic precision. 
As an example, for k = 3, 


o° = 4477X tsen + 2266X t.s29n)e1) 
+ .0776X ¢¢.scssni619 


has asymptotic relative efficiency .89. 

Bivariate normal distribution. An estimator of 
the correlation coefficient, p, was devised by Mostel- 
ler in 1946 using the ranks of 2n < N observations. 
The procedure is to order the N observations on the 
x coordinate and to distribute the n largest and n 
smallest of them into an upper and a lower set 
based upon the y coordinate, The dividing line be- 
tween the upper and lower sets is the median of 
the 2n observations on the y coordinate when the 
means and variances are unknown. Using the num- 
ber of cases in the resultant four corners, an esti- 
mate of p can be obtained with the aid of a graph 
provided by Mosteller for varying levels of n/N. 
The optimal value of n/N is approximately 27 per 
cent, in which case the method has an efficiency 
slightly in excess of 50 per cent in comparison to 
the corresponding Pearson correlation coefficient. 
This method has merit where data are on punch 
cards and machines can be used for rapid sorting. 
By using the upper and lower 27 per cent of a 
sample, a similar adaptation can be made when 
fitting straight lines by regression, especially when 
both variables are subject to error (see Cureton 
1966). 


Outlying observations 


An observation with an abnormally large resid- 
ual (deviation from its estimated expectation) is 
called an outlier; it can arise either because of 
large inherent variability or because of a spurious 
measurement. A rule to reject outliers should be 
considered an insurance policy and not really a 


test of significance, The first attempt to develop a 
rejection criterion was suggested by C. S. Peirce in 
1852 while he was studying observations from 
astronomy. 

C. P. Winsor in 1941 proposed a procedure that 
now bears his name: a suspected outlier should 
not be rejected completely, but its original value 
should be replaced by the nearest value of an ob- 
servation that is not suspect. For the normal dis- 
tribution the symmetrically Winsorized mean is 
somewhat similar to the second Dixon estimator 
discussed above (the mean of all order Statistics 
except the largest and smallest), but the former 
shows only a small loss of efficiency and is more 
stable than the latter (see Tukey 1962). The Dixon 
estimator was proposed for a different purpose but 
has been used when an outlying observation was 
suspect. In evaluating the utility of either of these 
procedures, or of any other, one must consider the 
probability of falsely rejecting a valid observation 
as well as the bias caused by retaining a spurious 
item. The usefulness of rejection criteria should be 
measured in terms of the residual error variance. 
[These problems are discussed in more detail in 
STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, 
article on OUTLIERS.] A few examples will be given 
here to illustrate rejection rules when it is sus- 
pected that the observation has an error in location 
and when the underlying distribution is normal 
with mean p and variance o°. 

Case of ø known, To test whether X, is an 
outlier when is known, use can be made of 


B,=Am=K, 
ea 


whereas when « is unknown, one can use 


ewe 
: oe 

or 
B, = Xm = Xow | 
E o 


For detecting an outlier, the performance of B, is 
better than that of B, (and better than those of 
others not listed here), although B, is easier to 
compute, Tables for using B, can be found in Pear- 
son and Hartley (1954, vol. 1, 1966 edition). 

Case of o unknown. If an independent esti- 
mator of o is available from outside the sample 
in question, one can substitute it in B, to obtain a 
modified criterion. Tables of upper percentage 
points for this externally Studentized extreme devi- 
ate from the sample mean were prepared by David 
in 1956, 

When no estimator of o is available except from 
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the observed sample, two of the appropriate tests 
are 


Xm- R * 
B, = —“4— > where = aa 
s n ia 
and 
-Xm Xan 
E Xoca 


An example. Given five observations, 23, 29, 
31, 44, and 63, is the largest observation a spurious 
one? 


63 — 38 25 
B= = 1.75, 
Vi0l6e/5 203.2 

63-44 i lon 


t= Gye a 

From the tables prepared by Frank E. Grubbs in 
1950 for B, with n = 5, the 90th upper percentage 
point is 1.791, and the 95th upper percentage point 
is 1.869. Thus, the value of 63 gives a result above 
the 10 per cent level and would not be rejected by 
B, at ordinary levels. From the table prepared by 
Dixon in 1951 for Tı, the sample significance level 
is again seen to be between 10 per cent and 20 per 
cent, so 63 would again not usually be rejected, 
and the suspect value would be retained. 

Multiple outliers. Tests can also be constructed 
for multiple outliers, and the efficiency of various 
procedures can be compared. Excellent reviews of 
this field were given by Anscombe (1960) and by 
Dixon in 1962. 


Tests of significance 


Counterparts to the standard tests of significance 
can be derived by substituting the median, mid- 
Tange, or quasi midrange for the mean and by 
using the range or subranges in lieu of the sample 
standard deviation. Such tests will usually be lower 
in power, but in small samples the differences may 
be negligible. 

Hypotheses on location. The test criterion for 
the difference between the location measure of a 
sample and a hypothetical population value based 
upon substitution of the range in the standard 
t-test, published by Joseph F. Daly in 1946 and by 
E. Lord in 1947, is 


An example. Walsh (1949) poses the problem 
of testing whether the mean (< = 1.05) of the fol- 
lowing sample of 10 observations differs signifi- 
cantly from the hypothesized population mean of 0. 
Ordered observations are —1.2, —1.1, —.2, .1, .7, 
1.3, 1.8, 2.0, 3.4, and 3.7. 


1.05 —0 
(3.7) — (-1.2) 49 
As tabulated by Lord, the critical values for a 
two-tailed test are 10 per cent = .186 and 5 per 
cent = .230, and therefore the hypothetical mean 
would not be rejected at the 5 per cent level. 
Difference between means of two samples. Lord 
in 1947 devised the following test criterion for the 
difference between the means of two samples of 
size n from a normal population: 


|\X-Y| 4 
Xin Xw + Yom Yo 


An example. The problem is to test whether 
two samples, shown in Table 3, have a common 


= 105 Z 214. 


G= 


G, = 


Table 3 
X-sample Y-sample 
27.6 43.3 
35.5 48.7 
45.0 53.6 
46.7 56.5 
47.6 63.9 


population mean, assuming a normal distribution 
and equal but unknown variances. These data give 
ž = 40.5, J = 53.2, and 


Mt? 40.5 — 53.2| 

* = 747.6 — 27.6) + (63.9 — 43.3) 
me oe 12:7 
= 20.0 + 20.6 


The critical value for 5 per cent is .307; for 1 per 
cent it is .448. The null hypothesis of equality of 
the means is rejected at the 5 per cent level. 

Equivalent tests were devised by J. Edward Jack- 
son and Eleanor L. Ross in 1955 without the 
restriction that the two sample sizes be the same. 

Tests on variances. One-sample, two-sample, 
and k-sample tests of variances and analysis of 
variance can be performed with the use of ranges 
and subranges. A good reference on this subject is 
Chapter 7 by David in Sarhan and Greenberg 
(1962). 


G 


= .313. 


BERNARD G, GREENBERG 
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m 
RUNS 


A “run” is a sequence of events of one type oc- 
curring together in some ordered sequence of 
events. For example, if a coin is spun 20 times and 
the results are heads (H) or tails (T) in the fol- 
lowing order: 


HHTTTHTHHHHTTTHTTHTT, 
there are 10 runs in the sequence, as indicated by 
the lines shown. If the tossings occur at random, 
it is clearly more reasonable to expect the number 


of runs occurring in the above configuration than 
those in the configuration 


HHHHHHHHHTTTTTTTT 


which has the same number of heads and tails but 
only two runs. In dealing with runs as a statistical 
Phenomenon, two general cases can be distin- 
guished. In the first, the number of heads and the 
number of tails are fixed and interest is centered 
on the distribution of the number of runs, given 
the number of heads and tails. In the second, the 
number of heads and the number of tails them- 
selves are random variables, with only the total 
number of spins fixed. The distribution of the num- 
ber of runs in this case will be different. 

A study of runs can assist one in deciding upon 
the randomness or nonrandomness of temporal ar- 
rangements of observations. Runs can also be used 
to form certain nonparametric or distribution-free 
tests of hypotheses that are usually tested in other 
ways. Examples of these uses are given below. 

Simple runs. Interest in runs can be traced 
back to Abraham de Moivre’s Doctrine of Chances, 
first published in 1781. Whitworth (1867) devotes 
some space to runs, and Karl Pearson (1897) dis- 
cusses in an interesting manner the runs of colors 
and numbers occurring in roulette plays at Monte 
Carlo. His analysis is, in fact, faulty in certain re- 
Spects, a point that was picked up by Mood (1940). 
(Some errors in Mood’s own paper have been given 
by Krishna Iyer [1948].) In an unusual article on 
runs, Solterer (1941) examines the folklore which 
holds that accidents or tragedies occur in triplets 
by analyzing the dates of death of 597 Jesuit 
Priests in the United States for the period 1900- 
1939. Using a simple run test, he shows that some 
grouping of deaths does, in fact, occur. Wallis and 
Moore (1941a; 1943) give further illustrations of 


les 
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the uses to which runs have been put. Many writers 
treat the following basic problem: 

Given r, elements of one kind and r, elements of 
a second kind, with the r= r, +r, elements ar- 
ranged on a line at random, what is the probability 
distribution of the total number of runs? The 
general answer is a bit complex in expression, but 
the first two moments of d, the number of runs, 
may be simply written out: 


(1) Expected value of d= E(d) = 20" 41, 
and 

Variance of d 
(2) 2r.r.(2r72— T), 


= var (d) = o°(d) = AmI 
Furthermore, for large values of r, and r» the dis- 
tribution of d tends to normality. This provides a 
ready and straightforward means of carrying out 
significance tests for the random arrangement of 
the r elements. 

The power of the test will be good either when 
elements form rather too few groups—that is, when 
there is strong positive association of the elements 
—or when there are too many groups because of 
a disposition for the elements to alternate. The 
test may be applied to quantitative observations by 
dichotomizing them as above or below the sample 
median (or some other convenient quantile), as in 
the following example. 

Example—lake levels. The data for this ex- 
ample are given in Table 1 and consist of the 
highest monthly mean level of Lake Michigan- 
Huron for each of the 96 years from 1860 to 1955 
inclusive, Inspection of the figures shows that the 
median height was 581.3 feet. Each level below 
this figure is marked B, and each height equal to 
or above this figure is marked A. Thus, for these 
data, r= 96, r, = 49, and r; = 47, A dichotomiza- 
tion just a bit off the sample median (to avoid 
boundary problems) has been used. Although a 
dichotomization near the sample median (as op- 
posed to some theoretical cutting point) was used 
to define the categories A and B, the null distri- 
bution of d is unaffected. The number of runs is 
counted and found to be 15. From (1) and (2) 
it is found that E(d) = 49 and o(d) = 4.87. Test- 
ing the observed number of runs against the ex- 
pected, the quantity 


15+0.5-49 __ 
ASI ie othe He 


is referred to the unit normal distribution. (The 
0.5 above is the so-called continuity correction; 
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see Wallis & Roberts 1956, pp. 372-375.) The 
probability of observing a value of —6.87 or smaller 
by chance, if the arrangement of A’s and B’s is 
random, is well below 0.00001, and hence it can 
be concluded that the observations are not ran- 
domly ordered. In other words, the lake has tended 
to be high for several years at a time and then 
low for several years. In practical terms, this means 
that an estimate of next year’s highest level will 
usually be closer to this year’s level than to the 
96-year average. 

Example—anthropological interval sift. The an- 
thropological interval sift technique is described in 
a paper by Naroll and D’Andrade (1963). Traits 
often diffuse between neighboring cultures, form- 
ing clusters of neighbors with like traits. A sifting 
method seeks to sift out a cross-cultural sample, 
so that, ideally, from each geographical cluster 
of neighbors with like traits only one example will 
be considered in the cross-cultural sample, The 
object of the test is to see whether the sift has been 
successful in removing the correlation effects of 
neighboring societies with like traits. 

The method is applied to narrow strips of the 
globe 600 nautical miles wide and thousands of 
miles long. From a random start, equal intervals 
are marked off along the length of the strip every 
so many miles. The first society encountered after 
each mark is included in the sample. The interval 
chosen must be large enough to ensure that neigh- 
boring members of a single diffusion patch are not 
included any more frequently than would be pro- 
duced in a random geographic distribution. Every- 
thing depends upon the interval chosen, and a run 
test is used to decide whether the interval used is 
suitable. 

For this purpose the 40 ordered societies ob- 
tained in the sample were dichotomized on each 
of four characteristics, For example, the first di- 
chotomization was according to whether or not the 
society had a residence rule of a type theoretically 
associated with bilateral or bilineal kinship sys- 
tems (bilocal, neolocal, uxorineolocal, or uxoribi- 
local), Although the values of the r; in this example 
may seem to be random, it is reasonable to regard 
them as fixed and to apply the appropriate tests 
given the 7;. Hence, for each of the four charac- 
teristics looked at, the run test was applied, with 
the results shown in Table 2. Because of the rela- 
tionships between the four characteristics, strong 
dependences exist among them; this is the reason 
for the identity, or near identity, of the results for 
characteristics 1 and 3, and for 2 and 4. Thus the 
four tests are appreciably dependent in the statis- 
tical sense. 
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Table 1 — Lake Michigan—Huron, highest monthly mean level for each calendar year, 1860-1955 


Level Two 

Year in feet" _categories* Change‘ 
1860 583.3 A 

1861 583.5 A + 
1862 583.2 A - 
1863 582.6 A - 
1864 582.2 A - 
1865 582.1 A - 
1866 581.7 A = 
1867 582.2 A + 
1868 581.6 A -= 
1869 582.1 A + 
1870 582.7 A + 
1871 582.8 A + 
1872 581.5 A - 
1873 582.2 A + 
1874 582.3 A + 
1875 582.1 A - 
1876 583.6 A + 
1877 582.7 A - 
1878 582.5 A - 
1879 581.5 A - 
1880 582.1 A + 
1881 582.2 A + 
1882 582.6 A + 
1883 583.3 A + 
1884 583.1 A = 
1885 583.3 A + 
1886 583.7 A + 
1887 582.9 A - 
1888 582.3 A - 
1889 581.8 A - 
1890 581.6 A - 
1891 580.9 B - 
1892 581.0 8 + 
1893 581.3 A + 
1894 581.4 A + 
1895 580.2 B - 
1896 580.0 B - 
1897 580.85 B + 
1898 580.83 B a 
1899 581.1 B + 
1900 580.7 B a 
1901 581.1 B + 
1902 580.83 B - 
1903 580.82 B "a 
1904 581.5 A + 
1905 581.6 A + 
1906 581.5 A = 
1907 581.6 A + 
1908 581.8 A + 
1909 581.1 B = 


a. Data for certain years are shown to two decimals to avoid ties, 
b. 81.3 feet or more, 

ss than 581.3 feet. 

€. This column will be used in a later example. 


In all four cases the observed number of runs 
was slightly below the expected number. Applica- 
tion of the test described above showed, how- 
ever, that there was no statistically significant 
evidence of overclustering and, hence, no statisti- 


Level Two 
Yeor categories” Change® 
1910 580.5 8 - 
1911 580.0 B -= 
1912 580.7 B + 
1913 581.3 A + 
1914 580.7 B - 
1915 580.0 B - 
1916 581.1 B + 
1917 581.87 A + 
1918 581.91 A + 
1919 581.3 A - 
1920 581.0 B - 
1921 580.5 B - 
1922 580.6 B + 
1923 579.8 B - 
1924 579.6 8 = 
1925 578.49 B -= 
1926 578.49 B o 
1927 579.6 B + 
1928 580.6 B + 
1929 582.3 A + 
1930 581.2 8 - 
1931 579.1 8 - 
1932 578.6 8 ~ 
1933 578.7 8 + 
1934 578.0 B ~ 
1935 578.6 B + 
1936 578.7 B + 
1937 578.6 8 E 
1938 579.7 8 + 
1939 580.0 B + 
1940 579.3 B m 
1941 579.0 B = 
1942 580.2 B + 
1943 581.5 A + 
1944 580.8 B = 
1945 581.00 B + 
1946 580.96 B - 
1947 581.1 8 + 
1948 580.8 B - 
1949 579.7 B - 
1950 580.0 B + 
1951 581.6 A + 
1952 582.7 A n 
1953 582.1 A = 
1954 581.7 A = 
1955 581.5 A co 


Source: Wallis & Roberts 1956, p. 566. 


cally significant evidence of diffusion. Thus the 
sift had been successful in removing the statisti- 
cally significant effects of diffusion. Of course, 
considerations of power would be necessary for a 
full analysis. 
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Table 2 
Sample significance 
Characteristic d n r level* 
1 9 34 6 0.137 
2 19 18 22 0.337 
3 9 34 6 0.137 
4 19 22 18 0.337 
* Probability of obtaining the observed number of runs or a more extreme 
result, If the two kinds of sign were arranged at random. 


Source: Noroll & D’Andrade 1963, p. 1061. 
Reproduced by permission of the 
American Anthropological Association. 


Runs with more than two categories. The fore- 
going can be extended to the situation where the 
r elements are subdivided into more than two types 
of element, say k types, so that there are r, ele- 
ments of type 1, r+ of type 2, and so on up to Tk 
of type k where }),r; = r. When the r elements are 
arranged at random, the first two moments of the 
number of runs, d, are now 


k 
(3) E(d) =r-4)% 74(t)—1), 
and p 
var (d) = o°(d) 


Mo renon A Fh 
=r- 1) r(r-1) r(r=1) 


where F, = Dyrj(ry— 1), Fs = Xr; — 1)(r; — 2). 
These formulas, provided that r and the r; are all 
reasonably large, permit tests of significance for 
random groupings to be carried out. 
Example—runs with multiple types. The ex- 
ample of runs with multiple types presented here 
is based on the work of E. S. Pearson and is quoted 
in David and Barton (1957). The data relate to 
the falls in the prices of shares on the London 
Stock Exchange during the Suez crisis period, 
November 6, 1956-December 8, 1956, inclusive. 
Five types of industrial activity were considered: 
A, insurance; B, breweries and distilleries; C, elec- 
trical and radio; D, motor and aircraft; E, oil. The 
closing prices as given in The Times (of London) 
for 18 businesses of each type were taken, and 
Table 3 shows, for each day, the type of industrial 
activity for which the greatest number of the 18 
showed a fall in price from the previous day. In 
the few cases where there were equal numbers 


Table 3 
Nov. 6—A Nov, 13—B Nov. 20—E Nov.27—8 Dec. 4—C 
Nov. 7—A Nov. 14—C Nov.21—C Nov.28—E Dec. 5—C 
Nov. 8—D Nov.15—C Nov.22—E Nov.29—A Dec. 6—D 
Nov. 9=D Nov. 16—C Nov.23—E Nov.30—E Dec. 7—C 


Nov. 10—A Nov. 17—E Nov.24—E Dec. I-E Dec. 8—8 
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for two types, that type which also showed the 
fewer rises in price was taken. For the data shown 
in Table 3, r = 25 with r, = 4, Ts = 3, tc = 7, To = 3, 
Te = 8. There are 16 runs (d). From (3) and (4) 
above, the mean and standard deviation of d, given 
a random grouping of the letters, are E(d) = 20.12, 
o(d) = 1.24. The probability of getting the ob- 
served value of d or a lower one by chance if the 
grouping is random is therefore found by referring 
the quantity 


16 + 0.5 — 20.12 _ 
re e 


to a unit normal distribution. The appropriate prob- 
ability is 0.001, and it is therefore concluded that 
the test is picking out the fact that during the 
Suez crisis there was some persistence from day 
to day in the way in which different classes of 
shares were affected. 

Runs in a circle. A modification of the type 
of run described above occurs when the line is bent 
into a circle so that there is no formal starting and 
ending point. If each configuration around the 
circle is equally probable, then it is possible to 
find the distribution of the total number of runs, d 
(see, for example, David & Barton 1962, p. 94, or 
Stevens 1939). The first two moments are 


Tifi 
(5) E(a) = 2. 
(6) var (d) = 0°(d) = aee Se D 


For small r, say, up to 20, the exact probabilities 
need to be enumerated to carry out significance 
tests (see, for example, Walsh 1962-1965), but 
for larger values of r the distribution can be as- 
sumed to be normal with the appropriate parame- 
ters. It should also be noted that in this analysis 
it is assumed that the circle cannot be turned over, 
as would be the case in considering the number 
of runs among the beads threaded on a bracelet. 

Random totals. In the foregoing analyses it is 
assumed that the total composition of the sequence 
is known, which is to imply that the r; are known 
quantities. The probability distribution and mo- 
ments are thus conditional upon r; being known. 
It is possible to take the argument a stage further 
and to consider the situation if the r; are obtained 
as a result of some sort of sampling and are thus 
themselves random variables. For purposes of illus- 
tration, suppose that there are two kinds of ele- 
ments and that the sampling is of a binomial form 
with the chance of an element being of the first 
kind as p and the chance of its being of a second 
kind as q (where p +q = 1), the elements being 
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independent of one another. Then it can be shown 
that 


(7) E(d) =1+2(r—1)pq 
and 
var (d) = o?(d) 
= pq(4r — 6) — p?q?(127r — 20). 


A test based on the above moments might be used 
in a quality control situation where a previously 
estimated proportion, p, of articles are expected to 
have some characteristic. Consecutive items are 
examined and the number of runs, d, found in r 
items are counted in order to see whether the 
machine concerned is producing a random order- 
ing of quality or not [see QUALITY CONTROL, STA- 
TISTICAL]. 

Runs up and down. Given a sequence of num- 
bers, a sign, either + or —, can be attached to each 
number other than the first, the sign being + if 
the current number is above the previous number 
numerically or — if it is below. Using this sequence 
of +’s and —’s, one can count the number of runs 
(d’). If the original sequence of numbers was such 
that long sequences of roughly equal numbers 
were placed together, the number of runs as de- 
fined here would be large, since there would be 
a considerable element of chance as to whether 
two adjacent numbers were in ascending or de- 
scending order of magnitude, If there is, however, 
a gentle rise followed by a gentle fall in the original 
sequence, perhaps repeated several times, then 
the number of runs will be much smaller. Hence, 
the number of runs calculated in this manner is 
a measure of persistence of the trend. The prob- 
ability distribution is a complicated one (details 
are given in David & Barton 1962, p. 154), but 
the first two moments of it can be evaluated fairly 
readily and give 


(8) 


(9) E(d’) = an = 1 
and 
(10) var (d) = o(a) = 182, 


where n is the total number of original observa- 
tions, giving n— 1 signs in the sequence. Again 
it may be shown that for large values of n, the 
distribution tends to normality. 

Example—runs up and down. The data in 
Table 1 is used again, only in this example the 
aim is to see whether the directions of movement 
in water level tend to persist. It would be Possible 
for high and low values to cluster together simply 


as a result of a few large changes, with changes 
of direction within the clusters varying as for in- 
dependent observations. 

To carry out a suitable test of the hypothesis 
that there is no persistence in the directions of 
the measurements, a sign was inserted against the 
maximum level for each year except the first. The 
sign is + if the level was higher than for the pre- 
ceding year, — if it was lower, 0 if it was the same. 
(In this case the total number of runs, d’, would 
be the same whether the 0 was considered as + or 
as —.) The longest movements in one direction 
were the two five-year declines from 1861 to 1866 
and from 1886 to 1891. There were two four-year 
movements, the rise from 1879 to 1883 and the 
decline from 1922 to 1926. 

The total number of runs, d’, was 48. On the 
hypothesis that the + and — signs were arranged 
at random, the distribution of d’ is approxi- 
mately normal with moments from (9) and (10): 
E(d') = 63.67 and o(d’) =4.10. Testing the ob- 
served against the expected value of d' the quantity 


48 + 0.5 — 63,67 _ 
4,10 ease: 


is referred to a unit normal distribution, A one- 
tailed probability is required, since the alternative 
hypothesis is the one-sided one that there will be 
fewer runs than the null hypothesis indicates. The 
probability of a value of —3.7 or smaller arising 
by chance if there were no real persistence of 
movement in the same direction would be less 
than 0.001. Thus, the lake evidently has a tend- 
ency to move consecutively in the same direction 
more often than would be the case with independ- 
ent observations. The clustering of high and low 
values, therefore, is not due (at least not exclu- 
sively) to a few large changes but in some part 
to cumulative movements up and down. 

Test for two populations. Let x, x2, °-*, Xn, be 
an ordered sample from a population with proba- 
bility density f(x), and let y,, Yas "tts Yn, be a 
second ordered sample from the same population. 
Let the two samples be combined and arranged 
in order of magnitude; thus, for example, one 
might have 


Xi SY L Y2 L< Xa LYs LXa LX LEs LY Lt 


To test the null hypothesis that the two samples 
come from the same population, let the two 
samples be combined as above, define a run as a 
sequence of letters of the same kind bounded by 
letters of the other kind, and count the total num- 
ber of runs, d. 

Thus, in the above example the first element 
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forms a run, the second and third a run, the fourth 
a run, the fifth a run, the sixth to eighth a run, 
and so on. The first and last elements are bounded 
only on one side for this purpose so that if only 
the nine elements given here were involved, there 
would be six runs. 

It is apparent that if the two samples are from 
the same population, the x’s and y’s will ordinarily 
be well mixed and d will be large. If the two popu- 
lations are widely separated so that their ranges 
do not overlap at all, the value of d will be only 
two and, in general, differences between the popu- 
lations will tend to produce low values of d. Thus, 
the two populations may have the same mean or 
median, but if the x population is relatively con- 
centrated compared to the y population, there will 
be a long y run at each end of the combined sample 
and there will thus tend to be a low value of d. 

The test is performed by observing the total 
number of runs in the combined sample, accept- 
ing the null hypothesis if d is greater than some 
specified number d,, rejecting the null hypothesis 
otherwise. To do this, the first two moments of the 
null distribution of d, given in (1) and (2) above, 
may be used, This run test, originated by Wald 
and Wolfowitz (1940), is sensitive to differences 
in both shape and location between the two popu- 
lations and is described in more detail in Mood 
(1950), 

Example—two-sample test. This example of 
the two-sample test is taken from Moore (1959). 
Two random samples, each of 14 observations, 
were available, and it was desired to test whether 
they could be considered to have come from the 
same population or not. 


Sample A: 9.1, 9.5, 10.0, 10.4, 10.7, 
9.0, 9.5,/\ 122, ~ 118, — 12.0; 
10:2; TA WC a 


Sample B: 9.4, 11.3, 10.9, 9.4, 9.2, 
8.5, 8.9,.. MAS did, 9.3, 
9.8, 9.6, 8.0, 9.7 


Arranging these in numerically ascending order 
gave the following (where sample B is shown in 
italics); 
8.0, 85, 89, 9.0, 9.1, 9.2, 9.3, 
9.4, 9.4, 9.5, 9.5, 9.6, °9.7, 9.8, 
10.0, 10.2, 10.4, 10.7, 10.9, 11.1, 11.2, 
11.3,~ 1714, 11.5, 11.6, 11.8, 12.0, 12.2 


A straightforward count shows that there are 10 
runs; hence, d=10. From (1) and (2), with 
Tı=r,=14 and r=28, E(d) =15 and var (d) = 


6.74. Testing the null hypothesis that there is no 
difference between the two samples, the quantity 


10+0.5—15 _ 
v6.74 


is referred to the unit normal distribution. The 
probability of that value, or a smaller value, aris- 
ing by chance if there were no difference between 
the two samples is 0.045—that is, the difference 
is significant at the 5 per cent level but not at the 
1 per cent level—and thus some doubt is thrown 
on the hypothesis that the two samples come from 
the same population. A standard two-sample t-test 
for samples from normal populations (see, for 
example, Moore & Edwards 1965, pp. 9-10) gives 
a similar indication. 

Other run tests. Various other procedures based 
on runs have been proposed, a few of which are 
mentioned here. Length of runs as a criterion 
is discussed by Takashima (1955). The use of the 
longest run up or down (that is, numerically ris- 
ing or falling) is discussed by Olmstead (1946), 
who gives a table to facilitate the use of his test. 
Mosteller (1941) gives a test on the use of long 
runs of observations above or below the median 
and discusses its use in the interpretation of 
quality control charts. Mann (1945) discusses a 
test for randomness based on the number of runs 
up and down. In testing goodness of fit, runs of 
signs of cell deviations from the null hypothesis 
have been studied (see, for example, Walsh 1962- 
1965, vol. 1, pp. 450-451, 463-465). 

An extension of runlike concepts is used by 
Campbell, Kruskal, and Wallace (1966) to study 
the distribution of adjacencies between Negro and 
white students seating themselves in a classroom. 

The power of run tests has been investigated by 
a number of writers. For example, David (1947) 
studies the power of grouping tests for randomness. 
Bateman (1948) investigates the power of the 
longest run test. Levene (1952) studies the power 
of tests for randomness based on runs up and 
down. The power of the two-sample tests above 
depends upon the kind of alternative that is being 
considered. In instances where the alternative can 
be precisely specified, there may be other tests, not 
primarily based on runs, that have better power. 

Ties. Except in one of the above examples, 
where the course of action was obvious, the possi- 
bility of ties, that is, a pair of observations being 
precisely equal, has been ignored. Hence, observa- 
tions could always be ordered uniquely without ties 
and the various statistics uniquely defined. How- 
ever, in practice, observations are measured only 
to a few significant figures and ties will therefore 


—1.73 
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sometimes occur. The simplest way to deal with 
this possibility in the situations discussed here is 
to assume that the tied observations are ordered at 
random. The appropriate tests are then carried out 
as before. Discussions of the problems of ties are 
given by Kendall (1948) and Kruskal (1952). 


PETER G. MOORE 
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Iv 
RANKING METHODS 


A ranking is an ordering of individuals or ob- 
jects according to some characteristic of interest. 
If there are n objects, it is natural to assign to 
them the ranks 1, 2, ---, n, with 1 assigned to the 
object ranked highest. In this article it is, however, 
more convenient to employ the opposite and com- 
pletely equivalent convention of assigning rank 1 
to the lowest and rank n to the highest object. 
Ranking methods are concerned with statistics 
(rank-order statistics) constructed from the ranks, 
usually in random samples of observations. An 
ordinal scale of measurement clearly suffices for 
the calculation of such statistics, but ranking 
methods are also frequently used even when mean- 
ingfully numerical measurements are available. In 
cases when such measurements are available the 
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ordered observations may be denoted by xa, < 
Xa) St < Xm, Where xa (i= 1, 2, «+, n) is the 
ith order statistic. Only the rank i of xa) occurs 
jn rank-order statistics; use of Xn itself leads to 
order statistics [see NONPARAMETRIC STATISTICS, 
article on ORDER STATISTICS]. 

Objects which are neighbors in the ranking al- 
ways differ in rank by 1, although it may reason- 
ably be objected that the actual difference between 
neighboring objects is often greater near the ends 
of a ranking than in the middle. This difficulty can 
be overcome by the use of scores, a score being a 
suitably chosen function of the corresponding rank. 
However, the objection turns out to have less force 
than it might appear to have, since in many im- 
portant cases scoring has been found to be little 
more efficient than the simpler ranking. 

It is clear that an unambiguous ranking is not 
possible when ties are present. Strictly speaking, 
ranking methods therefore require the assumption 
of an underlying continuous distribution so as to 
ensure zero probability for the occurrence of ties. 
In applications, ties will nevertheless insist on ap- 
pearing. Commonly, each of the tied values is given 
the average rank of the tied group, a technique 
which leaves the sum of the ranks unchanged but 
reduces their variability, The effect is negligible if 
the number of ties is small, but in any case simple 
corrections can be applied to many of the pro- 
cedures developed on the assumption of no ties. 

Ranking methods are potentially useful in all 
fields of experimentation where measurements can 
be made on at least an ordinal scale. Much of the 
motivation for the development of the subject has 
in fact stemmed from the social sciences. To date, 
testing of hypotheses (for instance, identity of two 
or more populations) has been most fully explored. 
Point and interval estimation are sometimes pos- 
sible, but usually the order statistics are needed in 
addition to the ranks. Ranks are also beginning to 
be used in sequential analysis and multiple-decision 
procedures, Ranking methods are not as flexible as 
parametric procedures developed on the assump- 
tion of normality, but they have a much greater 
range of validity, since they do not generally re- 
quire knowledge of the underlying distribution 
other than its continuity (but see, for example, 
Estimation,” below). Most ranking methods there- 
fore fall under the wider heading of nonparametric 
Statistics and play a major role in that subject. 

When numerical measurements are replaced by 
their ranks, the question arises of how much “in- 
formation” is thereby lost. Perhaps contrary to 
intuition, the loss in efficiency or power of the 
Standard rank tests compared to the best corre- 


sponding parametric tests is usually quite small. 
The loss hinges on several factors: the particular 
test used, the nature of the alternative to the null 
hypothesis under test, and the sample size. Since 
the best parametric test in a given situation de- 
pends on the often uncertain form of the underly- 
ing distribution, one should also compare the per- 
formance of the two tests for other distributional 
forms which may reasonably occur. Whereas the 
significance level of the rank test remains quite 
unchanged, that of the parametric test may be 
seriously upset. Even when the parametric test is 
not too sensitive in this respect it can easily be- 
come inferior in power to the rank test. A more 
obvious point in favor of ranking methods is that 
they are much less affected by spurious or wild 
observations. 

The following account begins with two-sample 
problems and then turns to several-sample situa- 
tions and rank correlation. [Nonparametric one- 
sample procedures, including ranking methods, are 
considered in NONPARAMETRIC STATISTICS, article 
on THE FIELD.] The emphasis is on standard 
tests, but some estimation procedures are described, 
and reference is made to several multiple-decision 
procedures. 


Two-sample problems 

Let Xy, Xa, °°, Xm and Yi, Y2,°''> Yn be inde- 
pendent observations made respectively on the con- 
tinuous random variables X with density f(x) and 
Y with density g(y). Consider the important ques- 
tion: Do the x's differ significantly from the y's; 
that is, do the observations throw serious doubt on 
the null hypothesis that f = g? The statistical test 
most appropriate for answering this question will 
depend on the alternative the experimenter has in 
mind, Often the y's are expected, on the whole, to 
be larger than the v's; that is, if p=Pr(Y>X), 
the null hypothesis Hs: f = g, for which p = 4, is 
tested against the alternative, H,: p> }. A special 
case of this alternative occurs when g(y) is merely 
f(x) shifted to the right, so that the two popula- 
tions differ in location only. Other alternatives of 
interest are ,H: p < 4 and the two-sided alternative 
H.: p#4. 

Note that H, specifies identity of f and g, not 
merely that the distributions have the same median 
(see Pratt 1964) or the same spread, 

Wilcoxon’s two-sample test. Wilcoxon’s two- 
sample test enables one to test H, without knowl- 
edge of the common (under the null hypothesis ) 
functional form of f and g. Arrange the m +n 
observations (w;) in combined ascending order of 
magnitude giving w, <w: <S SW. The Wil- 
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coxon (1945) statistic is the sum, R+, of the ranks 
of those w’s that came from f(x). (For an equiva- 
lent counting procedure see Mann & Whitney 
1947.) If H, is true, each observation has expected 
rank given by [1+2+-:++(m+n)]/(m+n) 
=}(m +n + 1), so that the expected value of R, is 
4m(m +n +1). When H, is the alternative under 
consideration, values of R, much smaller than this 
lead to the rejection of H, in favor of H,. For a 
two-sided test, H, is rejected when R, differs from 
E(R,) by too much in either direction. Actual sig- 
nificance points are given in many sources, for ex- 
ample, by Siegel and Tukey (1960) for m <n < 20. 
Outside this range a normal approximation, pos- 
sibly with continuity correction, is adequate unless 
m and n differ greatly. Simply treat 


z= [2R; -m(m +n + 1)]/ Vīmn(m +n +1) 


as approximately a unit normal deviate. 

An example. The amount of aggression attrib- 
uted to characters in a film by nine members of 
each of two populations resulted in the following 
scores (Siegel & Tukey 1960): 


x: 25 51419 01715 8 8 
ys 1216 61313 3101011 


These are combined into the w-series, with their 
origin noted as in Table 1. (The modified ranks 
will be discussed below.) Here R, = 1+3+5+6+ 
13+14+16+17+18=93 and E(R,) =3-9-19= 
85.5. Because R, is this close to E(R.), there 
is evidently no reason for rejecting H, . Tables show 
that for a two-sided test R, would have to be as 
small as 62 or, by symmetry, as large as 109 to be 
statistically significant at the 5 per cent level. 


Table 1 — Aggression scores and rankings 


MODIFIED 
SCORE SAMPLE RANK RANK 
0 x 1 1 
3 y 3 4 
5 x 3 5 
6 y 4 8 
8 x 5 9 
8 x ó 12 
10 y 7 13 
10 y 8 16 
un y 9 17 
12 y 10 18 
13 y 1 15 
13 y 12 14 
14 x 13 “n 
15 x 14 10 
16 y 15 7 
17 x 16 6 
19 x 17 3 
25 x 18 2 


Test of relative spread. Although the foregoing 
example was used to illustrate Wilcoxon's test for 
differences in location, the alternative of interest 
was, in fact, that the two populations differ in vari- 
ability. An appropriate test can be made by assign- 
ing in place of the usual ranks the modified ranks 
shown in the last row of Table 1, which are ar- 
ranged so that low ranks are applied to extreme 
observations—both high and low—whereas high 
ranks are applied to the more central observations. 
Clearly, sufficiently low or high values of the sum 
R, of modified ranks will again lead to rejection of 
H,. As before, there are three alternatives—say, 
H;, ,H’, H., where H; specifies that the x’s are 
more spread out than the y's, etc. Precisely the 
same significance points apply as for the ordinary 
Wilcoxon test, since the ranks have merely been 
reallocated to produce a test statistic sensitive to 
the alternative under consideration. (Indeed, other 
alternatives, H”, can be tested by an assignment of 
ranks which tends to make R” small or lar, 
is true. As usual, the alternative should be specified 
prior to the experiment.) 

Here R}=1+5+9+12+11+10+6+4+3+2 
= 59, which is significant at the 5 per cent (1 per 
cent) level against H;,(H‘). 

Unless the two populations have the same me- 
dian or can be changed so that they do, interpreta- 
tion of this test may be difficult. 


Several-sample problems 


One-way classification. Wilcoxon’s procedure can 
be generalized to produce a test of the identity of 
several populations. Given the N observations Xy 
($= 1,2, +++, hk f=1,2,+++,01; En =N), ar- 
range them in common ascending order. Let Ri 
denote the sum of the ranks of those n; observations 
originating from the ith population. If R; = R/ni, 
it is clear that large variations in these mean ranks 
will cast suspicion on the null hypothesis of 
equality of all k populations. A suitably standard- 
ized weighted sum-of-squares test statistic suggests 
itself. Kruskal and Wallis (1952) show that except 
for very small n;, 


pa N-15 mihi -aN +1) 
(1) N (N? — 1)/12 
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is distributed (under the null hypothesis of identity 
of the populations) approximately as chi-square 
with k—1 degrees of freedom. The last form of 
(1) is a convenient computing formula. In the cen- 
tral form it may be noted that }(N + 1) is the 
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mean and (N? — 1)/12 the variance of a randomly 
chosen rank. In fact, the procedure is essentially a 
one-way analysis of variance performed on the 
ranks as variables. 

Two-way classification. Friedman (1937) was 
concerned with data giving the standard deviations 
of expenditures on n = 14 different categories of 
products at k = 7 income levels. The problem was 
to determine whether the standard deviations dif- 
fered significantly over income levels. The “prob- 
lem of n rankings” considered by Kendall and 
Babington Smith (1939) can be handled in the 
same way and the procedure will be illustrated 
with such data. The n = 4 observers were asked to 
rank k =6 objects, with the results shown in 
Table 2. An approximate test of equality among 
the objects is obtained by referring 


H nlf — (k +1) 
(k: — 1)/12 


Tr 
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to tables of chi-square with k — 1 degrees of free- 
dom. A measure of agreement between observers 
is given by the coefficient of concordance, denoted 
by W = x2/[n(k — 1)], defined so as to have range 
(0,1), with W=1 corresponding to complete 
agreement. (In the x? expression r; =T;/n.) 

In this example x? = 32/7 = 4.57, which is 
clearly not significant, and W = 0.23. 

In cases where x? is significantly large, that may 
be because one or more of the objects are “outliers” 
—that is, come from a population different from 
the bulk of the objects. Tables for the detection of 
such outliers have been provided by Doornbos and 
Prins (1958) and by Thompson and Willke (1963) 
[see STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, 
article on OUTLIERS]. 

J Incomplete rankings. In a two-way classifica- 
tion, it is often impracticable to have each observer 
tank all the objects at one time. For example, if 
the objects are foods to be tasted, the block size 
is best restricted to 2 or 3, but each judge may be 
asked to perform several such tastings with inter- 
vening rest periods. By means of suitable balanced 


Table 2 — Results of four observers ranking six objects 


OBJECT 
A 8 c D E F 
P 5 4 1 6 3 2 
O A 2 3 1 5 6 4 
R 4 1 6 3 2 5 
s 4 3 2 5 1 6 
Tofalar oro a s15 Ai aA N | 17 
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incomplete block designs it is still possible to treat 
this case by very similar methods [see EXPERI- 
MENTAL DESIGN; see also Durbin 1951]. 

The method of paired comparisons. Formally, 
the method of paired comparisons is a special case 
of incomplete ranking where the block size is 2. 
Each of the observers expresses a preference for 
one of the objects in every pair he judges. Ordi- 
narily all possible n(*) pairwise comparisons are 
made, but fractional designs are also available. 

The method has long interested psychologists 
(for “objects” read “stimuli”) and can be traced 
back to G. T. Fechner. It received fresh impetus 
from the work of L. L. Thurstone (1927), who 
supposed, in effect, that a particular observer's 
response to object A; can be characterized by a 
normal variable, Y;, with true mean V;. The prob- 
ability that A; is preferred to A; (A; > A;) in direct 
comparison is then Pr(Y; > Y;). Thurstone distin- 
guished five cases, of which the simplest (case v) 
assumes that (a) observer differences may be 
ignored and that (b) the Y; are independent (ac- 
tually, equal correlation suffices) normal variates 
with common variance. In this situation it is 
easy to estimate the V; and to test their equality 
(see Mosteller 1951; see also SCALING). 

Distributions other than normal have also been 
proposed for the Y;. Postulating a sech? density for 
Y; — Y;, an assumption close to that of normality, 
Bradley and Terry (1952) arrive at the model 


Pr(Y: > Y;) = m/m +m), m 20, am = 1; 


in other words, the odds on A; > A; are m; to 7;- 
The z; can be estimated by maximum likelihood. 
This approach can be extended to comparisons in 
triples and to multiple-choice situations (Luce 
1959). 

Implicit in these various models is the assump- 
tion that one-dimensional scaling of responses is 
appropriate. No such assumption is needed in the 
combinatorial approach of Kendall and Babington 
Smith (1940), who count the number of circu- 
lar triads—that is, the number of times A; > Aj, 
A; > Ax, and yet A,> Ai. A sufficiently low count 
leads to rejection of the null hypothesis, Ho, of 
equality among the objects. The following pro- 
cedure (David 1963) is completely equivalent for 
a single observer and provides a very simple gen- 
eral test. Let a; be the total number of times that 
A; is preferred to other objects. Then, under Ho, 


a Šla; — 4n(k — 1)}?/(4nk) 


is distributed approximately as chi-square with 
k— 1 degrees of freedom. Large values of T lead 
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to rejection of H., in which case it is possible 
to make more detailed statements about differ- 
ences between the objects. The necessary multiple- 
decision procedures are analogous to those devel- 
oped by Tukey and Scheffé for the separation of 
treatment means in the analysis of variance [see 
LINEAR HYPOTHESES, article on MULTIPLE COM- 
PARISONS]. 


Rank correlation 


For n pairs of observations (x; , y;) the product- 
moment coefficient of correlation 


pe = 2G =) 
(Lai = z) — HPF 
provides a convenient index (—1<r<1) of the 
extent to which, as x increases, y increases (r > 0) 
or decreases (r <0) linearly [see MULTIVARIATE 
ANALYSIS, articles on CORRELATION]. 

Spearman's p. If ris applied to the ranks rather 
than to the observations, the result is Spearman's 
coefficient of rank correlation, more readily com- 
puted from 

6d? 


a Cosby 


where d; is the difference in rank between y; and x;. 
Here p = 1 corresponds to complete agreement and 
p= -1 to complete reversal in the two sets of 
ranks, 

An example. Consider the rankings of observers 
P and Q in Table 2. There, d; = 3, 1, 0, 1, —3, —2, 
so that 

6(24) 
6(6?—1) 
indicating only slight agreement between P and Q. 
In fact, a value of Ed? that is 4 or smaller or 66 
or larger would be needed for rejection of the null 
hypothesis, H,, of independent rankings at a 5 per 
cent level of significance (Owen 1962). For n > 12 
an approximate test of H, can be made by treating 
pyn— 1 as unit normal. 

Estimation. If the underlying distribution of 
(x,y) is bivariate normal, with correlation coeffi- 
cient p, an estimate of p, which is unbiased in 
large samples is given by 


To = 2 sin rp = 2 sin (30p)?. 


In the example, r, = 0.33. 

Test of trend. Given the observations Yis Yo, 
“**, Yn, an investigator may wonder whether some 
positional effect or trend is present. For example, 
if y; is the reaction time of a subject to the ith 
presentation of a stimulus, fatigue could tend to 
increase the later observations. A possible test of 


p=1- = 0.31, 


randomness against such an alternative in this 
one-sample situation is to compare the ranking of 
the y; with the natural ordering 1, 2,:--,n, by 
means of p (with d; =rank y;—i). Clearly, a 
strong upward trend would lead to large values of p, 
Jonckheere (1954) treats the corresponding k- 
sample problem. 

Other rank correlation coefficients. The coeffi- 
cient p is a member of a class of correlation coeffi- 
cients which includes r, Kendall's 7, and the Fisher- 
Yates coefficient, ry, obtained by replacing x; and 
yi in r by their normal scores. The last three quan- 
tities are further discussed by Fieller and his asso- 
ciates (1957). In addition to the above applica- 
tions, p, 7, and ry may be used to test the equality 
of the true correlation coefficients in two bivariate 
normal populations and to investigate by means 
of partial rank correlation coefficients whether 
agreement between two rankings might be due to 
some extraneous factor (for example, Goodman 
1959). For interpretations of various rank correla- 
tion coefficients see Kruskal (1958). 


HERBERT A. DAVID 
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NONPARTISANSHIP 


The term “nonpartisanship” has been used in at 
least four different ways, all related to efforts at 
reducing or eliminating party influence in the polit- 
ical process. Political scientists are interested in 
Studying the subject because nonpartisanship in 
any of its forms alters the relative balance of power 
of various segments of society as well as the func- 
tions of various social and political institutions. 
The different meanings of the term include the 
following: 

(1) Bipartisanship. Journalists, public officials, 
and citizens sometimes say “nonpartisan” when a 
Political scientist would say “bipartisan.” They may 
do this, for example, in referring to a commission 
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when the law does not permit the membership to 
consist of members of only one political party, or 
to a particular legislative issue which may have 
been approached without close regard to party 
lines. “Nonpartisanship” in this sense is sometimes 
praised in the United States as “rising above petty 
party concerns” and is viewed by many citizens as 
a preferred approach to public policy making. 

(2) Monopartisanship. The writings of V. O. 
Key, Jr., and other political scientists refer to the 
“quasi-nonpartisanship” of one-party politics which 
is common in the United States, especially in the 
South. In this sense, the term emphasizes the lack 
of a clearly labeled opposition group in a political- 
action situation. 

(3) An elections system based on a particular 
form of the ballot. In many nations, labels indicat- 
ing the party memberships of candidates are not 
placed on the ballot. Nonetheless, elections in these 
political systems are regarded as party matters, the 
ballot is normally short, no primary election is 
used, and the party affiliation of candidates is 
easily determined and widely publicized. In these 
countries, nonpartisanship is not a fanuiliar con- 
cept. In the United States, however, where the bal- 
lot normally contains a great many names and 
party identifications have traditionally been printed 
on the ballot, a “nonpartisan” election commonly 
refers to an election in which party labels are not 
printed on the ballot. 

When the nonpartisan ballot is used, the general 
election is usually preceded by a “primary” election. 
The latter is actually an elimination contest de- 
signed to reduce the number of candidates to twice 
the number of positions to be filled. In some cases, 
the law provides that if any candidate receives 
more than 50 per cent of the vote in the primary, 
he is declared elected and no general election takes 
place for that office. Sometimes no primary is pro- 
vided for, and the winner is the candidate receiving 
a plurality of the vote in the general election. 
Many procedures are essentially the same as those 
for partisan elections, including the procedure by 
which persons secure a place on the ballot. Names 
are usually listed in alphabetical order in both the 
primary and general elections, sometimes with a 
rotation system of the sort often used for partisan 
primary elections. 

‘The nonpartisan ballot is generally used in at- 
tempts to eliminate or at least weaken political 
party influence. In actual practice, however, politi- 
cal parties may be involved in nonpartisan elec- 
tions in the following ways: First, parties may 
dominate the selection of candidates, and cam- 
paign strategies may be popularly viewed as those 
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of the parties. This is often the case, for example, 
in Chicago aldermanic elections. Second, parties 
may be only one of the groups recruiting and sup- 
porting candidates. In Cincinnati, for example, the 
City Charter Committee and independent candi- 
dates commonly oppose national party candidates. 
Third, local political-action groups may be active, 
but national political party organizations may be 
excluded by custom. This was the pattern for many 
years in Detroit, Dallas, San Francisco, and many 
other cities. 

There is, however, one pattern in which all can- 
didates are self-starters, plan their own campaigns, 
and arrange their own finances. In these cases, 
found in many small towns, there are no political- 
action organizations beyond those built around the 
individual candidate. This pattern may, therefore, 
be considered truly nonpartisan. 

(4) A political ideology. In democratic nations, 
the political party is commonly seen as a device 
for mobilizing support for the filling of public 
office. Parties often have interest or class bases 
and are viewed as logical, necessary, and proper. 
In the United States, however, the political party 
has traditionally been viewed with suspicion. This 
suspicion has been especially prevalent among 
members of the middle classes, but Americans of 
all categories have long believed it is desirable to 
“vote for the man” rather than the party. 

The scope and history of nonpartisan elections. 
After World War u, the bland political style of 
American suburbia reflected the traditional Amer- 
ican preference for nonparty politics. The middle 
classes concentrated on rational discussion of is- 
sues and the consideration of “objective” data. They 
sought a “public interest” and believed this should 
be served, irrespective of party. They wanted deci- 
sions to be based upon the recommendations of 
experts, the avoidance of rowdy campaigning of 
the sort associated with nineteenth-century Amer- 
ican politics, and a reasoned and informative ap- 
peal to voters. Nonaggressive, dignified contests 
were the goal, for they reflected middle-class pref- 
erences for order, respectability, dignity, and 
“responsible” (as distinguished from demagogic or 
“crowd-pleasing” ) behavior. This ideology was man- 
ifested in nonpartisan local and judicial ballots, in 
social approval of split-ticket voting, in political 
amateurs as candidates for high office, in heavy 
dependence upon a professionally qualified civil 
service, and in minimizing the role of the political 
party in the democratic process. 

The picture of the development of nonpartisan 
elections fades away gradually as its chronology is 


traced backward, In some cities and school districts 
of the nineteenth century, a “citizens’ caucus” was 
sometimes held to nominate candidates. All per- 
sons eligible to vote were welcome and the candi- 
dates then ran for office without party label, or 
under some innocuous title, such as the “Citizens’ 
Ticket.” Municipal reformers, beginning about 1890, 
sought to separate local from state and national 
elections and to separate issues and candidates, 
Gradually cities and other local units in many 
states began to have elections with no party desig- 
nation on the ballot, following the pattern of the 
original Australian ballot, which was first used in 
the United States in Louisville in 1888. In the dec- 
ade following 1910, many states established the 
nonpartisan ballot for municipal, judicial, and 
schoo] elections. California, Minnesota, and North 
Dakota extended it to the county; and Minnesota, 
in 1913, and Nebraska, in 1935, put elections for 
the state legislature on the nonpartisan ballot. In 
the 1960s, probably more than one-half of the 
elective offices in the United States were filled using 
a nonpartisan ballot, including at least 85 per cent 
of all school elections and 65 per cent of municipal 
elections. The ballot form was especially popular 
in the suburbs of metropolitan areas and, above all, 
in cities using the council—manager form of gov- 
ernment. 


Nonpartisanship is useful as a concept in politi- 
cal-science research, providing careful distinctions 
are made among the various types and definitions, 
It is an important field for study because of its 
extensive use in local elections and widespread 
commitment to it as an ideology. 

Recent research on nonpartisanship has cen- 
tered on the comparison of partisan and nonparti- 
san voting patterns, the question of whether or not 
nonpartisanship contains a conservative bias, the 
interaction patterns between partisan and nonpat- 
tisan office holders, and the degree to which parti- 
san and nonpartisan elections are insulated from 
one another. 

The popularity of nonpartisanship remains un- 
diminished among most governmental reformers in 
the United States, but it seems to be a uniquely 
American ideology. Elsewhere, the political party 
is considered an integral part of the political proc- 
ess and frank allegiance to a party is acceptable 
and even desirable. 

CHARLES R. ADRIAN 


[See also BIPARTISANSHIP; ELECTIONS; PARTIES, POLITI- 
CAL; and POLITICAL PARTICIPATION.] 
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NORDENSKIOLD, ERLAND 


Nils Erland Herbert Nordenskiöld (1877-1932), 
Swedish ethnologist, came from an aristocratic fam- 
ily that had a tradition of scientific interests. His 
father was the first man to navigate the Northeast 
Passage; his brother Gustav made valuable con- 
tributions to archeology through his excavations in 
the Mesa Verde area of the southwestern United 
States; and his cousin Otto was a distinguished 
geographer and Antarctic explorer. 

Nordenskiöld began his scientific career as a 
zoologist and as such visited Patagonia in 1899 and 
Argentina and Bolivia in 1901-1902. He soon be- 
came more interested in anthropology, however, 
and in 1904-1905 went to Peru and Bolivia to do 
archeological research in the highlands among the 
little-known Indians of the Madre de Dios area. On 
two other expeditions he studied the Indians of the 
Gran Chaco and those of the forests in the border 
regions between Bolivia and Brazil. 

In 1913 he was appointed chief curator of the 
ethnographical department of the Goteborg Mu- 
seum, and in 1916 he received an honorary PH.D. 
from the University of Goteborg. He was appointed 
professor of American and comparative ethnology 
at that university in 1924. Nordenskiöld also taught 
at the University of California in 1926, as a visiting 
professor. He did not give up field research entirely, 
however, and in 1927 returned to South America, 
this time to the Cuna and Choco tribes of Panama 
and Colombia. But his health had been impaired 
by his travels, and he died in 1932. 

Nordenskiöld was primarily interested in South 
American ethnology and archeology from a histori- 
cal point of view. Besides writing semipopular nar- 
Tatives of his travels, he was the author of many 
Scientific articles and books, of which the ten-volume 
Series entitled Comparative Ethnographical Studies 
(1918-1938) is the most important. 

This work benefited from Nordenskiéld’s famil- 
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iarity with even the rarest and most inaccessible 
books of the early Spanish and Portuguese literature 
on South America, as well as from his correspond- 
ingly thorough study of the available museum mate- 
rial. This background enabled him to map the dis- 
tribution of a great number of elements in South 
American culture. In his mapping methods he was 
a pioneer, substituting a system of dot signs for the 
earlier, crude, planar indications. The advantage of 
these dot signs was that they enabled him to present 
a much clearer picture of so complicated an area 
as South America. Although his objective was pri- 
marily the reconstruction of cultural history and 
his method to a large extent was based upon the 
geographical distribution of cultural elements, he 
was extremely skeptical of the German and Austrian 
Kulturkreis theory, not only on account of the often 
deficient knowledge of its devotees but in particular 
because he did not believe in universal Kulturkreise. 
On the contrary, he always held that South Amer- 
ican problems should be solved on the basis of 
South American facts alone. 

The first three volumes of Comparative Ethno- 
graphical Studies contain a detailed analysis of the 
material culture of various tribes in the Gran Chaco 
and of the Moxos in Bolivia. In these volumes 
Nordenskiöld attempted to show the relation of the 
natural environment to other influences that might 
have been brought to bear on a culture pattern. He 
concluded, for example, that the culture of the 
tribes of the rain forests was less affected by the 
Incas than that of the tribes of the more accessible 
Chaco. He demonstrated convincingly that the basic 
culture of the Chaco tribes must have been very 
similar to that of the Patagonians and Ona: several 
later elements are due to influence from the north 
and east, but this influence has essentially modified 
their life only in one respect—the introduction of 
manioc cultivation and, perhaps, of some other 
plants. The Andean influence is restricted to a num- 
ber of “articles of luxury.” A number of elements 
reveal the connection of the various tribes that had 
migrated to the borderland of the Chaco (the Chiri- 
guano and the Chané) with their kinsmen in the 
tropical area, and there is also evidence of a con- 
siderable influence from the Andes—whereas they 
have adopted very little from their Chaco neighbors. 
Among the Moxos practically no vestiges of the 
basic Chaco culture are left, and there are but few 
elements of Peruvian origin. However, several cul- 
tural elements would seem to derive from the 
northern Andean area of Colombia. 

In Volume 4, Nordenskiöld attempted to demon- 
strate that New World cultural development par- 
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alleled that of the Old World, in that the Copper 
Age had preceded the Bronze Age. Volume 5 con- 
tains his conclusions on various routes of cultural 
development, which he traced by using the distribu- 
tion of certain post-Columbian loan words. Volume 6 
studies ancient Peruvian astronomical knowledge 
and the use of Peruvian knot-records, throwing new 
light on both subjects. Volume 7 deals with the 
remarkable picture writing of the Cuna; and vol- 
umes 8 and 9 deal, to a great extent, with the inde- 
pendent Indian inventions, some of which are still 
in use. Nordenskiölds deep admiration for the 
Indians is particularly apparent in these last three 
volumes, Details in his work may need correction, 
but there is no doubt that the principal results will 
remain. 

In Nordenskiéld’s last volume his skepticism 
toward the Kulturkreis theory is particularly evi- 
dent. After having pointed out a considerable num- 
ber of cultural parallels between North America 
and southern South America, he proceeded to dis- 
cuss the so-called Oceanic elements in South Amer- 
ica and the origins of the ancient civilizations of 
Peru and Mesoamerica. He arrived at the conclu- 
sion that insofar as there was any influence from 
Oceania, it must have been in the exceedingly re- 
mote past and that the high American civilizations 
were built up of intensified Indian cultures. 

Among his purely archeological papers it may 
be well to single out for particular mention his 
description of urn burials and mounds in Bolivia 
(1912); a paper on the eastern distribution of the 
Tiahuanaco culture (1917); and a profusely illus- 
trated survey of the archeology of the Amazon area 
(1930). Nordenskiöld was also an outstanding 
museum director. Under his leadership, the Gote- 
borg Museum amassed a very fine collection dealing 
with South American ethnology and archeology. 
His university teaching has deeply influenced Scan- 
dinavian anthropology, particularly in Sweden and 
Denmark. 

KAJ BIRKET-SMITH 


[See DIFFUSION, article on CULTURAL DIFFUSION, and 
the biographies of Mérraux and SCHMIDT.] 
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1. THE Concert OF NORMS 
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Robin M. Williams, Jr. 
Jack P. Gibbs 


1 
THE CONCEPT OF NORMS 


A norm is a rule, standard, or pattern for action 
(from the Latin norma, a carpenter's square or 
rule). Social norms are rules for conduct. The 
norms are the standards by reference to which be- 
havior is judged and approved or disapproved, A 
norm in this sense is not a statistical average of 
actual behavior but rather a cultural (shared) 
definition of desirable behavior. 

To the extent that a particular social norm actu- 
ally is effective, one will be able, of course, to ob- 
serve a marked actual regularity of social acts in 
recurrent situations of a particular kind. Thus 
there will be more or less standardized ways in 
which people are seen to behave when conducting 
trade, engaging in religious worship, or playing 
games. A sheer uniformity in behavior, however, 
does not necessarily mean that a norm is involved. 
The uniformity may simply represent such sepa- 
rate individual reactions to a common stimulus as 
fleeing from fire. Nevertheless, the great majority 
of important social interactions are guided in part 
by norms. 

Related concepts. The history of the concept 
of norm goes back as far as men have recorded 
their ideas of proper conduct. Although the term 
has been fairly common in philosophical analyses 
of ethics, in social science use of the specific word 
“norm” has become extensive only in recent times 
(the Encyclopaedia of the Social Sciences, pub- 
lished in the 1930s, did not even include the term). 
Closely related concepts, however, have long been 
current in the discourse of the social sciences: for 
example, convention, etiquette, law, custom, folk- 
ways, and mores. 

Custom. The very broad term “custom” fre- 
quently has been used to refer to the whole body 
of those shared behavior patterns in a particular 
group or society that are regarded as the traditional 
and established ways of the people, To speak of a 


“customary” way of acting is to suggest that the 
conduct in question has come down from the past 
and is not regarded solely as a matter of immediate 
expediency. Customs are more than mere aggre- 
gates of individual habits; a custom is always 
tacitly supported by social approval. Negative 
social sanctions, ranging from ridicule to extreme 
punishment, may be applied when there is devia- 
tion from custom. Of course, sanctions may not 
occur when the deviation is as slight as failure to 
take a siesta in the tropics; but they will certainly 
occur when it is as great as marrying a forbidden 
clan-sister. Thus, the actual enduring regularities 
in social behavior, accepted by and approved by 
most members of a group or society, may be called 
customs. The standards used for judging con- 
formity are the norms for the customary behavior. 

Folkways. The concept of folkways was devel- 
oped by the American sociologist William Graham 
Sumner and by his followers. Sumner considered 
folkways to be group habits or customs which 
originated in the repetition of adjustive acts by 
individuals responding to a similar social or indi- 
vidual need. The folkways, thus, are concrete pat- 
terns of behavior; they have usually been inter- 
preted as the outcome of largely unreflective 
trial-and-error adjustments. 

Mores. Customs that are regarded by general 
agreement as highly important and obligatory, as 
evidenced by strong sentiments against deviation 
and by severe punishments for violation, are 
usually called mores. When certain folkways be- 
come well-established and are supported by the 
belief that they are proper, right, and indispen- 
sable, they become mores. They are ordinarily 
thought to be supported by diffuse common agree- 
ment. Examples of mores include incest taboos and 
rules against in-group murder, rape, cannibalism, 
and other practices generally regarded as especially 
heinous. Some definitions of mores include as a de- 
fining criterion the presence of some overt philo- 
sophical justification of these customs as indis- 
pensable to social welfare. 
: Characteristics of norms. Social pressures aris- 
ing from group acceptance of norms have definite 
consequences for behavior. When, for example, at 
a given time an individual is confronted with in- 
compatible norms held by different individuals or 
groups that are important to him, his behavior is 
likely to show signs of psychological conflict or 
tension. Unanimity of support for norms within a 
group or society powerfully checks tendencies of in- 
dividuals toward deviance (Asch 1952, chapter 16). 

Norms are learned by individuals in social inter- 
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course with others—that is, in the process of 
socialization. By definition, then, norms are shared 
by two or more individuals (Sherif 1936). Some 
norms are particular to quite small groups, such as 
a husband-wife pair or a clique of friends, while 
other norms may be held by a large collectivity that 
is one of several existing in one nation (for ex- 
ample, Muslims and Christians in Lebanon). Still 
other norms may be shared by most adult mem- 
bers of a nation or of an entire civilization (for 
example, those pertaining to monogamous mar- 
riage). 

Norms are always to some extent both general- 
ized and generalizable. They may refer to all hu- 
man beings at all times and in all places, or they 
may refer only to a specific category of person in a 
specific type of situation. A norm calls for “right 
action” and implies a generalizable reason for the 
rightness of the indicated conduct. Ultimately this 
propriety or rightness traces back to some standard 
of value that is taken without further justification 
as valid by the individual or group in question. 

Norms thus are more than an idiosyncratic ex- 
pression of the wants or desires of a particular per- 
son. Even quite specific norms imply a basis for 
assent by someone other than the norm sender him- 
self (Rommetveit 1953, pp. 11-42). At the same 
time, norms actually can guide conduct only if they 
prescribe or proscribe identifiable courses of action; 
therefore norms are more specific and socially im- 
perative than values or ideals. For example, “hon- 
esty” is a generalized value (a “conception of de- 
sirability”), but it is still found socially necessary 
to have specific rules for concrete situations such 
as students’ behavior on examinations or the finan- 
cial responsibilities of banking officials. 

Norms may arise in relation to any aspect of 
human activity and experience that comes to be 
regarded as of any importance or consequence. 
There are norms for perceiving, feeling, thinking, 
judging, evaluating, and acting. Thus, there are 
cognitive and logical norms for carrying on scien- 
tific investigations, aesthetic norms for judging 
music or sculpture, and norms of grammar and 
style for writing in a particular language. A re- 
ligious believer relates himself to supernatural or 
transcendent entities in accordance with certain 
norms. Convention, etiquette, fads, and fashions 
are all regulated by norms. Finally, there are the 
norms of moral conduct which guide direct social 
interaction. The term “social norm” may refer to 
any of these, but most commonly is used to desig- 
nate rules for social interaction. 

Norms are exceedingly diverse not only in their 
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objects but also in respect to their important prop- 
erties in different societies and in different his- 
torical periods. Some norms are widely known, 
accepted, and followed, whereas others are char- 
acterized by low consensus and only partial con- 
formity. Some norms are learned early in life, 
through identification with parents or other pri- 
mary agents of socialization; others are acquired in 
later life through secondary social relations, To 
understand any particular norm it is important to 
know whether it has been “internalized” so as 
to become part of the conscience or self-ideal 
of the individuals in question; if so, there will be 
much conformity even if there is no external sur- 
veillance of conduct or punishment for deviant 
behavior. Norms not thus internalized can be en- 
forced only through external rewards and penal- 
ties. Those norms that are primarily enforced 
through punishment and threat contrast with 
standards that are maintained by a flow of positive 
social rewards such as wealth, prestige, or social 
approval (see Rommetveit 1953; Parsons 1951; 
Thibaut & Kelley 1959). Great variations exist in 
consistency of enforcement, source of authority, 
degree of allowable variation in conformity, extent 
of deviance, and type of enforcing agency (Morris 
1956), 

Norms and social needs. What is it that makes 
a norm normative? How does a norm acquire the 
obligatory quality which distinguishes it from a 
simple habit or preference? How does it occur that 
aman accepts the legitimacy of a criterion for con- 
duct that results in his disadvantage in the immedi- 
ate situation of action? (cf, Sorokin 1947, chapter 
4, on law norms). The answer seems to be that 
norms arise to meet recurrent problems. They tend 
to be initiated or proposed by someone who finds 
an immediate agreement to be in some Way ad- 
vantageous. They persist when, on the basis of the 
basic values and beliefs accepted in a society, ac- 
ceptance of norms “cuts costs” (Thibaut & Kelley 
1959; Homans 1961). Of course, the types of ad- 
vantages so obtained exhibit great qualitative 
diversity. 

Types of norms. There are several intrinsically 
different kinds of norms that vary according to the 
locus of human activity and experience to which 
they are applied, 

Technical norms have to do with effective means 
for the attainment of specific goals (Pepper 1960). 
They define effective action in dealing with phys- 
ical and biological as well as social elements in 
immediate situations. When the Satisfaction of in- 
dividual desire for pleasure and affection is the 
goal we may call such norms hedonic. When the 


individual must choose between different personal 
purposes and satisfactions, his choices may be 
guided by personality-integrative norms. For ex. 
ample, “eat, drink and be merry” may be countered 
by “moderation in all things.” Likewise, long-term 
goals may generate considerations affecting im- 
mediate gratifications. 

Hedonic or technical norms and the norms of 
personality integration, however, are insufficient 
for guiding social interaction, which involves the 
interests and claims of many persons and groups. 
Hence, distinctively social rules emerge, specifying 
rights, duties, disabilities, and privileges. Further- 
more, most if not all cultures contain norms which 
regulate what are considered to be man’s relations 
with the superhuman and supernatural. 

The most definite, strongly held, and socially 
obligatory norms regulating conduct may be called 
institutional norms. In any group or society there 
are certain norms that are widely known and ac- 
cepted, supported by legitimate authority, and 
incorporated into the individual's social conscience 
early in life. However, there is no sharp line be- 
tween institutional and noninstitutional norms. 
The degree of institutionalization can be repre- 
sented statistically as a profile exhibiting more or 
less of each one of the characteristics just reviewed. 

To the extent that there is consensus concerning 
the norms regulating conduct, a pragmatic basis 
is available for integration among the persons and 
other units making up social systems (Parsons 
1951). Choice among various norms regulating 
interaction may in turn be decided with reference 
to generalized norms that transcend the particular 
context and shade over by degrees into cultural 
values. It is generally thought that a cultural sys- 
tem generates complex pressures toward consist- 
ency among its beliefs, values, and norms. A cen- 
tral empirical problem in the sociological study of 
norms is to ascertain whether norms combine into 
systems, and, if so, what conditions and reasons 
govern these combinations. 

Thus, there are six main foci of normative regu- 
lation of human conduct: the internal functioning 
of the individual personality; relations with the 
world of physical and biological phenomena ex- 
ternal to the actor; interpersonal and intergroup 
relations within a particular society; intersocietal re- 
lations; intracultural integration; and man’s rela- 
tions with supernatural entities and events. 

Conformity. The greater the agreement upon a 
norm among the members of a social system, the 
greater in general will be the pressures for con- 
formity to that norm. Some norms acquire great 
force by sheer unanimity; for example, in a society 


in which nearly everyone speaks only Chinese, one 
must speak Chinese to be understood. Conformity 
is rendered more likely if the would-be deviant has 
no alternate group to support his deviance or to 
which he could escape to avoid penalties (Thibaut 
& Kelley 1959, chapter 10). If compliance with 
norms is induced primarily by punishment, much 
surveillance is required; conformity induced by 
rewards, by intrinsic satisfactions, or by consensus 
does not need to involve detailed monitoring and 
supervision. When persons (or collectivities) are 
highly interdependent within a closed social bound- 
ary, the implicit threat of withdrawal of reciprocity 
is an effective sanction against nonconformity. 

In ordinary interpersonal relationships, con- 
formity to every detail of every specific norm is 
not rigidly required; rather, sanctions tend to fo- 
cus upon the total role relationship (Goode 1960, 
p. 252). When consensus is low, personal mobility 
high, and individuals carry on many of their ac- 
tivities in nonoverlapping social networks, both the 
demand for conformity and the actual degree of 
conformity tend to be low. 

Conformity in inner belief or in affective in- 
tensity and depth of commitment is more difficult 
to observe than overt behavior. Therefore, any type 
of “thought control” is difficult to maintain. On the 
other hand, any expression of disbelief in a norm 
(or normative system) and any overt evidence of 
lack of emotional commitment is likely to be espe- 
cially threatening to those who are committed to 
enforcing the norm. The sinner can always repent; 
the heretic challenges the very existence of the 
norm (cf. Goode 1960, p. 257). 

; Norms and freedom. Consideration of institu- 
tionalized norms raises questions about the mean- 
ing of “freedom” in human affairs; a necessary first 
Step in examining this concept is to draw a distinc- 
tion between psychological and structural freedom 
in social systems. Psychologically, a man is “free” 
if he can do whatever he wishes to do; if his desires 
are more limited than the possibilities for action 
that are open to him, he will believe and feel him- 
self to be free. It is possible for him to have this 
subjective freedom even though a detached ob- 
server with a more extended knowledge of the cir- 
cumstances might regard him as closely con- 
strained by institutionalized norms and sanctions. 
This kind of freedom is not, in short, descriptive of 
any clearly identifiable feature of social structure. 

Different social structures vary widely in the ex- 
tent to which they enforce a monolithic system of 
norms. Pluralistic societies contain a variety of 
relatively autonomous norm-creating and norm- 
enforcing groups, associations, and governmental 
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units. Maximum structural freedom exists when 
the individual has many alternative choices be- 
tween groups with varying norms and varying 
types and degrees of demanded conformity. 

Origins of norms. Much research will be re- 
quired to establish a clear understanding of the 
conditions and processes associated with the emer- 
gence and acceptance of norms. Some sources of 
norms are known. Many rules for behavior seem 
to arise as a direct consequence of recurring prob- 
lems that are found to be socially costly. Examples 
of such conditions of high “cost” (in terms of frus- 
tration, deprivation, and conflict) include: per- 
sistent failures of coordination in important group 
tasks, mutual misunderstandings, chronic inter- 
ferences in achieving consensus, low predictability, 
recurring interpersonal conflicts, and incompatible 
claims to legitimate authority. Persons in weak 
positions, fearing exploitation by the stronger, 
often favor predictable norms and sanctions. Per- 
sons in positions of authority and power often de- 
sire reliable conformity that is not dependent upon 
continual surveillance and coercion; they may 
therefore favor establishment of norms even if 
their own caprices are thereby curbed. 

A demand for norms is likely to arise from per- 
sons who find their interactions confusing or 
vaguely defined; for this reason, unstructured situ- 
ations often create a pressure for the development of 
new norms. Enduring social conflicts, when not of 
too great an intensity, also generate new norms, de- 
veloped out of negotiation, compromise, mediation, 
and related social processes. Another major source 
of new norms lies in collective reactions to shared 
“strain” experienced in relation to old norms. 

Every assertion of a demand or the communica- 
tion of an expectation from one person to another 
contains some, however slight, normative element, 
even if the norm is not stated. To communicate an 
expectation to another person about his own be- 
havior is to implicitly claim the right to influence 
his conduct. Acquiescence in the expectation by the 
receiver establishes a shared commitment. Such 
shared expectations form the initial basis for de- 
velopment of a behavior standard for later recur- 
rent situations of the same kind. 

Needed research. Contrary to a priori theories 
which would deny the presence of normative ele- 
ments in man’s behavior, modern social science 
finds a central place for the concept of norms in its 
analysis of social systems (Parsons 1951, pp. 
3-45). But we need to know more about the conse- 
quences of partial consensus and the relation of 
consensus to social cohesion. Under what circum- 
stances do old norms cease to be effective because 
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of gradual alienation, loss of support and concern? 
When does such “withering away” lead to anomie 
(normlessness), and when and how does it pre- 
pare the way for new norms? In addition, we need 
rigorous study of the sources of rebellion, deviance, 
and creative reformulation of norms. Such re- 
search will add to knowledge of those conditions 
under which self-realization is possible in an in- 
creasingly complex and interdependent world. 


RoBIN M. WILLIAMS, Jr. 


[See also Conrority; PUNISHMENT; REFERENCE 
GROUPS; SANCTIONS; SOCIAL CONTROL; VALUES.] 
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LLI 
THE STUDY OF NORMS 


No concept is invoked more often by social scien- 
tists in explanations of human behavior than 
“norm.” However, despite the plethora of “norma- 
tive” explanations, the conceptual treatment of 
norms remains unsatisfactory, No particular ge- 
neric definition of norms is widely accepted in the 


social sciences, and consensus is lacking as to the 
differentiation of types of norms. Moreover, for 
the most part normative explanations are ad hoc— 
that is, they do not stem from systematic compara- 
tive research. In fact, given the state of the concept 
and the paucity of systematic studies of norms in 
general (as opposed to particular norms, such as 
the incest taboo), normative explanations of be- 
havior are often dubious. The alleged prescription 
or proscription may be merely assumed to exist, 
and the conceptual and methodological issues in- 
herent in any such explanation are rarely discussed, 


Issues and questions 


Any normative explanation presupposes the ex- 
istence of norms, and the adequacy of the explana- 
tion therefore depends partially on the validity of 
this presupposition, At first glance the only problem 
might appear to be that of perfecting techniques 
for identifying norms, but techniques alone cannot 
resolve the conceptual issue of how to define norms 
in general. Most definitions emphasize that a norm 
necessarily represents a high degree of consensus 
in a social unit as to what the conduct of members 
ought to be. But, if so, what is the “normative” 
status of laws that govern a country under military 
occupation or, more broadly, the status of all un- 
popular laws? Are we to conclude that such laws 
are not, after all, norms? It is far better to recog- 
nize that, by definition, some but not all norms 
represent collective evaluations (that is, consensus 
as to what conduct ought to be). Other norms 
represent nothing more or less than collective ex- 
pectations, meaning consensus as to what the be- 
havior of members of a social unit will be. Further, 
an evaluation or expectation (collective or noncol- 
lective) of conduct may or may not be associated 
with attempts to prevent, revenge, or rectify con- 
trary behavior; and such attempts may or may not 
be made only by persons in a particular status 
through means that may or may not include the 
use of force. 

The point is that a truly generic definition of 
norms must encompass a wide range of definitional 
attributes. If this is borne in mind, it enables one 
to distinguish types of norms, such as customs, 
mores, rules, laws (Gibbs 1965). Equally important, 
if it is recognized that what is a definitional attri- 
bute for one type of norm is a contingent attribute 
for another type, then what was once a sterile con- 
ceptual issue gives rise to some significant empirical 
questions on norms in general. For example, why 
are some laws more consistent with collective 
evaluations than others? How much do societies 
differ with regard to the consistency between legal 


and extralegal norms? Why do they differ? Why 
do some collective evaluations and collective expec- 
tations diverge, as when members of a social unit 
expect other members to drink alcoholic beverages 
but have no definite collective evaluation of the 
act? Why are some but not all collective evalua- 
tions backed by force? How much and why do so- 
cial units differ as to the use of coercion to revenge 
or prevent violations of norms? 

Social scientists are not completely ignorant in 
the face of such questions as these. Some notable 
studies have been conducted in jurisprudence, so- 
cial anthropology, and sociology (Diamond 1935; 
Durkheim 1893; Hoebel 1954; Sumner 1906; 
Weber 1922); but demonstrably valid answers are 
wanting. Indeed, for the most part, truly systematic 
research on normative phenomena is now largely 
limited to the study of small groups (see Rommet- 
veit [1953] 1955 for references); and comparative 
studies at the macroscopic level, such as that con- 
ducted by Richard D, Schwartz and James C. Miller 
(1964), are now rarer than in the heyday of evolu- 
tionary theory in the social sciences. 

Legal norms. Regardless of how the conceptual 
issue is resolved, the empirical identification of 
norms presents some complex technical problems. 
Consider laws as a case in point. Analytical defini- 
tions (e.g., Weber [1922] 1954, p. 5) stress the 
point that a legal norm is characterized by a “high 
probability” that an attempt will be made, by means 
which may include the use of force, to prevent, 
revenge, or rectify contrary behavior. However, 
these definitions do not specify a numerical cri- 
terion for a “high probability,” and any criterion 
(for example, 50 per cent) would be arbitrary. 
Moreover, it is difficult to identify coercion as an 
attribute of laws as such. The frequency with which 
force is employed may not be relevant because vio- 
lence in the enforcement of laws may be rare when 
there is a high degree of consistency between legal 
and extralegal norms. Additional complications 
stem from the fact that force does not play the 
same role in all types of laws (for example, in 
criminal as opposed to tort law). Finally, one could 
argue cogently that a law is identified not only by 
a high probability of enforcement but also by a low 
probability of retaliation as a reaction to enforce- 
ment; yet, currently there is no defensible socio- 
logical rationale for stating exactly how probable 
retaliation may be. 

The pressing problem in the identification of 
legal norms is that of moving from general defini- 
tions to specific empirical criteria that are applica- 
ble in all societies. It is most unlikely that this 
problem can be solved without extensive research 
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of a special kind on laws in widely diverse societies. 
While it is possible to identify some norms in a 
given society as “legal” and others as “extralegal” 
by virtually any definition, we know very little 
about the related frequency of enforcement, use of 
force, and retaliation for enforcement. Such knowl- 
edge could provide a basis for achieving the ulti- 
mate goal—specification of empirically applicable 
criteria for identifying legal norms. 

Extralegal norms. The technical problems posed 
in identifying extralegal norms are different from 
the case of laws but no less difficult. On the whole, 
the problems fall into three classes—the seman- 
tical, the dimensional, and the statistical, Seman- 
tical problems stem from the fact that most defini- 
tions treat norms as prevalent beliefs as to what 
conduct ought to be. Accordingly, any study of such 
beliefs (e.g., Stouffer 1955) is an investigation of 
norms. The crucial technical problem in conducting 
such a study is that “normative” opinions can be 
solicited in a variety of ways. For example, con- 
sider the alleged proscription in the United States 
of sexual relations outside of wedlock. How can 
the existence of this norm be demonstrated? One 
approach would be to ask a representative sample 
of the American public a question worded along the 
following lines: Should a man and a woman who 
are not husband and wife have sexual relations 
with each other? Observe, however, that the ques- 
tion could be phrased in several other ways. For 
that matter, an entirely different kind of question 
could be posed, such as whether or not the person 
being questioned would commit the act, or what 
the person thinks should happen when the act is 
committed (i.e., the “reactive norm” as described 
in Clark & Gibbs 1965). One could argue, of course, 
that the results would be substantially the same 
regardless of the kinds of questions; but research 
to date does not provide a basis for such a gen- 
eralization, let alone a conclusion as to the best 
way to solicit opinions about norms. 

Apart from the choice of words in formulating 
normative questions, there is the problem of struc- 
turing the solicitation of opinions in such a way as 
to avoid ambiguity, To illustrate, consider the fol- 
lowing question as it might be posed in a survey 
of norms in the United States: Should Negroes and 
whites be permitted to have sexual relations with 
each other? Some respondents probably would be 
confused and/or give an answer that is misleading, 
because nothing is said in the question about 
whether or not the couples are married. Thus, a 
respondent of liberal convictions may assume that 
the question refers to racial intermarriage and 
answer Yes, while another may assume that extra- 
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marital relations are the point of the question and 
answer in the negative. Indeed, two respondents 
could agree in their answers even though their in- 
terpretations of the question differ. The general 
point is that most prescriptions or proscriptions of 
conduct are relative to statuses or situations, and 
related opinions can be solicited in a meaningful 
way only by possessing prior knowledge of the 
norms involved, The solution may require inter- 
views in depth, as in John Ladd’s study of Navajo 
ethics (1957), before normative questions can be 
formulated in such a way as to elicit answers that 
can be easily classified and analyzed in quantitative 
terms. Without systematic classification and quan- 
titative analysis, we have, as in the case of most 
anthropological studies, only the investigator’s in- 
tuitive assessment of the norms in a social unit. 

One dimensional problem in the identification of 
norms is that statements as to what conduct ought 
to be do not necessarily reveal the intensity of 
norms. By way of illustration, consider opinions 
in the United States concerning two acts—(1) a 
bride not assuming the surname of her husband 
and (2) a man engaging in sexual relations with 
his mother. Now in all probability, the overwhelm- 
ing majority of Americans would state that both 
acts “ought not be”; but it is equally probable that 
the sense of “oughtness” is much stronger for the 
latter act, and statements merely as to what ought 
to be would not reveal this differential intensity. 
However, social scientists have not perfected a 
standard technique for assessing not only opinions 
as to what conduct should be but also the intensity 
of such opinions. 

The viability of norms is much more difficult to 
treat than the dimension of intensity. It can be 
argued that norms are something more than mere 
statements of opinion. The point is that a person 
may state emphatically what conduct ought to be 
and yet be ostensibly indifferent to contrary be- 
havior. Stated otherwise, a person may manifest 
no signs of true commitment to his normative 
opinions; and, so the argument goes, if a large pro- 
portion of the members of a social unit do not ex- 
hibit such a commitment, the related opinion is 
not a norm, regardless of its prevalence, There is 
merit in the argument, but it raises some very dif- 
ficult conceptual and technical problems. 

One can take the position that a norm is viable 
if and only if a certain degree of conformity to it is 
realized. But this position is tantamount to assert- 
ing that norms are to be inferred from actual be- 
havior. They can be so defined, but the result is a 
tautology when one then employs the concept to 
explain behavior. The alternative is to focus on the 


characteristics of reaction to contrary behavior as a 
test of the norm’s viability. Put simply, if the reac- 
tion is typically negative, then and only then is 
the related opinion a norm, The technical problem 
in this instance is that it is difficult to specify what 
constitutes a negative reaction (for example, must 
it be overt? ), and little is gained by declaring that 
the behavior must be subject to a sanction, because 
the term is no more precise than the notion of a 
negative reaction. The conceptual problem is that 
certain types of norms (customs in particular) are 
often defined without reference to negative reac- 
tions to contrary behavior. 

The statistical problem. Whatever the solution 
to the semantical and dimension problems, the 
identification of norms requires the formulation of 
statistical criteria. Despite the superorganic quality 
ascribed to norms, opposing normative opinions 
may prevail among members of a social unit, and 
negative reactions to contrary behavior are seldom 
either certain or uniform. Thus, most Americans 
probably believe that first cousins should not mar- 
ry, but it is improbable that all instances of such 
marriage provoke a negative reaction, certainly not 
the same type of negative reaction. Now the ab- 
sence of normative consensus and/or uniformity 
in reaction to contrary behavior does not imply that 
there are no norms, but it does pose a problem in 
the identification of norms. Some statistical cri- 
terion must be formulated, and the formulation is 
most difficult. To be sure, one can assert that at 
least 50 per cent of the members must subscribe to 
a particular opinion as to what conduct ought to be 
before that opinion is a norm, and the same pro- 
portion can be used as a standard to assess the 
viability of a norm in terms of reaction to contrary 
behavior; but the number obviously is arbitrary in 
both cases. Accordingly, in the absence of a ration- 
ale for statistical standards, there is a real need for 
studies of normative phenomena from a purely 
methodological perspective in order to explore 
alternative ways of identifying norms. 

Even though most sociological investigations 
deal with norms directly or indirectly, systematic 
attention to the problems of conducting research 
on normative phenomena is rare (Zelditch 1962 is 
an isolated exception), The few existing techniques 
for the analysis of norms are most applicable to 
small groups (e.g., Jackson 1965) or to the study 
of normative phenomena in organizations (€.8+ 
Organizational Stress 1964), Nevertheless, there 
are no corresponding techniques at the macro- 
Scopic or societal level. Conceivably, the methods 
developed by F. H. Allport (see Katz & Schanck 
1938) could be applied at any level, but they pre- 


suppose the inference of norms from actual be- 
havior—which is contrary to typical sociological 
usage—and do not provide a basis for identifying 
all of the commonly recognized types of norms. 


Conformity and deviance 


Even if the demonstrability of norms is granted, 
all normative explanations are debatable insofar 
as they ignore deviant behavior. Thus, to illustrate, 
while reference to the prescription of monogamy 
provides a facile explanation as to why an adult 
American male may confine his sexual relations to 
one woman, bigamous and adulterous husbands 
cast doubt on the explanation, whether applied to a 
particular case or to a statistical construct labeled 
“the average American,” The point is that a norma- 
tive explanation of behavior presupposes not only 
the existence of a norm but also conformity to it. 
Accordingly, the explanation is never complete or 
adequate without reference to a theory of con- 
formity. 

It follows that theories and research on deviant 
behavior are directly relevant to the study of norms. 
Indeed, speculation as to the causes of deviance 
poses still another central normative question and 
one that often underlies conceptual issues. For ex- 
ample, analytical definitions of a law stress the role 
of coercion as a legal element, and they have been 
criticized on the ground that the use of force does 
not explain either conformity or the subjective 
aspects of legal obligation (see especially Hart 
1961). Such criticism may be erroneous, because 
it calls for an answer to an empirical question (that 
is, why persons do or do not conform to laws) 
from what is nothing more than a definition. 

While investigations of deviance are relevant to 
the study of norms, it does not follow that such 
fields as criminology have or will provide answers 
to all normative questions concerning conformity. 
Theories of deviant behavior typically take norms 
as given and regard certain characteristics of indi- 
viduals, such as their personality traits, or certain 
social conditions, such as the rate of unemploy- 
ment, as possible etiological factors in deviation; 
but a purely normative perspective focuses on the 
characteristics of the norms (see, for example, 
Mizruchi & Perrucci 1962). In brief, the central 
question from a purely normative point of view is 
not why some individuals violate norms or why the 
rate of deviant behavior varies from place to place 
and time to time but, rather, why the amount of 
conformity varies from one norm to the next in the 
same social unit. 

The maintenance of norms. The prevalence of 
normative explanations in sociology and anthro- 
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pology reflects the fact that there are only two 
fundamental questions for these fields: Why do 
persons conform to norms? And, for any given 
social unit, why are the norms what they are? 

No general theory provides answers for the above 
questions, but there are at least two schools of 
thought. Functionalists from Durkheim to Parsons 
have viewed norms as reflecting consensus and a 
common value system. Correlatively, norms form 
or tend to form an integrated system, that is, inter 
alia, a minimization of conflict among and over 
norms. In opposition, the “conflict” school of soci- 
ology (the Marxian variety in particular) views 
norms as primarily reflecting the historical or 
present power of one division of a society over the 
others. As such, the normative system is not gen- 
erated by consensus and common values; indeed, 
at least one type of norm—law—is interpreted as 
reflecting dissensus and divergent values. Given the 
irreconcilable views of the two schools, it is not 
surprising that conflict sociologists and functional- 
ists have opposing ideas on the fundamental ques- 
tions in the study of norms. 

The question of conformity is the central issue 
in attempting to explain the maintenance of 
norms. Observe, however, that norms remain via- 
ble not because persons conform to them, but 
rather persons conform to them because they are 
viable. In any event, the difference between the 
conflict and the functionalist schools on the sub- 
ject of conformity (or more generally the main- 
tenance of norms) is striking. Coercion and sanc- 
tions as bases of social order are emphasized much 
more by the conflict than the functionalist school, 
as E. A. Hoebel (1954), for instance, has pointed 
out in his critique of Malinowski’s conception of 
law. For the functionalist school the crucial factors 
in maintaining social order are the effectiveness of 
socialization, and the rewards obtained through 
reciprocity as it operates in conformity to norms 
(Gouldner 1960). 

As far as the maintenance of norms is con- 
cerned, the truth seems to lie somewhere between 
the extreme positions of the conflict and the func- 
tionalist schools. But the debate between the two 
schools has not generated the kind of research 
necessary to resolve the issue. In all societies at 
least some norms are supported by coercion; there- 
fore, the question turns largely on differences 
among societies and other social units as to the 
proportion and kinds of norms that are subject to 
organized sanctions. General surveys (Hoebel 1954; 
Diamond 1935) have indicated that such differ- 
ences are considerable, and additional research 
could confirm the suggestion that maintenance 
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mechanisms vary from norm to norm and from one 
social unit to the next. The goal of theory construc- 
tion would then become that of specifying the con- 
ditions under which formal sanctions, internaliza- 
tion of norms, reciprocity, etc., will be most 
prevalent. 


The origin of norms 


In the nineteenth and early twentieth centuries, 
the sociologists and anthropologists who tried to 
discover the origins of norms were concerned 
mainly with widely diffused beliefs and doctrines 
of long standing, such as monotheism. Since the 
relevant historical evidence was usually lacking, 
most of the proffered “explanations” were neces- 
sarily speculative or, at the least, one-sided; and 
with the growth of field research, together with 
increasing theoretical sophistication, this type of 
inquiry fell into disrepute. Contemporary social 
scientists are aware that the origins of norms need 
to be explained, but they are not usually content to 
focus on a single institution to the exclusion of all 
others, nor do they see any need to delve into the 
remote past in order to explain what is happening 
now. Instead, they tend to concentrate first on 
identifying the norms of a given social unit and 
only then to ask, in terms precluding a purely his- 
torical explanation, why, for this type of social 
unit, the norms are what they are. 

The functionalist and conflict schools differ in 
their response to this kind of question, but the dif- 
ference cannot be summarized neatly. For one 
thing, the functionalist school is largely indifferent 
to the question. Functional analysis is prone to 
take a norm as “given” and then show how it con- 
tributes to the functioning of the larger social sys- 
tem; thus it explains the proscription of incest by 
asserting that the incest taboo “functions” to pre- 
vent the socially disruptive sexual exploitation of 
daughters by fathers. In a sense this perspective 
does explain the origin of norms, but only insofar 
as one assumes that a social system has certain 
“needs” and that norms inevitably develop to meet 
these needs. However plausible such an explana- 
tion may appear, it does not account for normative 
differences among societies. These differences can- 
not be explained in strictly functional terms with- 
out further assuming that all social systems do not 
have the same needs. But this assumption runs 
contrary to the emphasis in functional theory on 
universal needs, In any event, the functionalists 
have not identified the distinctive needs of various 
types of social systems, much less the norms which 
supposedly meet these needs. Finally, if different 
norms can meet the same need, then the function- 


alist perspective cannot generate predictions as to 
what norms or types of norms will prevail in a 
given type of social unit. 

The position of the conflict school on the origin 
of norms is also subject to criticism on both sub- 
stantive and methodological grounds. Insofar as 
this school can be said to accept any general prin- 
ciple, it tends to argue that norms represent means 
by which one social class or caste achieves the 
domination and/or exploitation of others. But, con- 
sidered in substantive terms, it is most difficult to 
see how some norms (for example, the mother-in- 
law taboo or certain dietary prescriptions) have 
any connection whatever with domination or ex- 
ploitation. 

The methodological problem in dealing with the 
conflict interpretation is that it does not generate 
any specific propositions about norms and norma- 
tive differences among social units, Given the un- 
disputed fact that norms vary from one social unit 
to the next, it follows, if we use the terminology of 
conflict theory, either that domination and ex- 
ploitation necessarily assume various forms or that 
domination and exploitation are essentially the 
same in all societies but can be maintained by dif- 
ferent norms. But in either event, the members of 
the conflict school have not specified in a system- 
atic manner how domination and exploitation cre- 
ate differences in the norms of societies. The same 
criticism also applies to the Marxian variety of con- 
flict sociology, which goes beyond the facts of 
domination and exploitation in asserting that the 
material forces of production are the ultimate 
causes of economic relations, which in turn de- 
termine all other social and cultural characteristics 
of a society, including its normative system. How- 
ever, the Marxian view of norms is significant in 
one particular respect. Whereas functionalists tend 
to take norms and values as given and treat them 
as the basic determinants of behavior, the Marx- 
ists view them as, at most, only links between the 
material forces of production and the postulated 
effects of the forces. Since the Marxist view goes 
beyond norms and eschews normative determinism 
(see Blake & Davis 1964 on this particular issue), 
it is a much more general theory than that achieved 
by the functionalists. Nonetheless, neither Marx 
nor his followers have cast the theory in terms 
which enable one to predict systematically the 
attributes of a normative system from knowledge 
of the material forces of production. 


Jack P. GIBBS 


[Directly related are the entries Conr ict, article on 
SOCIAL ASPECTS; DEVIANT BEHAVIOR; FUNCTIONAL 


ANALYSIS; SOCIAL CONTROL; VALUES. Other relevant 
material may be found in INTERACTION, article on 
SOCIAL EXCHANGE; LEGAL SYSTEMS, article on COM- 
PARATIVE LAW AND LEGAL SYSTEMS; MARXIST SO- 
CIOLOGY.] 
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NORTH, DUDLEY 


Sir Dudley North (1641-1691), British mer- 
chant, civil servant, and economist, was the third 
of five sons of the fourth Baron Guilford. He owes 
his place in the history of economic thought to his 
essay Discourses Upon Trade, published in 1691. 
The author's reputation as an economist was not 
firmly established until 1824, when the essay was 
rediscovered and accorded favorable notice in John 
McCulloch’s A Discourse on the Rise, Progress, 
Peculiar Objects, and Importance of Political Econ- 
omy (1824). However, the main ideas of the work 
were earlier available in summary form in the biog- 
raphy of Sir Dudley published by his brother Roger 
(North 1744). The biography throws light on the 
possible extent of the collaboration between Sir 
Dudley and Roger in the composition of the Dis- 
courses, and there is good reason to believe that 
the preface to the Discourses, the concluding para- 
graph of the second discourse, and also the final 
paragraph of the postscript are, in fact, the work 
of Roger. 

Sir Dudley's business experience in the Levant 
merchant trades and his later appointments as 
commissioner of the customs and commissioner of 
the treasury afforded him an insight into the func- 
tioning of the economic system and formed the 
basis of his relatively advanced economic analysis. 
His well-known argument for free trade principles 
is to be seen as only a part of the analytical appa- 
ratus of the Discourses. North was able to escape 
from the restrictive, regulatory assumptions of the 
earlier mercantilism because he was not concerned 
in quite the same way as other authors of the time 
with the problems of international money and bul- 
lion movements. Instead, his primary aim was to 
exhibit something of the structure of the flow of 
trade and money in the economic system, both 
national and international. He argued that it was 
unnecessary and fruitless for the government to 
concern itself directly with the availability of bul- 
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lion and metallic money. There was an “ebbing and 
flowing of money,” North argued, an alternation 
between coining, melting, and recoining of bullion 
(“the buckets work alternately”) in terms of which 
a freely functioning monetary economic system 
would always be assured of a money supply ade- 
quate to the needs of trade, and in terms of which 
redundant money supplies would be melted and 
reabsorbed into hoards or exported. This significant 
empirical and analytical assumption enabled North 
to concentrate on the nature and significance of 
monetary circulation in the economy, rather than 
solely on money as such, In this lay his primary 
analytical contribution. But although he empha- 
sized the importance of circulation, he failed to 
push his analysis to the point of showing the full 
effects of the balancing process on variations in 
commodity prices and income levels. 

With John Locke, North argued against the pro- 
posals, recently advanced by Sir Thomas Culpeper 
and Sir Josiah Child, for a compulsory lowering of 
the legal maximum rate of interest. Such an act 
would lead to various kinds of legal avoidance, to 
the hoarding rather than the lending of funds, and 
to an artificially high black market rate of interest. 
A low interest rate was, for North, an effect rather 
than a cause of prosperous economic conditions. 
The interest rate would be low, he argued, when in- 
comes and merchants’ savings out of incomes were 
high. Thus his interest analysis was largely di- 
rected toward explaining the supply side of the 
loan markets, focusing on the supply of savings or 
“stock.” The demand side of the loan markets was 
composed mainly of persons wishing to borrow for 
consumption expenditure, Interest rates on loans 
would differ, depending on the degree of risk in the 
loan. 

North's insistence on the significance of mone- 
tary circulation led him to a relatively advanced 
understanding of the importance of monetary de- 
mand. For healthy business activity it was not 
enough that the nation should possess money. It 
was necessary also that the nation should spend 
money. In discussing the state of the currency he 
argued that clipped money should be recoined at 
the expense of the holder, and that taxes should 
be payable only in full-valued money or the intrin- 
sic equivalent thereof. But while North intended 
that this policy force the general acceptance and 
circulation of intrinsically full-valued coins, he 
recognized that clipped money would continue to 
circulate at its face value so long as it was univer- 
sally acceptable, a notion that, together with other 
references to the use of money substitutes, pointed 
in the direction of the eighteenth-century anti- 


metallists. In the course of the recoinage argument 
North drew attention to what became known sub- 
sequently as the velocity of circulation of money, 
In their methodological perspective and the breadth 
of their monetary analysis, Sir Dudley's Discourses 
are an important anticipation of the best work in 
telated areas in the following hundred years. 


DOUGLAS VICKERS 


[See also Economic THOUGHT, article on MERCANTILIST 
THOUGHT; Money.) 
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Theory of 


NUCLEAR WAR 


Nuclear war has but a brief history. In 1945 two 
low-yield weapons were used against Japanese 
cities, resulting in the surrender of Japan to the 
Allies. Nuclear deterrence has had a richer history, 
but much of this history has little relevance to 
matters of current importance. This article will 
therefore focus on the present and prospective con- 
Sequences of the existence of nuclear weapons. 


Some continuities with the past 


As context for this discussion, it is important to 
point out the many factors in the international 
situation that remain unchanged since 1945, First 
—and perhaps most important—in spite of inter- 
national legal institutions that are effective and 
valuable in a wide range of ordinary situations, 


. crises continue to arise in which each nation be- 


comes the judge of its own cause and of the meth- 
ods and intensity with which it will seek to advance 
its interests. Thus, international conflicts have 
always been prone to “escalation” (a “qualitative 
increase in the intensity or level of conflict or vio- 


lence, perhaps occurring inadvertently, perhaps 
resulting from a conscious competition in risk tak- 
ing or even from a deliberate increase in the use 
of force). I emphasize the pervasiveness of the 
escalation concept, which implies that in most con- 
flicts there will be force that remains unused. 
Nuclear weapons have increased the amount of 
force that tends to remain unused but are not 
themselves responsible for the possibility that low 
levels of violence will escalate to medium levels or 
even very large-scale destruction; that possibility 
has almost always been present. 

Moreover, it is still most unlikely that a thermo- 
nuclear war would mean the “end of history,” not 
because it is technologically impossible to end hu- 
man history by the use of nuclear explosives but 
simply because, at this writing (i.e., in 1967), even 
the superpowers have not procured the kinds of 
weapons systems that could—in a realistic situa- 
tion—bring about such a result; nor are they likely 
to procure such systems. Even if a thermonuclear 
war were actually fought today by the existing 
powers, and even if it were fought with the utmost 
ferocity and lack of control, such a war would be 
likely to end with the Southern Hemisphere and 
large portions of Asia largely undamaged. Nor 
would the survivors in the Northern Hemisphere 
be likely “to envy the dead.” Recuperation is ex- 
tremely likely, and, although it is true that accord- 
ing to the best scientific estimates, the postwar 
environment might be more hostile to human life 
for many years, objective studies also indicate that 
this environment would not be so hostile as to pre- 
clude, at least in the long run, decent and useful 
lives for the survivors and their descendants. In- 
deed, what evidence there is suggests that relatively 
normal and happy lives would not be impossible 
even under the harsh conditions that might prevail 
after a nuclear war, in spite of the personal and 
social traumas that would have been experienced. 

Of course, to say that a nation can survive a 
thermonuclear war is not to say that all the prob- 
lems—military, social, political, economic, and 
medical—that would arise from a thermonuclear 
war can be handled. Would the social organism 
fall apart completely—that is, die in some sense— 
as a result of the tremendous shock it would re- 
ceive from a large thermonuclear war? Obviously 
there is no way of being certain. However, insofar 
as there are historical examples to study (and 
some of them are close to thermonuclear wars in 
intensity, e.g., the devastation of Germany and the 
Soviet Union in World War 11), they provide evi- 
dence that people can and do rise to the occasion 
(Kahn 1962). 
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The age-old role of bargaining in war (e.g., work- 
ing on the enemy's mind rather than on his body 
or resources) may be increased rather than de- 
creased both before and after a nuclear attack. 
Deterrence is, in effect, a form of bargaining and 
could occur not only before but also during and 
after a war (including a war caused by a failure of 
deterrence )—i.e., even if deterrence fails and war 
occurs, each side is likely to want to withhold some 
weapons in order to be able to threaten the enemy 
and may be anxious to settle the war while the 
other side still has not used its withheld weapons. 
The possibility of technological, tactical, or stra- 
tegic surprise also still exists. Despite current beliefs 
in stability, “Pearl Harbors” and “Munichs” remain 
possible (i.e., deterrence may not necessarily favor 
the defensive or status quo side); thus, those who 
take risks may find that their gamble has not 
failed. Finally, even though countries will be less 
likely to risk war, in some ways surprises and “un- 
expected” victories are more likely in a nuclear war 
era (i.e., war could occur as a result of correct cal- 
culations as well as by miscalculation). 

While mutual destruction and Pyrrhic victories 
can occur, such possibilities have been with us at 
least as long as recorded history, and the fact that 
they could occur days or even hours after the start 
of hostilities, rather than after weeks, months, or 
years, does not necessarily change the basic issues 
and situations that have arisen in the past. Thus, 
history may repeat itself in the thermonuclear era, 
which could experience successful wars and ag- 
gressions as well as mutually destructive ones. 


Differences in technology and doctrine 


The most important way in which the thermo- 
nuclear era differs from previous eras is that the 
changes in the available technology have, by them- 
selves, been large enough to change the strategic 
situation and the character of international rela- 
tions. This has resulted both in posing very differ- 
ent strategic issues and in changing the answers to 
old issues. Thus, in this period U.S. strategic weap- 
onry has changed (or is changing) from B-50 
bombers to B-36 bombers, to B-47s, and finally to 
B-52s; and from Atlas and Titan intercontinental 
ballistic missiles to early-model Minuteman and 
Polaris, and now to Minuteman-2s and Polaris A-3s. 
These changes have produced enormous differ- 
ences in the effectiveness of the forces. Even the 
change from Minuteman-1 to Minuteman-2 can by 
itself increase the effectiveness of a force by a 
large factor (doubling—or perhaps redoubling sev- 
eral times—the size of a nation’s missile forces), 
to the extent of altering the outcome of a war. 
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However, since all these changes have occurred 
without being tested in battle, one can question 
whether their full significance has even been ab- 
sorbed intellectually, much less in plans and pro- 
grams. 

Indeed, it is clear that in many respects the two 
superpowers, and certainly the other nations, have 
failed to understand these rapidly changing inter- 
actions between technology and doctrine. We can 
(with some violence to the subtleties, and viewing 
the subject from the viewpoint of the U.S. govern- 
ment) consider the two postwar decades as four 
five-year eras characterized as follows: 

1946-1950: Early fission and bomber era: entry 
of Soviet Union into nuclear club. 

1951-1955: Early thermonuclear weapons; max- 
imum “apparent” U.S. superiority; simple massive- 
retaliation theories; entry of United Kingdom into 
nuclear club. 

1956-1960: Early missile-middle thermonuclear 
weapons era; intense debates about deterrence and 
tactical nuclear war; continuation of (and prob- 
ably increased but less apparent) U.S. superiority, 

1961-1965: Middle missile era; two-day deter- 
rence; controlled-war doctrine; entry of France and 
China into nuclear club. 

In the early nuclear bomber era, the strategic 
balance was one-sided, for the Soviets did not test 
their first weapon until February 1949. However, 
the United States had not produced very many 
nuclear bombs, and it is doubtful that the U.S. 
strategic forces could have had anything like the 
decisive impact on a war that was popularly sup- 
posed—indeed, these forces might have done less 
damage to a Soviet war effort than the Germans 
did in World War 1 by invading and occupying 
Soviet territory. 

By the early 1950s the United States had sub- 
stantially increased its forces, but the Russians had 
scarcely begun to procure intercontinental bombers. 
They had a rather large force of medium bombers 
of the TU-4 (similar to the B-29) and Badger (like 
the B-47) types, but it now seems clear that both 
of these aircraft were designed and procured for 
European rather than intercontinental missions— 
although at the time no one in the United States 
or Europe seemed to realize this. Although thermo- 
nuclear weapons had been tested, the military 
stockpiles consisted almost wholly of kiloton bombs. 

Despite slow means of delivery (bombers) and 
relatively low-yield warheads, both U.S. and Soviet 
forces were almost incredibly vulnerable to sur- 
prise attack. At the beginning of the period all U.S, 
strategic forces were located at a dozen bases. 
Hours, perhaps days, would have been required to 


evacuate them and days, perhaps weeks, of warn- 
ing would have been necessary for them to be able 
to mount effective combat operations. Nuclear 
weapons were stored in a relatively vulnerable con. 
figuration (at first in one building and then in 
two). In the early part of this period, almost no 
one seems to have understood the subtleties of the 
problem of vulnerability. Active air defense was 
deployed to protect cities and nuclear research 
centers like Oak Ridge, Tennessee, and Hanford, 
Washington. Strategic Air Command bases were 
left unguarded, on the theory that no one would 
waste nuclear weapons on military bases. By the 
middle and end of the period, senior officers in the 
U.S. Air Force understood vulnerability somewhat 
but did not believe (it now seems correctly) that 
the Soviets had much actual operational strategic 
capability for a surprise attack. 

By the late 1950s, third- and fourth-generation 
nuclear weapons had been procured by the United 
States and a large Spectrum of such weapons was 
available to the U.S. forces, from small (“suitcase”) 
to multimegaton bombs, U.S. military planners 
and decision makers began to think of thermo- 
nuclear weapons as relatively inexpensive, but the 
Soviets still did not. However, the problems of vul- 
nerability were still inadequately appreciated; in 
fact, by the end of the period there was much dis- 
cussion of the existence of a “missile gap.” While 
the U.S. government conceded the existence of 
such a gap—and in fact was responsible for dis- 
seminating the estimates which gave it plausibility 
—it simultaneously argued there was no “deter- 
rence gap,” since the five hundred missiles which 
U.S. intelligence attributed to the Soviet Union 
would have been unable to do as much damage as 
the two thousand bombers that the United States 
Possessed. Congressional testimony and other docu- 
ments disclose that almost none of the top civilian 
officials and relatively few scholars and journalists 
understood that these five hundred Soviet missiles, 
if they existed, could probably have destroyed the 
two thousand American bombers on the ground 
in a surprise attack (but see Wohlstetter 1959; 
Kahn 1960). 

By the early 1960s the United States, at least, 
was well into the missile era, and almost every- 
body interested in such problems understood the 
distinction between “first-strike” and “second-strike’ 
tactics, forces, and postures. However, according to 
the 1963 testimony of Defense Secretary Robert 
McNamara, the Soviets had not yet hardened and 
dispersed their missile forces, although it was €x- 
pected that they would do so by the late 1960s. In 
the early 1960s some of the doctrinal lags of the 


late 1950s were revealed. For example, it was dis- 
closed that the more important half of the U.S. 
Semi-automatic Ground Environment Air Defense 
System, the part designed to control the air battle 
in defense of strategic centers on the American 
east coast, west coast, and Canadian border, was 
located on Strategic Air Command (SAC) bases 
and thus was almost certain to be destroyed or dis- 
abled in any war in which Soviet missiles were suc- 
cessfully launched at these SAC bases. Similar 
mistakes in both installations and weapons systems 
occurred elsewhere. As a result, there was tremen- 
dous concern about vulnerability and emphasis on 
such problems as “reciprocal fear of surprise attack” 
(expressed in terms of “gun duel” models of a 
strategic confrontation in which the side which gets 
off a “shot” first may escape all retaliation). 

By the end of this period the United States had 
begun to digest not only the preattack implications 
of two-way deterrence but also the possibility of 
intrawar or postattack deterrence, and therefore 
the need for restraint in the threat and use of 
force even after hostilities have begun. This re- 
sulted in the so-called controlled-response doctrine 
(Kahn 1960, pp. 171-175) and such policy state- 
ments as the one by Secretary McNamara that 
“principal military objectives in the event of a 
nuclear war stemming from a major attack on the 
Alliance, should be the destruction of the enemy’s 
military forces, not of his civilian population” 
(New York Times, June 17, 1962, pp. 26-27) and 
the statement of President Johnson in his defense 
Message to the 89th Congress that “our military 
forces must be so organized and directed that they 
can be used in a measured, controlled, and deliber- 
ate way as a versatile instrument to support our 
foreign policy” (1965, p. 825). 

Despite these statements, however, it becomes 
quite clear from other statements that both Secre- 
tary McNamara and President Johnson remained 
doubtful about the feasibility, likelihood, or even 
possibility of a controlled response in a major war. 
Moreover, there is little indication that the United 
States has thoroughly organized its forces (not to 
Speak of the NATO forces) around these concepts. 

The change from kiloton to megaton weapons in 
the early 1950s was, in some ways, as significant 
as the change from high-explosive to kiloton weap- 
ons in the mid-1940s. Until megaton weapons be- 
came available, it was unlikely that the U.S. Stra- 
tegic Air Command could have done as much 
physical damage to the Soviet Union’s war effort 
4s was actually accomplished by the German army 
in World War 1. In the mid-1960s, in the multi- 
megaton era, potential destruction from a thermo- 
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nuclear war is almost incredibly large. The duration 
of an “all-out” war against city targets is now likely 
to be closer to a minimum of 30 minutes than to 
a maximum of 30 days. Analysts no longer ask 
“What is destroyed?” but “What is left?” What has 
also become apparent, loose rhetoric to the con- 
trary, is that thermonuclear wars may come in 
many sizes and shapes. The analysts must, conse- 
quently, examine and estimate a large number of 
effects in a wide range of situations. 

First, prewar tactics and strategy, which may 
make an enormous difference to the outcome, must 
be studied. Then the immediate effects of blast 
and “prompt” gamma and thermal radiation must 
be examined, as well as the subsequent effects of 
primary and secondary fires and of fallout radia- 
tion during the first week or two. If the war has 
involved widespread targeting of civilian areas, 
there are problems of human survival, of radioac- 
tivity, and of damage to the physical environment, 
and problems of economic and social disorganiza- 
tion. These require consideration of immediate re- 
pair and “patch-up,” and economic and social reor- 
ganization and recuperation (problems which so 
far have hardly been dealt with, despite many mil- 
lions of dollars spent on research in these areas). 
Even in a controlled thermonuclear war the likely 
rate of economic and social recuperation (assum- 
ing the recuperation efforts themselves have been 
successfully launched) may be very different from 
those which would follow even a very destructive 
conventional war. There are also long-term en- 
vironmental problems, including the medical after- 
effects of radiation on exposed survivors. Finally, 
account must be taken of the genetic effects of 
radiation and long-range changes in the physical 
environment resulting from widespread radiation 
damage to plants and animals, large-scale fires, 
floods, possibly genetic changes in flora and fauna, 
and even weather changes. 

Many of these questions are unprecedented; all 
are complex; and almost all tend to arouse “open- 
ended” fears. But in some ways the appearance of 
total uncertainty on these issues is misleading, 
since the results of thermonuclear war, particularly 
the destruction of enemy weapons and, to some 
degree, cities, as well as many physical aftereffects, 
are easier to calculate than are the results of the 
clash of two land armies fighting a conventional 
war. However, the lack of experience in such wars, 
and the absolute necessity for making and relying 
on decisions and analyses made in advance, means 
that, from a decision-making point of view, the 
situation is much more complex and uncertain. 
Also, while one can often make relatively precise 
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estimates about the probability that something will 
be destroyed by some specific mechanism, it is 
often impossible to make precise statements that 
something has survived all the ways in which it 
could be destroyed. 


Deterrence versus denial (or defense) 


One result of the potential increase in destruc- 
tiveness combined with confusing complexities and 
uncertainties has been a major change in attitudes 
toward war. Such phrases as “absolute weapon” 
(doomsday machine), “balance of terror,” “live to- 
gether or die together,” and “war is unthinkable” 
(or impossible or obsolete) illustrate widely held 
and widely expressed attitudes and beliefs, Not 
only is such total destruction hard to visualize, but 
it leads to the feeling that the deterrence of war, 
or even of the threat of war, is a simple and logical 
consequence of the existence of nuclear weapons. 

The total destruction or “mutual homicide” in- 
terpretation of thermonuclear war has other com- 
forting aspects. If it be granted that each side can 
utterly and reliably destroy the other, then expen- 
sive preparations to reduce casualties, lessen dam- 
age, and facilitate postwar recuperation are useless. 
Can we not spare ourselves the financial burden of 
such preparations? This logic has sometimes been 
carried further, to argue that if modern weapons 
are so enormously destructive, then only a few 
are needed to deter the enemy. War can be deterred 
with much smaller forces than in the past; and we 
certainly don’t need larger ones. Most influential 
of all of these arguments is that if destruction is 
always total and automatic, we need not spend 
time and energy worrying about details, compari- 
son of risk, etc. 

All of the above has led to a well-articulated and 
explicit dependence on deterrence—on dissuasion 
through terror—and the belief that the major role 
of a nuclear force is that of a deterrent and bar- 
gaining tool. As President John F. Kennedy said on 
March 28, 1961, in his special message to the Con- 
gress on the defense budget: 


The primary purpose of our arms is peace, not war— 
to make certain that they will never have to be used— 
to deter all wars, general or limited, nuclear or con- 
ventional, large or small—to convince all potential 
aggressors that any attack would be futile—to provide 
backing for diplomatic settlement of disputes—to in- 
sure the adequacy of our bargaining power for an end 
to the arms race. (New York Times, March 29, 1961, 
pp. 16, 17) 


Whether based on objective capabilities or “re- 
solve,” the concept and use of deterrence are not 


new. However, today there is almost total empha- 
sis on mutual terror and destruction—that is, the 
countervailing power does not emphasize its ability 
to negate the acts of the aggressive or active power, 
or to destroy or block his forces militarily, as in 
the past, but emphasizes instead its ability to harm 
the population, resources, or property of the op- 
posing power. An explicit distinction is made today 
between deterrence and what is sometimes called 
denial (or defense), which is the physical preven- 
tion or alleviation of an action (as opposed to a 
“psychological” prevention, based mainly upon the 
threat of pain or destruction of other values). 
The relationship between deterrence and denial— 
the various ways in which they can reinforce or 
conflict with each other—has become the subject 
of much discussion (e.g., Kaufmann 1956; Kahn 
1960; 1962; Schelling 1960; 1966). 

Deterrence is, of course, a complicated relation- 
ship, and in trying to analyze deterrence situations, 
one may elaborate on Raymond Aron’s well-known 
questions by asking: . . . Who deters, influences, 
coerces or blocks whom from what actions (alterna- 
tives), by what threats and counteractions in what 
situations and contexts, in the face of what counter- 
threats and counter-counteractions? . . . and why 
does he do it? ... 

One can group the italicized variables into three 
groups: (1) political (who, whom, and why); 
(2) scenario (alternatives, situations, and context); 
and (3) military (actions, threats [or counterac- 
tions], and counterthreats [or counter-counterac- 
tions]), While most discussions of thermonuclear 
war generally emphasize only one or another of 
the above sets of variables, all three sets must be 
considered in an integrated way. (Only two works 
seem to have attempted this: Kissinger 1957; 
Strausz-Hupé et al. 1959.) Failure to do so con- 
stitutes a major source of misunderstandings about 
deterrence. 2 

It is important to distinguish between what is 
sometimes called “passive” and “active” deterrence. 
“Passive” deterrence refers to a situation in which 
a nation has been so provoked (perhaps by a direct 
attack on its population) that the response is al- 
most automatic. “Active” deterrence applies to 4 
situation in which the “proper” response to prov% 
cation is clearly further escalation (perhaps ones 
ally has been struck). In this situation it takes an 
act of will to respond. Rather than being an auto- 
matic act of revenge, the response may start a 
sequence leading to one’s own destruction. 

It is also a mistake to treat deterrence as an 
either/or situation, instead of considering degrees 


of deterrence. In most situations one could usefully 
distinguish at least six levels of deterrence between 
the United States and the Soviet Union (Kahn 
1965, pp. 277-280): 

(1) Minimum (or low-level): (a) deterrence 
by uncertainty (including the use of dire but unbe- 
lievable threats as a declaratory policy but pre- 
emptive or preventive accommodation tactics as 
an action policy); (b) deterrence by threshold; 
(c) deterrence by taboo. 

(2) Workable: reliable ability to threaten, say, 
1-10 million casualties or unreliable ability to 
threaten, say, 10-50 million casualties. 

(3) Adequate: reliable ability to threaten, say, 
10-50 million casualties or unreliable ability to 
threaten, say, 50-100 million casualties. 

(4) Reliable: reliable ability to threaten, say, 
50-100 million casualties. 

(5) Approaching absolute: reliable ability to 
threaten, say, 100-200 million casualties. 

(6) Near absolute (or stark): reliable ability to 
threaten overkill by a factor greater than 2, so that 
no miscalculation or wishful thinking could con- 
fuse the potential attacker. 

Although the above scale leaves out most of the 
subtleties associated with, for example, credibility 
issues, it correctly implies that there is a broad 
range of circumstances in which even a minimum 
deterrent might work and that there are circum- 
stances in which the most stark deterrent might 
fail. The scale also suggests that the question 
“How much deterrence does a nation need?” can- 
not be answered simply by “As much as possible.” 
Rather, a range of scenarios, asking the who- 
whoni-why kind of question not discussed here, 
must be examined, and, most important of all, the 
degree of assurance necessary for various situations 
must be weighed against the various costs of going 
higher on the scale of deterrence. 

Deterrence clearly involves dissuasion by use of 
threat of varying degrees and kinds of terror, which 
in turn assumes that the deterree (and even the 
deterrer) possesses some degree of rationality— 
but usually very little: about as much as is demon- 
Strated by a child who has learned not to burn 
himself or climb out of windows. Nevertheless, the 
emphasis on terror and rationality raises much 
apprehension, particularly among members of the 
Peace groups but also among decision makers and 
analysts generally. This is not only because subtle 
and misleading situations can arise but also be- 
Cause many people feel, often reasoning from the 
Psychology of the neurotic and psychologically 
disturbed, that threats and terror may attract rather 
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than repel and that rationality is thus a weak reed 
on which to rest our hopes for preventing thermo- 
nuclear war. One can, however, fully share these 
apprehensions and still feel that deterrence is, for 
the time being, the safest and most moral alterna- 
tive available. But while apprehensions are often 
exaggerated, even the most optimistic may still 
wish some protection against the failure of deter- 
rence and may work to improve and eventually to 
reform the deterrent system. [See DETERRENCE.] 


Changes in prospect 

By the late 1960s at least five or six countries 
should have reached India’s position in 1965, i.e., 
that of having completed almost all research nec- 
essary to assemble and test a nuclear device but 
having stopped short of developing a “working” 
model (a matter of perhaps less than a year and 
less than a $1 million cost) because of governmen- 
tal reluctance to authorize this final step. This 
situation may become typical, and it would not sur- 
prise most scholars if there were no new entries 
into the “nuclear club” during the next decade. As 
of 1966, there have been no public entries into the 
nuclear race for more than ten years: China and 
France made and announced their initial decisions 
to become nuclear powers before this period. 

In terms of delivery systems, the late 1960s will 
be the early missile era, in that most nations will 
not have achieved the kind of capabilities that the 
United States will possess. The United States, in 
turn, will probably have entered the mature missile 
era, since at least some U.S. missiles will be cheap, 
reliable, and relatively small in size; will possess 
great range, good accuracy, and reasonable payload 
capability; and will be capable of complicated or 
sophisticated tactical performance, 

The cost of a simple Minuteman missile system 
is likely to be something between $1 and $2 million 
a year per missile to buy and maintain, as long as 
there is reasonable access to production and oper- 
ating techniques comparable to those of the United 
States. These costs are likely to apply to missile 
systems with at least hundreds but not necessarily 
thousands of missiles. Thus, on the basis of U.S. 
costs, any nation will be able to have, say, a 500- 
missile Minuteman-type strategic force for a budget 
of between $500 million and $1 billion a year. 

There are many who now believe that the kind 
of revolutionary technological changes that have 
occurred since 1945 will, for practical purposes, 
have come to a stop. This belief is usually based 
on various versions of overkill theories and the 
conviction that nuclear “stalemates” are not sus- 
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ceptible to breakthroughs in technology. The argu- 
ment runs as follows: 


From 1965 on, or soon afterward, both the United 
States and the Soviet Union will be able to inflict 
unacceptable damage on each other under all circum- 
stances, using more or less existing, or moderately 
improved, systems. Thus, if one nation or the other 
provides a new way to damage or destroy its potential 
opponent's civilian society, the deterrent equation is 
not affected. For this reason the many possible tech- 
nological improvements in offensive power (against 
civilians) will not affect international relations. The 
same will probably be true in the unlikely event that 
there are major improvements in defense (or counter- 
force) capability. The current stalemate is at a high 
enough level so that each nation will still be able to 
inflict unacceptable damage in retaliation, even if its 
capability is cut by a large factor. This will be true 
practically, even if it is not true theoretically, because 
if one side does invent and procure some breakthrough, 
it cannot rely on the system’s working properly (per- 
fectly?) the first time it is put to the test and will 
therefore be “deterred by uncertainty,” and deterred 
almost as effectively as if the deterrence rested on 
objective capabilities and calculations. Therefore, at 
least in the area of central war (i.e., nuclear war 
involving the homelands of the major powers), there 
is little or no interest in examining technology with 
the same intensity that was necessary in the last two 
decades, 


Even though it ignores many important issues, 
this argument is persuasive and may prove to be 
practically correct as far as the politics of U.S.— 
Soviet confrontations are concerned—although 
both countries will still work at improving their 
forces, and such improvements might make a 
dramatic difference in the outcome if deterrence 
ever failed. In any case, as Figure 1 shows, budgets 
for central war forces have been declining over the 
years (in the late 1950s the total budget was prob- 
ably close to $15 billion). The data suggest either 
that thermonuclear war is getting much less expen- 
sive or that the United States is becoming satisfied 
with much less capability (actually, both effects 
are occurring ). 

The argument leaves open the question of a 
confrontation between one of these central super- 
powers and an nth country or a confrontation 
between nth countries. In these situations many of 
the points made above lose much of their force, 
Although such lesser confrontations do not seem 
to raise the question of the future of the entire 
world as immediately and intensely as U.S.Soviet 
ones, the most significant technological arms race 
will probably go on “outside” the direct confronta- 
tion of the two superpowers (unless, of course, the 
superpowers go in for elaborate arms control or 


active and passive defense, in which case technol- 
ogy will again make a great difference). There will 
in any case be a widespread distribution of long- 
range missile technology using both solid and 
liquid fuels. Missile technology will proliferate more 
rapidly than weapons technology because of lesser 
secrecy, due, in part, to the much greater applica- 
tion of missile technology to normal peacetime re- 
search and engineering. 

The strategic situation will probably be very 
much affected by whether or not the United States 
and/or the Soviet Union allocate sufficient re- 
sources to active and passive defenses to keep 
ahead of the later entries into the nuclear club. 
If either spends, say, $5-$10 billion a year in this 
area, they will probably preserve an important, 
perhaps overwhelming, strategic asymmetry. If 
they do not do this, and the likelihood is that they 


BILLIONS 
OF 
DOLLARS 

12 


n 


10 


1962 1963 1964 1965 1966 1967 1968 1969 1970 
FISCAL YEAR 
Figure 1 — U.S. budgets for central war forces* 


* Includes procurement, maintenance, and operation of strategic offense and 
active and passive defense of the United Stotes, but no R & D programs: 


Source: Based on U.S. Department of Defense figures 


will not, the situation will be closer to that illus- 
trated by an example drawn from the American 
West, where the six-shooter was the great equal- 
izer, and there will be a lesser tendency to differen- 
tiate between superpowers and great powers. 

If there is widespread proliferation, the situation 
will doubtless be quite dangerous; but despite the 
obvious dangers, it is not likely that a war between 
two small or medium powers, or even large powers, 
would trigger an orgy of destruction in which 
every power joins. In a world of nuclear prolifera- 
tion the concept of controlled response will be 
firmly fixed in everybodys mind, and it is incon- 
ceivable that many, if indeed any, nations will 
want to enlarge a nuclear war in which they are 
not directly and vitally involved—and perhaps not 
even then. It may even be that a nuclear war may 
occur between two smaller powers in which both 
sides are wiped out, or even in which one side 
“wins” but at a cost which results in a sobering, 
Not to say chilling, effect on all. Most dangerous of 
all, one power may win clearly and easily, and 
succeed in keeping its winnings. This might be a 
great spur both to further proliferation and to risk 
taking by some of the more aggressive nuclear 
powers. 

By the late 1970s, in short, the technological and 
economic possibilities for proliferation will be such 
that unless there are extreme implicit or explicit 
constraints against it, a widespread or even explo- 
sive proliferation can be expected. By the 1980s 
we will be either in the postnuclear era or in what 
may be thought of as the mature nuclear era. That 
is, we will be living either in a world in which large 
numbers of powers have nuclear weapons or in a 
world in which nuclear weapons have been effec- 
tively controlled. 


Attitudes toward thermonuclear war 


Parallel to the acceleration of the advances in 
Modern technology in the twentieth century, there 
has been a growing feeling that technological leaps 
forward (rather than morality) have made war un- 
acceptable. However, another reason why war is 
“unacceptable” should also be noted: the existence 
of a “peace of satisfaction” (Aron 1962). To illus- 
trate this phrase, consider Latin America today, 
where for the last two decades there has been 
almost no major threat of war. The explanation 
obviously has nothing to do with nuclear weapons. 
Yet if these nations possessed such weapons, almost 
everyone would ascribe the relative peace of the 
Period to the existence and effectiveness of nuclear 
deterrence. Similarly, most of the large nations in 
the world today are deterred—but even more, they 
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are defensive, prudent, and in large part supporters 
of the status quo. Nevertheless, nuclear weapons 
do pose the possibility of mutual destruction and 
thus raise the question of war's practicality or use- 
fulness as a social institution in an entirely differ- 
ent way than does “conventional” war. The answers 
to this question reflect at least three common atti- 
tudes toward nuclear weapons: 

(1) An unqualified rejection, which also includes 
a rejection of deterrence—i.e., it is argued that 
since deterrence can fail, we must move to a situa- 
tion in which the weapons themselves no longer 
exist. This condition would doubtless be desirable, 
but it is difficult to see how it is likely to be achieved 
without much more peaceful evolution—or crises 
or wars—than seems likely in the immediate fu- 
ture. It seems unlikely that the threat of nuclear 
war will be “settled” through peaceful evolution 
and international cooperation in the next decade or 
so, although these may help facilitate the “eventual 
settlement.” 

(2) A more common attitude of qualified rejec- 
tion of nuclear weapons as useful for any purpose 
except to deter their use by others. All the various 
kinds of finite deterrence strategies and basic deter- 
rence strategies express this attitude, and little or 
no preparation is advocated for alleviating the con- 
sequences if deterrence fails. 

(3) A qualified acceptance of nuclear weapons 
or resignation to their continued existence and 
possible use. Acknowledging all of the points made 
earlier on ways in which the world situation has 
not changed, some of these “acceptors” believe in 
a self-defeating prophecy—i.e., that deterrence will 
be much more likely to fail if too many people 
believe that “war is unthinkable.” Some of this 
group also argue that even during a war there 
might be deterrence of many actions and thus some 
degree of control. Few of this group would argue 
that such intrawar deterrence can be relied on. All 
believe, however, that the likelihood of maintain- 
ing control, even after nuclear weapons have been 
used, is so high that attempts should be made to 
exploit it. Finally, there are those who wish also 
to support a bargaining position, or to hedge, 
against a loss of control, by having some active 
and passive defense and some offensive counter- 
force ability. 

Whatever the attitude about the feasibility of 
various solutions, hardly anyone considers the 
arms race or large-scale war useful in international 
relations. President Kennedy expressed, in two 
aphorisms, what seems to be the consensus of man- 
kind. The first was in his UN address of September 
25, 1961: “The weapons of war must be abolished 
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before they abolish us”; the second, in his budget 
message of January 18, 1962, was that we must 
“retain for ourselves a choice other than a nuclear 
holocaust or retreat.” The first statement counsels 
us to change the current international system; the 
second, to maneuver successfully within it. The 
dilemma of the nuclear age is usually posed by argu- 
ing that these two imperatives are both valid and 
mutually exclusive. And indeed, although it may 
prove impossible to change the current system until 
after it has failed conspicuously and perhaps dis- 
astrously, the second imperative could also help to 
deal with the first one. 

In any case, we seem, for the time being, to be 
succeeding in avoiding both disastrous appeasement 
and withdrawal or escalation to catastrophic levels. 
To hope that this success can be perpetuated is to 
hope for more than just the accurate and detailed 
awareness of the need to maintain military and po- 
litical strength while limiting the new potential for 
unprecedentedly rapid, long-range, and widespread 
destructiveness. These efforts, of course, are more 
likely to avert eruption of a cataclysmic war or ap- 
peasement only if there is understanding of all the 
dangers and options, In addition, increased aware- 
ness of all the issues may motivate sufficient inter- 
national reform and simultaneously constrain all 
national decision makers to exhibit a consistently 
high degree of caution, reasonableness, and re- 
straint in their dealings with one another, whether 
in peace or in limited war. The issue is, of course, 
still open. 

HERMAN KAHN 


(See also DETERRENCE; DISARMAMENT; LIMITED WAR; 
TECHNOLOGY, article on TECHNOLOGY AND INTERNA- 
TIONAL RELATIONS. Guides to other relevant material 
may be found under INTERNATIONAL RELATIONS; 
Wanr.] 
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A study of the state of nuptiality in the world 
and of its trends requires a preliminary examina- 
tion of the types of conjugal unions and of the 
circumstances through which they are established. 

Monogamous unions are by far the most fre- 
quent; but we must devote special attention to the 
polygamous unions, which are particularly impor- 
tant in African populations. 

When marriages are the occasion of legal pro- 
cedures, as in Western countries, they lend them- 
selves to demographic observation in the most 
favorable way, and they permit a systematic regis- 
tration by the agencies responsible for the collection 
of statistical data. But in some cases legally recog- 
nized unions may be the occasion simply of reli- 
gious ceremonies; or they may be the result merely 
of traditional processes characteristic of certain 
ethnic groups (customary marriages). 

Finally some unions which have existed without 
any celebration (free unions) may sometimes pre- 
sent enough stability and generality to deserve the 
demographer’s attention; consensual unions in 
Latin America are an instance. 
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Some demographic considerations. Various con- 
straints originating in demographic mechanisms 
are most important in influencing the way couples 
are formed within a population. First we shall con- 
sider the marriages between single persons in a 
monogamous society. 

From a strictly numerical point of view, it is not 
always possible to match precisely the men and 
women in the usual marrying ages because of the 
frequent imbalance between the sexes. If the choice 
of a spouse is limited further to a small social, 
religious, or geographical group, random fluctua- 
tions may cause a sizable imbalance, so that a 
large number of persons may never marry. In this 
respect, the increasing mobility of people in the 
developed societies of today has enlarged the 
groups within which unions are established; so- 
cially isolated groups, or isolates, as demographers 
call them, are disappearing. This is probably one 
of the causes of the rise in nuptiality witnessed in 
certain countries; in other instances the increased 
mobility has mainly resulted in a considerable re- 
duction of the number of consanguineous mar- 
riages. 

Let us now examine what factors are capable of 
creating an imbalance of the sexes at the ages 
where marriage most frequently occurs when the 
population is large and random fluctuations are not 
operative. First, there is a surplus of male births 
(105 boys per 100 girls on the average). This 
surplus decreases with age as a result of excess 
male mortality and is finally resolved into a surplus 
of women. To visualize the effect of mortality de- 
cline on the sex ratio at the ages when people 
marry, it is possible to compute the number of 
men aged 27 years per women aged 23 years in a 
stationary population; for an expectation of life of 
30 years, men are outnumbered (97 per 100 
women), but they are the majority (104 per 100 
women) when expectation of life reaches 73.9 
years (United Nations 1956). Most populations are 
growing, and this, when the expectation of life 
remains at a low level, increases the shortage of 
men still further. Thus, if we assume a population 
growth rate of 3 per cent a year—now common in 
underdeveloped countries—there would be a defi- 
nite shortage even if the mean length of life 
reached 73.9 years (in this particular case, there 
would be approximately 92 men aged 27 years per 
100 women aged 23 years). 

However, one of the most important sources of 
imbalance between the numbers of men and wom- 
en eligible for marriage is to be found in migra- 
tion, since movements of persons, and most of all 
movements across boundaries, mostly involve men. 
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As a result, countries of emigration usually experi- 
ence a deficit of young men, while countries of 
immigration have a surplus. 

Finally, there are irregularities in the age dis- 
tribution caused by exceptionally heavy war losses 
and by birth deficits (from wars and economic 
crises); such irregularities may interfere with mar- 
riage in certain cohorts of males and females be- 
cause of the resulting imbalance between the sexes. 

So far, in order to simplify, we have dealt only 
with marriages between single persons, but it is 
obvious that the frequency of marriages in a popu- 
lation is also a function of the relative frequency 
of dissolution of marriages (by widowhood or di- 
vorce) and of ensuing remarriages. The situation 
is even more complex when there are polygamous 
unions. These require a special discussion. 

Polygyny. Although polygamous marriages are 
expressly allowed by Muslim law, which recognizes 
the right of polygyny up to four wives, these mar- 
riages do not appear to have great demographic 
importance in the Muslim countries along the 
Mediterranean and in Iran. The political develop- 
ments of recent years can only strengthen this 
trend; for instance, polygyny is now outlawed in 
Tunisia and discredited in several other countries 
(including Iran and Egypt). 

It is mostly among the populations of sub- 
Saharan Africa that polygyny has a notable impor- 
tance. It is characterized not by a small number of 
husbands with a great many wives each, but bya 
large number of husbands with only a few wives 
each (most frequently two wives per husband). 
However, the number of women living in a popu- 
lation sets a limit to polygyny, and it is essentially 
practiced by elderly men who had at first been 
monogamous. Usually a man does not wed several 
wives at the same time, but successively at various 
points in time. This is demonstrated by the study 
of the average number of wives according to the 
age of their husband (see Table 1). A number of 


Table 1 — Polygyny in west Africa: selected areas 


AVERAGE NUMBER OF WIVES 


Age of Senegal 
husband Malit Senega Valley’. ‘Tago! 
15-19 1.00 1.00 1.00 1.06 
20-29 1.05 1.04 1.02 1.21 
30-39 1.17 1.13 1.12 1.46 
40-49 1.33 1.27 1.30 1.63 
50-59 1.38 1.36 1.44 171 
a. Niger Valley only. 
b. Dakar only. 


© Senegalese and Mauritanian sides of the valley. 
d. Kabrai region only. 
Source: Adapted from Blanc 1959, p. 23 
in the 1962 edition. 


these wives of polygynists are actually “inherited” 
widows, but it is not possible to find out what pro- 
portion of the polygynous unions these second or 
subsequent marriages represent. 

Finally, in those populations where polygyny is 
very widespread, almost every woman has been 
married at least once, and her first marriage has 
been as a rule very precocious. Never-married men 
represent a higher proportion, Fast-growing towns 
in particular have a substantial excess of men, 
many of whom are therefore prevented from mar- 
rying. 

Consensual unions. Consensual, or “free,” un- 
ions play a variable part in different human socie- 
ties. Their importance in Latin America and their 
characteristic features in that region deserve dis- 
cussion. 

According to Mortara (1963), a consensual 
union is not an introduction or a conclusion to 
marriage but rather a substitute for it. Accordingly, 
a first pregnancy resulting from it will not encour- 
age the couple to get married; if they do marry, it 
is usually after long and fertile cohabitation. The 
frequency of consensual unions (which, according 
to some, are the posthumous legacy of the sexual 
promiscuity characterizing a slave society) is diffi- 
cult to measure with precision because of omis- 
sions in the censuses. It is interesting to note that 
in Haiti, for instance, for every 100 married cou- 
ples, 300 couples were enumerated as living in 
consensual union; in Guatemala, the corresponding 
number of consensual unions was 213, in Panama 
120, and in El Salvador, Honduras, and the Do- 
minican Republic almost 100. In Nicaragua, Cuba, 
Jamaica, Trinidad, Venezuela, Peru, Paraguay, and 
(probably) Brazil, the number ranged between 50 
and 90. 

Monogamous legal marriage. What prevails in 
most human societies is monogamous marriage 
solemnized by a civil or religious ceremony giving 
to the union its legal character. The relative uni- 
formity of the institutions permits comparison be- 
tween a wide range of societies. By first comparing 
the proportion of never-married at various ages, it 
is possible to judge whether first marriages are 
more or less frequent, as well as how precocious 
they are (see Table 2). 

As a rule, Western countries have the least in- 
tensive nuptiality (i.e., the highest proportion of 
single persons aged 45 to 49 years—a proportion 
that is very close to the proportion of those who 
never marry at all, if we disregard the effects of 
mortality). There are, however, significant varia- 
tions among countries; in countries of emigration, 
for instance, women who never marry are more 


Table 2 — Per cent single in various countries: selected 
ages and years 


AGE AND SEX 
20-24 25-29 45-49 
Country Year ME M F M F 

Algeria’ 1948 68 23 37 10 Sa? 
Ceylon 1946 80 29 43 12 8 3 
England and Wales 1951 76 52 35 22 10 15 
Formosa 1930 52 15 19, 34 4 0 
France 1954 82 57 38 23 11 10 
Germany 1939 91 64 51 31 7 15 
India 1931 35.35 14 2 es | 
Ireland 1951 95 82 77 54 32 26 
Japan 1951 83 55 34 15 seh} 
Korea 1930 3 «2 10 1 1 0 
Portugal 1950 84 65 44 35 12 17 
Spain 1940 94 79 63 44 9 14 
Sweden 1950 84 60 49 26 16 18 
U.S.S.R. 1926 51 28 18 9 3 4 
U.S.A. 1950 59 32 24 13 9 38 
a. Muslim populotion only. 

b. Age 40-49. 


Source: For Ceylon, Formosa, Germany, India, Japan, 
Korea, Spain, and t e Hajnal 
1953; data for remaining compiled 
from national census statistics. 


numerous than men. Cases in point are Germany, 
England and Wales, Spain, and Portugal. The 
never-married are numerous in Sweden and still 
more numerous in Ireland, where as an additional 
factor, marriages take place particularly late in life. 
According to the 1926 census results, the Soviet 
Union had intensive and early nuptiality; judging 
from recently published crude marriage rates, this 
does not seem to have changed. 

Non-Western countries. In the rest of the world, 
culturally very different from the Western coun- 
tries, the intensity of nuptiality is very great, In 
several countries (India, Taiwan, Korea) nearly all 
women marry at least once. There is a need here 
for careful study of the respective traditions of 
these countries: the laws that govern family forma- 
tion, the function of marriage, and the concept of 
the family are all found to be widely different from 
those in the West (Goode 1963). 

India is probably the Asian country for which 
Studies of nuptiality have been the most compre- 
hensive and exhaustive. An Indian marriage must 
be preceded by an engagement which has a binding 
character comparable to that of marriage itself in 
a Western country. Indeed, this is what Indian 
Studies of nuptiality identify as the beginning of 
Marriage; in the case of women it constitutes a 
une child marriage, their mean age at marriage 
being around 14.5 years (according to the 1951 
census), with some variation by caste and religion 
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(Agarwala 1957). Nuptiality in India is also af- 
fected by a prejudice against the remarriage of 
widows. However the practice of remarriage is 
tending to spread, particularly in the lower castes 
(Dandekar 1963). 

Nuptiality in Western history. The evolution of 
nuptiality can be treated only in the West, by which 
we mean Europe and the English-speaking coun- 
tries outside Europe, since in many of these coun- 
tries accurate records have been kept since more or 
less remote times. As a rule, nuptiality is a phe- 
nomenon that does not permit long-term fluctua- 
tions of an importance comparable to those which 
affect other demographic phenomena, such as fer- 
tility and mortality. In the absence of mortality, 
between 70 per cent and 95 per cent of any male 
or female cohort end up marrying; the various 
Western countries have at all times been within 
that range. 

Sizable variations may nevertheless occur in the 
distribution of ages at marriage, for males as well 
as for females. There are long-term fluctuations 
(usually in the direction of earlier marriage), and 
“accidents” attributable to wars and economic 
crises. On the whole, it is the marriages that finally 
take place after having been postponed which pro- 
duce the most obvious disturbances in the yearly 
trend of nuptiality. 

We must also take into account the frequency of 
marriage dissolutions by widowhood and divorce, 
and of ensuing remarriages; taken together, these 
two factors regulate the total number of remar- 
riages. The number of divorces is rising continu- 
ously in most countries, but the mortality decline 
has considerably reduced the number of widowed 
men and women for the largest part of adult life. 
The combined effect of these trends on nuptiality 
is not easy to ascertain. In a different connection, 
it is not impossible that the fact that divorce is 
granted easily in many countries is an encourage- 
ment to marry, since legal dissolution appears to 
be a possible way out of an unhappy union (Goode 
1956). 

The available historical series indicate that the 
intensity of nuptiality for single persons has not 
varied much during the last 100 to 150 years. Now- 
adays, however, marriages are demonstrably taking 
place at an earlier age. In France, where cohort 
analysis has been done since the first half of the 
nineteenth century, the mean age at first marriage 
has moved from 28.7 years to 25.9 years for the 
men, and from 26.1 to 23.1 years for the women, in 
a little over 100 years. During the same period, the 
proportion of single persons has kept very close to 
10 per cent, the maximum range of variation being 
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less than two points on either side (Chasteland & 
Pressat 1962). 

Any estimates of nuptiality at earlier times must 
rely on more partial statistical information based 
on monographs. The evidence indicates that nup- 
tiality in rural France during the eighteenth cen- 
tury had approximately the same characteristics 
(intensity and precocity) as 100 years later. Thus, 
in that country, the changes in nuptiality have been 
restricted to the most recent period, and they have 
consisted mainly in an earlier age at first marriage 
(by about 3 years during the last 100 years). 

Despite this over-all stability, certain social 
classes have reacted in different ways as a result 
of peculiar circumstances. A good example is the 
nuptiality crisis in Geneva’s middle class during 
the eighteenth century under the pressure of popu- 
lation growth and the ensuing difficulty of making 
an independent livelihood. Permanent celibacy, 
which concerned less than 10 per cent in each 
cohort before 1600, was the state of 25 to 30 per 
cent of men and women around 1750; at the same 
time the mean age at first marriage had risen by 
five years to reach 32 years among the men, and 
27 years among the women (Henry 1956). 

Other interesting examples of significant move- 
ments of nuptiality can be found in the recent his- 
tory of some countries of northern Europe (Sweden, 
Norway, Ireland), the case of Ireland being the most 
noteworthy. It appears that a reaction to the great 
famine of 1840, and the considerable movement of 
emigration which followed, caused the present pat- 
tern of late and infrequent nuptiality among men 
and women. 

The new fact in the history of Western nuptiality 
is the rise in the annual frequency of marriages in 
a number of countries (the Scandinavian countries, 
England and Wales, the United States). This rise 
is probably related to an increased precocity of 
marriage. 

Very accurate cohort analysis has led in the 
United States (Ryder 1963) to the finding that in 
50 years the already mentioned trend toward earlier 
marriage (for the 1929-1930 cohort, the age at 
first marriage is 21.4 years for females and 24.2 
years for males) has been paralleled by a higher 
frequency of first marriages, which involve now 
96 per cent of the women and 92 per cent of the 
men. 
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[See also COHORT ANALYSIS; FAMILY, article on pisor- 
GANIZATION AND DISSOLUTION; FERTILITY; MAR- 
RIAGE.] 
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Although obesity, which indicates excessive ac- 
cumulation of fat tissue with a corresponding in- 
crease in size, has been the focus of medical 
interest since antiquity, it has not been well 
defined, Moderate stores of fat are desirable and 
indicate normal nutrition. Since only an excess of 
the physiologically desirable is regarded as abnor- 
mal, a quantitative, even arbitrary, element enters 
into the definition. A clinician of the last century 
(Ebstein 1882) divided obesity into three stages: 
the enviable, the comical, and the pitiable. In con- 
temporary American society no stage would be 
considered “enviable.” Even mild degrees of excess 
weight, or those that indicate abundant normal 
growth, are condemned. 

During the past half century, obesity has custom- 
arily been equated with overweight in relation 
to an “average” figure for height, age, and sex. 
Since in Western society weight commonly in- 
creases with age, it was considered a further refine- 
ment to refer to the weight of the young adult 
person as the “ideal.” Life insurance companies, 
which need a statistical approach for quick evalua- 
tion of a large number of people, have branded any 
excess weight as undesirable. Clinically, it is doubt- 
ful whether such a statistical concept of “normal” 
or “ideal” weight is valid. A recent examination of 
the total population of a certain area in the south 
of Sweden, for example, revealed that as much as 
70 per cent of the female population over 40 years 
of age were 10 kilograms “overweight” and 10 per 
cent were overweight by 30 kilograms or more 
(Bjurulf & Lindgren 1964). 

Many other objections have been raised against 
a rigid statistical approach. It has been criticized 


as being based on the implicit assumption that all 
body weight is equivalent and that overweight is a 
measure of relative fatness. Keys and Brozek 
(1953), who first gathered extensive evidence 
against the blind use of statistics, particularly in 
regard to moderately overweight persons, consid- 
ered some judicious pinches of various skin folds 
a more scientific way of estimating fatness. Refine- 
ments of this biological approach include the use 
of skin-fold calipers, soft-tissue roentgenograms, 
and estimates of the total body water through 
densitometry. These newer methods aid in obtain- 
ing more accurate measurements of the body fat 
in relation to the lean tissue (von Döbeln 1964). 

There is increasing awareness that clinical obes- 
ity does not represent just an extreme on the nor- 
mal distribution curve for weight, but that it is a 
symptom of true abnormalities, an expression of 
a variety of underlying clinical disturbances (Mac- 
Bryde 1964). There is increasing recognition that 
the term refers to a variety of decidedly different 
clinical pictures. Obesity that develops early in life, 
in infancy or before puberty, has an entirely dif- 
ferent course and significance from the gradual 
increase in weight in middle life (Bruch 1955). 

Entirely different problems are encountered in 
(1) those whose weight is stable or who have 
arrived at the stationary stage of obesity and 
(2) those who are in the active phase of progres- 
sively increasing weight or who, with constant 
fluctuation, show dramatic weight losses during 
which the fat seems to melt away, only to be re- 
gained even more rapidly. The clinical picture and 
the pathological processes are different again in 
superobese people who achieve a tremendous size, 
with weight in excess of 300 pounds—reaching up 
to 700 or 800. At the present stage of our knowl- 
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edge, or rather ignorance, it is not possible to differ- 
entiate between these different forms of obesity 
on the basis of any laboratory test; thus far it can 
be done only through evaluation of the whole devel- 
opment and reaction pattern (Bruch 1957a). 

Economic and sociological factors. Regardless 
of the physiological and psychological aspects to be 
discussed later, external factors influence the oc- 
currence of obesity to a conspicuous degree. In the 
United States, obesity has been called the number 
one health problem. Other affluent countries, too, 
express concern with overnutrition as a not alto- 
gether desirable by-product of increased and im- 
proved food consumption and labor-saving devices. 

Human history has been determined to a large 
extent by the search for sufficient food. Malnutri- 
tion is still prevalent in underdeveloped countries 
with inadequate nutritional resources, and starva- 
tion is a specter that even today threatens large 
sections of mankind. Such abundance of food for 
large population groups is historically very recent, 
and geographically it is limited to a few overprivi- 
leged countries. 

Yet the condition of obesity has been familiar 
to man since the dawn of history. The oldest rep- 
resentation of the human form, the “Venus of 
Wilendorf,” shows marked obesity, and so do other 
paleontological figurines. The taste for fat women 
persisted into the Neolithic period. It is not known 
whether these “Venuses” are realistic representa- 
tions or whether they represent an artistic ideal, a 
dream of abundance and fertility. 

In primitive society, a yearning for fatness per- 
sists. Cloete (1953, p. 119) describes the attitude 
of the typical South African who works for the 
white man: “. . . and what is his heart's desire? 
Fat above all things. To be fat himself, to have a 
fat wife and children and fat cattle. This is the 
native’s dream, the Biblical dream of plenty in a 
starving land.” 

A similar primitive desire for fatness was con- 
spicuous during the 1930s among the mothers of 
a large group of obese children who were studied 
in New York City (Bruch 1940). Many of the 
women had been poor immigrants who had suf- 
fered hunger during their early lives. They did not 
understand why anyone should object to a child’s 
being big and fat, which to them indicated success 
and freedom from want. 

Even in the United States, traditionally the coun- 
try of abundance, nutritional deficiency diseases 
were the object of concern fifty years ago; today 
they have become quite rare. In the last fifty years, 
American children, both of native and foreign-born 
parents, have become 6-8 per cent taller and 12— 


15 per cent heavier than their parents, and matura- 
tion and puberty occur at an earlier age. The same 
observation is being made in Japan, where the new 
generation, under the influence of improved nutri- 
tion, is outgrowing the short stature of the parents, 

Adequate food supply is thus a prerequisite for 
optimum weight and physical development. Yet 
under the conditions of abundance, only a certain 
number of people become obese. A rich literature 
exists on the occurrence of obesity in different 
societies and eras, and on outstanding individuals 
who have been fat. Fascinating as these reports are, 
they have the value only of anecdotal contributions 
and are based on hearsay tales, old chronicles, and 
speculation (Bruch 1957a; Clauser & Spranger 
1957). 

In spite of the great public interest in obesity, 
systematic inquiries into its sociological aspects 
are conspicuously rare. Public health figures indi- 
cate a change in the weight of adult Americans: 
since 1912 the average weight of women has de- 
creased by 5 pounds, whereas the average man is 
5 pounds heavier. A detailed evaluation of a large 
unselected population sample—1,600 adults living 
in a central residential area of New York City—led 
to the discovery of a striking relationship between 
obesity and social class (Moore et al. 1962). Obes- 
ity occurred seven times more frequently among 
women of the lowest socioeconomic level than 
among those of the highest level; among men the 
same relationship existed, although to a lesser 
degree. As other studies also noted, obesity was 
found to increase with age. 

Similar observations are reported in the south 
of Sweden, where the weight of women in the upper 
classes is lower and shows less variability than 
that of women of poorer economic background. 
However, men in the lower sociological group in 
Sweden do not tend to be overweight (Andersen & 
Esmann 1957). 

A recent German study gives a different picture 
(Pflanz 1963). Information on social factors was 
obtained for 10,000 patients who were seen during 
one year at the medical clinic of the University of 
Giessen. A weight excess of 15 kilograms, indicat- 
ing undisputed obesity, was observed in a sample 
of about 1,000 patients. This study revealed that 
the sociological factors associated with obesity 
were quite different in men and women. Conspicu- 
ously often, obesity occurred in men who had been 
only children. No such difference between the sexes 
was observed in those who came from a large sib- 
ling group. Single and divorced women and those 
living in cities were less obese than married and 
widowed women and those living in rural sur- 


youndings. Analysis according to social class re- 
vealed the greatest differences. In the group of 
independent farmers and small-business people, 
obesity was observed at approximately the same 
frequency in men and women. In the highest and 
lowest social class, the sex distributions went in 
opposite directions. Obesity was more frequent in 
women of the lowest class and in men of the high- 
est class. It seems that under the special conditions 
of German culture, obesity in men has an entirely 
different sociological significance from that in 
women; in men it seems to add to their sense of 
power and prestige. 

Physiological factors. The question of what 
causes obesity has challenged physicians since 
antiquity. Two lines of reasoning can be recognized 
throughout the medical literature: one suggests 
that obesity is due to some innate inherited factors 
and is therefore an unalterable fate; the other 
suggests that it is the result of deplorable personal 
habits—overeating and inactivity—that are mor- 
ally condemned as greed and laziness. Scientific 
understanding has been delayed by the tendency 
to consider heredity and environmental factors as 
mutually exclusive. They are not only not opposed 
but interact and influence each other in many ways 
(Stern 1949). This clinical deduction now finds 
support from genetics, namely, from recent evi- 
dence that in the cells of higher organisms a large 
part of the genetic material remains inactive and 
that the level of activity of a gene depends on en- 
vironmental factors. Changes in diet and hormonal 
secretion may serve to activate certain genes. They 
in turn are influenced by living conditions, climate, 
and the emotional state of the organism. 

The case for heredity is easily made in regard 
to obesity. A high family incidence is reported by 
many different investigators (Bruch 1957a; Daven- 
port 1923). Of course, it is just as easy to demon- 
strate that these families indulge in rich meals and 
tend to avoid physical activities. 

Other evidence of inherent factors is the preva- 
lence of certain constitutional types. In a group of 
180 obese adolescent girls, there was an expected 
high proportion of endomorphic types, but also a 
high incidence of mesomorphic types (Seltzer & 
Mayer 1964). A group of 250 obese boys and girls 
tended to be taller and to mature earlier than their 
age peers of normal weight (Bruch 1939a). An 
anthropological study of adult obese women showed 
the prevalence of a certain body type, namely, 
juvenile proportions that are associated with early 
maturity [Angel 1949; see also PSYCHOLOGY, article 
On CONSTITUTIONAL PSYCHOLOGY]. 

Micromorphologic study shows that fat tissue 
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varies greatly in size and number of cells (Hirsch 
& Goldrick 1964). It appears that the size of the 
cell varies with nutritional factors, whereas the 
number of cells is determined by endogenous, prob- 
ably genetic, factors. Individuals with the same 
thickness of subcutaneous fat may have different 
morphologic tissue structure. Coronary sclerosis 
appears to be associated with large fat cells but not 
with a high cell count (Bjurulf 1964). 

Powerful support for the genetic view came from 
the discovery of certain strains of mice that grew 
fat under ordinary laboratory conditions. As many 
as 15 types of obesity, with different hereditary, 
metabolic, regulatory, eating, and activity disturb- 
ances, have been recognized in the mouse (Mayer 
1963). It is questionable whether any of them has 
an equivalent in human obesity. The great merit 
of these extensive studies is the clarity with which 
they demonstrate the way multiple factors interact 
and how etiological and pathogenetic factors can 
and need to be separated. 

Clinical investigations reflect this stimulus in 
the greater diversification of problems that are be- 
ing studied. “Metabolism,” a generation ago, re- 
ferred to the balance between caloric intake and 
output. Metabolic studies today are directed toward 
clarifying the transformation of various foodstuffs 
into the metabolites that are deposited in the tis- 
sues. There is increasing evidence that there are 
forms of human obesity that are characterized by 
the increased formation of fatty acids from glu- 
cose and other foodstuffs and their decreased re- 
lease from fat deposits. Various pathways of the 
transfer and synthesis of fatty acids have been 
clarified (Elovson 1964; Gordon 1964). It has been 
recognized that the fat tissues themselves play a 
role in this process and that hormones enter into 
the delicate balance between lipogenesis and fat 
mobilization in various ways (Dole & Hirsch 1960). 
Once lipogenesis is increased, it cannot be reversed 
by simple dieting. The fat person's claim, “Every- 
thing I eat turns into fat,” and his sad experience 
that the painfully lost weight is readily regained, 
now has some scientific support. Other metabolic 
work deals with disturbances in the carbohydrate 
metabolisms, with differences in the glucostatic 
regulations (Mayer 1953). 

‘Another factor leading to disordered weight is 
found in disturbances in the regulatory mecha- 
nisms. Since the beginning of the century, it has 
been recognized that obesity may develop after 
midbrain lesions, in the presence of tumors, or 
following encephalitis (Bruch 1939b). Extensive 
animal studies with experimentally produced mi- 
croscopic lesions in the hypothalamus and other 
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midbrain regions have elucidated the importance 
of different cell groups in influencing appetite and 
the experience of satiation, and thus weight regu- 
lation. Here again, newer studies emphasize the 
complexity of the problem, namely, that not only 
the eating function is affected by these operations 
but also spontaneous activity and motivation (Ken- 
nedy 1964), Proven anatomical lesions are rare 
in humans, and there has been speculation about 
possible functional weakness of the regulatory 
“centers,” some aspects of which will be discussed 
in the section on psychological factors. 

Furthermore, animal experimentation has also 
demonstrated that environmental factors alone can 
result in severe obesity. If young rats are allowed 
food only two hours per day, they become markedly 
obese, with increased lipid synthesis in the adipose 
tissue, in contrast to litter mates who have free ac- 
cess to food and grow into normal rats (Hollifield & 
Parson 1962), The gain in weight continues, with 
voracious eating, when the restricted rats are re- 
stored to normal feeding. However, the cycle is 
interrupted, with corresponding metabolic changes, 
after a period of fasting. 

Psychological aspects. Throughout the ages, pop- 
ular opinion has attributed definite, although con- 
tradictory, character traits to fat people that reflect 
changing concepts of obesity. Attention has been 
focused on the inherent temperament of the fat 
person, or on the consequences of being fat and 
the insults of a derogatory social climate. The con- 
tribution of psychological factors to the develop- 
ment of obesity has been recognized only recently. 

Systematic psychological studies have led to bet- 
ter understanding and clinical management of 
obese people, and they have also revealed the com- 
plexity of the problem. Psychiatric observations 
first emphasized the need for differentiating among 
several types of human obesity and recognized 
overeating and inactivity as manifestations of un- 
derlying emotional and personality disturbances 
(Bruch 1957a). 

The psychiatric problems of obesity are far from 
uniform (Bruch 1957b), It is not possible to speak 
of the psychodynamics of one basic personality 
type or even to describe one single psychological 
feature as characteristic of all obese people. Obesity 
may be associated with every conceivable psychi- 
atric disorder, with neurosis as well as psychosis. 
Psychological problems can be recognized under 
two different conditions. In some patients with 
many signs of disturbed behavior and adaption, 
obesity is intrinsically interwoven with their whole 
development; others become obese as a reaction 
to some traumatic event. It is in the latter form 
that the significance of psychological factors was 


first recognized. Obesity seems to serve as an equiv- 
alent for a depressive reaction in people who in 
other respects have the capacity for adequate func- 
tioning. 

The total picture is much more disturbed in the 
developmental obesity that occurs in children and 
adolescents who grow up in families in which they 
are treated as objects—precious or hated posses- 
sions that are overstuffed with food and shielded 
from, or not trusted with, the ordinary tasks of 
life. In many respects their behavior and psycho- 
logical reactions resemble those of the preschizo- 
phrenic stage. Such patients may become overtly 
schizophrenic when the demands of life become 
too threatening. Enforced reducing at this time 
may precipitate the manifest psychosis. Overeating 
and inactivity appear as central symptoms in the 
patient's whole development and often reflect se- 
vere intrafamilial pathology. Such patients parallel 
preschizophrenic development in their general im- 
maturity, social withdrawal, and body image dis- 
turbances (Bruch 1958), 

Psychological and psychiatric studies of obesity 
have become so numerous that it is impossible 
even to enumerate them in a brief review. Various 
investigators have focused on different traits in an 
effort at classification, Stunkard (1959a), for in- 
stance, differentiates various forms of obesity ac- 
cording to eating patterns. He has also studied 
the extent and psychodynamic aspects of inactivity 
(Dorris & Stunkard 1957). 

Psychoanalytic thinking has played an important 
role in the psychological study of obesity and has 
helped to clarify the conflicts and motivations that 
underlie overeating, Food is symbolically equated 
with an insatiable desire for unattainable love, but 
also with inhibited destructive impulses. Food in- 
take may reflect self-indulgence, the desire for 
sexual gratification, or, the wish to be pregnant; 
it also may represent punishment of forbidden im- 
pulses. Preoccupation with food may appear as 
helpless, dependent clinging to parents or as hostile 
rejection of them. 

In recent years a skeptical note can be recog- 
nized in psychiatric writings. The question has 
been raised whether the observations made on 
selected patients undergoing extensive psychoanal- 
ysis could and should be generalized to apply to the 
enormous group of obese patients not so exten- 
sively studied and observed. There is also need to 
differentiate between the psychological forces that 
lead to obesity and the psychological suffering 
resulting from it. Monello and Mayer (1963) ob- 
served that the psychological responses of a group 
of obese adolescents were similar to those of a 
minority group. They also noted that the obese 


child came from a nonunified family with poor 
sociability and was afraid to leave home. In a 
sociological study in New York City, obese persons 
gave more pathological responses on nine measures 
of mental health (Moore et al. 1962). For three of 
these measures—immaturity, suspiciousness, and 
rigidity—the results were statistically significant. 
These observations suggest that serious emotional 
disturbances may be present in obese people other 
than those who were actually studied. 

There has also been a decided change in the 
direction and emphasis of psychiatric inquiry. In- 
stead of focusing exclusively on the symbolic mean- 
ing of the disturbed behavior, a more basic the- 
oretical question has been raised: How does it 
happen that a body function becomes capable of 
being misused in the service of motivational con- 
flicts and disturbed interpersonal relations? It has 
been recognized that deficits in perception and 
conceptualization are important factors in the psy- 
chopathology of obese patients. Inability to identify 
correctly hunger and satiation is conspicuous 
among many other indications of falsified aware- 
ness of bodily needs, emotional states, and inter- 
personal situations. Patients with such disturbances 
have difficulty in drawing correct conceptual con- 
clusions about sensations and impulses arising 
from within and differentiating them from those 
impinging from the outside [Bruch 1961; see also 
Bopy IMAGE]. 

This failure to develop an integrated and differ- 
entiated concept of a bodily self and psychological 
identity could be related to a deficit in earliest 
learning experiences, namely, the absence of con- 
firmation of child-initiated impulses. The impor- 
tance of disturbed family influences had been rec- 
ognized much earlier (Bruch & Touraine 1940), 
but as long as the abnormal interaction was re- 
lated entirely to emotional experiences, this basic 
disturbance in conceptual awareness remained 
unclear. The importance of conceptual aware- 
ness has been demonstrated experimentally. When 
measured amounts of food were introduced di- 
rectly into the stomach, marked individual differ- 
ences were observed in the accuracy of recognizing 
whether, and how much, food had been given. 
Some healthy, normal subjects were consistently 
accurate. Obese patients were significantly less 
accurate. Independently, Stunkard (1959b) ob- 
served that obese patients frequently failed to feel 
hungry in the presence of stomach contractions or 
to recognize such contractions. 


A comprehensive review in a limited space of 
Ongoing research on obesity offers severe, even 
unsurmountable difficulties. These very difficulties 
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signify a desirable and much needed development. 
They indicate that extensive research on this topic 
is being performed by investigators in many dif- 
ferent disciplines. The multitude of new findings 
may appear confusing and contradictory; yet there 
is slowly developing a convergence of opinion that, 
contrary to old concepts, obesity is not a uniform 
or simple condition but a symptom of multiple 
interaction of various factors, only a few of which 
have thus far been clearly recognized. 

This whole line of work is still very much in the 
beginning stages, but it promises a better under- 
standing of the problem through integrating find- 
ings in various fields of research. From the psy- 
chiatrist’s point of view, patients suffering from 
obesity go through life with inadequate guideposts 
for orienting themselves about their own function- 
ing and their relationships with others. They develop 
inaccurate behavior that becomes more and more 
inappropriate as, with their increasing age, the 
demands of life become more and more complex. 


HILDE BRUCH 


[See also Foon, article on CONSUMPTION PATTERNS; 
PSYCHOSOMATIC ILLNESS.] 
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SOCIAL OBSERVATION D SOCIAL CASE STUDIES 


The term “case study” comes from the tradition 
of medical and psychological research, where it 
refers to a detailed analysis of an individual case 
that explicates the dynamics and pathology of a 
given disease; the method supposes that one can 
properly acquire knowledge of the phenomenon 
from intensive exploration of a single case. Adapted 
from the medical tradition, the case study has be- 
come one of the major modes of social science 
analysis. 

The case studied in social science is typically not 
an individual but an organization or community. 
Case studies have been done of such widely varying 
phenomena as industrial towns (E. Hughes 1943), 
urban neighborhoods (Gans 1962), factories (Dal- 
ton 1959), mental hospitals (Goffman 1961), and 
the interconnections of slums, politics, and rackets 
(Whyte 1943). Case studies of individuals are, of 
course, also undertaken by social scientists, espe- 
cially in the form of the life history; but such 
studies, although often made by an earlier genera- 
tion of sociologists and psychologists (Thomas & 
Znaniecki 1918-1920; Shaw 1930; Conwell 1937) 
are now relatively rare (but see H. Hughes 1961 
and Williamson 1965). 

The social scientist making a case study of a 
community or organization typically makes use of 
the method of participant observation in one of its 
many variations, often in connection with other, 
more structured methods such as interviewing. 
Observation gives access to a wide range of data, 
including kinds of data whose existence the investi- 
gator may not have anticipated at the time he 
began his study, and thus is a method well suited 
to the purposes of the case study. 


Aims of the case study 


The case study usually has a double purpose. On 
the one hand, it attempts to arrive at a compre- 
hensive understanding of the group under study: 
who are its members? what are their stable and re- 
curring modes of activity and interaction? how are 
they related to one another and how is the group 
related to the rest of the world? At the same time, 
the case study also attempts to develop more gen- 
eral theoretical statements about regularities in 
social structure and process. 

Because it aims to understand all of the group’s 
behavior, the case study cannot be designed single- 
mindedly to test general propositions. In contrast 
to the laboratory experiment, which is designed to 
test one or a few Closely related propositions as 
rigorously and precisely as possible, the case study 
must be prepared to deal with a great variety of 
descriptive and theoretical problems. The various 
phenomena uncovered by the investigator's ob- 
servations must all be incorporated into his account 
of the group and then be given theoretical relevance. 

So stated, the aims of the case study can scarcely 
be realized; it is utopian to suppose that one can 
see, describe, and find the theoretical relevance of 
everything. Investigators typically end up by focus- 
ing on a few problems that appear to be of major 
importance in the group studied—problems that 
touch on many aspects of the group’s life and 
structure. Thus a community study (E. Hughes 
1943) may come to focus on the problems of indus- 
trialization and cultural contact, or a study of an 
urban neighborhood may focus on the relation be- 
tween ethnicity and social class (Gans 1962). 

The comprehensive goal of the case study, how- 
ever, even though it is not reached, has important 
and useful consequences. It prepares the investi- 
gator to deal with unexpected findings and, indeed, 
requires him to reorient his study in the light of 
such developments. It forces him to consider, how- 
ever crudely, the multiple interrelations of the par- 
ticular phenomena he observes. And it saves him 
from making assumptions that may turn out to be 
incorrect about matters that are relevant, though 
tangential, to his main concerns. This is because a 
case study will nearly always provide some facts 
to guide those assumptions, while studies with 
more limited data-gathering procedures are forced 
to assume what the observer making a case study 
can check on. 

The aims of the case study and the kinds of 
problems it ordinarily poses for study suggest par- 
ticular techniques of data gathering and analysis. 
After describing these, we will consider the uses, 
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scientific and otherwise, which may be made of ob- 
servational case studies. 


Techniques of observation 


In gathering data, the participant—observer en- 
gages in a number of different activities. One can 
distinguish several possible modes of proceeding, 
depending on the degree to which one is a partici- 
pant as well as an observer (Gold 1958). At one 
extreme, the observer may not participate at all, 
as when he hides behind a one-way screen in an 
experimental room; at the other, he may be a full- 
fledged participant, living in the community under 
study or holding a full-time job in the organization 
he studies and subject to the same life chances as 
any other member of the group. The particular 
techniques he uses are shaped by the demands of 
playing these different roles; a hidden observer 
cannot openly interview other participants, while 
a known observer may find that certain group 
secrets are systematically kept from him. 

The observer places himself in the life of the 
community so that he can see, over a period of 
time, what people ordinarily do as they go about 
their daily round of activity. He records his obser- 
vations as soon as possible after making them. He 
notes the kinds of people who interact with one 
another, the content and consequences of the inter- 
action, and how it is talked about and evaluated by 
the participants and others after the event. He 
tries to record this material as completely as pos- 
sible by means of detailed accounts of actions, 
maps of the location of people as they act (Whyte 
1943), and, of course, verbatim transcriptions of 
conversation, 

The problem of bias. The observer has the 
problem of trying to avoid seeing only those things 
which accord with his explicit or implicit hypotheses 
(Zelditch 1962). This kind of bias can occur in 
several ways. The observer, interacting with those 
he studies on a long-term basis, comes to know 
them as fellow human beings as well as research 
subjects; thus, he can hardly help acquiring feel- 
ings of friendship, loyalty, and obligation, which 
may make him wish to protect some members of 
the group by not seeing those events which would 
render them liable to criticism. Some persons or 
factions may see his research as dangerous and try 
to keep him from seeing certain aspects of group 
activity (Dalton 1959). Finally, he may feel that 
certain events are so distasteful or personally dan- 
gerous (for example, the activities of homosexual 
networks or violent gang conflict) that he is un- 
willing or afraid to remain close enough to the 
participants to see what actually happens. 
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Bias can be avoided by carefully rendering a 
complete account of all events observed; by seek- 
ing to cover all varieties of events by some kind of 
primitive sampling device (making observations at 
different times of the day or year, deliberately 
seeking out members of different groups in the 
community or organization, and so forth); and by 
formulating tentative hypotheses as the field work 
proceeds and then deliberately searching for nega- 
tive cases (Lindesmith 1947), These topics are 
more fully treated in the next section of this article, 

Types of data. The observer is especially alert 
for incidents of anything defined as conflict or 
“trouble” by the community or organization being 
studied. Such incidents enable him most quickly 
to discover the expectations that guide interaction: 
when expectations are violated, trouble follows. 
By seeing what kinds of actions produce conflict, 
the observer can infer the existence of implicit ex- 
pectations, which then become part of his analytic 
model of the group under study. 

He is also alert to nuances of language, such as 
special meanings given to ordinary words, for these 
signal the existence of situations, events, and per- 
sons the members of the group think distinctive 
enough to merit being singled out linguistically and 
thus give a clue to the characteristic problems and 
responses of the group. By inquiring into the mean- 
ing and usage of an unusual term, by investigating 
instances of its use and seeing when it applies and 
when it does not, he adds to his analytic model 
(Becker & Geer 1957). 

The observer does not confine himself to obser- 
vation alone. He may also interview members of 
the group, either alone or in groups. In the first 
case, he can inquire into the social background and 
earlier experiences of a participant as well as into 
his private opinions about current affairs. In the 
latter, he is in effect “tapping” the ordinary kinds 
of communications current in a group, seeing what 
members will say when in the company of other 
members, The difference between private opinion 
and public communication may provide important 
clues to group norms (Gorden 1952), 

The observer will also find it useful to collect 
documents and statistics (minutes of meetings, 
annual reports, budgets, newspaper clippings) gen- 
erated by the community or organization. These 
can furnish useful historical background, neces- 
sary documentation of the conditions of action for 
a group (as in a codified set of rules), or a con- 
venient record of events for analysis (as, for in- 
stance, when a college newspaper reports the 
marriages of students, specifying their position in 
the campus social structure). In every case, the 


observer must inquire carefully into how the docu- 
ments he works with are created: by whom, follow- 
ing what procedures, and for what purposes? For 
it is clear that documents cannot be taken at face 
value but must be interpreted in the light of such 
considerations (Kitsuse & Cicourel 1963). 

The observer may also create his own statistics 
for the solution of particular problems. Thus, one 
may observe the number of times people in an 
office ask one another for advice (Blau 1955), or 
one may keep accurate records of one’s own piece- 
work production in a machine shop, to be used as 
an indication of what is possible for the average 
group member (Roy 1952). 


Techniques of analysis 


It is a truism to say that procedures of analysis 
and proof take their form from the problem one is 
trying to solve. It is more important to indicate the 
variety of problems typically encountered in analy- 
sis of observational material and the means by 
which they may be solved. 

Observational materials, since they are usually 
gathered over a long period of time, can be ana- 
lyzed sequentially, That is, analysis need not await 
completion of data gathering but can go on con- 
currently with it; results of early analyses may be 
used to direct further data-gathering operations. 
Different problems arise at different stages of the 
research. 

Choice of problem. In the beginning the re- 
searcher may not be sure what problem is most 
deserving of study in the community or organiza- 
tion he is working in; he devotes his first analytic 
efforts to uncovering worthwhile problems and 
hypotheses that will prove most useful in attacking 
them (Geer 1964), Researchers frequently dis- 
cover that the problem they set out to study is not 
as important as, or cannot be studied except in the 
context of, some other problem they had not antici- 
pated studying. Thus, Vidich and Bensman (1958) 
found that the problem of the relationships between 
the rural communities and the various agencies 
and institutions of American mass society that 
affect rural life could be understood only if one 
also investigated how the community and its mem- 
bers were able to function in spite of the fact that 
their immediate social environment demonstrably 
negated their basic beliefs, 

In selecting problems, hypotheses, and concepts, 
the investigator works from concrete findings made 
early in the research. Typically, he discovers that a 
given event has occurred, perhaps only once, and 
asks what the significance of such an event might 
be. It may be an incident of conflict or the kind of 


linguistic nuance already referred to. Whatever it 
is, the investigator must first ascertain that the 
event actually is what it seems to be and then trace 
out its possible theoretical implications. The first 
problem requires him to consider whether people 
may have been consciously or unconsciously deceiv- 
ing him; this can be checked by an assessment of 
whether the event that arouses his curiosity is one 
that was concocted for his benefit or whether it 
would have occurred in the same way even if he 
had not been there. For instance, a statement vol- 
unteered by an informant who does not know what 
the observer is after may be given more weight 
than one that has been influenced by the observer's 
leading questions. Similarly, an event that occurs 
in an ordinary institutional context, subject to all 
the constraints of that context, can be given more 
weight than one that occurs without being observed 
by other members of the group. 

The observer then traces the possible theoretical 
implications of his finding by considering what 
class of events it might be representative of, utiliz- 
ing such theory as is available about that class of 
events to deduce further propositions. For instance, 
if one hears a worker in a service profession cate- 
gorize members of his clientele, he may apply the 
proposition that such a categorization will be based 
on the problems clients of various kinds pose for 
the worker in trying to realize his occupational 
goals. (Teachers, for example, distinguish pupils 
according to how hard they are to teach and disci- 
pline; doctors distinguish patients according to how 
easily they can be cured, whether they pay on time, 
and so on.) Working from this, the observer begins 
to look for the basic problems implied by the set of 
categories and the way the problems impinge on 
workers at different career stages. Obviously, a 
large number of theories may be applied to discrete 
observations in order to draw out their implications 
and use them to direct further observations. 

Quasi-statistical method. At a later stage, the 
observer, having decided, at least provisionally, 
what he will study in the situation at hand and 
what theoretical apparatus he will use, is con- 
cerned with whether his initial findings hold for 
the entire community or organization. His data will 
usually not, unless expressly gathered for the pur- 
pose, be sufficiently systematic to be amenable to 
statistical manipulation. But he can generate what 
have been called “quasi statistics” (Barton & Lazars- 
feld 1955), that is, such imprecisely sampled and 
enumerated figures as his data contain. Such data 
are often quite adequate for the points he wishes 
to make. 

In particular, quasi statistics may allow the in- 
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vestigator to dispose of certain troublesome null 
hypotheses. A simple frequency count of the num- 
ber of times a given phenomenon appears may 
make untenable the null hypothesis that the phe- 
nomenon is infrequent. A comparison of the num- 
ber of such instances with the number of negative 
cases—instances in which some alternative phe- 
nomenon that would not be predicted by his theory 
appears—may make possible a stronger conclusion, 
especially if the theory was developed early enough 
in the observational period to allow a systematic 
search for negative cases. Similarly, an inspection 
of the range of situations covered by the investi- 
gator’s data may allow him to negate the hypothesis 
that his conclusion is restricted to only a few situa- 
tions, time periods, or types of people in the 
organization or community. 

The technical problem in creating quasi statistics 
lies in making sure that one has in fact inspected 
all the relevant cases. A number of workers have 
devised schemes for doing this. (A representative 
scheme is described in Becker & Geer 1960.) The 
common feature of these schemes is the reduction 
of the body of data by making an abstract of the 
field notes that have been accumulated, breaking 
them down into small units, and classifying each 
unit under all the analytic categories to which it 
might be relevant. When the investigator desires 
to analyze all the material on a given point, he sorts 
his units (which may be reproduced on keysort 
cards, for convenience), takes out those items 
which are irrelevant, and frames a conclusion that 
takes account of all the relevant evidence remaining. 

One of the greatest faults in most observational 
case studies has been their failure to make explicit 
the quasi-statistical basis of their conclusions. 
Even though the investigator uses faulty sampling 
and enumeration procedures, his evidence may 
nevertheless be sufficient to warrant the conclusions 
he draws if he explicitly states what the evidence 
is and shows how his conclusions are related to it. 
In particular, the conclusions may appear extreme- 
ly plausible (Pélya 1954) if they are supported by 
several kinds of evidence at once. Thus, the con- 
clusion that medical students make use of a per- 
spective based on the values of clinical experience 
and medical responsibility gains great plausibility 
when it is shown not only that use of the perspec- 
tive is frequent and appears in a wide range of 
situations but also that students’ characterizations 
of patients depend heavily on the same criteria 
(Becker et al, 1961, pp. 338-340). 

Construction of models. As a result of the early 
stages of analysis, the researcher acquires a num- 
ber of limited models of parts of the organization 
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or community, propositions which describe one 
kind of interaction between one pair of statuses in 
one kind of situation. The final stage of a case 
study consists in a progressive refinement of these 
part-models (accomplished by continual checking 
against evidence already available in the field notes 
or newly gathered in the field) and their integra- 
tion into a model of the entire organization or com- 
munity. The model provides answers to the theo- 
retical questions of the study and shows the 
contribution of each part of the analyzed structure 
to the explanation of the phenomenon in question. 
Models of the community or organization that 
result from case studies are not to be confused with 
mathematical models. Rather, they have the same 
relation to the group studied that the natural his- 
tory of a process (such as the race-relations cycle 
or the process of becoming a drug addict) has to 
any specific set of events said to embody it. In a 
natural-history analysis of a process we strip away 
the historical uniqueness of a number of instances 
of the same phenomenon, leaving as our result the 
generic steps in the process—those steps that 
would always occur if the same result were to be 
found. Similarly, in a case study of a social struc- 
ture we strip away what is historically unique and 
concentrate on the generic properties of the group, 
viewed as an example of a particular kind of struc- 
ture. Relations between the essential characteristics 
of that kind of structure are stated by verbal gener- 
alizations, For instance, one might study a prison 
or school with a view to discovering what the char- 
acteristic statuses and forms of interaction are in 
an institution in which one class of participants is 
present involuntarily. The result would be a model 
that might also be applied to other institutions hav- 
ing that characteristic, such as mental hospitals. 
The problem of reliability. The reliability of 
such an analysis is sometimes questioned in an 
equivocal way that plays on the meaning of “relia- 
bility.” The question is put thus: would another 
observer produce, with the same analysis, the same 
total model, were he to repeat the study? The 
answer is of course that he would—but only if he 
used the same theoretical framework and became 
interested in the same general problems, for neither 
the theoretical framework nor the major problem 
chosen for study is inherent in the group studied. 
Nevertheless, given the same basic framework— 
for instance, a Sociology based on conceptions of 
social structure, culture, and symbolic interaction 
—the same fundamental parts of the group studied 
would be found in a second study, even if the 
major problems chosen for study were quite dif- 
ferent. For instance, one might study a medical 


school to discover how the students are changed by 
their experience in it; this would be a problem in 
the theory of adult socialization. Or one might, 
with equal justice, choose to use the medical school 
as the arena for a study of how specialists cooper- 
ate in a common task, a problem in the “politics” 
of complex organizations. In either case, a com- 
plete study would necessarily describe the same 
basic relationships among students, among faculty, 
between students and faculty, between both and 
patients, and so on. Admittedly, the theoretical use 
to which the analysis was to be put would shape 
the kind of structural model built, and a model 
built for one purpose might slight or ignore im- 
portant elements in the other; but the two could 
be combined, so that neither would contain any 
element denied in the other. 


The use of observational studies 


Every case study allows us to make generaliza- 
tions about the relations of the various phenomena 
Studied. But, as has often been pointed out, one 
case is after all only one case. Suppose that some 
of the most important factors involved in under- 
standing the particular theoretical problems posed 
by the case are so invariant in it that we are un- 
aware of their importance, How is one to discover 
their importance? 

The problem can be handled (or can in principle 
be handled) by gathering a large number of cases 
and “partialing out” the effects of various influ- 
ences. In any case, the problem is not a real one 
if we take a long-term view of the development of 
theory. Each study can develop the role of a dif- 
ferent set of conditions or variables as these are 
found to vary in the setting of the study, Over a 
series of studies, the comparison of variations in 
conditions and consequences can provide a highly 
differentiated theory of the phenomenon under 
study. As a simple example, a community study 
might locate six social classes in a community. 
A later study, in a somewhat different community, 
discloses only five, the upper class failing to divide 
between “old” and “new” wealth; comparison of 
the two may show variations in the histories Or 
ecological positions of the communities that might 
account for the difference, and the hypothesis can 
be checked out in yet another study. 

Comparative analysis: an example. To take 
another example, some studies of prisons (Sykes 
1958; Cressey 1961) have revealed elaborate or- 
ganizations of inmates around matters of depriva- 
tion; wherever inmates were deprived of something 
—material possessions, sex, autonomy—they de- 
veloped social units and practices designed to deal 


with the deprivation as best they could under 
prison conditions. Because these early studies were 
all of men’s prisons, they could not discover what 
a later study of a women’s prison revealed: that 
the informal organization of the prison varied 
according to the kind of people recruited, because 
deprivations differ according to what it is one val- 
ues and, therefore, misses when deprived of it. 
Women apparently set far less store by autonomy 
than men, do not miss it, and do not develop a 
sub rosa government; they are, however, very de- 
pendent on intimate affectional ties, miss their 
families intensely, and develop homosexual liaisons 
as their form of informal organization (Ward & 
Kassebaum 1965). Other studies might show the 
influence of age, region, and other factors on the 
organization of prison life. A series of comparisons, 
based on variations in the phenomenon, show the 
influence of each factor; each succeeding study can 
be built on the contributions of its predecessors. 

Developing theory by comparative analysis is 
necessarily a protracted process. Comparative find- 
ings take many years to establish, for each study, 
by itself, may take several years and, for maximum 
effect, studies should be built on one another rather 
than being done simultaneously. The result can be 
a detailed understanding of the operation of a large 
number of factors and conditions as they interact 
to produce different results. 

One useful strategy is to state the findings of 
each study as universal propositions, even though 
it is obvious that they are provisional. By doing so, 
the investigator makes it possible to identify excep- 
tions to his propositions and to proceed most effi- 
ciently with fruitful comparisons (Lindesmith 
1947), 

Practical use of research findings. The observa- 
tional study of an institution or community can be 
(and often is) used by various people in various 
ways, depending on their position in, or relation to, 
the group and their interest in its functioning. It 
does not differ in this from other kinds of research, 
but it does differ, typically, in the range and 
number of variables considered and the distance 
beneath the surface of events that the research 
Teaches, Studies are often undertaken with the 
Subsidiary—if not primary—purpose of providing 
guidance to administrators and others who may 
wish to intervene in the organization or community 
in order to change some condition thought to be 
inefficient, distasteful, or inimical to group welfare. 
The observational study is useful in identifying and 
Specifying such problems and in finding their ori- 
gins and consequences at various levels and in 
Various parts of the group. 
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Clues to intervention. The observational study 
also makes it possible to go beyond the problem as 
originally conceived by those group members who 
wanted help and to discover other problems that, 
from some viewpoint other than theirs, require or 
warrant intervention. For instance, the officers of 
an autocratically governed trade union may not 
think the absence of organizational democracy a 
problem, but some of the members or an outside 
observer might take a different view. The farther 
beneath the surface the study penetrates, the more 
likely it is to discover problems that have not been 
labeled as such by the leaders of the group. 

Whatever the problems identified, the wide range 
of the case study makes it likely that it will contain 
hints or suggestions as to the crucial points of possi- 
ble intervention. Many studies diagnose the “causes” 
of a problem and yet are not useful for social action, 
because the causes discovered are not accessible to 
manipulation by the people involved. Thus, even 
though the generalization that the cause of teen-age 
vandalism lies in the early childhood experiences 
of the vandal might be true, knowing this is of little 
value. It is more useful to know, through close ob- 
servation, that (as might be the case) vandalism 
takes place more frequently in unlit and unwatched 
places or becomes more frequent as the certainty 
of apprehension declines, for these matters are 
more subject to remedial action by police and other 
agents of social control. 


Ethical problems of the researcher 


The published report of an observational study 
may be used, either by members or outsiders, to 
embarrass or even endanger the organization or 
community studied or, at least, its leaders. Every 
group maintains fictions about itself—they may, 
perhaps, be necessary for the continued existence 
of the group—which present it as better in some 
ways than unprejudiced research will reveal it to 
be. A town may feel that its government is more 
broadly representative than it is; a hospital may 
think its treatment of patients more successful 
than it is. A case study is bound to reveal the dis- 
crepancy between the operating reality and the 
image believed in and presented to the rest of the 
world by members. When the results of the study 
are published, the discrepancy is publicly attested 
to in a way that members cannot ignore. Their 
enemies may make use of the opportunity to 
embarrass or attack them. The members may ask 
that the findings be withheld or may attempt to 
coerce the researcher into suppressing them. 

The investigator therefore faces an ethical di- 
lemma. Science requires frank and unfettered re- 
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porting, and the matters group members complain 
of may be important aspects of group functioning, 
whose suppression would emasculate the report 
and strip it of scientific significance. On the other 
hand, the investigator surely has some obligation 
not to bring harm to those who have allowed them- 
selves to be studied; he may, indeed, have promised 
them that they will not be harmed. In making the 
promise, he may have meant merely that he would 
not expose any individual to ridicule or retribution 
—Most sociologists probably regard this as a fixed 
ethical principle—but he now finds that he is being 
asked to respect the same niceties in the case of 
a group. 

The solution to the dilemma depends in part on 
the investigator’s own ethical commitments. How- 
ever, he can avoid some of the difficulties inherent 
in the research relationship by striking a clear 
bargain with those he studies before he begins 
his work, taking care to alert them to the full range 
of unpleasant possibilities they may be exposing 
themselves to, He can also attempt to educate those 
most likely to take offense at the final report, ex- 
plaining to them as the study proceeds what its 
consequences are likely to be and helping them to 
find a workable way of living with the published 
study, 

Howarp S. BECKER 


[See also FIELD work; INTERVIEWING. ] 
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Lii 
PARTICIPANT OBSERVATION 


The phrase “participant observation” is reserved 
for those forms of research in which the investi- 
gator devotes himself to attaining some kind of 
membership in or close attachment to an alien or 
exotic group that he wishes to study: for the 
anthropologist this may be a small community 
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within a foreign society; for the sociologist it may 
be a street gang, a class of medical students, or 
the inmates of a hospital, insane asylum, or prison. 

Where observation as a technique suggests de- 
tachment, participant observation implies involve- 
ment and thereby socialization or, more frequently, 
resocialization. Participant observation is both logi- 
cally and temporally prior to the more formal and 
ritualized techniques of investigation of social 
activities. 

In becoming a member of an alien group, the 
student submits to some degree of resocialization. 
He may learn another language or argot or new 
forms of etiquette and morality; he plays various 
novel roles, assumes obligations, and is granted 
privileges. If talented and successful, he will in 
time become capable of thinking and acting with 
the perspective and basic assumptions belonging 
to two quite different groups—the one in which he 
was reared and, to some degree, the one he is 
studying. At times he will also be able to assume 
a critical objectivity peripheral to both. What he 
eventually produces out of this tension will depend 
on his sophistication and training and on his 
ability to recognize what he has experienced and 
learned and to communicate this in terms that will 
illumine significant areas of the social sciences. 

Participant observation is a lengthy process; 
many (and sometimes most) of the important 
decisions and insights of the field worker come 
during the earlier and often painful process of 
social initiation and not during the later stages of 
collecting and analyzing data. Every honest field 
worker proceeds toward knowledge along a thorny 
path filled with errors and misconceptions; flexi- 
bility, humor, patience, and integrity can be more 
important than specific methods of recording data. 
Field situations are becoming increasingly complex 
as more and more groups and factions strive des- 
Perately to find an advantageous place in their 
vision of the new world (Burridge 1960; Maquet 
1964; Wax et al. 1964). 

Participant observation is essential to almost all 
branches of the social sciences that depend to any 
degree on understanding or meaning (Vidich 
1955). Little can be accomplished in appraising or 
analyzing data if neither the language in which 
they are expressed nor the point of view they 
reflect are understood by the analysts. Scholars 
Sometimes overlook the fact that the efficiency of 
Scheduled questionnaires and sample surveys de- 
Pends not so much on elegant statistics as on a 
Sample of respondents whose homogeneity is fur- 
ther accentuated by the very structure of the in- 
quiry (Riesman & Benney 1956). Like Moliére’s 
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bourgeois, these investigators have been partici- 
pating and observing all their lives in middle-class 
or mass society; unlike that enterprising person, 
some have not yet become aware of this fact. 
Although participant observation is only one of 
the many instruments or approaches to the study 
of social activities, it is sometimes the only possible 
approach (Evans-Pritchard 1940; Goffman 1961). 

The works of Peter Freuchen, Jaime de Angulo, 
Meyer Fortes, James West (pseud.), Howard S. 
Becker, and also those of Whyte (1943), Turnbull 
(1961), and Goffman (1961), frequently reveal 
problems and situations where participant obser- 
vation has proved a most illuminating and efficient 
method, Each in its own manner provides unique 
pictures of an alien and unfamiliar world view: 
the perspective, the social dynamics, or the central 
and sometimes secret moralities of a culture. Field 
research of the future will see a further elaboration 
of participant observation in studies of situations 
embracing several groups or aggregates, each of 
which has its particular and distinct interests, 
biases, and, sometimes, cultures. This area was 
called to the attention of social scientists by 
Hughes and Hughes (1952), and it has been ex- 
plored by several scholars (see Bibliography). 

A considerable amount of pedagogic emphasis 
is still being placed on the so-called dangers of 
bias. Thus, a recently published textbook on socio- 
logical research by Riley (1963) lists a biased 
viewpoint as one of the limitations of participant 
observation. However, it is precisely the “bias” of 
the participants that the researcher wishes to be- 
come capable of assuming and understanding. 
The observer who establishes himself and remains 
in a role external to the group being studied is not 
so much unbiased as incompetent or unenterpris- 
ing. The disciplined and reliable researcher is not 
a technical virtuoso, a machine, or a man without 
a viewpoint, but rather a trained person who enters 
the field with the expectation that he will be 
obliged to do many things in spite of his personal 
preferences, prejudices, or inclinations. The train- 
ing of the participant—observer and the techniques 
he employs are designed to assist him in experi- 
encing the significant events and recording the full 
range of data, despite the idiosyncrasies of his 
background. In scientific research generally, the 
problem is not simply one of eliminating error or 
bias but of assessing its nature, degree, and cause. 
No technique of research is free from error, and 
perhaps one virtue of participant observation is 
that the kind of data it yields allows the biases, 
inadequacies, and predilections of the researcher 
to be clearly perceived. The bias and error of sam- 
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ple surveys and other quantitative procedures are 
sometimes more subtle and more troublesome for 
science because the over-all perspective reflects the 
attitudes of management and authority maintained 
around the conference table. Such investigative 
techniques frequently lack insight into the activi- 
ties, attitudes, and basic axioms of the population 
to be surveyed. 

Malinowski performed the archetypical study in 
participant observation, living in a Trobriand vil- 
lage for almost three years and striving to learn and 
describe all aspects of the society and the native’s 
“vision of his world.” He seems also to have been 
the first social scientist to perceive and commu- 
nicate to his students that this kind of experience 
and procedure—with its unparalleled opportunities 
for observing the complexity and intricacy of tribal 
life—constituted a new instrument for social scien- 
tific research. Indeed, it may be that the develop- 
ment and efflorescence of the school of British 
social anthropology rested as much on the vistas 
opened by the appreciation of this new instrument 
as on the conceptual clarity later introduced by 
Radcliffe-Brown. 

The self-conscious adoption of participant obser- 
vation by sociologists stems from the perception 
that ethnographic techniques are extremely effec- 
tive in the investigation either of the less obvious 
aspects of one’s own society (Hughes 1960) or of 
the study of the various religious, ethnic, or status 
groups inhabiting a particular community, city, 
or nation. Thus, in 1883 we find Beatrice Webb 
setting out to see for herself what the British work- 
ing class was like, just as some sixty years later 
we see Whyte (1943) insinuating himself into a 
street corner gang. On the other hand, an ardent 
and vocal appreciation of the singular virtues of 
the method appeared in North America only re- 
cently, apparently by serendipity. Among a cluster 
of Chicago graduate students engaged in specific 
studies of occupations or professions were several 
who were already resocialized members of exotic 
groups. Encouraged by E. C. Hughes to make use 
of their fortuitously acquired data, they were so 
impressed by the powers and potentialities of the 
method that, like Malinowski, they became en- 
thusiastic proselytizers (Becker & Geer 1960). Si- 
multaneous and subsequent investigations in which 
these and other young social scientists entered into 
and observed unusual or neglected groups have 
attracted considerable attention. 

Despite a theoretical orientation that generally 
emphasizes empirical rather than social observa- 
tion, some American anthropologists have used 
participant observation with notable skill (Lowie, 


Herskovits, Oliver LaFarge, Dorothy Eggan, and 
Margaret Mead, to name only a few). Like any 
other social research technique, participant obser- 
vation must be flexibly adapted to the procedures 
and values of the group being studied. It is not 
to be naively confused with the notion of “living 
like a native.” 

RosaLiz HANKEY Wax 


(Directly related are the entries CULTURE, article on 
CULTURAL RELATIVISM; FIELD WORK. Other relevant 
material may be found in ANTHROPOLOGY, article 
on CULTURAL ANTHROPOLOGY; ERRORS, article on 
NONSAMPLING ERRORS; and in the biography of 
MALINowskKI.] 
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OBSESSIVE-COMPULSIVE DISORDERS 


A descriptive definition of the obsessive-com- 
pulsive disorders was provided by K. Schneider 
(1939): an obsession exists whenever a person 
cannot exclude thoughts from consciousness and, 
although he distinguishes them as unreasonable 
or without basis, is mastered by them (see Ingram 
1961). 

Obsessional phenomena range from trivial every- 
day acts, such as knocking on wood, stepping on 
lines, special ritualistic procedures for dress- 
ing, toiletry, or going to bed, and repetitions of 
such ordinary precautionary measures as turning 
off the gas or light, to well-organized behavioral 
and intrapsychic systems that dominate the person 
in the obsessional neuroses or character disorders. 

In terms of a dynamic definition, an obsessional 
disorder is one of the reaction types of psychoneu- 
tosis in which the individual suffers from the need 
to perform either logically unnecessary ritual- 
istic acts (compulsions) or experiences repugnant 
thoughts (obsessions), The element that distin- 
guishes these disorders from similar ones often 
found in conjunction with them is a drive to repeti- 
tion that is irresistible even in the face of strong 
Conscious suppressive efforts and that is senseless 
or inappropriate, and only imperfectly adjustive 
in reducing tension and anxiety. Often present in 
these disorders are various grades of doubt, inde- 
cision, ambivalence, guilt, magical thinking and 
Superstition, sadistic and masochistic tendencies, 
Tumination about order and disorder, right and 
oe cleanliness and soiling, love and hate; all 

lese elements contribute to the conflict, usually 
ae being understood or subject to control by 
Weta vidual even though he thinks them silly, 
ee’ ae or humiliating (Cameron 1963; 
1957), ; Freud 1905; 1917; Jones 1953- 
ae neurotic reactions are not totally deter- 
Slatcy’ by hereditary (genetic) mechanisms, as 

TS studies on identical twins demonstrates 
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(see Shields 1962); however, Tienari (1963) and 
Woodruff and Pitts (1964) believe the question 
remains open. [See MENTAL DISORDERS, article on 
GENETIC ASPECTS.] 

Historical background. Because the obsessive— 
compulsive disorders occupy such a wide spectrum 
of behavioral manifestations in the spheres of 
thinking, acting, and feeling, definitive identifica- 
tion of crucial criteria is precluded. The terms “ob- 
sessional neuroses” and “obsessional illness” be- 
came common in Great Britain, while “compulsion 
neuroses” was used more frequently in the United 
States, but all these terms were derived from the 
German terms psychische Zwangsvorstellung (in- 
troduced by Krafft-Ebing in 1867), Zwangsneurose, 
Zwangserscheinung, and Zwangsaffection (see 
Loewenfeld 1904). Adolf Meyer introduced the 
more inclusive term “obsessive-ruminative tension 
states” as a category in his dynamic classifica- 
tion of reaction types (Collected Papers, vol. 3, 
p. 297). Other designations used since 1941 are ob- 
sessive or compulsive behaviors, defenses, mecha- 
nisms, systems, conditions, or reactions. The term 
“psychasthenic” was popularized in France after 
Pierre Janet used it in his classic descriptions to 
cover a wide range of repetitive, anxious behavior 
in the ideational, emotional, and motor areas, but 
this term is little used in English-speaking coun- 
tries (Janet & Raymond 1903). Schneider's defi- 
nition is derived from the work of Westphal 
(see Loewenfeld 1904), with its negation of any 
emotional or affective basis for these conditions. 
A. Lewis stressed the subjective feeling of resist- 
ance rather than the recognition of senselessness 
(see Ingram 1961). The French, following Morel, 
insisted on the emotional foundation of obsessions 
and distinguished them from phobias, or obses- 
sions consisting of fears [see Puosias]. Most Eng- 
lish-speaking writers did not pursue these lines of 
thought because there was increasing recognition 
that these distinctions between thinking (obses- 
sions) and feeling and acting (compulsions) were 
artificial; upon close inspection over time, all three 
were found together. 

Pavlov related repetitive actions such as human 
obsessions and compulsions to the pathological 
conditions of the cerebral cortex: mechanisms that 
would normally obliterate an impulse failed to do 
so, with the result that persistent repetitive activity 
occurred or, in some cases, lesions produced patho- 
logical isolation and fixation of ideas (Wortis 
1950). Particularly during the last ten years of his 
life Pavlov wrote papers on neuroses and psychoses. 
His thesis of “second order” activity (speaking, 
thinking, abstract ability) gave sanction to psycho- 
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therapy as we know it (because it uses words ) which 
thus received approval from the Soviet government. 
[See the biography of Paviov; see also Seredina 
1955.] 

Biological, evolutionary process concepts. Cur- 
rent American writers on obsessive-compulsive dis- 
orders on the whole adopt Freud’s view that behav- 
ioral syndromes are processes rather than discrete 
entities. However, careful physical examinations 
must be performed to determine the presence of 
neurological lesions, since D. Denny-Brown (1952; 
Denny-Brown & Banker 1954; Association for Re- 
search . . . 1958) and H. Kliiver and P. C. Bucy 
(1937) have shown that obsessive, catatonic, and 
oral behavior may be considered as release phe- 
nomena; O. R. Langworthy (1964) has reviewed 
the clinical importance of this work. 

Obsessional processes may be thought of as or- 
ganized psychological systems, operating to help 
maintain internal and external equilibria for the 
survival of the person. This view is based on a 
broad evolutionary perspective: man is a member 
of the biological world, and is both a product and 
an architect of evolutionary biological and cultural 
development. As a result of the work of Freud, 
Eugen Bleuler, and Adolf Meyer, psychiatrists have 
learned to study man as a biological organism 
whose developmental history is intimately related 
to his motivation and behavior. This permits the 
reordering of massive volumes of meticulous de- 
scription into more comprehensible families of be- 
havior aligned according to principles. Develop- 
mental periods—autoerotic, narcissistic, oral, anal, 
phallic, latency, genital—and specific phenomena 
of the periods can be grouped in relation to the 
critical periods of behavioral change—Oedipal, 
pubertal, mature (Freud 1905). Other functional 
principles, such as an ascending order of complex- 
ity; the clustering of defense mechanisms, such as 
regression, displacement, isolation, reaction forma- 
tion, undoing, and self-punishment; and other re- 
current patterns—all can be used as bases for 
organizing overt behavior. Formally these patterns 
are those of the classic theoretical descriptions; in- 
formally they may refer to the characteristic ways 
in which a person may use time, money, or clothes 
and other possessions, exert power, or choose an 
occupation, leisure and hobbies, or any other activ- 
ity (Fenichel 1945; Freud 1905; 1917; Jones 1953-— 
1957). 

Domestication (socialization) of the basic hered- 
itary components is required in order for a person 
to achieve sufficient maturity to live at the cultural 
level required by his peers. This maturity is not a 
stable, enduring state but must be maintained by 


numerous adaptive devices such as are found in 
the physiological system for the regulation of 
oxygen supply, blood pressure, or pH levels. The 
writings of Frank Beach (1958) offer a sophisti- 
cated introduction into the problems of design and 
interpretation inherent in the subtle interactions 
between physiological and psychosocial systems, 
[See DEVELOPMENTAL PSYCHOLOGY; HOMEOSTASIS; 
PERSONALITY; SOCIALIZATION. ] 

Dynamics of obsessional disorders. There is no 
wholly satisfactory classification of the underlying 
dynamics of obsessional disorders. Until the ap- 
pearance of Freud's monograph “Three Essays on 
the Theory of Sexuality” (1905), the descriptions 
of the obsessional, sado—masochistic, and related 
fetishistic or perverse symptom patterns were a 
collection of unrelated, often bizarre, but very puz- 
zling exhibits in the museum of human behavior. 
Freud's brilliant insight that these behaviors were 
intelligible in the developmental process, and were 
akin to similar processes in biology and psychology, 
made possible an examination of them in natural- 
istic terms. The general principles that have been 
established for such other neurotic reactions as 
anxiety states, phobias, conversions, and dissociative 
reactions also apply here. Psychiatric and psycho- 
analytic textbooks usually list subgroups of the 
obsessional disorders according to the presenting 
symptom or the major defense processes in the 
conflict between instinct and defense. These action 
patterns are immensely helpful to the clinician in 
the intricate task of describing and observing the 
progress made by the patient. Because the symp- 
toms usually reflect ambivalent motives and exhibit 
opposing emotions simultaneously (obedience and 
rebellion) or in biphasic fashion (behaving alter- 
nately as a rebellious child and a stern parent), 
it is not a simple task to chart their course. Under- 
standing the motivations, such as the child’s love 
for, and fear of, parents, makes the transforma- 
tions much more intelligible. Such paradoxical be- 
havior as not being able to brush one’s teeth for 
obsessional reasons and then slapping and scold- 
ing oneself for this failure, or excessive and ritual- 
istic handwashing, or concern for external cleanli- 
ness occurring together with soiled underwear, or 
time-consuming ritualistic bathroom or bedroom 
behavior are seen as logical sequences in terms of 
the patient's personality organization. The patient is 
re-enacting fragments of earlier behavior, when the 
angry, frustrated child’s thinking was dominated 
by the talion principle and the rules of word magic, 
and before the more realistic logical rules of adult 
life were established. This interplay can also be 
seen daily in normal equivalents. 
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Norman Cameron (1963) has attempted a three- 
level classification which, with some justification, 
represents ascending levels of complexity of evasive 
maneuvers each with its more prominent behav- 
joral characteristics. A fourth category is appended 
which does not fit into the scale because much of 
the emotional component of the obsessional reac- 
tions has been repressed. 

(1) The first level consists of regression, dis- 
placement, and isolation. These are unconscious 
defensive techniques used in an attempt to relieve 
intolerable tension and anxiety. Isolation is related 
to denial and ego-splitting and may be a variant of 
the same general process. Regression and displace- 
ment are used when adult sexual or aggressive 
problems are shifted to the infantile scene where 
the patient feels more competent to handle them, 
or to change the focus from a crucial conflict to a 
less threatening sphere, such as reckless driving, 
body-contact sports, or a chess game, rather than 
to direct assault. Isolation can be used to fragment 
opposing elements of a conflict so that they will 
be less threatening or to steer away the fearful 
emotional components from the obsessional de- 
fense systems. This is aimed at preventing the con- 
frontation of opposing attitudes, such as I love 
(hate) my father, mother, boss, country. The re- 
sults are similar to those seen in the dissociative 
reactions (hysterias) of denial and splitting of the 
ego. [See HysTERIA.] 

(2) The second level deals with the overt use of 
one neurotic maneuver as a countermeasure in an 
attempt to offset the effects of some other neurotic 
maneuver. This technique is present to some ex- 
tent in most obsessive-compulsive reactions. In 
addition to the first-level defenses, which are used 
when repression is deficient, the countermeasures 
are often necessary to help prevent further disin- 
tegration of the ego because of excessive tension 
and anxiety. The patient will invent rituals to avoid 
unpleasant thoughts. These may take many forms, 
such as thinking about something else or indulging 
in behavior to ward off the feared thoughts. This is 
Particularly evident in the patient with excessive, 
crippling overconcern with germs, viruses, broken 
glass, knives, poison, gas, feces in the toilet, or- 
ganic fertilizer, or other substances that might be 
lethal Weapons against those who are closest to the 
Patient and who are simultaneously hated and 
loved, 

Hs ae oe third level involves more complex de- 
cane easures : reaction formation, ritualistic 
oing, and sado—masochistic self-punishment. A 


h: A P aye 
asling compulsion utilizes all these proc- 
S, 
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(4) A fourth level involves phenomena such 
as obsessive doubt and rumination, where many 
of the emotional components of the obsessive—com- 
pulsive reactions have been repressed. This results 
in what Cameron calls “caricatures of magical, 
scientific, artistic and religious practise.” Especially 
in highly cultivated people, remnants of supersti- 
tions and the rigid retention of antiquated and 
useless practices in commerce, law, medicine, sci- 
ence, and philosophy are present. In such patients, 
apparently profound questions about the nature 
and meaning of life, death, immortality, time, 
space, eternity, sin, sex and reproduction, justice, 
and God may be displaced symptoms related to 
primitive childhood perplexities about their own 
identity and relations to members of the family. 
Clergymen have long since learned to separate the 
genuine seeker after truth from an obsessive 
doubter, who is referred to a psychiatrist (1963, 
pp. 373-411). 

Etiology. Freud viewed the neuroses, including 
obsessional disorders, as processes consisting of 
adjustive defense mechanisms to protect the per- 
son’s conscious and moral life-activities from the 
onslaughts of the basic internal drive states, e.g., 
libido, instincts (Fenichel 1945; Freud 1905; 1917; 
Jones 1953-1957). Neonates can be observed to 
differ in their innate ability to contro] these im- 
pulses. (Inherent in this evolutionary view, when 
one considers the potentials for the emergence of 
new properties, qualities, and configurations—as in 
our embryologic development from a single ferti- 
lized egg to an extraordinary complex organization 
in which the central nervous system alone has 
about 10 elements—is the probability of defects 
in all the human systems during the tumultuous 
growth period. ) 

Obsessional disorders are similar to other neu- 
rotic symptoms in being compromise formations 
between the libidinal wish and the repressing mech- 
anisms that form a part of the organized ego. Typ- 
ically, these libidinal wishes belong to childhood, 
and then and now are members of the cluster of 
early partial components making up the adult drive 
(instinct) and are therefore easy to overlook. 
Symptoms provide tension reduction (but do not 
lead to mature behavior) at two levels: (1) the 
primary or internal relief experienced by the ego 
faced with some painful situation, even though the 
symptoms represent a flight into a disordered, un- 
comfortable state, and (2) the secondary, external 
gain which derives from advantages in power, 
prestige, or material things. External stress may 
also play a vital part in the precipitation and con- 
tinuance of an obsessional reaction, but seldom is 
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the reaction a simple function of such things as 
overwork, unreasonable expectations, domestic dis- 
cord, underutilization or overutilization of special 
talents, or lack of rewards or recognition. Precipi- 
tating causes and the background causes are multi- 
determined. Altered value systems concerning 
spouse or children, work associates, and the pieties 
that provide hope or faith may be crucial as power- 
ful initiators of regressive defenses against a real 
or fancied stress. 

Relation to other reactions. An estimated max- 
imal prevalence of obsessional reactions in the gen- 
eral Caucasian population is 5/10,000 or 0.05 per 
cent (Woodruff & Pitts 1964). However, the num- 
ber of persons with character disorders is probably 
greater. When there is a blending of active obses- 
sional symptoms into habitual attitudes or patterns 
of behavior, we speak of obsessional character, 

Phobias, the avoidance of situations that create 
anxiety, although dynamically differentiated from 
obsessional disorders, also serve the ego as one 
means of defending itself against anxiety. The 
transition of a phobia into a compulsion is readily 
noticeable. A phobia develops when obsessional 
persons displace and project a fear of an internal 
conflict into a fear of an external object or situa- 
tion and then compulsively avoid it. [See PHOBIAS.] 

Depressive episodes as responses to increased 
stress are common in obsessive persons during 
periods of illness since many of the same defense 
systems are involved, namely, the activation of 
early childhood conflicts and the severe, archaic 
superego acting upon an ambivalent ego. [See DE- 
PRESSIVE DISORDERS. ] 

Although there have been continued efforts to 
link the obsessional reactions to schizophrenia, the 
genetic evidence is unconvincing (Roth 1960; 
Shields 1962; Tienari 1963), Some authors state 
on clinical grounds that a psychosis appearing in 
the course of an obsessional illness is probably 
schizophrenic, but most psychiatrists do not take 
the position that all nonorganic delusional psy- 
choses are schizophrenic. [See SCHIZOPHRENIA, | 

Prognosis and treatment. Most obsessional re- 
actions do not come to the attention of physicians 
or psychiatrists. The more severe cases usually ap- 
pear early in life and are missed by the family or 
the physician at a time at which continuous inter- 
vention is indicated, and thus they run a course 
of remissions and exacerbations. Psychotic epi- 
sodes are relatively rare in the series of studies 
published in English-speaking countries as com- 
pared with those published in Germany and 
Sweden. More than 70 per cent of the severe cases 
in one English series were sufficiently well to lead 


fairly normal lives (Roth 1960). Improvement 
usually occurs regardless of the method of treat- 
ment. There are no reliable figures from a suffi- 
ciently large number of cases to warrant dogmatic 
conclusions about the value of any one treatment 
method compared with others (Cremerius 1962; 
Fenichel 1945; Gottschalk & Auerbach 1965; 
Ingram 1961). Psychotherapy undoubtedly is more 
effective in the early stages of acute cases and may 
be prophylactic, Suicide is not common, being re- 
lated more to the depressive reactions, and the 
incidence reported varies with the country 
(Cremerius 1962; Roth 1960). 

Much skill is required to deal with well-devel- 
oped obsessional systems, so that cither formal 
psychoanalytic therapy or therapies based on psy- 
choanalytic principles probably offer the best op- 
portunity for recovery, for abatement of the acute 
symptoms, and for helping the patient free his en- 
ergies for character maturation. Rehabilitative and 
supportive therapies, including use of spas, accom- 
panied by rituals such as colonic flushing, often 
have symbolic meanings that fit into the patient's 
obsessional system and thus bring partial relief 
from symptoms, Leucotomy and lobotomy may 
help very advanced cases, but the evidence is not 
uniform, and the procedure is rarely used in the 
United States. Electric shock may be used in a 
limited fashion for patients with severe obsessional 
pathology who are psychotically immobilized as 
a result of their indecision, ambivalent rage, and 
such practices as incessant handwashing. There 
are successes reported, but there are also disap- 
pointments. Insulin shock therapy has not been help- 
ful in the advanced cases. [See MENTAL DISORDERS, 
TREATMENT OF, article on SOMATIC TREATMENT.] 

Obsessional disorders in other cultures. It is 
appropriate to mention in an international encyclo- 
pedia the world-wide distribution of obsessional 
practices. Although many individual papers have 
appeared since Freud’s Totem and Taboo in 1913 
and T. Reik’s Ritual in 1931, there are no compre- 
hensive surveys; however, Erikson’s study of the 
Yurok Indians (1950) and Mead’s of the Pacific 
cultures (1964) are excellent guides, 


Henry W. BROSIN 


[See also DEFENSE MECHANISMS. Other relevant mate- 
rial may be found in ANXIETY; MENTAL DISORDERS; 
Neurosis; PHOBIAS; PSYCHOANALYSIS; RELIGION.) 
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Occupations may be defined as relatively con- 
tinuous patterns of activities that provide workers 
a livelihood and define their general social status. 
Occupations emerge whenever division of labor is 
associated with a monetary economy and labor and 
commodity markets. Functional specializations 
within the family, tribe, and other units that are 
mediated primarily by ascribed relationships are 
not generally regarded as occupations. For exam- 
ple, the peasant, wife, slave, serf, shaman, and 
chief in isolated and economically self-sufficient 
localities are status rather than occupational desig- 
nations. 

The distinction between occupations and sta- 
tuses is difficult to make in concrete situations. 
Within a given society the same functional special- 
izations may be performed by people who have 
different status and market relationships; for ex- 
ample, wives, slaves, and artisans may all manu- 
facture cloth, part or all of which may be either 
used at home or sold in the market place. There- 
fore, occupations should be conceived as constructs 
against which particular situations may be com- 
pared. 

Status and occupational societies. Occupations 
existed in ancient cities, medieval towns, and es- 
tate societies. However, only a minority of the 
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adult population in those societies participated in 
labor and community markets. In contemporary 
urban and industrial societies the great majority 
of adults at one time or another hold occupations 
which tend to be distinct from their other social 
statuses. Such societies usually have attained a 
high level of technological development, a highly 
differentiated division of labor, a highly developed 
exchange economy based on cash, and relatively 
free labor markets, These characteristics describe 
occupational societies. 

Societies where the social status of the person 
determines his work are called status societies. 
Castes and estates represent functional specializa- 
tions in status societies. 

Medieval guilds and occupational corporations 
in early industrial societies also tended to be domi- 
nated by the status principle. Even in contempo- 
rary industrial societies some occupations, espe- 
cially some professions, have estatelike attributes. 
For example, military, religious, legal, and medical 
bureaucracies tend to regulate the total life of both 
the worker and his family. For the dominant seg- 
ment of the labor force, however, the occupational 
role tends to be differentiated from other statuses; 
it is freely contracted and subject to market condi- 
tions. In occupational societies the child, the unem- 
ployed, sick, aged, or retired person, and the full- 
time housewife do not hold occupations. The 
general social position of each tends to be defined 
either by his past occupation or by that of the head 
of the family. 

The historical shift in the West from status to 
occupational societies was largely the result of per- 
sistent application of scientific knowledge to pro- 
duction and other economic organization. The 
persistent pattern of change in the technology of 
production was accompanied by equally persistent 
changes in the social organization of production, 
distribution, and related economic activities, Since 
occupational routines are related to given techno- 
logical processes and to given work organization, 
changes in the organization of either had conse- 
quences for the structuring not only of occupations 
themselves but also for related activities of workers 
and their organizations. All societies have some 
degree of functional interdependence; hence, basic 
changes in any one-of their institutions produce 
change in other parts of the society. Thus, the 
growth of scientific knowledge, the changes in 
technology and work organization, the growth of 
markets, the rise of industrial and commercial 
centers, and other societal changes affected the 
composition of the occupational structure and the 
behavior of workers both within and outside 
the plant (Polanyi 1944), 


Occupations in societies with relatively low levels 
of technological differentiation tend to be charac- 
terized by stable relationships, both at work and in 
the broader community, Thus, crafts, professions, 
and trades of earlier days were lifetime commit- 
ments, and they regulated work behavior, relations 
with colleagues, and life in the community. In 
highly differentiated occupational societies these 
stabilities are weakened. The mechanization and 
specialization of operations are so extensive that 
they render many occupational titles meaningless; 
make many occupations obsolete; reduce attach- 
ments to occupations; and make occupational re- 
training necessary. Moreover, temporary, intermit- 
tent, and multiple jobholding tends to grow, as do 
unemployment and residential mobility. Social 
mobility is also accelerated as the income, prestige, 
and social power of occupations change. New oc- 
cupational associations arise to deal with these 
disturbing influences. When they are incapable of 
solving their problems, the community and state 
intercede. 


Nature of the occupational structure 


The occupational structure may be thought of 
as a series of more or less permanently related 
occupational families that are hierarchically ar- 
ranged according to complexity of skills. Typical 
classifications used today include professional, 
technical, managerial, clerical, sales, skilled, semi- 
skilled, and unskilled. These, in turn, may be end- 
lessly subdivided into particular occupations. The 
industrial structure, on the other hand, represents 
broad economic activities or areas, such as agricul- 
ture, manufacturing, transportation, communica- 
tion, government, trade, and services of various 
kinds. 

Comparison of occupational structures. Since 
there is no universally accepted taxonomy of occu- 
pations and industries, a comparison of different 
societies is hazardous. However, whenever there is 
rational, systematic, and instrumental application 
of scientific knowledge to the production and dis- 
tribution of goods and services, occupational struc- 
tures arise that have distinctive features at differ- 
ent stages of economic development (Kerr et al. 
1960, pp. 33-46). This close relationship between 
occupational composition and industrial structure 
has been generally found in countries experiencing 
an industrial revolution. In short, a characteristic 
division of labor seems to be associated with the 
extent of industrialization. 

Yet there are difficulties in analyzing the occu- 
pational structures of societies in different periods 
and in comparing different societies. Difficulties 
stem from several unresolved problems: disagree- 


ment on definitions of specific occupations, lack of 
consensus on the bases for classifying occupations, 
and differences in social policies regarding the 
occupational structures. In simple economic sys- 
tems it is relatively easy to demarcate various types 
of product and labor markets. With increasing divi- 
sion of labor and trade, market boundaries become 
blurred. Questions therefore originate concerning 
which geographic unit is most useful for studying 
the occupational structure. Since nations collect 
the most extensive occupational data, comparisons 
can be made only between national economic sys- 
tems, which may not represent the most useful unit 
of analysis. Additional difficulties are encountered 
not only because national systems of occupational 
titles and classifications do not agree but also be- 
cause they are changed continuously. Occupational 
classifications are made to fit administrative needs, 
and scant attention is paid to criteria for identify- 
ing and classifying occupational families. 

Several attempts have been made to arrive at a 
rational classification that would receive universal 
acceptance. The most widely adopted has been 
that of Alba M. Edwards (U.S. Bureau of the 
Census 1943). He ranked occupational families 
according to their education and income to arrive 
at a socioeconomic classification, This classifica- 
tion was used by the Census Bureau in 1940 and 
was subsequently adopted by various countries, 
including Canada, the Federal Republic of Ger- 
many, France, Italy, Japan, Sweden, and Panama. 

Prestige rating of occupations. Sociologists have 
also attempted to classify occupations according 
to their general standing or prestige. Unfortu- 
nately, most of their research has used students 
as subjects or adults who did not accurately repre- 
sent the total working population. A carefully 
drawn scale was prepared in 1947 by Cecil C. 
North and Paul K. Hatt (see Reiss et al. 1961, 
pp: 54-57). They asked a quota sample of the 
adult population of the United States to evaluate 
90 occupations. Reiss (Reiss et al. 1961) and 
others carefully examined the representativeness 
of occupations used in the scale, as well as its 
Statistical attributes. They found that although the 
Scale was unevenly representative of various oc- 
Cupations in the labor force, it was sensitive to 
Socioeconomic gradations among the occupations. 
seas concluded that neither the occupations in the 
cale nor the general occupational structure con- 
Stitutes a single ordinal scale. 

; Undoubtedly, different segments of the popula- 
tion valued the income and education associated 
ins various occupations differently and, there- 
Ore, ranked certain occupations differently. This 
dissensus probably reflected the amorphous and 
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dynamic social class system of the United States. 
Despite the disagreements, however, there was gen- 
eral consensus on the prestige ranking of occupa- 
tions, even though different criteria may have been 
invoked in the ranking. Otis Dudley Duncan and 
Albert J. Reiss, Jr. (see Reiss et al. 1961) pre- 
pared an index of socioeconomic status for the 425 
occupations in the detailed classification used by 
the U.S. Census Bureau and transformed the in- 
dexes into comparable North—Hatt prestige-scale 
scores. This operation has increased the utility of 
both scales in research. 

Cross-cultural comparisons. A comparison of 
the prestige ratings of various occupations in dif- 
ferent industrial societies reveals a general consist- 
ency of results (Inkeles & Rossi 1956). However, 
there are disagreements over the interpretation of 
the findings. Functional theorists have suggested 
that occupations vary in their importance to a so- 
ciety and that the more important ones tend to 
be scarce because they require more skill and 
preparation (Davis 1948, pp. 368-369 ). Therefore, 
people in these occupations tend to be accorded 
more prestige, income, and social influence. Ac- 
ceptance of this theory in whole or in part has led 
some scholars to compare changes in occupational 
structures of various countries and to interpret oc- 
cupational mobility according to local functional 
requirements (Lipset & Bendix 1959). 

‘A detailed comparison of occupational prestige 
and other rewards in different countries reveals 
certain inconsistencies that challenge functional 
theory. Each society tends to assume that its occu- 
pational structure represents a hierarchy of skills 
socially legitimized by appropriate economic, pres- 
tige, and other rewards. This position has been 
attacked on theoretical grounds (Tumin 1953) and 
still awaits satisfactory empirical validation. Never- 
theless, one may safely conclude that the occupa- 
tional structure is not a uniformly graded ordinal 
scale representing a single, functional, economic, 
prestige, or influence hierarchy. The lack of hier- 
archical convergences arises from interacting and 
sometimes contradictory technological and social 
forces concerned with allocating occupations and 
their associated reward systems. Changes in occu- 
pational rankings reflect not only occupational 
fashions but also changes in the stratification posi- 
tion of various interest groups in the community. 
Highly diversified societies probably exhibit greater 
internal segmentation, less agreement on the ob- 
jective and subjective criteria for ranking occupa- 
tions, and more pervasive occupational politics. 

Manpower planning. In highly diversified so- 
cieties the occupational structure does not auto- 
matically regulate itself to supply adequate num- 
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bers of workers for particular occupations, to meet 
market demands, or to fulfill specific community 
needs. Nor does it adjust rewards in accordance 
with objectively derived criteria. Consequently, 
various groups try to shape the occupational struc- 
ture to meet their own interests or those of the 
State. Such efforts may result from various short- 
comings in the occupational structure: failure of 
educational and other agencies to provide a suffi- 
cient number of qualified personnel to particular 
industries (steel, agriculture, science); failure to 
meet occupational quotas set by planning authori- 
ties during emergencies (war, reconversion, fam- 
ine); inability to employ labor surpluses arising 
from technological or market changes (unemploy- 
ment and underemployment); inability to meet 
political crises originating from demands to pro- 
vide equal occupational Opportunities for sup- 
pressed classes or disadvantaged ethnic groups; in- 
ability to overcome the grip of highly organized 
occupations on the recruitment and training of 
workers. Attempts to change the occupational 
Structure are designed not only to meet the im- 
perfections in the labor market but also to attack 
the social problems associated with economic 
change. 

Pressures to change or plan the occupational 
structure have intensified with the increasing divi- 
sion of labor all over the world as occupational 
groups or the market fails to respond to economic 
and social changes quickly enough to recruit, train, 
and place workers. Three social systems are typi- 
cally involved in occupational planning: the state, 
its educational institutions, and occupational asso- 
ciations. The state has traditionally operated 
through the schools to meet its manpower goals. 
With increasing industrialization, the state in- 
creases its pressure on school authorities to meet 
changing manpower needs. School authorities gen- 
erally fail to respond quickly enough, because they 
place higher value on other educational goals and 
because they are institutionally incapable of resolv- 
ing all problems of the occupational structure. Con- 
sequently the state sometimes bypasses the school 
system by establishing independent agencies to 
train or retrain workers, providing placement serv- 
ices, and subsidizing certain industries to train 
workers. 

Occupational associations also attempt to regu- 
late the school system (sometimes in conflict with 
the government) in order to control entry, training, 
placement, and retraining of workers. Since occu- 
pational groups sometimes have a monopoly of 
skills and knowledge, they are in a position to 
influence control over educational and govern- 


mental programs. Where unsuccessful, they tend 
to establish independent training and placement 
agencies. 

Whoever controls manpower planning—school, 
government, or occupational associations—the Te- 
sults of their activities are similar everywhere. 
Training requirements are being increased; the 
retraining of adult workers is increasing; the scope 
of concern for the worker is being broadened to 
include his leisure, politics, mental health, and 
community participation; and the concern for the 
worker’s welfare is being extended into retirement, 


Industrialization and occupational trends 


An analysis of both industrialization and mod- 
ernization of economies reveals a pattern of change 
in the demographic, industrial, and occupational 
composition of labor forces, The persistent applica- 
tion of scientific knowledge to improve production 
and work organization has required that workers 
be more highly educated and thus has delayed their 
entry into the labor market. The spread of univer- 
salistic norms in hiring practices has increased the 
proportion of women, both single and married, in 
the paid labor force and has extended their stay 
in it. Although their early entry is typically re- 
stricted to marginal occupations or occupations 
similar to their preindustrial roles, the general 
movement has been to equalize opportunities and 
to provide equal income and status benefits. With 
increasing urbanization the age of workers in the 
labor force rises, and retirement age is lowered. 

When industrialization begins, the proportion of 
workers employed in manufacturing and urban 
services rises, and the proportion in agriculture 
declines, not only because factory workers are re- 
cruited from rural areas but because the mechani- 
zation of agriculture permits workers to be released. 
Eventually manufacturing may employ more work- 
ers than any other economic sector. Then, as man- 
ufacturing becomes more efficient, it needs rela- 
tively fewer workers to produce and more workers 
to service the economy in transportation, commu- 
nication, trade, clerical, financial, governmental, 
professional, and related areas. In mature indus- 
trial economies these “service industries” eventu- 
ally employ the majority of urban workers (Woy- 
tinsky & Woytinsky 1953, pp. 117, 425). 

Evolution of occupational structures. Changes 
in the occupational structure accompany the 
changes that come with industrialization. The pro- 
portion of farmers decreases, as does the propor- 
tion of unskilled workers in manufacturing and the 
services. During the process of mechanization the 
percentage of semiskilled manual workers grows 


rapidly, while that of skilled workers rises slowly. 
After this pattern reaches a peak, the proportion 
of office workers and salesclerks multiplies dramat- 
ically, as does that of professional, technical, man- 
agerial, and administrative personnel. The per- 
centage of service workers at all levels of skill also 
increases sharply. The expansion of the white-collar 
sector results from many forces, the most impor- 
tant of which is the need for more highly trained 
workers to handle the problems of an increasingly 
complex and interdependent economy. 

These industrial and occupational changes ac- 
celerate the geographic mobility of workers, in- 
crease the rates of job change within and among 
industries, and quicken the mixture of socially and 
culturally heterogeneous populations in the work 
world, Such dynamics reflect not only a broadening 
and diversification of product and service markets 
in response to ubiquitous changes in consumption 
styles but also the growth in size and complexity 
of economic and social organizations. 

Undoubtedly one of the major causes of change 
in the occupational structure is the relentless divi- 
sion of labor, which, on the one hand, lowers skill 
levels by accelerating specialization and, on the 
other, demands more technically qualified special- 
ists not only in production but in the necessary 
Supporting services of research, planning, person- 
nel development, market analysis, and industrial 
relations. The very concept “division of labor” may 
be misleading because it implies a process of con- 
tinuous job simplification, whereas, in fact, the 
diversification of labor occurs at all skill levels and 
often calls for higher skills. 

Since the patterns of occupational changes in 
the industrial countries of the West have been 
Strikingly similar, one might conclude that all 
industrializing nations would undergo similar evo- 
lutionary patterns. They do not, for three important 
reasons, First, the technology and industrial organ- 
ization that is exported from economically devel- 
Oped countries to less industrialized countries is 
highly advanced; thus, receiving nations have no 
need to recapitulate the industrial changes of the 
exporting countries. Second, the organizational 
Structure necessary for industrialism is also ex- 
Ported; thus, the independent evolution of such a 
Structure is not required. This structure includes 
the necessary governmental apparatus for indus- 
trialism, such as employment services, educational 
Systems, social security agencies, collective bar- 
gaining laws, regulatory agencies, and other 
Paraphernalia of industrial society, such as occu- 
pational associations, learned societies, and ad- 
ministrative and professional services. Third, the 
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demographic, social, and cultural milieus of newly 
industrializing nations differ; consequently, unique 
adaptations and inventions must occur in every 
society (Kerr et al. 1960). 


Consequences of changing structures 


Focusing only on the evolution of occupational 
structures masks the changes that occur in specific 
occupations. Many occupations are altered or elim- 
inated, and new ones arise in response to both 
technological innovations and reorganizations of 
work systems. Continuous organizational and tech- 
nological revolutions force changes both in occu- 
pational reward systems and in social relations in 
and out of work, which are contingent on occupa- 
tional status (Hughes 1958, pp. 11-22). Tradi- 
tionally, both technological and organizational 
innovations have been aimed at manual workers, 
often damaging their economic and social welfare. 
More recently, automation and rationalization of 
work organization have been applied at higher 
occupational levels, affecting successively skilled 
workers, office personnel, technical and profes- 
sional personnel, and most recently, the managers 
and administrators themselves. 

Inconsistencies. Since innovations have not 
been uniformly and consistently applied across the 
occupational structure, ambiguities and inconsis- 
tencies have been created in the reward and rela- 
tional systems within both work organizations and 
organizations tangent to them. In the United States 
an income hierarchy still roughly parallels the 
prestige hierarchy of broad occupational families, 
even after age and income are controlled (Miller 
1955, pp. 50-55). Ranking from high to low are 
professionals, managers, clerks, craftsmen, opera- 
tives, service workers, and laborers. Income differ- 
entials among these categories may have narrowed 
in the past twenty-five years. Nevertheless, impor- 
tant inconsistencies exist between the prestige and 
income of occupational families in the United 
States. The median annual income of salaried man- 
agers is higher than that of salaried professionals; 
the income of craftsmen is higher than that of 
clerical and sales personnel; and women operatives 
receive incomes equal to or higher than those of 
female proprietors and salesclerks. 

When comparisons by sex are made, the anom- 
alies are even greater. For example, female pro- 
fessionals, managers, proprietors, and white-collar 
workers all receive median annual incomes lower 
than the median for all semiskilled workers. In- 
consistencies between the prestige and income 
levels of specific occupations are exacerbated when 
their social power is considered. Thus, occupations 
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with strong unions generally have higher incomes 
and status than weakly organized occupations. 

By comparing studies in various countries, 
Davies (1952) showed that incumbents of differ- 
ent occupations tend to agree on the prestige rank- 
ing of broad occupational families. Yet recent 
evidence suggests that such consensus declines 
when the traditional alignment of income and 
influence with occupations is upset. Such imbal- 
ances typically appear in communities undergoing 
industrial and occupational changes. Traditional 
stratification arrangements based upon local own- 
ership of industry are undermined by the invasion 
of absentee or governmentally owned industries, 
which have altered not only the local occupational 
structure but also its wage and prestige markets. 
Moreover, as organized labor succeeds in raising 
the relative incomes of manual workers, these oc- 
cupations become more powerful politically than 
many white-collar occupations. 

Resistance to change. Alterations in traditional 
occupational stratification systems, whatever their 
origin, are resisted by occupational groups, first by 
informal means and later by formal collective bar- 
gaining. In its early stages, collective action tends 
to be protectionist; when successful, it aggressively 
seeks advantages over competing occupational 
groups. In large bureaucratic settings, formal bar- 
gaining is initiated by manual workers, while in- 
formal bargaining is practiced by white-collar and 
technical workers. As technological and organiza- 
tional innovations are introduced in higher eche- 
lons, even professional and managerial employees 
begin to formalize the collective bargaining process. 

Where free labor markets exist, collective bar- 
gaining institutions of different occupational groups 
grow at an uneven pace. However, persistent un- 
resolved conflicts among occupational associations 
everywhere lead to restrictive legislation and direct 
governmental involvement in settling disputes and 
planning industrial-relations policies. Some govern- 
ments grant certain occupational associations 
quasi-legal powers to resolve disputes; some may 
prohibit formal collective bargaining altogether, 
assuming direct administration of the occupational 
reward systems. 


Occupational mobility 

Occupational mobility may be discussed by ex- 
amining both the mobility of the occupations them- 
selves and individual movement from one occupa- 
tion to another. Mobility may involve changes in 
locus, function, income, prestige, power, inde- 
pendence, or other occupational attributes. Most 
studies have focused on prestige changes, assum- 


ing incorrectly that other attributes change in the 
same degree and direction. 

Group mobility. Although both occupations 
and individuals may exhibit mobility, there has 
been relatively little systematic research of the 
former. The downward mobility of barbers has 
accompanied the rising status of the physician. 
The upward mobility of nurses has been accom- 
plished by the sloughing off of dirty work to new 
occupations lower in the medical hierarchy (Hughes 
1958, pp. 42-55). Obviously occupations differ in 
capacity to determine either their rank or their 
social fate. In highly industrialized societies, occu- 
pations that legitimize the social order, have high 
political power, coordinate other occupations, or 
require long technical training have the greatest 
prestige and income and evidence the least down- 
ward mobility. Those occupations most exposed to 
changes in technology and in popular consumption 
patterns experience the greatest mobility. 

While changes in income and power can be 
directly measured, changes in occupational pres- 
tige depend upon alterations in audience evalua- 
tions. Research shows that the prestige level of oc- 
cupations is strikingly resistant to change (Hodge 
et al. 1964). However, changes in the size of an 
occupational group, its market demand, independ- 
ence, and financial rewards affect how it is evalu- 
ated. Thus, sales and office clerks have lost prestige 
as millions have entered these occupations, and 
atomic physicists have gained social honor because 
of their short supply. Small businessmen have lost 
status with the growth of corporate management, 
and some professions have lost prestige as they 
have lost their independence and become bureauc- 
ratized. Conversely, the social standing of man- 
agers of big businesses and large governmental 
agencies has increased relative to that of proprie- 
tors of small business. 

Studies of individual career mobility. Vertical 
occupational mobility increases in a dynamic econ- 
omy. It may be measured in the space of the indi- 
vidual’s work life or over a period of two or more 
generations. Research on occupational mobility 
within the life span reveals that the majority of 
workers do not experience orderly and regular up- 
ward occupational movement. Most occupational 
mobility takes place within the first ten years of 
work life, and the worker generally finds his regu- 
lar niche at this time. 

Mobility studies have been made both for broad 
occupational categories and for specific occupa 
tions. Research results for broad categories até 
consistent in societies with free labor markets. 
Miller and Form (1951, pp. 541-765) have de- 
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scribed patterns of occupational mobility for those 
in the labor force ten or more years. Professionals 
exhibit the most orderly patterns, for they begin 
their work life on the professional level. Proprietors 
and managers generally move in a rapid and or- 
derly way to their positions, achieving long tenure 
on the job. Skilled manual and clerical workers 
change jobs relatively frequently in the first part 
of their work lives but exhibit great stability later 
on, Domestic and personal service workers and 
semiskilled and unskilled workers change jobs dur- 
ing their entire work life and rarely find security 
or stability; they do not have careers in the tradi- 
tional sense. 

Studies have also been made of mobility patterns 
within broad occupational families. For example, 
Reiss (1955) found that various types of profes- 
sionals (established, new, semiprofessional, would- 
be, and marginal) have different patterns of 
mobility. The degree of instability, risk, and depend- 
ency increases as one moves from the established 
to the marginal professions, reflecting differences 
not only in their market and organizational condi- 
tions but also in their social origins. 

Studies of intergenerational mobility. Some 
studies have pointed to the importance of the strati- 
fication position of the family for setting the occu- 
pational mobility patterns of the children. Children 
of manual workers usually neglect job planning at 
school, where they receive less and poorer voca- 
tional guidance than do children of white-collar 
workers; upon leaving school, they take any work 
offered them and move haphazardly from one job 
to another. By comparison, children of white-collar 
workers are more likely to plan their careers, Te- 
ceive better guidance in school, and exploit the 
informal occupational contacts that their families 
and friends have. In short, they live in a different 
Opportunity structure. 

The study of intergenerational mobility focuses 
directly on the extent to which children inherit the 
occupational level of their parents. In an open Op- 
portunity structure, assuming equality in ability 
and aspiration, the incumbents of all occupational 
levels should have fathers who represent the entire 
Occupational structure. 

In the United States the traditional ideology of 
an open class system has emphasized upward occu- 
pational mobility. The availability of free land and 
expanding industry allegedly guaranteed ample 
Opportunity to all. The great depression called this 
ideology into question, however, and scholars be- 
gan to measure the amount of occupational mo- 
bility that actually was occurring. Davidson and 
Anderson (1937) surveyed San Jose, California, 
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and found that one-half to two-thirds of the work- 
ers at any given occupational level had fathers 
whose occupations were at the same or adjacent 
levels. That occupational inheritance was the rule 
rather than the exception was then demonstrated 
in a spate of studies of business elites, rural elites, 
people listed in Who's Who, millionaires, and 
finally a national sample. The conclusion drawn 
from these studies was that mobility was decreas- 
ing and class lines were hardening. 

Although the methodological crudeness of these 
studies casts shadows on the validity of some of 
their findings, subsequent research generally sup- 
ported the conclusion that a completely open occu- 
pational structure does not exist. However, there is 
disagreement about the degree to which it is 
closed. Many studies of communities in the United 
States, as well as three national cross-sectional sur- 
veys, have documented that from one-half to two- 
thirds of adult males are in the occupational levels 
of their fathers or in the immediately adjacent 
levels (Jackson & Crockett 1964). These studies 
also show that most mobility occurs within the 
manual-work sector and within the white-collar 
sectors, and relatively little occurs between the 
sectors. 

Studies in western European countries, Aus- 
tralia, Japan, and the Soviet Union confirm these 
findings. Although these studies do not agree on 
whether or not vertical occupational mobility is 
increasing, they do suggest that economic pros- 
perity and depression are at least partially respon- 
sible for changing rates of vertical occupational 
movement. 

Early studies did not take into account the ris- 
ing prestige level of the entire occupational struc- 
ture as increasing proportions of workers moved 
into the white-collar, professional, and managerial 
occupations. Rogoff and Goldhamer devised a 
formula that takes into account changes in the 
occupational structure, while measuring individual 
mobility (Rogoff 1953). Applying this formula, 
Rogoff found no changes in the degree of vertical 
occupational movement in Indianapolis from 1912 
to 1941. A comparable study for Rome, Italy, 
extending over almost the same period, provided 
almost identical results (Lehner 1954). Using the 
same formula, Warner and Abegglen (1955) 
found that an increasing proportion of the nation’s 
big business leaders were being recruited from 
lower occupational strata. 

Comparisons of mobility rates. There has long 
been speculation on the amount of occupational 
mobility in different countries. Lipset and Bendix 
(1959) compared results of studies of the occu- 
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pational origins of representative populations of 
the United States, France, Germany, Great Britain, 
Italy, Finland, the Netherlands, Sweden, and 
Japan. Although the methodologies of these studies 
were not precisely comparable, they permitted com- 
parisons of occupational origins of farmers, manual 
workers, and white-collar workers. The data do not 
support the conclusion that occupational mobility 
rates of the United States are characteristically 
higher than those of industrial European countries. 
All countries showed high mobility of farmers’ sons 
into urban manual jobs and of sons of urban 
manual workers into white-collar occupations. 
Most of this mobility was in response to changes 
in the occupational structure. Probably the major 
differences in mobility rates of industrializing na- 
tions reflect differences in the degree and pace of 
industrialization. 

Nations that are at the same level of economic 
development and are industrializing at the same 
pace tend to exhibit similar rates of vertical occu- 
pational mobility. However, the pace and level of 
industrialization do not bear a unitary relationship 
to the degree of closure of an occupational struc- 
ture. Two nations with similar patterns of indus- 
trialization may differ in their degree of vertical 
mobility, because recruitment into any level may 
be from adjacent levels or from all occupational 
levels. Even where there is no structural mobility, 
the degree of generational occupational circulation 
may be high or low. 

Jackson and Crockett (1964) measured the de- 
gree of occupational mobility in the United States 
as revealed in three national surveys for 1947, 
1952, and 1957. Using three categories (farmer, 
manual, and nonmanual), they found that in all 
three periods almost half the sons had moved out 
of parental occupational levels. They concluded 
that about one-quarter of the movement was due 
to structural causes and one-fifth to other causes. 
Not enough differences were found among the 
three periods to establish a secular trend. 

Evaluation of research findings. Conclusions 
from mobility studies must be tentatively held. The 
crude occupational categories that are used mask 
much of the mobility that actually occurs. More- 
over, that all movement from skilled manual to 
white-collar jobs represents upward mobility in all 
stratification dimensions is questionable. Yasuda’s 
Japanese studies (1964) suggest that a more ac- 
curate method of studying intergenerational mo- 
bility is to compare the subject's first job with that 
of his father, to pay special heed to the mobility of 
the first son where primogeniture is practiced, and 


so on. Moreover, the significance of the mobility 
must be determined by ascertaining whether rela- 
tive changes in income and prestige in fact accom- 
pany occupational changes. It is even possible for 
nonmovement to represent mobility when the 
occupation itself is mobile. 


Careers 


Mobility may be examined either as a function 
of the occupational structure or in terms of indi- 
vidual careers, Three major types of definitions of 
“career” are found. The first emphasizes notable 
achievement in any occupation; the second, any 
pattern of occupational change; and the third, a 
succession of related occupations that are hier- 
archically arranged and through which a worker 
rises in an ordered sequence. The last definition 
most closely corresponds to the historic use of 
the word. 

As a construct, a career is the opposite of ran- 
dom job mobility. Careers imply a long, if not a 
lifetime, commitment to moving upward through 
a series of related occupations and statuses accord- 
ing to a schedule. They are, therefore, associated 
with situations in which occupational mobility is 
considered the norm. Typically, a career involves 
not only systematic education for the initial occu- 
pation but also systematic occupational experience 
in which each occupation is considered as tech- 
nical and social preparation for the succeeding 
ones. The cursus honorum in ancient, medieval, 
and later societies arose when the expertise needed 
to maintain certain private and public services 
could not be satisfied by isolated occupations. The 
very existence of careers implies the existence of oc- 
cupational associations to regulate training and to 
supervise mobility. These associations stabilize not 
only the occupational order but also society itself. 

Societal influence on individual careers precedes 
entry into the labor market and persists into the 
retirement period. Family socialization influences 
work values, occupational aspirations, amount of 
formal education received, the type of first employ- 
ment, and later employment opportunities. Social- 
ization continues on the job and affects future life 
chances and job mobility. 

In traditional societies or in societies at early 
stages of industrialization, career sequences are 
more or less institutionalized from one generation 
to the next. Some instances survive in modern 50- 
ciety. In these instances strong occupational as- 
sociations assure the orderly recruitment, training, 
and movement of incumbents through a series of 
statuses. 


_—— 


Seen historically, the ability of different occu- 
pational groups to define and control careers varies 
enormously. The ability of the colleagueship to 
build full, firm, and lasting commitments varies 
according to its degree of control over the forces 
affecting the occupation. Career instabilities have 
been greatest in those occupations with the lowest 
skill and educational requirements, the most ex- 
posure to technological changes, and the weakest 
organizations. 

Careers in industrial societies. In contemporary 
industrial society only a minority of the work force 
participates in careers. Yet the career remains a 
powerful ideal in the life plans of both individuals 
and occupational groups. Perhaps as a survival 
from status societies, some professions retain ca- 
reer characteristics (for example, law, military, 
clergy, medicine, and perhaps education). How- 
ever, careers are possible for others, such as clerks 
in governmental bureaucracies, managers in large- 
scale industries, and skilled workers and tech- 
nicians in growing enterprises. 

The very question of the nature and stability of 
careers in industrial society is the organizing 
framework for research on the subject. Wilensky 
(1960) has demonstrated that career opportunities 
are greater in work organizations that have more 
supervisory levels, a higher ratio of managers to 
workers, and a more rapid rate of growth. He also 
demonstrated (1961) that workers with orderly 
careers are more socially integrated not only in 
their occupations but in the broader community; 
compared to those who experience random job 
movement, they participate more in all forms of 
social organization, both formal and informal, in 
and out of work. Moreover, their nonwork and 
leisure activities are more closely integrated with 
their occupational life. 

Industrialization and the growth of large-scale 
work organizations have had enormous but contra- 
dictory effects on careers. While the number of 
possible careers increases with the growth of 
bureaucracies, the instabilities introduced by tech- 
nological and organizational changes impede or- 
derly escalations in income and social honor. The 
growing complexities of the industrial order de- 
mand longer periods of educational preparation 
and more retraining during the career. The grow- 
ing number of echelons and bureaucratic reorgan- 
izations creates more career stages but also more 
interruptions in orderly ascent. Increasing tech- 
nical specializations make many careers less visi- 
ble, thus shrinking their status audiences and 
enticements to recruitment. The dissatisfactions 
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arising from unanticipated and often unwanted 
changes and the lack of status satisfaction lead to 
the abandonment or switching of careers. If these 
conditions are aggravated, careers may cease to 
exist. 

Development of protective associations. Just as 
insecurities among manual workers stimulated the 
formation of labor associations, so insecurities 
among career-oriented white-collar workers have 
stimulated the formation of protective associations. 
The routinization of office work has prompted 
many white-collar workers to join unions, and the 
bureaucratization of professions has also stimu- 
lated the growth of protective associations. The 
so-called professionalization movement in many 
occupations represents in fact an attempt to reduce 
risks in planning careers. In seeking to specify 
educational requirements, conditions of work, pay- 
ment, promotions, and benefits, “professional” as- 
sociations seek to commit their members to the 
occupation, thereby decreasing their mobility and 
creating orderly careers, Success here varies ac- 
cording to types of work structures and their eco- 
nomic characteristics. Heretofore, it has been 
assumed that the managers responsible for intro- 
ducing changes in large-scale industrial, political, 
military, religious, educational, and welfare bu- 
reaucracies would be able to secure their own 
careers. However, the introduction of operations 
research and the computer to solve problems of 
management has threatened even managerial ca- 
reers. It is now problematic whether any occupation 
has its destinies in its own hands. 


WILLIAM H. FORM 


[See also LABOR FORCE, article on MARKETS AND MO- 
BILITY; PROFESSIONS; SOCIAL MOBILITY; STATUS, 
SOCIAL; WORKERS.] 
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Occupations and Social 


OCEANIAN SOCIETY 


Oceania refers to Australia and to those Pacific 
islands situated between (and including) the 
Hawaiian archipelago and the Marianas Islands in 
the north, Easter Island in the east, New Zealand 
in the south, and New Guinea in the west. These 
boundaries are essentially ethnological and, in 
some respects, arbitrary. Although only a few 
scholars think that there have been significant 
human interchanges—biological or cultural—be- 
tween this region and the Americas, the western 
boundary is anything but sharp. Prior to the 
colonial era people of the Marianas and West Caro- 
lines seem to have had little or nothing in common 
with the Ryukyuans to the north, but their past 
relations with the Philippines are clearly demon- 
strable in language, culture, and physique. Links 
between New Guinea and islands west of it are 
even more evident; in fact, the Moluccas constitute 
something of a transition zone. 


Geography 

Our concern with the physical environment of 
Oceania is twofold. First of all, we are interested 
in those environmental features which have had 
some relevance to the social behavior of peoples 
with nonmetallic technologies, nonurban. settle- 
ment patterns, and largely nonscientific ideologies. 
For such peoples the presence or absence of min- 
eral deposits, deep harbors, or natural grazing pas- 
tures was largely irrelevant, but these very factors 
did become relevant to native behavior through the 
intermediacy of alien whites and Asians. 

For the native Oceanians the region provided 4 
wide range of natural assets as well as a formida- 
ble array of liabilities (Oliver 1951). In Australia, 
the climate nowhere reached such extremes as to 
render any large zone entirely uninhabitable. In 
fact, the populace tended to concentrate, regard- 
less of climate, in places where natural foods were 
most abundant, i.e., in the humid and tropical 
north as well as in the temperate southeast. The 
natural foods relied upon by the hunting and gath- 
ering peoples included kangaroos, cassowarles, 
snakes, lizards, turtles, fish, grubs, fruits, roots, 
seeds—in fact, almost everything the land and 
water produced that was even conceivably edible. 
The Australians’ direct and, one might say, indis- 
criminately total reliance upon the continents 
given resources for their subsistence may help to 
explain many of the similarities among aborigin 
cultures noted by most students. But by the same 
token, local differences in the kinds and quantities 
of those resources also resulted in the development 


of some regional differences in other domains of 
cultural life. 

Unlike the Australians, other islanders were 
primarily gardeners; hence the factors of rainfall, 
topography, and soi] were of more immediate im- 
portance than direct availability of wild plants and 
animals. The islands of Oceania may be divided 
into several more or less distinctive types in regard 
to these features. 

The “continental” islands are New Guinea, New 
Britain, New Ireland, Bougainville, and the moun- 
tainous archipelagoes which culminate in Fiji in 
the east and New Zealand in the south. These 
islands rise from a vast submarine platform which 
extends outward from Asia. The bold relief and 
wide, diversity of soil types, coupled with local dif- 
ferences in climate, have produced numerous 
sharply distinctive natural areas: bleak mountain 
summits, fern-forested uplands, grassy plateaus 
and high valleys, magnificent rain forests, scrubby 
jungles, riverine swamps, foothills, sandy coastal 
shelves, flat offshore reef islets, etc. This geo- 
graphic diversity has contributed to the cultural 
diversity which is a hallmark of this portion of 
Oceania, 

The remaining islands of Oceania are much 
smaller, more dispersed, and consist of just three 
basic landforms: high volcanic peaks, low coralline 
atolls, and raised-coral “pancakes’—or combina- 
tions of these, each affected by differences in age, 
weathering, and climate. In addition, the proximity 
to supplies of marine food has served, in some 
places, to reduce the direct dependence upon soil. 

Opportunities for formulating and testing hy- 
potheses about human behavior are enhanced by 
the insular nature of the region, which provides 
the researcher with laboratorylike controls found 
in few other regions of the world. In island Oceania 
wide stretches of ocean or hazardous natural bar- 
riers helped to isolate human communities from 
one another for years or even centuries at a stretch; 
and the Australians, although in contact with each 
other, were themselves more or less isolated from 
the rest of humanity for many thousands of years. 
But before describing the uses that social scientists 
have made of data obtained in Oceania, we shall 
sketch the outline of mankind's history in the area, 
as reconstructed by archeologists, linguists, and 
ethnologists. This reconstruction is, of course, im- 
mensely interesting in itself as a chronicle of some 
fascinating chapters of human history; but its rel- 
evance in this article consists of the light it can 
shed about some of the events whose sequels pro- 
vide social science with such varied and amenable 
resources for research. 
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Ethnology 

Skeletal fragments and crude stone artifacts 
found on Java demonstrate that tool-making homi- 
nids inhabited at least the Greater Sunda Islands 
as early as the first interglacial period, but the 
oldest human remains yet found in Oceania (ie. 
in Australia) go back no further than ten to four- 
teen millennia. Since archeology is just beginning 
in Australia and New Guinea, it is reasonable to 
anticipate some deepening of their chronologies 
in due course. But it is interesting and probably 
indicative that no excavations carried out elsewhere 
in Oceania have revealed traces of humanity dating 
back beyond 3,500 years ago. It is simply unlikely 
that much earlier than that there were any boats 
in the western Pacific capable of reaching such 
places as Hawaii, New Zealand, or even Fiji. And 
as for movements from the east, I stated at the out- 
set my firm belief that Oceania’s populations and 
cultures derived ultimately from the southern and 
eastern shores of Asia. There may well have been 
added a few genes and a few culture traits from 
the Americas, but if such were the case they were 
relatively late and comparatively insignificant. 

There is no demonstrable basis for linking race 
with intellectual potential, but race—or at least its 
visible criteria—has some relevance to the student 
of social behavior in Oceania. It has figured, for 
example, in natives’ estimates of each other; and 
it has greatly influenced whites’ attitudes towards 
natives (e.g. the light-skinned, straight-haired 
peoples of Polynesia have by and large been treated 
with less contempt than their darker-skinned neigh- 
bors of Australia and the western islands). But 
knowledge of the genetic composition of Oceania’s 
population could conceivably also provide helpful 
clues concerning culture history. 

Few systematic studies of race have been carried 
out in Oceania, save in Australia and southeastern 
Polynesia, and the specialists differ in their inter- 
pretations of the findings. Although there is nearly 
universal agreement upon Asia's having been the 
source of Oceania’s populations, there is no con- 
sensus concerning the identity or the sequence of 
the several genetic strains that are evidently pres- 
ent in these populations. 

There is a difference of opinion even with re- 
spect to the make-up of Australia’s quite distinctive 
aboriginal population—the dark-skinned, curly- 
(not frizzly ) haired individuals with massive brow- 
ridges and low, broad noses. On the basis of 
some marked regional differences in physical fea- 
tures, some specialists posit three separate racial 
components: a short-statured negroid type; a 


256 OCEANIAN SOCIETY 


larger-bodied, lighter-pigmented, more hirsute type 
reminiscent of the Ainu of northern Japan; and 
a more slender, dark-skinned, curly-haired type 
similar to the Veddas of Ceylon. According to this 
view, these three types arrived in separate waves 
—or trickles—and have interbred somewhat, but 
not homogeneously, during the succeeding millen- 
nia, According to another view, the aborigines 
were of the same race to begin with and have 
developed their regional differences since arrival 
on the subcontinent. For the social scientist 
these contrary views are not without relevance: if 
the population can be shown to be tri-hybrid in 
origin, researches will logically focus on explaining 
the many cultural similarities found throughout the 
continent and vice versa. 

For the rest of Oceania the racial composition is 
even more complex and variously interpreted. The 
archipelagoes containing the so-called “continen- 
tal” islands, from New Guinea to New Caledonia 
and Fiji (but not New Zealand), are inhabited 
mainly by populations with frizzier hair and some- 
what darker skin colors than possessed by their 
neighbors to the north, east, and south. This cir- 
cumstance has led to the area being labeled “Mel- 
anesia” (“black islands”), a term which is rather 
inaccurate and has proved to be mischievously 
misleading. In the first place, although there are 
many dark brown and even coal black populations 
within Melanesia, there are also many others no 
more heavily pigmented than, say, natives of 
Tahiti or Tonga. Second, this regional division 
based on somatic criteria has been arbitrarily per- 
petuated by ethnologists in the cultural sphere. 

Within Melanesia the range of racial types (or 
subtypes) is very wide. Stature ranges from pyg- 
moid to tall, pigmentation from light copper to jet 
black, prognathism from absent to pronounced, 
etc., and there are no obvious correlations, direct 
or inverse, between these attributes. Some popula- 
tions look remarkably Australian (except for hair 
forms), others like frizzly-haired Mongoloids, and 
still others (with light pigmentation and high, 
beaklike noses) resemble no other physical types 
anywhere, 

Elsewhere in Oceania—in the far-flung archipel- 
agoes of Micronesia and Polynesia—physical types 
tend to be more uniform: the population becomes 
more “Mongoloid” and less “Negroid”; but the sim- 
ilarities (and differences) are not distributed in 
clear enough patterns to provide the specialists 
with unambiguous historical clues. 

In fact, there is enough ambiguity in the racial 
data available for Oceania to permit any number 
of different historical reconstructions (including 


one that posits an American Indian component: 
Asia, after all, is the ultimate source of Oceanians 
and Amerindians). One reconstruction, derived 
from the tri-hybrid Australian scheme, proposes a 
succession of racial immigrations of the foilowing 
order: Ainoid, Pygmy Negritoid, Veddoid, and 
Mongoloid. Another scheme includes Australoids 
(undifferentiated), both pygmy and full-statured 
Negroids, and Mongoloids. Still others (for some- 
what gratuitous reasons ) believe a so-called “Cauca- 
soid” element to be present, especially in the pop- 
ulations of Polynesia. 

Weighing all these alternatives, it seems least 
uncertain, and geographically most logical, that 
Australia and Melanesia were the first to be peo- 
pled, and by some combination of Negroids (short, 
or short and tall) and Australoids (or Ainoids— 
Veddoids); and that these separate strains inter- 
bred in varying degrees in different places. Nor is 
it unreasonable to believe that Mongoloid strains 
were the last to appear, leaving their genetic traces 
along the route, or routes. 

It is unlikely that archeologists will ever turn up 
enough skeletal remains to permit a detailed re- 
construction of Oceania’s whole racial history, and 
social scientists searching for precise and long- 
range historical guidelines cannot expect much 
help from this direction. However, the small sizes 
and relatively great isolation of so many of 
Oceania’s populations render them ideal “labora- 
tories” for studying microevolutionary phenomena 
—e.g., the relationship between physical variance, 
on the one hand, and social structure, ecology, or 
epidemiology, on the other. Here, indeed, are to be 
found ideal opportunities for anthropologists to 
practice what they preach about their concern with 
both cultural and biological aspects of mankind. 

The languages spoken by the Oceanians com- 
prise three great categories: Australian, Austro- 
nesian, and non-Austronesian (Capell 1962; Kliene- 
berger 1957). Quite apart from the intrinsic 
interest of the subject matter, the study of these 
languages, both descriptively and historically, is 
relevant to social science inquiry. Not only is 
knowledge of the local vernacular indispensable for 
all but the most superficial field research in any 
Oceanic society, but ethnographers—and especially 
those who have worked in Oceania—would prob- 
ably agree that a society’s language is a very im- 
portant part of its cultural inventory. And on the 
historical side, findings about language relation- 
ships, genetic and acculturational, provide the best 
evidence we have for culture-historical reconstruc- 
tion in general—and hence for comparative studies 
of social behavior. 


The native languages of Australia (including 
Tasmania) differ markedly among themselves 
in structure and vocabulary, but their outstanding 
student, Arthur Capell, considers them members 
of the same family (1956). Numerous attempts 
have been made to trace their relationships out- 
side Australia; so far these efforts have proved un- 
convincing, but it would not be surprising if future 
research were to turn up some links with non- 
Austronesian languages of neighboring New 
Guinea. 

Prior to the spread of English, Spanish, and 
French in recent centuries, Austronesian was the 
most far-flung family of languages in the world: 
its speakers were spread from Formosa and Malaya 
to Hawaii, Easter Island, and New Zealand (one 
of its western languages even became established 
on Madagascar), Outside Australia and certain 
parts of the “continental” islands, all the languages 
of Oceania are to be classified within this great 
family. 

For many decades it was the conventional practice 
of linguists to subdivide this family into four major 
(and implicitly more or less coordinate) branches: 
Indonesian (including Malay and all the Austro- 
nesian languages of the Philippines, the Sunda 
Islands, the Moluccas, etc., along with Malagasy 
(Madagascar), Cham (Cambodia), Li (Hainan), 
Jarai (Vietnam), Lati (southwest China), etc.; 
Micronesian (all the languages of Palau, the Mari- 
anas Islands, Caroline Islands, Marshall Islands, 
and Gilbert Islands); Polynesian (all the languages 
of Hawaii, Tonga, Samoa, New Zealand, Tahiti, 
Easter Island, etc.); and Melanesian (all Austro- 
nesian languages of New Guinea, the Solomon 
Islands, New Hebrides, New Caledonia, Fiji—ex- 
cept for certain Polynesian language “outliers” 
within the geographic zone of Melanesia). Thus 
the practice of subdividing Oceania according to 
so-called racial (Melanesia, “black islands”) or 
geographic criteria (Micronesia, “small islands”; 
Polynesia, “many islands”) was somewhat arbi- 
trarily carried over into linguistic classification and, 
as will be seen, into general cultural classification 
as well. 

Recent developments in linguistic science, in- 
cluding lexicostatistics and new methods of data 
processing, have stimulated a reappraisal of this 
conventional scheme (Capell 1962; Grace 1964). 
There is anything but consensus among the many 
linguists now studying Austronesian—some depend 
almost wholly on lexical data for their results; 
Others insist that grammatical considerations must 
also be taken into account—but the older four- 
class scheme has been generally abandoned. It is 
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now acknowledged that the languages of the 
Marianas Islands, Palau, and Yap are closer to 
those of the Philippines than. to any in Oceania 
itself. There is also common agreement that the 
several Polynesian languages (or dialects) are far 
too alike to justify placing them™in a genetic po- 
sition coordinate with the many -widely.-varying 
languages of Melanesia. It is in connection with 
the latter that the specialists are in least agree- 
ment. According to one view they remain some- 
thing of a genetically separate unit more or less 
coordinate with a comparable unit of “Indonesian”; 
while in another scheme they are classified into a 
dozen or more units of the subfamily order of 
branching. Also—and this has a direct bearing on 
long-range perspectives of social change—some 
writers view the Austronesian languages of Mela- 
nesia as fusions of the area’s numerous aboriginal 
(and non-Austronesian ) languages with immigrant 
(and, implicitly, quite uniform) Austronesian 
tongues: this is the “pidginization” theory, so called 
by analogy with present-day Melanesian pidgin, 
the “contact” language between Oceanians and 
whites throughout most of Melanesia. This view 
has been sharply challenged, both on linguistic and 
culture-historical grounds. 

In fact, among the Austronesian languages of 
Oceania it is only with respect to the closely inter- 
related Polynesian subgroup that historical relation- 
ships have been sufficiently established to provide 
the social scientist with bases for some controlled 
comparisons of social and cultural systems. One 
can better appreciate the attractive possibilities for 
this kind of research by taking note of the likeli- 
hood, suggested by lexicostatistics, that all the 
known Polynesian languages derive from a single 
language which began branching not much more 
than two thousand years ago, and that during their 
subsequent histories many of them had no contact 
with non-Polynesian speech. 

The label non-Austronesian has been given to 
those languages of island Oceania not classifiable 
as Austronesian; they are to be found on New 
Guinea, New Britain, New Ireland, and the north- 
west Solomon Islands, as well as on Halmahera 
and other islands of eastern Indonesia. From their 
distribution—mainly in the interiors of the large 
west Melanesian Islands—it has generally been 
assumed that they are survivors of the tongues 
spoken in this region before the spread of Austro- 
nesian, Unlike the Austronesian family and the 
languages of Australia, the non-Austronesian lan- 
guages have so far resisted the efforts of linguists 
to link them into a single genetically interrelated 
unit, although they do not appear to be quite so 
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fragmented as was once believed. In New Guinea, 
for example, linguists have discovered in the east- 
ern highlands a very extensive “stock” comprising 
some 750,000 speakers (Capell 1962); other such 
unities are likely to emerge as more professional 
linguists turn their attention to parts of Oceania 
where the native languages have not even been 
recorded, much less studied. 


Ethnography of Australia 


Australian aborigines (Elkin 1938; Berndt 1965) 
got their food by hunting, fishing, and collecting; 
despite occasional contacts with Macassarese and 
Papuans they appear never to have adopted agri- 
culture, And although they kept more or less tame 
dogs that helped them hunt larger game, they 
raised no animals for food. Men hunted (and 
fought) with spears, clubs, throwing sticks, or, in 
some areas, bows and arrows; women grubbed up 
roots and insects with digging sticks. Life was 
nomadic, in pursuit of widely scattered and season- 
ally variable food supplies; shelters were temporary, 
makeshift affairs. Some of the artifacts were fash- 
ioned out of stone, bone, and shell, but plants pro- 
vided the materials for most objects of daily life. 

It has been estimated that at the time of initial 
European colonization some two hundred years 
ago, there were no more than about 300,000 peo- 
ple inhabiting Australia—probably a fairly stable 
figure in view of their seemingly unchanging tech- 
nology and their millennia-long residence. It is not 
unlikely that the distribution of the population had 
also reached a point of stability in adjustment to 
the continent's several geographic zones, with the 
heaviest concentrations in the temperate southeast 
and tropical north and the lightest in the arid in- 
terior. Some three hundred languages are said to 
have been spoken in Australia, but these were not 
necessarily contiguous with cultural or political 
distinctions. 

The nuclear family, modally if not normatively 
monogamous, was the basic residential unit of 
society. In some areas, and during certain parts of 
the natural seasonal cycle, individual families trav- 
eled separately, and although males and females 
contributed differently, food was usually shared. 
When the availability of food permitted, but also 
for social and ritual purposes, several families con- 
gregated into bands (or “hordes”) of various sizes 
and degrees of integration. 

In addition to families and bands, Australian so- 
cieties were divided into various other kinds of 
social units based on locality, kinship, age, and sex 
—or combinations of these factors. One relatively 
simple and fairly widespread kinship structure con- 


sisted of unilinear and exogamous moieties. Socie- 
ties were sometimes divided into four or eight such 
parts. In the view of some analysts these arrange- 
ments functioned mainly to regulate marriages, 
while other writers consider them to be classifica- 
cation devices for the convenient ordering of one’s 
numerous kinfolk—i.e., all other members of one’s 
community. 

The factor of age also received emphasis in 
almost all aboriginal societies. Particularly for 
males, the cycle of growing up and aging was 
associated with a series of ritual events. These were 
carried out within the context of localized all-male 
sodality that was stratified into more or less age- 
graded subgroups. Some of these rituals included 
extreme forms of body mutilation (e.g., subinci- 
sion of the penis) along with ceremonial dances 
and recitations of great religious depth and drama. 
The form and content of these rituals, along with 
their theological connotations and their social func- 
tions, varied considerably from place to place; but 
they were widespread enough and similar enough 
to be considered a very characteristic—but of 
course, not distinctive—feature of aboriginal Aus- 
tralian culture. 

Another characteristic feature of Australian life 
was the absence of anything approaching occupa- 
tional specialization. Individual differences in skill 
and knowledge and stamina were recognized, but 
expert hunters, warriors, artists, magicians, flint- 
knappers, etc., were not relieved of the ordinary 
chores of subsistence, and they received few mate- 
rial rewards for their specialties, Some individuals 
undoubtedly produced goods that were surplus to 
their own families’ subsistence needs—such things 
as stone spear points, cordage, mineral pigments— 
or benefited from occasional windfalls of meat or 
fish. The limited local exchange and long-distance 
trade of these goods were usually carried out with- 
in the context of kinship and with some ceremonial 
elaboration. However, there were probably no bands 
capable of producing enough over-all surplus to 
sustain full-time specialists of any kind. 

Perhaps the most prestigious of skills was the 
ability to chant from memory the interminable 
myths, prayers, and formulas which formed indis- 
pensable parts of various rituals. Individuals pos- 
sessing this skill who had also moved up through 
the ranks of the age-graded men’s sodalities 
achieved a status that commanded some measure 
of authority in community affairs. Compared, how- 
ever, with most other societies in Oceania, the ine 
stitutionalization of authority in aboriginal societies 
was not very developed. 

No aspect of Australian life has attracted more 


“scientific” attention than the so-called religious 
beliefs and practices. Living, as the aborigines do, 
in symbiosis with their physical environment, they 
have animated it so anthropomorphically and so 
comprehensively that their perceptions of the uni- 
verse appear to contain no boundaries between 
mankind and the actual or imagined populace of 
nature. One of their most widespread beliefs, for 
example, consists of linking certain animals and 
plants—generically or individually—with each of 
their enduring social units or categories. Such link- 
ages are usually conceived of in terms of kinship 
and not infrequently involve restrictions against 
eating or rituals aimed at magical increase of the 
species involved. In some places even mountains, 
pools, stars, thunder, rain, and sneezing are either 
individually or generically assimilated into the social 
structure. The myths and rituals embodying these 
beliefs are as diverse and bizarre as they are long 
and dramatic. Fertility—of nature and of human- 
ity—is a theme which runs through many of them; 
and they are enacted through songlike recitation, 
dance, and instrumental music. 

Finally, this brief inventory of institutions would 
be incomplete without mention of the graphic art 
of aboriginal Australia. Students have only recently 
begun to study the rich domain of painting, carv- 
ing, and engraving—naturalistic and abstract, pub- 
lic and esoteric. Although these deserve serious 
enough attention on artistic grounds alone, their 
apparent associations with myth and ritual make 
them intriguing subjects for social science as well. 


Ethnography of island Oceania 


_ Technology and material culture. As noted ear- 
lier, agriculture was the basis of subsistence 
throughout all of island Oceania (Oliver 1951). 
Even on certain of those arid atoll islets where soil 
is lacking, natives laboriously imported soil for 
gardens (Barrau 1958). In places dependent main- 
ly on self-propagating tree crops some effort was 
occasionally spent in protecting and tending the 
plants, and some supplementary gardening was 
usually practiced as well. The main “tree” crops of 
the islands were coconuts, sago, breadfruit, panda- 
nus, and bananas. The first European visitors found 
coconut palms growing on nearly every inhabited 
island except Easter Island, New Zealand, and 
Chatham Island. These trees thrive best in lower 
altitudes near the coasts and provided islanders 
with food, drink, oil, containers, fibers, thatch, and 
Construction wood. Sago palms grow semiwild in 
many swampy areas, particularly on the larger 
ontinental islands; the starch extracted from the 
Palm’s pith was the staple food in many riverine 
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and coastal communities. Breadfruit is most pro- 
lific in the volcanic soils of the central and eastern 
islands; although fruiting only seasonally, this tree 
produces bounteously and requires little care. Some 
varieties of the pandanus, or screw pine, produce 
a fruit which can be made partly edible and which 
serves only as a famine food on richer islands but 
is the main vegetable food on some of the arid 
atolls, Bananas (including plantains), which grow 
in most of the moist tropical areas, varied widely 
in culinary importance, from a staple food to an 
occasional supplementary one. 

Of the root crops, both wet-land and dry-land 
varieties of taro were cultivated; yams were grown 
widely both for food and for purposes of display; 
and sweet potatoes were adapted to poorer soils 
and cooler climates. 

The islanders supplemented these crops with 
wild roots, stems, shoots, fruit, and leaves. The 
only part of Oceania in which natives cultivated 
rice was in the Marianas, another trait linking these 
islands with the Philippines. 

Each of the vegetable staples required different 
production techniques and resulted in a wide range 
of cultural variations. Sago, for example, could be 
collected at any time of the year and was preserved 
by a laborious process. In contrast, breadfruit re- 
quired little processing but fruited only once or 
twice a year and remained edible only in a fer- 
mented state. 

In comparison with the Australians, most Ocean- 
ian islanders spent little time hunting. A note- 
worthy exception occurred in New Zealand, where 
early inhabitants hunted to extinction the giant 
moa, a large, flightless, ostrichlike bird. On the other 
hand, fishing was a major activity wherever marine 
resources permitted. Streams, rivers, reefs, lagoons, 
and open seas were harvested by means of an ex- 
traordinary variety of tools, watercraft, and tech- 
niques. As in the case of agriculture, differences 
in emphasis on fishing together with differences in 
fishing techniques were reflected in other cultural 
domains—in religious beliefs and ritual as well as 
in the social structure of households and commu- 
nities. 

Canoes have played a central role in the lives 
of Oceanians, and they have been used for fishing, 
everyday transport, and, prehistorically, in the peo- 
pling of this world of islands. Some of the riverine 
and coastal peoples of New Guinea found shallow 
dugouts adequate for their purposes of moving 
about in calm waters, but most other islanders de- 
pended upon outrigger canoes or deep-hulled plank- 
built boats. Although some elements of this com- 
plex reflect the common southeast Asian origin 
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of Oceania’s seagoing heritage, there has developed 
a rich variety of local specialties—in boat construc- 
tion, ornamentation, and handling, as well as in 
navigational principles and skills. 

In many places the building and handling of a 
big canoe was an event of social importance, being 
one of the few instances of large-scale coordinated 
activity. For the social scientist these occasions re- 
veal otherwise unstated premises regarding di- 
vision of labor, authority, and exchange. In fact, 
in seagoing societies such as Tahiti the nomencla- 
ture applied to the various parts of their larger 
canoes was a metaphoric summary of the natives’ 
image of their political relations. 

Like Australians, the Oceanian islanders kept 
dogs—for pets, hunting aids, and sometimes for 
food. Most households also kept a few fowl—if 
“kept” is appropriate for the rather aimless rela- 
tionship in which the fowl were neither fed nor 
eaten with any regularity. It was only on remote 
Easter Island that fowl became important in na- 
tive economy and in ritual. Wherever islanders 
managed to introduce and keep them alive, pigs 
became much more important than dogs or fowl. 
They were eaten at feasts and used in ceremonial 
exchanges. In fact, so highly were pigs valued that 
in some societies they became the prime means 
and measure of political ascendancy. 

In societies like these, where food occupies such 
a dominant position—in productive energy, in so- 
cial interaction, in hierarchies of value, in cult 
focus, in symbolic expression, and so forth—the 
cooking and eating of a meal may provide social 
science with some of its most rewarding data. In 
this connection, then, it should be noted that tech- 
niques of food preparation vary within societies 
and among societies. Cooking was everywhere im- 
portant, although some fish and plant foods were 
occasionally eaten raw. Cooking itself varied from 
simple roasting and pot boiling to large-scale 
baking in community-size earth ovens. Even the 
most elaborate Hawaiian or Samoan menus and 
recipes did not compare with those of Asia, but in 
many places men (festal cooking was nearly every- 
where done by males) knew how to prepare pud- 
dings combining many ingredients in various pro- 
portions. 

Next to water the only beverage universally im- 
bibed was the liquid of unripe coconuts—at least 
where coconut palms grew. On many islands in the 
central and eastern Pacific natives drank kava (or 
“ava, etc.), a mildly narcotic liquid made from the 
root of a cultivated pepper plant. On some islands 
(e.g., Fiji, Tonga, and Samoa) kava drinking 
reached a point of high ceremonial elaboration. 


These ceremonials served to express and reinforce 
community integration and political status. West 
of the kava-drinking part of Oceania, and barely 
overlapping it, were areas of betel chewing extend- 
ing on into the south Asian mainland. In these 
areas betel chewing did not become as ceremonial- 
ized as kava drinking did elsewhere, but its use 
throughout the populations was more widespread. 

Plants were the source of nearly all the cordage 
and textiles made in island Oceania; loom weaving 
was restricted to the Marianas and West Carolines, 
but hand plaiting developed in some places to the 
level of a fine art. 

Matwork and barkcloth were the chief materials 
out of which most clothing, floor covering, bedding, 
sails, and temporary shelters were made. In some 
places finely textured mats and barkcloths circu- 
lated as highly valued objects in networks of re- 
distribution and intergroup exchange. Houses dif- 
fered widely in shape and size; some were built to 
accommodate only a small family, while others 
were spacious enough for hundreds of people. Com- 
parison of local differences can provide insights 
into human inventiveness and the processes of 
adaptation and also into historical relationships, 
but the nature of Oceanian housebuilding has even 
more direct relevance to the social scientist, inas- 
much as most such enterprises involve the actions 
of large numbers of people contributing materials 
and services according to conventional social pat- 
terns. House architecture often provides valuable 
insights into the residents’ views about their social 
universe—views which might otherwise remain 
inexplicit. The residences, for example, very rarely 
contain inner partitions, but for the occupants in- 
ternal space is divided into functionally and sym- 
bolically distinct “rooms”; in fact, in many places 
a house provides space for the living and for the 
dead, for spirits as well as mortals. 

Public structures of many types and utilizing 
varied construction materials were built in island 
Oceania. They served a wide variety of uses: clan 
refuges, exclusive men’s clubhouses, secular meet- 
ing places, temples, forts, theaters, athletic arenas, 
lovers’ trysts, craftsmen’s workshops—in fact, 
nearly everything but market places for buying and 
selling. 

Within recent years the graphic and plastic arts 
of Oceania have aroused keen interest among art 
historians and collectors. The skillfully executed 
masks, ceremonial implements, “idols,” and so on 
are also of interest to the social scientist because 
of their relevance to social behavior. Designs, for 
example, often express magical intent or super- 
naturally protect ownership or clan unity. Or, 


the roughly shaped, grotesque figure may in faith 
be the terrestrial resting place of a powerful and 
handsome god. We cannot begin to describe the 
great variety in materials, techniques, and designs 
found in Oceanic art objects, but the situation is 
not as chaotic as a rapid walk through a museum 
might lead one to believe. In fact, some surveys 
by anthropologically oriented experts have begun 
to delineate for all Oceania a manageably small 
number of distinctive artistic traditions, thereby 
providing social scientists with some new and stim- 
ulating possibilities for investigation (Linton & 
Wingert 1946; Guiart 1963a). 

Ideology and social structure. In the foregoing 
discussion we have dwelt mainly on what islanders 
did and what they made in connection with daily 
living, However, it should at least be pointed out that 
islanders did not go about the business of making 
a living without reflection, in slavish response to 
“custom.” On certain occasions islanders undoubt- 
edly acted because of time-honored and sanctioned 
precedent, but their actions were more frequently 
pragmatic. Perhaps the many different and often 
difficult kinds of physical environments met with 
in the course of their histories in Oceania had 
something to do with this, by placing a premium on 
flexibility and adaptability. Many of their actions 
were based on premises that we would call magical, 
but this is not to deny the presence of a “scientific” 
attitude toward their environment. 

As for the magical ingredient of their thinking, 
neither its logic (homeopathic, sympathetic) nor 
its content (animism, animatism) is distinctively 
Oceanian in any essential way. 

Turning now to the islanders’ pre-European so- 
cial behavior, we begin by acknowledging our 
inability to generalize about the region as a whole 
or about large segments of it. A great deal is known 
about the social life of certain island peoples, but 
there are many more societies about which nothing, 
or next to nothing, is known—with no prospect of 
ever gaining such knowledge in many cases be- 
cause the islanders’ native forms of society have 
completely disappeared under the impact of alien 
influences. And even with what is known—and 
among the studies of single island societies there 
are some of the world’s most complete ethnog- 
raphies—scholars are just beginning to push be- 
yond local description toward wider regional typol- 
ogies of the kind formulated for Australia (e.g., 
Hogbin & Wedgwood 1953; Sahlins 1958: Gold- 
man 1960), 

Settlement patterns. Although many excellent 
ethnographic descriptions treat patterns of resi- 
dence, few attempts have been made at the com- 
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parative study of settlement patterns. Perhaps the 
most typical form of settlement pattern in the 
islands—this is an impression, not an established 
fact—is the small four-to-five household hamlet 
or neighborhood; but there are also numerous in- 
stances of dispersed homesteads, at one extreme, 
and of densely settled villages, at the other. In this 
connection it is an interesting fact that some of the 
largest and most tightly integrated political units 
—e.g., on Tonga—comprised widely scattered 
homesteads, Villages rarely contained more than 
a thousand inhabitants; the average number was 
probably more like two to three hundred. Some of 
the larger villages were to be found alongside rivers 
or lagoons, but they have been noted in other kinds 
of settings as well. In some instances residences 
were clustered near the public places—temples, 
council houses, dance grounds, men’s clubhouses, 
etc.; in others the public places and dwellings were 
kept far apart. Some settlements were surrounded 
by stockades; others lacked defensive constructions 
despite their involvement in periodic warfare. 

Family. The nuclear family was certainly the 
most ubiquitous type of social group in island 
Oceania, although polygyny was permitted in most 
societies. Polygyny was practiced by only the most 
affluent—i.e., those men who could afford the 
bride price or other expenditures associated with 
marriage—but in some of the wealthiest societies 
even the most influential leaders had only one offi- 
cial wife at a time. 

There is evidence that polyandry was formerly 
practiced in some Polynesian-speaking societies, 
but little or nothing is known about its wider social 
contexts. 

With regard to matrimonial rules of residence, 
couples tended to reside near or with the husband's 
male patrilineal kinsmen. The next most prevalent 
pattern among those societies surveyed (Murdock 
1957) was residence near the wife’s female matri- 
lineal kinsmen; but in several other societies these 
alternatives were about equally favored. Still other 
alternatives have been recorded for other societies, 
e.g., residing close to the husband's matrilineal 
kinsmen. 

Even in societies allegedly ignorant of the male’s 
biological role in reproduction (Malinowski 1922) 
social roles of “maternity” and “paternity” were in- 
stitutionalized, although the nature of such roles, 
both in theory and practice, varied widely. At one 
extreme were those societies in which both mother 
and father shared the job of nurturing and social- 
izing their children, with property being trans- 
mitted through both parents. In contrast, there 
were some other societies wherein the sociological 
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father had little or nothing to do with his children’s 
specific upbringing or equipping beyond contribut- 
ing generally to the domestic commissary. In be- 
tween these extremes were numerous permuta- 
tions, usually reflecting each society's general 
conceptualization of kinship. 

Two other fairly characteristic—but of course 
not distinctive—features of island life had to do 
with membership in the family group. In some so- 
cieties, even when a child was recognized as the 
“biological” offspring of a man, the latter was 
called upon to validate the relationship before it 
could become socially operative. The other feature 
of widespread occurrence was the facility and the 
popularity of adoption, especially practiced in the 
eastern parts of the region. 

It is our impression that nuclear families—plus 
one or two other dependent relatives—constituted 
the most typical residential units in the majority 
of island societies, but there were numerous vari- 
ants. In some places households were much larger 
and consisted of composite families—either po- 
lygynous, stem, joint fraternal, joint sororal, or 
some other type. In other places a man spent most 
of his:sleeping and waking hours in his community 
men’s house, visiting his wife and children in their 
household only on occasion. Variations in house- 
hold composition were wide, as were variations in 
collective activity, in kinds and amounts of goods 
owned corporately, in symbols of unity, etc.; and 
all these facets of family and household life were 
surely related more or less directly to each soci- 
ety’s more general institutionalization of kinship. 

Although ties of kinship were not the only kind 
of social bond recognized and institutionalized in 
island societies, they were by all odds the most 
important. In most island societies, every member 
could claim (if not actually trace) some kinship 
tie with every other member. These kinship cate- 
gories each implied some normative pattern of be- 
havior no matter how attenuated by the remoteness 
of the tie or the influence of extraneous factors 
such as locality and social stratification. Indeed, 
relations across tribal and societal boundaries were 
more often than not dominated by considerations 
of kinship. 

Within the context of all-inclusive kinship, 
which characterized most island societies, there 
were, however, some wide differences in the actual 
groupings of kinfolk. In size such groups varied 
from small, sharply defined units to large ones with 
vague or overlapping boundaries. Some groups 
were bilateral in descent, others patrilineal or 
matrilineal. Some were stringently exogamous, 
while in others membership appears to have played 


no direct role in choice of mate. In some societies, 
like certain ones of highland New Guinea, groups 
formed by the male members of patrilineages were 
all-important—maritally, residentially, economi- 
cally, politically, and ritually. In other places actual 
groups of kinsmen—qua kinsmen—were scarcely 
discernible, either interactionally or symbolically. 

What little collation has been done in this do- 
main of social structure indicates that patrilineally 
structured groups predominated in New Guinea 
and matrilineal ones in central Micronesia and in 
parts of western Melanesia. Throughout most of 
Polynesia and in the rest of Micronesia the aggre- 
gates of kinfolk defined by common ownership of 
land and other valuables were ideally more “non- 
unilinear” in membership, although in actuality 
patrilateral ties preponderated, Elsewhere, in cen- 
tral and eastern Melanesia, there existed in close 
juxtaposition all these variants of kinship structure 
(Murdock 1957). 

Other social groups. In most island societies 
there were other kinds of associational ties which 
crosscut those of kinship—ties of coevality, of cult 
commitment, of occupation, and, most important, 
of coresidence. 

Age itself was less influential in island Oceania 
than it was in Australia. Authority and privilege 
did derive from seniority in some societies—espe- 
cially in some of those with patrilineal kin groups 
—but coevality as an organizing principle was only 
sporadically important (e.g., in parts of New 
Guinea and Melanesia, where painful male initia- 
tion rites served to usher boys into cult-focused 
men’s clubs). 

In many island societies, as throughout Aus- 
tralia, the mythical charters which rationalized and 
legitimatized kin groupings were embodied in con- 
gregational ritual. But, in addition, many island 
societies incorporated cult groups whose members 
were only incidentally kinfolk. Examples of such 
were the masoniclike men’s clubs of New Hebrides 
and the intertribal “Dionysiac” Arioi sect of eastern 
Polynesia. : 

Occupational specialization was more marked in 
island Oceania than in Australia, but groupings of 
specialists were rare. In Samoa there were “guilds 
of housebuilders, and in some other island societies 
one might discern the beginnings of other craft 
guilds or of “schools” of savant—priests, but that is 
about all. 5 

Political organization. In most island societies 
neighbors were also kinfolk—in fact or by fictional- 
ization—but coresidence was often more influen- 
tial than kinship as a basis for association. On the 
other hand, the size and degree of integration of 


such political units varied widely. At one extreme 
were numerous societies having no collective-action 
groups larger than localized extended families, At 
the other extreme were a few Polynesian societies 
containing highly organized, territorially based 
tribal units with many thousands of members. 
In between, and most typical, were societies whose 
political units were conterminous with small vil- 
lage or neighborhood communities, or with clus- 
ters of such communities, averaging perhaps a 
few hundred citizens and rarely exceeding fifteen 
hundred. 

Island political units differed not only in size 
but also in domain. Units for waging war varied 
from tightly knit regiments to undependable con- 
federacies of separate kin groups. Actions for the 
maintenance of internal order ranged from com- 
prehensive, centralized policing to uneasy interkin- 
group feuding, wherein the over-all leaders did 
little more than protect their own kin groups’ in- 
terests, In some places a political unit's members 
were all linked in redistributive networks involving 
frequent and copious flows of objects and services; 
in other places little or nothing was exchanged 
among the strata of social hierarchies. And finally, 
whereas in some societies the identities of the 
political units were symbolized and validated in 
influential myths and impressive ceremonies, in 
other places only the most discerning observer 
would have discovered clues to collective notions 
of unity, 

Succession to political leadership was hereditary 
in some island societies, nonhereditary in others; 
and there were differences within each category. 
In instances of hereditary succession, the principle 
of patriliny predominated; and even in societies 
whose kinship groups were matrilineal political 
offices usually passed from male to male. However, 
there were a few recorded instances, mainly in 
Polynesia, of high political office devolving upon 
females. 

Nonhereditary succession to political office char- 
acterized large portions of Melanesia. In what was 
Perhaps its most distinctive variant, wealth was 
an important steppingstone to power. In such 
cases, however, the prestige upon which power 
was based derived not so much from accumulating 
valuables but rather from disposing of them—in 
potlatchlike feasting or in conspicuous waste. 

But many island societies may not be so exclu- 
sively typed: in some, individuals born to high 
Office had also to prove themselves capable of exer- 
cising it; in other cases they had to vie for office 
with low-born individuals of outstanding ability. 
And in some societies these contrasting principles 
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of succession served to maintain situations of un- 
resolved internal conflict. 

Relations between political units were of many 
different kinds. Hostility colored most such rela- 
tions over the long run, but it was usually tempered 
either by periods of general truce or by only indi- 
vidual kin-group feuding. Moreover, even between 
traditionally hostile tribes it was customary for 
women to be exchanged and goods to be bartered. 
Some of the intertribal circuits extended over hun- 
dreds of miles, and while some of the transactions 
were conducted without direct contact between the 
principals (i.e., “silent trade”), others—including 
the famous kula trade of southeast New Guinea— 
involved mass expeditions and elaborate cere- 
monies (Malinowski 1922), Another institution 
typical of many parts of island Oceania was that 
of the trade partnership—i.e., a pact between two 
friends or kinsmen from separate political units 
providing reciprocal visiting and bartering rights 
even in periods of intertribal conflict. 

Many societies in island Oceania were to some 
degree stratified, but the phenomenon was most 
highly institutionalized in Polynesia, notably in 
Hawaii, Tahiti, Samoa, and Tonga, where three 
or even four strata were distinguishable. In these 
societies class status derived almost wholly from 
birth and birth order, and for higher-ranking indi- 
viduals class endogamy was so prescriptive that 
there developed castelike common-interest upper 
classes which cut across political boundaries. Po- 
litical and ceremonial leadership were closely 
linked with class status, but ability sometimes out- 
weighed birth, resulting occasionally in the relega- 
tion of highest-ranking persons to positions of little 
more than ceremonial pre-eminence (Sahlins 1958; 
Goldman 1960). 

In view of the wide variety of cultural traditions 
and social structures found throughout island Oce- 
ania, it becomes next to impossible to generalize 
comprehensively about the behaviors of individuals 
in these societies. Individual life cycles, for exam- 
ple, were institutionalized in many different ways. 
In some societies the onset of puberty was marked 
by physical mutilation and community-wide ritual, 
in others it was virtually ignored. In some places 
the aged were revered and deferred to, in others 
they were socially devalued. Females were perhaps 
nowhere treated as chattel, but their social and 
ritual roles ranged from that of a magically pol- 
luted minor to that of a semidivine chieftainess. 
Even innovation received widely differing valua- 
tions, not only from society to society but within 
the same society as well. In some communities, 
for example, the invention of new graphic designs 
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was discouraged while the composing of new songs 
was honored. Or, craft techniques remained rigidly 
traditional, while the “discovery” of new religious 
doctrines or magical formulas was socially re- 
warded. In fact, perhaps the only generalization 
one can make about islanders as individuals (and 
this in a manner both imprecise and impression- 
istic) is that in nearly all available descriptions of 
them they stood out as individuals—as distinctive, 
at least partly autonomous persons, not as mere 
faceless units of this or that social aggregate. 


History of European contact 


Prior to the sixteenth century there may have 
been direct contacts between Oceania and Asian, 
or even American, high civilizations, although they 
were not enough to revolutionize native ways of 
life. But Magellan’s discovery of the Marianas 
Islands in 1521 ushered in a new era which is still 
going on and which is destined to transform most 
of the region’s native societies. 

During the four and a half centuries since 
Magellan’s voyage tens of thousands of Westerners 
(also Japanese, Chinese, and Indians) have visited 
or resided in Oceania—not to mention the millions 
now established in Australia, New Zealand, and 
Hawaii, and the additional hundreds of thousands 
who swept through the islands during World 
War 11. Many Oceanians have also visited the out- 
side world, but up to now their influences upon 
their own native communities have been minimal. 
With the exception of Australia, New Zealand, and 
Hawaii, where the process of Westernization has 
proceeded at a faster tempo, the history of culture 
contact in Oceania can be described in terms of 
five distinctive but overlapping phases. 

(1) The phase of exploration began with 
Magellan and is still going on in parts of New 
Guinea. By 1830 the consequences of these visita- 
tions from the West were well underway, in the 
shape of depopulation (mainly through introduced 
disease) and murderous warfare (with the help 
of firearms). 

(2) Whalers, traders, and missionaries com- 
menced their operations about 1780, continuing 
until about 1850. (Spanish Catholic missionaries 
were active long before 1780 but only in the 
Marianas.) Depopulation and political turmoil con- 
tinued during this phase and were accompanied 
by widespread collapse of indigenous religious 
institutions and of religion-sanctioned political 
structures. 

(3) Around 1860, planters, labor recruiters, 
and merchants initiated change consequent upon 


the removal or shifting about of large segments 
of the male population for long periods of virtually 
forced labor, the introduction of money and cash- 
crop economy, and the heightened desire for West- 
ern manufactured goods. 

(4) Foreign governments began to assert ad- 
ministrative control over island populations over 
a hundred years ago, but interference with native 
political structures—including total replacement— 
was most direct during the half century before 
World War u. This phase also witnessed an in- 
crease in the native population, mainly because of 
improved medical services and an increased flow 
of Westerners into parts of the region where min- 
eral deposits were located. 

(5) The events of World War 11 served not only 
to speed up kinds of change already in process, 
including urbanization and money-based economy, 
but to stimulate other changes as well. The post- 
war improvement in interisland communication 
and transport gave rise to several dramatic devel- 
opments. Locally inspired movements to weaken 
political ties with the overseas ruling metropolitan 
powers and to advocate strengthened interregional 
cultural ties are among these new developments, 
although they are not necessarily fundamental 
to change. 

Despite the homogenizing effects of these sev- 
eral but predominantly Western influences, the 
various Oceanian societies retain a large measure 
of local variation. None are at exactly the same 
stage of Westernization: for example, one can 
contrast industrialized Nauru with the New Guinea 
population, only now exchanging stone tools for 
those of steel. And no two native societies have 
experienced the same mixture of Western in- 
fluence: even in New Guinea, for example, a 
community near a large coconut plantation has 
adjusted very differently from one near a mine; 
and the Polynesians in French Tahiti have become 
quite different from their ethnic cousins in British 
Samoa. 


Social science research in Oceania 


Although there are increasingly pressing politi- 
cal reasons why the rest of the world should begin 
to know something about Papuans or Fijians on 
Samoans, our present concern is with Oceanias 
significance for social science in general—with the 
research opportunities it has provided for formu- 
lating and testing universally valid methods and 
theories, and with the uses that have been made 
of such opportunities. The reaction, for example, 
by the natives of Bikini to resettlement away from 


their radiation-polluted home island is of course 
poignantly interesting and of some relevance to 
international politics; but study of this situation 
would have had little value for social science if its 
procedures had not provided possibilities for test- 
ing social science methods and making innovations 
in these methods and if its findings were not widely 
applicable (Mason 1957). 

Oceania has offered social scientists a very wide 
variety of social and cultural systems, many of 
them so strikingly exotic as to require major ac- 
commodations in some aspects of Western-based 
social scientific thinking. In addition, even as late 
as a few decades ago, when trained social scientists 
began their study of this region, they were observ- 
ing the end products of centuries or millennia of 
isolation from the rest of the world and even 
largely from one another. And third, the relatively 
small sizes, sharp boundaries, and (perhaps con- 
sequently) internal cultural homogeneity of most 
of these societies made it possible and indeed in- 
evitable for individual observers to investigate the 
functional relationships of many domains of be- 
havior—not just technology or kinship or art, but 
all three in themselves and in relation to each 
other. 

Research into Oceanian ways of life began 
nearly two centuries ago, when men like Banks, 
Bligh, and the Forsters went beyond the mere re- 
cording of personal experiences and of native 
bizarreness to carry out more or less pointed in- 
quiries into native institutions. Moreover, the re- 
ports contributed by such men were empirically 
Significant to the beginnings of comparative soci- 
ology in Europe. For the next century and a quarter, 
as more and better descriptions of Oceanians’ ways 
of life came to be produced by missionaries, ad- 
ministrators, and other island residents, the pro- 
fessors back home were able to use these data to 
Support theories or to compile vast syntheses (for 
example, Morgan, Durkheim, Frazer, Freud). But 
it was not until 1898 that social scientists left their 
armchairs to confront their subjects in person. 

In that year the Cambridge anthropological ex- 
pedition to the Torres Strait islands (between 
northern Queensland and New Guinea) took place 
and included such men as Haddon, Rivers, and 
Seligman. It was during this expedition that Rivers 
developed his genealogical method for recording 
kinship data, which has subsequently been such 
an indispensable tool in social anthropological re- 
Search everywhere. Between this expedition and 
the outbreak of World War 1 amateur and more 
or less competent observers residing in the region 
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continued to produce ethnographic accounts which 
were used by scholars in their compilations, but 
field research by trained social scientists was car- 
ried out by only a handful, notably Malinowski, 
Radcliffe-Brown, Thurnwald, Sarasin, Reche, Wil- 
liamson, Poech, Haddon, and Rivers. It is probably 
fair to say that only the first three (and Rivers, 
to a lesser extent) produced publications from their 
Oceanian data that have been influential in the 
subsequent development of general social science 
theory and method. 

Undoubtedly the outstanding landmark in so- 
cial science research in Oceania was the work of 
Malinowski, whose monographs on the Trobriand 
Islanders have never been surpassed in ethno- 
graphic artistry. His studies ushered in a new 
world-wide approach to anthropological research 
that has come to be known as functionalism. 
Radcliffe-Brown drew upon his field experiences in 
Australia (and elsewhere) to produce essays that 
have led him to be identified as a cofounder of 
“functional” anthropology, although he himself 
disavowed the label. Through their teaching and 
writings these two men virtually dominated social 
anthropology throughout the interwar period; and 
their students, and students’ students, still hold 
most of the important teaching positions through- 
out the British Commonwealth. 

In the interwar period more and more profes- 
sionally trained social scientists went to Oceania 
to carry out sociologically and psychologically ori- 
ented research, and after World War 11 the influx 
reached flood proportions and is not now visibly 
diminishing. Moreover, these research activities 
have been aided by a number of journals, mono- 
graph series, museums, libraries, and university 
departments devoted exclusively or at least pri- 
marily to Oceania. The rich ethnographic data 
resulting from field research in Oceania have been 
drawn on heavily by many other social scientists 
for inspiration and for information respecting the 
range and variation of human social behavior. 

The most influential innovation in social science 
research strategy and methodology to come out 
of Oceania was Malinowski’s experience of long 
residence in a native community and active par- 
ticipation in its activities. He worked exclusively 
in the native vernacular, focused his attention 
upon the prosaic as well as the dramatic aspects 
of native life, and collected (and published) masses 
of documentary evidence to support and enrich 
his generalizations. It is somewhat ironic that 
Malinowski’s style of field research has been more 
faithfully followed in Africa than in Oceania, with 
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the outstanding exception of Raymond Firth’s work 
in Tikopia (Firth 1936; 1939; 1940). 

Malinowski aimed at more or less total coverage 
of his native subjects’ way of life, and for some 
time after him this remained the objective of most 
social scientists working in the region. But this 
goal has increasingly given way to a narrower 
focus upon special aspects of native life, including 
economics, law, religion, ecology, acculturation, 
and education. 

Malinowski’s example of one-man field work has 
tended to prevail, although field research is com- 
ing to be conducted within the framework of 
larger-scale programs, such as the Coordinated In- 
vestigation of Micronesian Anthropology, the Tri- 
Institutional Pacific Program, the long-range New 
Guinea research program of the Australian Na- 
tional University, the University of Oregon’s study 
of resettled populations, the University of Wash- 
ington’s study of cultural and physical evolution in 
New Guinea, the Harvard study of social change 
in the Society Islands, etc. In this connection, at- 
tention should be called to the research activities 
of such organizations as the South Pacific Com- 
mission (an international body designed to im- 
prove the welfare of Pacific islanders) and the 
French government's Office de la Recherche Scien- 
tifique et Technique Outre Mer, which though 
aimed primarily at the solution of practical prob- 
lems have contributed useful data on some rapidly 
changing aspects of Oceanian ways of life. 

Turning now to the substantive contributions 
to general social science theory that have come out 
of research in Oceania—contributions in addition 
to the enrichment of the world’s ethnographic 
corpus—one again begins with the writing of Mali- 
nowski, who audaciously—although not always 
justifiably—challenged some of the basic assump- 
tions of economics, comparative law, semantics, 
and psychoanalysis, and who in addition popu- 
larized the functional viewpoint already mentioned 
(Firth 1957). For Malinowski functionalism con- 
sisted mainly of a proposition to the effect that all 
of a society's customs are mutually interdependent 
and an analytical principle based on viewing insti- 
tutions as instruments for satisfying basic human 
needs, The proposition has subsequently become 
an almost universally accepted canon among an- 
thropologists, but not much use has been found 
for the analytical principle. Radcliffe-Brown’s con- 
tributions to general social science theory have 
been mainly in the field of comparative sociology 
(see Radcliffe-Brown 1922), and although his in- 
terests were somewhat narrower than Malinowski's 
he has left a comparably deep imprint. Perhaps the 


most successful implementations in Oceanian re- 
search of the general methods and theories of 
Malinowski and Radcliffe-Brown have been done, 
respectively, by Firth (1936) and Warner (1937), 

It now remains to list some other investigations 
in Oceania which, in my opinion, have served most 
to enrich social science either by proposing or test- 
ing theory or by describing novel or comparatively 
important institutions. 

Major contributions to the sociology of kinship 
are to be found in the writings of Firth (1936), 
Warner (1937), Malinowski (1929), Radcliffe- 
Brown (1922), Elkin (1938), Mead (1934), R. M. 
Berndt and C. H. Berndt (1951), Meggitt (1962), 
and Goodenough (1951), Only from Africa have 
come works of comparable quality, Government 
and social control of relatively un-Westernized 
societies are usefully documented in the works of 
Malinowski (1926), Hogbin (1934), Guiart (1963), 
Oliver (1955), Pospisil (1958), and Berndt (1962). 
Useful studies of Oceanian economies are those 
by Malinowski (1922), Bell (1953), Salisbury 
(1962), and, especially, Firth (1939; 1959). The 
published works of Firth provide probably the full- 
est and most sophisticated treatment available on 
the economics of “primitive” societies. 

Among the most useful studies of the social 
contexts of belief and ritual are those of Firth 
(1940), Fortune (1932; 1935), Malinowski (1935), 
Warner (1937), Guiart (1951), and Williams 
(1940), In this connection should be mentioned 
Bateson’s stimulating, and in some respects novel, 
multifaceted analysis of ritual behavior (1936), 
which deserves far wider attention than it has thus 
far received. 

Many richly illustrated works have been pub- 
lished concerning the widely varied and extraor- 
dinarily elaborated graphic art tradition of Oce- 
ania, but only a few seek to relate these to social 
behavior, mainly those of Elkin et al. (1950), 
Mountford (1956), Firth (1936), and Guiart 
(1963b). 

For all the relevance of the ecological approach 
to Oceania, only a few investigators have dealt 
intensively and theoretically with the ecological 
aspects of Oceanian society: for example, Fosberg 
(1963), Sahlins (1958; 1962), Warner (1937), 
Mead (1938-1949), Thompson (1949), Firth 
(1959), and Raulet (1960). Nor has much ad- 
vantage been taken of the region’s inviting pros- 
pects for exploring the relationships between bi- 
ology and society (Oliver & Howells 1957; Oliver 
1954; Borrie et al. 1957), although a project along 
these lines is now underway at the University of 
Washington (Watson 1963). 


Every reliable description of a contemporary 
Oceanian society will of necessity touch on the 
processes and results of Westernization; in addition 
there are numerous published reports directly fo- 
cused upon such events, Such studies range from 
comprehensive, multitopic community or “tribal” 
ethnographies, including those of Firth (1959), 
Hogbin (1939; 1951), Mead (1956), Sahlins (1962), 
and Barnett (1949), to narrowly delimited con- 
cern with single institutions. Among the studies 
of changing economic systems those of Belshaw 
(1954; 1955) are outstanding; and Salisbury’s 
quantifiably documented description of the intro- 
duction of Western goods into a stone-age New 
Guinea society is unique (1962). Studies of chang- 
ing leadership are also well represented in the 
works of Spoehr (1949), Force (1960), Keesing 
and Keesing (1956), Guiart (1952), and Oliver 
(1951). 

Studies of urbanization are also beginning to 
proliferate, as are those of resettlement (Belshaw 
1957; Spoehr 1963). But the phenomenon of 
change that has excited most interest among social 
scientists in Oceania has been nativistic move- 
ments of various kinds, especially the so-called 
“cargo cults” that have proliferated with such en- 
ergy and speed throughout many parts of New 
Guinea (Worsley 1957; Mead 1956). 

And finally, in the subdiscipline of psychological 
anthropology, it may be justifiably claimed that 
ethnically oriented field research actually began in 
Oceania, and that the region has provided the in- 
spiration and the testing grounds for some of the 
subdiscipline’s most gifted and productive investi- 
gators. In Gladwin’s excellent survey of the subject 
he notes that practically all explicit anthropological 
contributions to the study of cognitive process have 
stemmed from Oceania; two major approaches to 
a genuinely anthropological (i.e., cultural) theory 
of personality development have been developed 
by Mead, Beaglehole, and their colleagues (Gladwin 
1961), 

In conclusion, although Oceania is everywhere 
undergoing change, and at varied but generally 
accelerated rates, the region continues to offer un- 
excelled research opportunities in social science. 
The terms of reference for such research are 
changing, but the phenomena are if anything more 
dynamic and more directly relevant to phenomena 
elsewhere. It would be surprising, to this writer 
at least, if any future investigator should equal 
e marks set by Malinowski, Radcliffe-Brown, 
ha or Mead in methodological pioneering or in 
ie eoretical achievement. But the very size of the 

gions of researchers now engaged in the region 
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will ensure rich returns for social science, includ- 
ing a measure of data reliability not always char- 
acteristic of work done in the days of sporadic and 
solitary research. 

Douctas L. OLIVER 


[Other relevant material may be found in COMPONEN- 
TIAL ANALYSIS; EXCHANGE AND DISPLAY; and the 
biographies of CODRINGTON; HADDON; MALINOWSKI; 
RADCLIFFE-BROWN; RIVERS; THURNWALD.] 
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OCKHAM, WILLIAM OF 


William of Ockham, political theorist and phi- 
losopher of science, was born about 1280, probably 
at Ockham, a village in the county of Surrey, near 
London, He.entered the Franciscan order, study- 
ing and lecturing at Oxford University from about 
1309 to 1323. In 1324 he was summoned to Avi- 
gnon by Pope John xxi to answer charges of having 
expounded heretical doctrines. It appears, however, 
that the commission appointed to investigate the 
charges against him reached no final agreement 
concerning the heretical character of his teachings. 

Ockham was still in detention in Avignon in 
1327, the year that Michael of Cesena, the general 
of the Franciscan order, arrived there to answer 
for his attacks upon the papal position on evan- 
gelical poverty. Ockham became involved in the 
Issue, siding with his general against the pope. In 
May 1328, evidently fearing the worst, Ockham, 
along with Michael and several others, fled for pro- 
tection to the court of the pope’s political enemy, 
Louis of Bavaria, and the pope promptly excom- 
municated them. Ockham retaliated with the pen: 
during the next twenty-odd years he directed a 
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series of bitter polemics at successive popes and 
their exalted claims of power. 

With the death of Louis of Bavaria in 1347, Ock- 
ham’s position became hopeless, and he appears to 
have taken steps to reconcile himself with the 
church. A formula of submission was drawn up, 
but it is not known whether Ockham actually 
signed it. He died in Munich about 1349, probably 
a victim of the Black Death. 

Political thought. Despite their polemical tone, 
Ockham’s political writings are essentially con- 
servative in outlook. In general he favored moder- 
ate opinions in regard to the hotly contested polit- 
ical issues of his time. He appears to have been 
interested not in political theory as such but rather 
in the special problem of the relation between 
church and state. Ultimately, his position, which 
remained rooted in the traditionally established 
separation between the two powers, contributed 
little to the solution of the knotty problem of 
precise definition. 

Specifically, Ockham maintained that the spir- 
itual power is autonomous because of its divine 
origin. Within its own realm it retains absolute 
independence, subject only to divine law. Neither 
the emperor, then, nor any other representative of 
the secular arm, has the right to interfere in 
church affairs, except in the instance of some evi- 
dent breach of divine law—as, for example, when 
a heretic occupies the papal seat. 

Similarly, the secular power is derived from God 
through the consent of the people. It too retains 
absolute autonomy within its proper sphere, sub- 
ject only to natural law. Neither the church, then, 
nor the people, can divest the emperor of his 
power, except in the instance of an evident breach 
of natural law—as, for example, when he proves 
dangerous to the general welfare. 

Philosophy of science. Ockham was enormously 
influential in shaping and giving impetus to the 
empirical trend which came to dominate thought 
in the late Middle Ages. Science, for Ockham, con- 
sists in a body of statements about natural entities 
and occasions. In all cases, the ultimate ground for 
the truth of a stated fact is the actual being of the 
things about which the fact is true. The scientific 
proposition, that is, is itself related to matters of 
physical fact as sign is to significatum; hence, it is 
objective circumstance alone that lends verity to a 
scientific statement. Truth, then, and falsity, when 
predicated of a proposition, mean that there obtains 
or does not obtain agreement between the proposi- 
tion as formulated and the fact as given. 

Clearly, in the light of the possibility of em- 
pirical verification as thus envisioned by Ockham, 


270 ODUM, HOWARD W. 


there is a great disparity between such statements 
as “Socrates is walking” and “Time is composed of 
instants.” In the former case, verification is rela- 
tively simple. In the latter, however, verification is 
not so simple: where and how does one seek out 
and encounter “time”? How does one demonstrate 
that its structural components are “instants”? Still, 
if Ockham’s notion of the relation obtaining be- 
tween truth, scientific assertion, and empirical fact 
is to avoid a breakdown, such verification must, in 
some way, be possible. It was with this problem in 
mind that Ockham noted the distinction between 
absolute terms (nomina absoluta) and connotative 
terms (nomina connotativa). 

Absolute terms, for Ockham, have real defini- 
tions,. while connotative terms are nominally de- 
fined. Thus, when the elements entering into the 
definition of a term signify only the individual 
entities for which the defined term can stand, the 
term is absolute. When, on the other hand, the ele- 
ments entering into the definition of a term do not 
all signify the same individual entities for which 
the definiens is a sign, the term is connotative. 

As Ockham saw it, the physical universe admits 
of but two varieties of actual existent: substance 
and quality, Hence, there are but two kinds of 
absolute terms: terms properly ascribed to the cate- 
gory of substance (concrete absolute terms) and 
terms properly ordered under the category of 
quality (abstract absolute terms). Neither class of 
actual existent, however, is ever experienced in 
abstraction from contingent circumstance. Sub- 
stances, that is, are never apprehended per se apart 
from their accidental determinations, any more 
than qualities are ever experienced apart from 
change; and since the written, spoken, or conceived 
expression of these contingent circumstances is in- 
variably couched in connotative terms, it is the 
connotative term which is first in the order of 
“coming-to-know.” But all such terms, according 
to Ockham, are susceptible of resolution into func- 
tions of absolute terms. 

The significance of the distinction and ultimate 
relationship between absolute and connotative 
terms for Ockham’s theory of empirical verification 
is quite clear. Unless there is some way of exhibit- 
ing connotative terms as functions of absolute 
terms whose significata, in turn, can be sought out 
and confronted in the physical realm of substances 
and qualities, definition as well as demonstration 
would involve circularity, or infinite regress, and 
the possibility of empirical verification of scientific 
statements as conceived by Ockham would be 
vitiated. 

In sum, Ockham’s approach provides the philo- 
sophic basis for the formulation of two regulative 


questions to which he would submit all physical 
descriptions, laws, and hypotheses: (1) What do 
the terms that we use in these physical explana- 
tions actually mean? and (2) How can we be 
assured of the truth of that which is asserted? In 
the last analysis, Ockham allowed as meaningful 
only those assertions that are reducible to state- 
ments whose elements designate observable en- 
tities, and as true only those that are satisfied by 
an existing state of affairs. 

HERMAN SHAPIRO 


[Other relevant material may be found in ScrENTIFIC 
EXPLANATION. ] 
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Howard W. Odum (1884-1954), American 50- 
ciologist, was born in Bethlehem in rural northern 
Georgia. He attended Emory University in Atlantā, 
Georgia, and the University of Mississippi and then 
continued his education in the North. At Clark 
University, he studied under G. Stanley Hall, ob- 
taining a PH.D. there in 1909; and at Columbia 
University, where he studied under Franklin H. 
Giddings, he received a second PH.D. in 1910. Im- 
portant influences on his work were to come from 


Wundt, Sumner, Spengler, Geddes, and Sauer. He 
returned to the South to teach, first at the Uni- 
versity of Georgia and then at Emory University. 
After 1920 he taught at the University of North 
Carolina, where he remained the rest of his life. 

Odum’s entire career was devoted to the develop- 
ment of a social science that would change the 
nation’s image of the South and the South’s image 
of itself, At North Carolina he established the de- 
partment of sociology, the School of Public Wel- 
fare, the Institute for Research in Social Science, 
the department of planning, and the journal Social 
Forces, As a sociologist he cultivated four fields, 
all related to one another as well as to his over-all 
aim. They were race relations, regional studies, 
social planning, and folk sociology. 

In his first important work, on the social life 
and folk culture of the Negro (1910), Odum made 
use of both sociological and literary materials. His 
Rainbow Round My Shoulder (1928), describing a 
black Ulysses, a wandering roustabout, is a work 
of literary artistry as well as of perceptive social 
analysis. Odum continued to write about the Negro 
for forty-five years, and the combination of his 
humane sympathy and social-scientific point of 
view did much to improve race relations. He never 
ceased to assert that American democratic prin- 
ciples of equal opportunity and justice apply to 
the Negro and that the Negro is capable of par- 
ticipating in the society on a basis of equality. 

Odum was keenly aware of the extent to which 
the South lagged behind the rest of the nation, 
culturally, socially, and economically. In a full- 
scale regional analysis, Southern Regions of the 
United States (1936), he used cultural-statistical 
indices to document the immaturity of Southern 
development. He hoped that such evidence would 
induce the South to try to catch up with the rest 
of the nation. He sought new standards for the 
region not only in race relations but also in public 
welfare, higher education, regional planning, and 
penal reform. Indeed, he saw the South as adher- 
ing to an obsolete sectionalism and urged that this 
Sectionalism be replaced by regionalism so that 
ie Turner (1932) called a nation of conflict- 
ng sections would become a regional—national 
equilibrium. Odum favored cultural diversity and 
Tease essays showed his delight in regional 
Ta rary movements. He stressed the importance 
= State, regional, and national planning (Folk, 

egion, and Society .. . pp. 413-426). 
pees vid system of folk sociology was left incom- 
the es death. He characterized the folk as “a 
isis rsa! constant in the world of variables,” “sur- 
P Pa es basic elements of new cultures; remnants 

» and asserted that “when old civilizations, 
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such as that of Greece and Rome, pass away new 
cultures arise from the folk” (Folk, Region, and 
Society . . . p. 224). He did not, however, idealize 
the folk; in fact, he regarded Hitlerism, which he 
abhorred, as basically a folk movement. His pri- 
mary concern was to understand the process of 
change from folk culture to mature state civiliza- 
tion, from a society characterized by folkways to 
a transitional one characterized by “technicways” 
and finally to one in which “stateways” predominate. 

In the folk society, Odum believed, the principal 
factor in social change is the slow growth of folk- 
ways, mores, institutions, and moral order. The 
transitional technicways are the habits and cus- 
toms that develop as adjustments to the innova- 
tions of science and technology. (They are not the 
scientific techniques themselves.) Technicways 
rapidly override existing folkways and morality; 
they change institutions and require massive 
changes in law. State civilization, finally, is char- 
acterized by intellectualism and cultural special- 
ization, by centralization of power, and, in some 
instances, by totalitarianism. State society is called 
upon to develop stateways as sanctions for the 
new technicways. In some instances it succeeds 
in developing such stateways; in other instances 
it fails. Thus, in the area of business, as the folk- 
ways of simple partnership fell before the technic- 
ways of large-scale organization, these technicways 
were indeed regularized by the stateways we now 
call corporation law or government control of busi- 
ness. But the technicways produced by such a new 
technique as contraception, namely, family plan- 
ning and premarital sexual freedom, have not 
generally been regularized in appropriate state- 
ways. Again, nuclear fission, a complex of scientific 
techniques, furnishes new technicways of war, but 
no stateways of international sanction and control. 
To Odum, there was an equal danger that social 
change could lead to inadequate controls or to 
totalitarian power. The state society that arises as 
a transitional society attempts to incorporate ac- 
cumulating technicways into an ever-growing body 
of stateways may ultimately repress the sponta- 
neity, vitality, and freedom of the folk. 


Rupert B. VANCE 


[See also REGION.] 
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Public office may be incompetently filled or its 
duties inefficiently performed. This indeed is often 
the case, but it is not the same as misuse of office. 
Misuse implies the doing of something improper; 
and the essence of the impropriety is the replace- 
ment of a public motive by a private one. When the 
satisfaction sought is material gain, the misuse is 
usually termed “corruption.” But private satisfac- 
tion may take various other forms, and the term 
“corruption” is sometimes used to include them as 
well. Further, the beneficiaries may extend beyond 
oneself to one’s family, one’s friends, one’s com- 
munity. The holder of a public office is said to 
have misused his position when, in pursuit of 
private satisfaction as distinguished from the pub- 
lic interest, he has done something which he ought 
not to have done or refrained from doing some- 
thing which he ought to have done. 

In the narrower use of the word as a criminal 
offense on the part of a public servant, corruption 
needs to be more precisely defined. For such a 
definition we may turn to the Indian Penal Code 
(sec. 161). According to the code, a person is 
guilty of corruption who 


being or expecting to be a public servant, accepts, or 
obtains, or agrees to accept, or attempts to obtain from 
any person, for himself or for any other person, any 
gratification whatever, other than legal remuneration, 
as a motive or reward for doing or forbearing to do 
any official act or for showing or forbearing to show, 
in the exercise of his official functions, favour or dis- 
favour to any person or for rendering or attempting 


to render any service or disservice to any person, with 
the Central or any State Government or Parliament or 
the Legislature of any State or with any public servant 
as such... . 


However, we are concerned with corruption and 
misuse of office in a somewhat wider context. For 
example, members of Parliament are not, strictly 
speaking, public servants; nevertheless they do 
hold public office and may misuse their office, 
Honorary magistrates are public servanis even 
though, by definition, they get no remuneration for 
their work. Obviously, there is nothing to prevent 
an honorary magistrate from being as corrupt as 
a stipendiary magistrate. For instance, the story 
is told of a retired government officer who, on the 
grounds that he could not make both ends meet 
on his pension, prayed that he be given the op- 
portunity of serving the state as an honorary 
magistrate! So long as obligations, duties, or func- 
tions of a public nature are attached to any posi- 
tion, salaried or honorary, big or small, the misuse 
of that position for private ends remains a possi- 
bility. Nor need corruption be confined to public 
office. In an extended sense, it may be said to be 
coextensive with all dishonest discharge of public 
obligations. Thus, an authorized dealer of rationed 
articles who sells at a price higher than the con- 
trolled price is guilty of corruption in the discharge 
of a delegated obligation. Indeed, in this larger 
sense, corruption has been described as “the ac- 
quisition, exercise and delegation of authority 
according to self interest rather than merit” 
(Wraith & Simpkins 1963, p. 56). 

An illustration of corruption at the stage of 
acquisition or attempted acquisition of authority 
is to be found in what takes place before and 
during elections. The conduct of public office is 
tainted at the very source when those who seek 
to be members of legislatures, and perhaps min- 
isters, obtain from the rich large donations for 
themselves or their parties and proceed to bribe 
the electors in order to gain votes. In commenting 
on standards of honesty in public administration 
in democratic countries, several writers point to 
the crucial importance of the norms set and ob- 
served during elections to legislative assemblies, 
senates, and parliaments (Gorwala 1953, p. 23; 
Bolles 1960, p. 183; Wraith & Simpkins 1963, 
chapter 2, part 2). 

Historical background, Corruption is at least 
as old as the pyramids. Says Kipling in “A General 
Summary” of the history of corruption: 


Who shall doubt “the secret hid 
Under Cheops’ pyramid” 

Was that the contractor did 
Cheops out of several millions? 


This perhaps will be dismissed as fanciful, but a 
concern with corruption is shown in the Egyptian 
Book of the Dead in the form of a supplication to 
Osiris: “I knew no wrong. I did no evil thing... . 
I did not diminish the grain measure. I did not 
diminish the span. I did not diminish the land 
measure. I did not load the weight of the balances. 
I did not deflect the index of the scales” (quoted 
in Goodman 1963, p. 81). This may be taken as 
indicative of what was in fact taking place fairly 
extensively in Egypt at that time. 

Seldom has administration—and the civil service 
—been developed to so high a degree of sophisti- 
cation as in ancient China. According to Con- 
fucius, “When a country is in chaos, it is shame 
to be rich and an official”; and it is on record that 
he thoroughly condemned Yang Ho, the powerful 
but corrupt official of Lu. Mencius, a follower of 
Confucius, said to King Hui of Liang, “If a king 
says ‘what will profit my kingdom?’ the high offi- 
cials will say, ‘what will profit our families?’ and 
the lower officials and commoners will say ‘what 
will profit ourselves?’ . . . Let your Majesty speak 
only of humanity and righteousness.” 

In ancient India, Kautilya, writing on political 

science, administration, and economics, said in his 
Arthasdstra: 
Just as it is impossible not to taste the honey or the 
poison that finds itself at the tip of the tongue, so it 
is impossible for a government servant not to eat up at 
least a bit of the king’s revenue. Just as fish moving 
under water cannot possibly be found out either as 
drinking or not drinking water, so government servants 
employed in the government work cannot be found 
out while taking money for themselves. 


The city-states of Greece, which in some respects 
represent the highest levels attained by European 
civilization, had one feature in common with the 
most primitive tribal organizations: the smallness 
of the unit of government and the consequent 
direct contact between the ruler and the ruled. In 
these circumstances, there may have been occa- 
Sional abuse of power through tyranny or open 
@ggrandizement, but little or no misuse of it in 
the sense of corruption and the subterfuge that 
Sea accompanies corruption. Nevertheless, by 
eto fifth century, corruption assumed fairly 
: Proportions in the Greece that had emerged 
eon the older city-states. 
ms shite aed pageantry, spoils, and wealth helped 
aint uce corruption ona large scale into Roman 
by ne I The continual search for revenues 
tan y and church, the weakness of the far-flung 
N y Empire, social and economic dispari- 
FEV nic helped concentrate power, wealth, and 

S in the privileged few—all these provided the 
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basis and the opportunity for extensive corruption 
in medieval Europe. The sale of judicial offices 
was restored in France in the fifteenth century. 
Writing at the turn of the sixteenth century, Bacon 
(in an essay on public office) listed corruption 
as one of the four vices of authority and exhorted 
those who held public office to “avoid not only the 
Fault but the Suspicion.” (Some years later he was 
condemned before the House of Lords for bribery 
and was deprived of office.) Philip Stubbes, the 
author of Anatomie of Abuses (1583), described 
how corruption had entered the very citadels of 
learning. One had to bribe one’s way into a college 
in Oxford. “Except one be able to give the Regent 
or Provost of a House a piece of money, ten pounds, 
twenty pounds, yea a hundred pounds, a yoke of 
fat oxen, a couple of fine geldings or the like, 
though he be never so toward a youth, nor have 
never so much need of maintenance, yet he come 
not there I warrant you.” Some eighty years later, 
Samuel Pepys was adjured by his patron, the earl 
of Sandwich, that “it was not the salary of any 
place that did make a man rich, but the opportunity 
of getting money while he is in the place.” 

As Parliament grew in importance during the 
seventeenth century, bribery and corruption be- 
came an organized feature of British parliamen- 
tary elections. This continued right through the 
eighteenth century, so much so that Gibbon was 
moved to speak of corruption as “the most infal- 
lible symptom of constitutional liberty.” During the 
eighteenth century the evil had been accentuated 
by the competition for parliamentary seats between 
the old landed aristocracy and the new nabobs 
who came back with the spoils of colonial rule, 
mostly from India, “Many of the merchants who 
had gone out to India from Manchester found 
bribery to be just as useful in the Commons as in 
Calcutta, and sometimes they combined to pur- 
chase a borough” (Wraith & Simpkins 1963, p. 67). 

There is little doubt that colonialism and im- 
perial exploitation contributed to the further de- 
terioration of standards of public conduct in Brit- 
ain, The greatest of the empire builders had been 
among those who had “shaken the pagoda tree” 
most violently: Clive and Warren Hastings, for 
example, although Hastings was finally exonerated 
and Clive went on record as saying, “By God, Mr. 
Chairman, at this moment I stand astonished at 
my own moderation.” It is necessary to guard 
against the error of blaming these men for stand- 
ards which were those of their age and environ- 
ment. But in some instances they fell even below 
those standards, as Hastings did, for example, in 
the cases of Chait Singh of Benares and the begums 
of Oudh (Oxford History of India 1919). 
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What is remarkable, however, is that simulta- 
neously with this devaluation of standards there 
were elements within Britain which would not 
allow the lower values to be taken for granted. 
Whatever their political motivations, it was these 
elements which ruthlessly insisted on charges being 
formulated, allegations being brought out into the 
open, and judgment passed by the highest tri- 
bunals of the day, 

In the last analysis, it is only the conscience of 
a country, a live public conscience, that can up- 
hold moral standards in the conduct of the busi- 
ness of the country—whether it be political, legis- 
lative, judicial, or executive. What has to be noted 
in this brief delineation of the historical back- 
ground, especially of Britain, is that such a moral 
awareness can grow, even in periods of consider- 
able corruption and malpractice, until a phase is 
reached when the public conscience appears able 
to operate effectively in the spheres of politics and 
administration. The remarkable fact about Britain 
is that there was—and perhaps there still is—such 
a phase which began before the middle of the 
nineteenth century. This fact is of special impor- 
tance when we attempt to analyze a little more 
closely the circumstances in which corruption 
flourishes or dwindles in public life. 

Significance of corruption. It is sometimes 
urged that it is not really possible to combat cor- 
ruption; that since it will neither diminish nor 
disappear, it would be much better to learn to live 
with it. It is pointed out that even when high stand- 
ards began to assert themselves in Britain, a special 
occurrence such as a war—with all the implications 
of government contracts for military supplies—was 
enough to upset the balance, as witness the Boer 
War scandals. Moreover, according to this point of 
view, corruption is ubiquitous. Japan, for example, 
had its naval scandals of 1914. In the United States 
the whole way of life that is congenial to private 
competition is also conducive to public corruption: 
the society is affluent, the values are acquisitive, 
and the operative social forces—in administration 
as in politics—are those which flow from aggres- 
sive and ostentatious private wealth. The United 
States has had some great men as presidents and 
many good men at all levels of government; but 
more typical of the society itself are those who 
figured in the whisky ring scandal during Pres- 
ident Grant's tenure, the Donald Dawson episode 
in President Truman's time, and the Sherman 
Adams interlude during the regime of President 
Eisenhower. At the opposite pole, economically, 
are the developing countries: India, Pakistan, In- 
donesia, and Nigeria, for example. All the data 


available point to there being a large element of 
corruption in the administration of these countries. 
Is the combating of corruption, then, either prac- 
ticable or worthwhile? 

An answer may be attempted from two different 
angles; first, the standpoint of latter-day British 
experience, and, second, the significance of corrup- 
tion for developing countries. There is little doubt 
that from about the middle of the nineteenth cen- 
tury, Britain managed to achieve, and on the whole 
to maintain, a level of integrity in public life, pol- 
itics, and administration which has rarely been 
witnessed in the history of the world. Perhaps we 
can regard this period as commencing from the 
civil reforms of 1854, although substantial legis- 
lative reforms—of elections, for example—had 
started as early as the Reform Act of 1832. But 
if the British example proves anything, it is that 
statutes by themselves cannot purify public life; 
that, although legislative reform can support, it 
must in turn be supported by various other opera- 
tive elements in the body politic; and that, while 
striving to reflect the highest in the public con- 
science, legislation cannot be much in advance of 
public conscience as a whole. The evolution of 
responsible political parties; the growth of com- 
munications; the spread of education; the emer- 
gence of an able and independent civil service to 
which recruitment was made entirely on merit 
(curiously, it was the Indian Civil Service that 
provided the model); national prosperity which, 
since it was based on competitive trade and indus- 
try, postulated a certain standard of national 
honesty; the development of professional integrity; 
the love of voluntary associations and the propaga- 
tion through such associations of the standards of 
professional integrity—all contributed to Britain's 
high degree of freedom from corruption. 

Considering the history of Britain, the degree of 
her freedom from corruption is a hopeful and sig- 
nificant example for developing countries, espe 
cially those whose economic progress is politically 
based on a democratic polity. Developing countries 
literally cannot afford corruption. It would be self- 
defeating for them to adopt the attitude that cor- 
ruption is inevitable and that they must therefore 
learn politically, economically, and administratively 
to reconcile themselves to an existence which is 
interwoven with corruption, For corruption saps 
development and makes nonsense of planning. This 
is especially true of those plans which are intended 
for the benefit of the weak and underprivileged- 
Corruption weights the already overweighted scale 
in favor of the rich and the privileged. Perhaps 
most serious of all is the frustration, the loss of self- 


respect, the insidious growth of cynicism it causes 
all around, particularly among young administra- 
tors—often the very ones responsible for develop- 
ment. It “vitiates policy, weakens administration 
and undermines public confidence” (Gorwala 1953, 
p. 14). Implementation is the essence of economic 
planning, and corruption has an almost fatal effect 
on the efficiency of execution. As an example, con- 
sider “speed money,” the system of giving bribes 
to expedite orders. According to the Indian Com- 
mittee on the Prevention of Corruption (1964) 
speed money, in addition to being an objectionable 
practice, actually aggravated delays. The govern- 
ments of the developing countries can no more af- 
ford to ignore corruption than to be apathetic to the 
poverty of their people. 

The importance of environment. Corruption is 
not an isolated phenomenon. In the first place, it is 
fostered by certain kinds of social environments, 
including in that term a whole variety of political, 
social, economic, administrative, and ethical factors, 
such as the size, form, and functions of the state; 
the degree of power the state wields over the citizen 
and his daily life; the extent of social divisions and 
socioeconomic disparities; the level and spread of 
education; the prevalent values and loyalties; the 
efficiency of the administration; and the traditions 
of the civil service. Second, it is allied to a number 
of other forms of misuse of official power and 
tends to occur along with them. 

In the same category as corruption are those 
forms of misuse of authority which result in some 
undue benefit to a favored individual (favoritism), 
telative (nepotism), coreligionist (sectarianism), 
member of the same clan or community (commu- 
nalism), and so forth. Patronage and influence go 
hand in hand with corruption, and all three indi- 
cate a very poorly developed social conscience, for 
Which personal profit and private loyalty take prece- 
dence over public duty. Quite often, the private 
loyalties (to clan or tribe, for example) must once 
have served a useful purpose—had “survival value” 
—in a different social context and under a different 
AURE organization, This is particularly so in de- 
recite countries, But these countries must realize 
ahs ey cannot both cling to the narrower loyal- 
ont tacts age and at the same time make a suc- 
eset emocratic planning in a modern state. The 
ie risa of Britain shows that older loyalties can 
hada ea place to new social values: “Britain has 
thatit aa years to get accustomed to the idea 
is thas apical to advance the claims of rela- 
p.33) “Lise posts” (Wraith & Simpkins 1963, 
in ie a eee help, but only if they are viable 
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held widely, strongly, and persistently enough to 
have a significant influence on political and admin- 
istrative action. Waves of moral enthusiasm sweep 
a country, sometimes in conjunction with a politi- 
cal revolution, but fail to have a permanent effect. 
Indeed, when the initial momentum is lost, more 
harm than good may well have resulted. An atmos- 
phere in which high ideals are accorded only lip 
service is among the most congenial for the growth 
of corruption; it helps disseminate exactly those 
double standards which enable the bribetaker to 
maintain his self-respect while presenting one face 
to the world and another to his client. Again, an 
ideological revolution accompanied by violence may 
have a sudden and drastic effect on corruption. But, 
since even fear cannot last forever, it remains to 
be seen how long the results of such a reformation 
will persist in the societies concerned. In the last 
analysis, it is not the sanctions of the state, but 
the disciplined transformation of society to new 
ends and purposes, that can have a really lasting 
effect on corruption. 

Among the best environments for corruption is 
a prosperous society which cherishes affluence as 
an end, competition as a means, and ostentation 
as a value. For those who are most powerful in such 
a society, the machinery of the state has signifi- 
cance only insofar as it can help to further their 
own aims, and there is a constant attempt to manip- 
ulate the machinery. The ethics of private business 
make persistent inroads on the ethics of govern- 
ment: gifts are given, entertainment offered, and 
contacts established. In such a society small busi- 
ness is at a disadvantage, because it has less access 
to the seats of power. In the developing countries 
this disadvantage is aggravated by the dependence 
of business on government for raw materials, cred- 
it, export licenses, etc. Since many countries launch 
on development for sheer survival, the state takes 
on powers which are inevitably large and dispenses 
resources which are inevitably scarce. That is how 
development (like war) tends to promote what, in 
terms of its successful prosecution, it can least af- 
ford: corruption. 

Inhibiting corruption. Drawing from the ex- 
perience of different countries, it is possible to 
make a list of those factors, and especially of those 
measures—legislative, administrative, and other— 
which may be regarded as corrective of corruption. 
They may be classified as general, preventive, and 
punitive. 

The most potent general remedy for corruption 
is public opinion. Nothing can overcome corrup- 
tion if the public is prepared to tolerate it. On the 
other hand, no government can afford to be compla- 
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cent about corruption if the public is not prepared 
to tolerate it. In this, the press has a most impor- 
tant part to play, both as the vigilant guardian of 
the public interest and as the vigorous exponent of 
public views. 

Elections can be a primary source of corruption. 
They cannot be fought without money, and if com- 
petition is uninhibited and expenditure unrestrict- 
ed, the money required can be considerable. This 
money can come only from rich firms and pros- 
perous individuals, who never pay without expec- 
tation of return. Hence the need to make elections 
cheap and the expenditure on them limited and 
accountable; also to restrict the amount of individ- 
ual donations. If this is effectively enforced, cor- 
ruption will have been controlled at one of its 
principal sources. Since, in actual practice, parties 
keep pace with individuals in observing double 
standards, election laws are in many countries 
more an adornment of the statute book than obliga- 
tory guidelines for the actual conduct of elections. 

An independent judiciary is a great asset for a 
country struggling with corruption. Mention may 
also be made of the ombudsman, who is directly 
answerable to parliament as protector of the citizen 
against the harassment of officialdom; first insti- 
tuted in the Scandinavian countries and later 
adopted elsewhere, as in New Zealand, this insti- 
tution has only limited relevance to the fight 
against corruption in developing countries. 

In considering the preventive aspect, one must 
deal with both ministers and civil servants—and, 
indeed, the whole machinery of government. In the 
long run, no civil service can rise above the level 
of its ministers. Some countries, such as India, 
have drawn up codes of conduct for ministers. 
Among other obligations, a minister must from 
time to time declare his assets and liabilities to the 
prime minister. He also must sever connection with 
any business he was employed in before becoming 
minister; undertake not to raise funds except 
through recognized trusts; and agree not to accept 
valuable gifts except from close relatives. A similar 
code applies to civil servants. One has, however, to 
draw a sharp distinction between the formulation 
of such codes and their strict observance in actual 
practice. 

In regard to the civil service, two or three other 
points need emphasis. Clearly, the proper recruit- 
ment and training of civil servants is of great im- 
portance, as is the upholding of standards and 
traditions. No less essential is the proper alloca- 
tion of authority. Delegation should not be unnec- 
essarily withheld, just as it should not be unwisely 
extended. Above all, it should be unambiguous. 


Nothing encourages corruption so much as the dif- 
fusion and blurring of responsibility. Finally, it 
is necessary that government servants should be 
properly housed and adequately paid. These ele- 
mentary requirements are often so neglected as 
positively to encourage corruption. 

All punitive procedures—investigation, inquiry, 
and judgment—should be speedy, effective, inde- 
pendent, and impartial. On the one hand, it is es- 
sential that the public should not have scope for 
the suspicion that the government is anxious to 
shield a minister or an official. On the other hand, 
it is equally necessary that the functionary con- 
cerned should have no room for apprehension that 
justice will not be done to him. For detection and 
general surveillance, vigilance commissions have 
been established in some countries. While not 
wholly independent of the executive, they have a 
right to report to the legislature. High-placed judi- 
cial tribunals and high-powered national panels 
are among the institutions established or proposed 
for dealing with ministerial misconduct. 

There is no lack, then, of administrative inge- 
nuity in devising measures to counter corruption. 
Also necessary, but not so readily forthcoming, are 
the social will and the political willingness to erad- 
icate the evil. 


B. VENKATAPPIAH 


[See also BUREAUCRACY; CIVIL SERVICE; CONFLICT OF 
INTEREST; ECONOMIC GROWTH, article on NONECO- 
NOMIC ASPECTS; POLICE; POLITICAL MACHINES; and 
the biography of KAvuTILya.] 
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William Fielding Ogburn (1886-1959), Ameri- 
can sociologist, was born in Butler, Georgia. He 
took his B.s, at Mercer University and received his 
PH.D. in 1912 from Columbia University. Between 
1911 and 1918 he taught economics, political sci- 
ence, history, and sociology at Princeton University, 
Reed College, and the University of Washington. 
He was chairman of the department of economics 
and sociology at Barnard College from 1919 to 
1927 and also taught at Columbia during this 
period. He then became professor of sociology at 
the University of Chicago and subsequently was 
appointed to the Sewell L. Avery distinguished 
service chair 

He retired in 1951, at which time he began 
traveling about the world, following his ever- 
broadening interests and his desire to see and learn 
more. He lectured at the universities of Calcutta 
and Delhi in India, and at Nuffield College, Ox- 
ford, and traveled extensively through Asia, the 
southwest Pacific, Europe, and Latin America. In 
addition, he was visiting professor of sociology at 
Florida State University. 

Ogburn held many responsible posts with the 
federal government, most of them concerned with 
such problems as retail prices and the cost of liv- 
ing. For a number of years he was chairman of 
the Census Advisory Committee. He was president 
of the American Sociological Association, president 
of the American Statistical Association and editor 
of its Journal, the first president of the Society for 
the History of Technology (founded in 1959), a 
vice president of the American Association for the 
Advancement of Science, and chairman of the 
Social Science Research Council. In addition, he 
was in demand as a labor mediator. 

The general concern with quantitative method- 
ology and thinking and the emphasis placed on 
Statistics at Columbia when Ogburn was a stu- 
dent there were important early influences on his 
Sought The courses he took at Columbia from 

ranklin H. Giddings in sociology, Edward L. 
Thorndike in education, Henry L. Moore in eco- 
ae and Franz Boas in anthropology all in- 
be quantitative methods and materials. This 
b “i on a quantitative approach was reinforced 
ae work he did on statistical cost-of-living 
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1964, p. ix). A 1915 article on the limited effec- 
tiveness of initiative and referendum in achieving 
reform was based on extensive statistical analysis. 
After the war, Ogburn gave up the role of reformer, 
although he continued to serve as a consultant to 
government agencies. He had come to doubt that 
idealism per se could accomplish very much, and 
he grudged the time that reformist activities took 
away from scientific ones. There is an interesting 
description by Duncan of the intellectual reorienta- 
tion that accompanied Ogburn’s abandonment of 
the reformist role: 

The conviction grew that the ideological component 
of reform movements was largely rationalization of 
wishes, a phenomenon akin to the fantasy or day- 
dream, Seeking an understanding of this phenome- 
non, Ogburn read widely in psychoanalysis and himself 
underwent analysis. He came to appreciate the per- 
vasiveness of bias in the absence of facts and devel- 
oped the idea that social scientists should seek a 
knowledge of their own prejudices and rationalizations 
to the end of reducing the output of theories shaped 
primarily by emotion and desire. Yet, such self-knowl- 
edge would not, of itself, lead to a social science, “for 
we cannot have a science without measurement. And 
science will grow in the social studies in direct ratio 
to the use of measurement.” (ibid., pp. x-xi) 


From his student days on, Ogburn never con- 
fined his interests to a single, narrowly defined 
social science. At Columbia he studied economics 
with John Bates Clark, E. R. A. Seligman, and 
Henry R. Seager. In the 1920s he was stimulated 
by the economist Wesley C. Mitchell and the an- 
thropologists Robert H. Lowie and Alfred L. 
Kroeber. It was his contact with cultural anthro- 
pology that made him modify the stress on psycho- 
logical motivation derived from the study of psy- 
choanalysis; he became increasingly aware that 
“any investigation of motives is likely to be super- 
ficial without the recognition of cultural factors, 
which can be discerned only through historical in- 
vestigation” (Duncan 1964, p. xi). Throughout his 
career Ogburn emphasized the interdependence of 
the various fields of social science, and his own 
work exemplified that interdependence. 

Methodology and theory. Ogburn’s perennial 
concern with verifiable knowledge is epitomized in 
his insistence that the question “How do you know 
it?” be answered. In thus emphasizing the need for 
proof he helped to change the social sciences, par- 
ticularly sociology, from disciplines dominated by 
social reformers and philosophers to ones based on 
the most nearly correct observations possible of the 
social world. Indeed, the emphasis on quantitative 
methods in sociology characteristic of the post- 
World War m period can be attributed in large 
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measure to Ogburn’s influence, both directly, and 
indirectly through his students. Although Ogburn 
frequently used statistics, he was not a methodolo- 
gist per se. He never insisted that statistical method- 
ology is the only way of obtaining verifiable data 
or that statistical methods provide all the relevant 
information (see 1934; Ogburn & Bose 1959). To 
him methods per se only served the pursuit of “big 
ideas.” 

Ogburn wished to differentiate between trivial 
findings (however well verified) and important 
findings, that is, findings relevant for advancing 
the social sciences as scientific disciplines or for 
dealing with some pressing social problems. Here 
he thought the relevant question is “What of it?” 
Ogburn never really relinquished his interest in 
social problems, and especially in his teaching he 
gave them salience. Examples of what he con- 
sidered to be big questions are: What should the 
functions of government be? How can we raise our 
standard of living? How can nations get along 
with each other? How are beliefs and ideas shift- 
ing? What are the roles of heredity and culture in 
shaping personality? Although he realized that 
these big questions cannot all be answered and 
that the few answers which available data and sta- 
tistical methodology can provide may be of little 
use, he nevertheless felt impelled to try to deal with 
such questions and one way or another to obtain 
as much verifiable knowledge as possible about 
them. 

Just as Ogburn did not believe in methodology 
for its own sake, so was he wary of an unques- 
tioning commitment to theory. He thought of the- 
ory as “the organization of knowledge,” which per- 
mits the researcher to see relationships and to 
construct hypotheses (1955). While he did not give 
much explicit attention to theory, referring to theo- 
rizing offhandedly as “Let’s think about it,” he did 
believe that his own explication of the problem of 
social evolution constituted a major contribution to 
sociological theory. 

Social change. Ogburn was preoccupied con- 
tinuously with the topic of social change. Until the 
first part of the twentieth century, writers on this 
subject had supposed that human society devel- 
oped and grew as biological evolution proceeded, 
some peoples having more advanced societies and 
cultures than others because they were more ad- 
vanced biologically. Then Franz Boas, in his work 
on the mind of primitive man and on the mental 
capabilities of various peoples, showed that all 
peoples have much the same inherent mental capa- 
bilities. Furthermore, he showed that there had 
been little, if any, biological evolution in the human 


race in the last few thousand years, while there 
had been very considerable social evolution. There- 
fore, culture must have grown and changed inde- 
pendently of any biological factors. Ogburn accepted 
the view that the explanation of social change must 
be sought in culture rather than in the biological 
nature of man. In his first systematic treatment of 
change, Social Change, With Respect to Culture and 
Original Nature (1922), he listed four factors 
which explain cultural evolution: invention, ac- 
cumulation, diffusion, and adjustment. 

Among these factors invention is basic. “Inven- 
tion is defined as a combination of existing and 
known elements of culture, material and/or non- 
material, or a modification of one to form a new 
one” ([1950] 1964, p. 23). Inventions result from 
the operations of three further factors: mental 
ability, demand, and the existence of cultural ele- 
ments that can be recombined into new inventions. 
Individuals with the needed mental abilities have 
appeared in all societies, but the larger the popu- 
lation is, the more such individuals there will be. 
Demand is a factor that not only contributes to the 
making of inventions but also determines whether 
an invention, once made, will be used. An invention 
not used is, in effect, an invention not made and 
one which is not likely to serve as a base for future 
inventions, Finally, the more cultural elements 
there are, the easier it is to produce new inventions 
by combining several already existing ones. 

Accumulation occurs when more new elements, 
that is, inventions, are added to the cultural base 
than are lost by obsolescence. A society or culture 
may be equated with an accumulation of inven- 
tions, and the larger that accumulation is, the easier 
it is to make new inventions. In theory, inventions 
should accumulate at an exponential rate, since 
any two or more existing elements may be com- 
bined to produce a new invention, Furthermore, 4 
combination of an increasing population and an 
increasing cultural base should lead to a greatly 
accelerated rate of invention. In reality, of course, 
exponential growth seldom lasts long. “Reality les- 
sens the slope or flattens it out eventually, perhaps 
to begin growing exponentially again for a time. 
Exponential growth-may work out to be irregular— 
now popularly called cyclical” (ibid., p. 26). 

Diffusion is the spread of an invention from one 
area to another. It is not necessary for each sepa- 
rate population group, tribe, or nation to make 
its own inventions. Given means of communica- 
tion, an invention made in one place can travel 
widely, Hence, those nations living at communica: 
tions crossroads may make enormous progress 
simply by receiving a wide variety of inventions 


from various sources. The strategic location of the 
ancient Greeks may account for the great cultural 
advances they made: they adopted and integrated 
the inventions and ideas (ideas are a form of in- 
vention) that were diffused to them. The reverse 
of this is that geographically isolated peoples tend 
to be culturally “backward.” 

Adjustment is the final factor producing cultural 
evolution. Since all elements of a culture are inter- 
related, any pronounced change in one element 
will of necessity result in changes in other elements. 
Each time an invention is made, some social change 
has occurred that requires adjustment. An inven- 
tion (material or nonmaterial) in one part of a 
culture may create an adjustment in a related part 
of the culture. “These adjustments do not take 
place instantaneously but are made after a delay 
and are called ‘cultural lags.’ Over the long course 
of social evolution, measured in thousands of years, 
cultural lags are invisible. At any particular mo- 
ment, however, they may be numerous and acute” 
(ibid., p. 30), Duncan has stressed that, in Og- 
burn’s conception, “The lag relationship comes into 
play when it can be demonstrated that, of two 
hitherto closely related and mutually compatible 
parts of culture, one changes in such a way as to 
disrupt the relationship and impair the compati- 
bility” (Duncan 1964, p. xvi). 

In a society where few inventions have occurred 
and where, therefore, little social change has oc- 
curred for a long time, the various elements are 
adjusted to each other. “. . , When a significant in- 
vention occurs in one part of culture, the balance 
is disturbed; change is set up in the other related 
parts as a process of adjustment to the new inven- 
tion. Thus social evolution goes forward by inven- 
tions which produce a disequilibrium in society, 
which in turn sets up forces which seek a new 
equilibrium” ({1950] 1964, p. 30). 

According to Ogburn, a common pattern of 
Social change in the twentieth century is for a tech- 
nological advance to affect an economic organiza- 
Hon; this in turn causes a change in a social institu- 
See as the family or government, and finally 
ea ange in the social philosophy of a people. 
foie ee that he did not assert the uni- 
ie P S this pattern. He saw it as an elabora- 
om the economic interpretation of history, 

ich placed the technical factor before the eco- 
nomic one (1936). 
ee began his discussion of social 
tion to be th inventions, he did not consider inven- 
noted previ e unique cause of all social change. As 
RRS iously, there must be a demand for an 

or it will not be used; such demand may 


OGBURN, WILLIAM FIELDING 279 


precede or follow the invention. “This demand may 
vary over time and is a cultural variable. . . . Thus 
there may have been much more demand for in- 
ventions during the past hundred years than for- 
merly” ([1950] 1964, p. 24). 

Apparently Ogburn never explored in detail the 
social prerequisites of the adoption of an invention. 
That he was aware of their importance is indicated 
by his emphasis on demand and by the inclusion 
in Technological Trends (which he planned and 
edited) of a chapter entitled “Resistances to the 
Adoption of Technological Innovations,” written by 
Bernhard J. Stern. Ogburn himself wrote in the 
introduction that resistance to invention can come 
from a number of sources, among them technical 
faults, high production costs, state opposition on 
social grounds, and adverse popular opinion (U.S. 
National Resources Committee 1937, p. 7). Thus, 
Ogburn’s linear sequence—technological change 
(i.e., invention) affects the economic organization, 
which affects social institutions (including govern- 
ment) which, in turn, finally cause a change in 
the social philosophy of people—can in fact be con- 
verted into a circular sequence: the social phi- 
losophy of people may affect technological change 
(i.e., the demand for and use of a new object or 
idea) which, in turn, may affect social institutions. 
Since Ogburn considered all parts of a culture to 
be interrelated, he would surely have accepted this 
reformulation by which nontechnological elements 
in a culture affect the technological. 

In his research, Ogburn actually attempted to 
measure social change as well as to explain it. To 
do this he considered very specific aspects of social 
trends—changes in the family, the city, etc.—from 
a limited geographical and chronological point of 
view. The best known reports of his research are 
probably in Recent Social Trends (see President's 
Research Committee on Social Trends 1933). Most 
of his studies were made of social changes in the 
United States and referred to the twentieth century, 
prior to which very few statistics were available. 
He studied various trends separately and then 
sought to ascertain the extent to which they im- 
pinged upon or reinforced one another. 

One generalization that he reached from this 
study of social trends is that movement continues 
in the same general direction for long periods of 
time; “long,” to be sure, is relative, and some trends 
are measured in decades, while others are meas- 
ured in centuries. Ogburn emphasized that people 
are most often aware only of what are in fact 
deviations from long-term trends; because these 
deviations are so highly visible they often obscure 
the major direction of change. Ogburn was con- 
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vinced that trends in any of our twentieth-century 
modes of behavior were likely to continue for a 
number of decades. He believed, therefore, that one 
could project at least a decade or two into the fu- 
ture with a probability of being correct about nine 
times out of ten. Examples of twentieth-century 
trends are the continuously increasing role of gov- 
ernment and the growth of large businesses. Such 
social trends can be deflected or slowed down only 
with considerable effort; it would require unimagi- 
nably great efforts to redirect or stop them de- 
liberately. 

Ogburn’s research and writing on specific topics 
such as the family, cities, population, legislation 
and voting, war, and international relations are 
closely, even inseparably, related to his work on 
social change. The outlines of particular social 
trends quickly become blurred, merging in broader 
patterns of change, As Duncan has written, “If, in 
a ‘changing society’ whose parts are closely inter- 
related, hardly any aspect of life remains constant, 
the study of social trends can accept few limita- 
tions on its substantive scope” (1964, p. xix). 

Influence. Ogburn had a direct influence on his 
students—particularly, of course, on those who 
prepared dissertations under his guidance or who 
served as his research assistants. (Probably all of 
the latter are named in one or another of his pub- 
lications, often as junior authors.) These students 
and assistants were indoctrinated with the need for 
evidence. Most of his students accepted his stress 
on quantitative data, and some even became promi- 
nent statisticians. A secondary influence on many 
sociology students came from the introductory 
textbook Sociology (1940), which Ogburn wrote 
with Nimkoff; by 1964, this book had gone through 
four editions. 

As early as the 1920s Ogburn emphasized the 
interdisciplinary approach that became popular in 
social science after World War m1. He did research 
in all areas of social science and drew his collabo- 
rators from the different disciplines. One of his 
outstanding works, The Social Sciences and Their 
Interrelations (1927), which he and Alexander 
Goldenweiser, an anthropologist, edited, is an early 
work of interdisciplinary cooperation, dealing with 
anthropology, economics, history, political science, 
sociology, ethics, law, religion, statistics, psychol- 
ogy, philosophy, and even biology and the natural 
sciences, One of the early leaders of team research, 
Samuel A. Stouffer, was a student of Ogburn’s and 
was greatly influenced by him. 

As one of the founders and first president of the 
Society for the Study of Technology, Ogburn tried 


to advance this field. It is too early to know whether 
or not sociologists will take an active part in this 
society and turn to the study of technological 
change. 

Ogburn’s main contribution to sociology was in 
the field of culture and cultural change; to him this 
was the heart of sociology. He never was particu- 
larly concerned with certain topics now of interest 
to some people who consider themselves to be soci- 
ologists, e.g., “small group relations” or “sociology 
as an aid to business.” Yet there is a difficulty in 
working in the field of social change, caused by the 
fact that techniques are not available that permit 
the carrying out of intellectually satisfying re- 
search. How does one, for example, ascertain 
whether the American culture is becoming “child- 
oriented” rather than “parent-oriented”? The prob- 
lem becomes particularly acute if one is interested 
in quantitative analysis. 

While Ogburn and other sociologists were grap- 
pling with this research problem, other social scien- 
tists were developing techniques for ascertaining 
public opinion. Methodologically, it turned out to 
be comparatively easy to develop satisfactory polls 
and surveys for learning (more or less accurately) 
what individuals thought and believed, but a gen- 
eration of experience with these methods showed 
that although they were useful for answering some 
practical questions, especially in the field of mar- 
keting, they were unable to provide answers to the 
“big questions.” Sociologists therefore are still con- 
fronted with the question Ogburn raised—how to 
study social change. 

Since World War 1, interest in the study of 
social change has been greatly stimulated because 
of the efforts of the economically developed nations 
to aid the underdeveloped ones. Vast sums of 
money and much thought are being devoted to the 
task of effecting socioeconomic change. Although 
some sociologists are still trying to use survey tech- 
niques and to question individuals in an effort to 
determine how to assist the underdeveloped coun- 
tries, this kind of research appears to be producing 
little of value. Ogburn’s work, however, does pro- 
vide relevant guidelines to the appropriate ways of 
helping the underdeveloped countries undergo 
social and economic transformation: his concep- 
tion of the interrelation of all parts of culture im- 
plies that the problem must be attacked on @ very 
wide front, and his insistence on the inherent con- 
tinuity of social trends in a society suggests that 
these trends must be taken into account when 


trying to effect socioeconomic change. 
A.J. JAFFE 


[For the historical context of Ogburn’s work, see Soci- 
oLocy; and the biographies of Boas; CLARK, JOHN 
Bates; GIDDINGS; KROEBER; Lowre; MITCHELL; 
Moore, HENRY L,; SELIGMAN, EDWIN R. A.; THORN- 
DIKE; for discussion of the subsequent development 
of his ideas, see CREATIVITY, articles on SOCIAL 
ASPECTS and GENIUS AND ABILITY; INTEGRATION, 
article on CULTURAL INTEGRATION; PREDICTION; SCI- 
ENCE, article on THE SOCIOLOGY OF SCIENCE; TECH- 
NOLOGY, article on TECHNOLOGY AND INTERNATIONAL 
RELATIONS; TRANSPORTATION, article on SOCIAL AS- 
pects; and the biography of STOUFFER.] 
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OLIGARCHY 


The word “oligarchy” and the concepts which it 
symbolizes originated in ancient Greece. In its 
basic use, the word identified one of the general 
forms of government recognized by the Greeks: 
that in which political government is conducted 
by a few persons or families. It was also used 
more narrowly, by Aristotle for example, to refer 
to the debased form of aristocracy, that is, to gov- 
ernment by the few or by a faction. The term 
“oligarchy” was also used to refer to the small group 
of persons who enjoyed a monopoly of political 
control in oligarchic governments; the term usu- 
ally had the added sense that the oligarchy ruled 
in its own rather than in the public interest. For 
Aristotle, classification of governments rested on 
two independent variables: the number of persons 
who ruled and the purposes served by their rule. 
Oligarchy was present when a few persons ruled 
for their own satisfaction. 

Development of the concept. The original uses 
of the term were associated with particular social 
and political regimes and with intellectual modes 
of analyzing them. Typically, societies were small 
and traditional and rested on established classes, 
including a slave class. Within Greek cities citizen- 
ship status often identified a large but still minor- 
ity class that could at least claim to participate in 
political decisions. Whatever the changes in politi- 
cal forms, this “upper class” was relatively stable 
by reason of property holding, authority relations 
with other classes, social position, and so on, and 
oligarchy could reasonably be expected to be suc- 
ceeded by other known forms of government. Clas- 
sical analysts found oligarchies to be endemic 
among ancient states, but they viewed them as 
unstable since they rested on military, economic, 
and leadership factors which were transitory as 
compared with the continuing forces which sup- 
ported the relatively large upper classes in tradi- 
tionalist societies. 

In the modern view, these classical conceptions, 
including oligarchy and the ideas associated with 
it, are far too simple for effective analysis. Indeed, 
classical writing makes it clear that the concep- 
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tions based on the formal structure of governments 
were not adequate even then, in spite of the par- 
ticular emphasis given to form. Greek analysts 
dealt with the phenomena of power, with the im- 
portance of procedures, and, of course, with the 
paramount role of values. These matters were 
merged with discussions of political form, but the 
elements were not clearly discriminated. The subtle- 
ties and complexities of Greek political thought do 
not appear to good advantage in this particular 
classificatory system. 

To the extent that the word “oligarchy” was em- 
ployed in the Roman and medieval periods, it 
appears to have been used in the Aristotelian sense 
of a perverted form of government. However, the 
emphasis on ideas and values of citizenship and 
law during the long period of Roman stability 
turned analytical attention away from forms of 
government as such. Even though the instabilities 
of feudalism did generate oligarchs, many of the 
writers of the early Middle Ages were more pre- 
occupied with problems of legitimacy, since these 
problems had been given new urgency by the com- 
plexities of church-state relationships. Neverthe- 
less, the word itself was coupled to its classical 
meaning and became an active term in political 
vocabularies following the rediscovery of Aristotle. 
In the late Middle Ages and early modern times, 
political instability among renascent states and a 
revival of secular realism restored earlier classifi- 
catory tools in significant measure. Machiavelli, 
for example, stipulated and used the Aristotelian 
classification of the types of government, including 
that of oligarchy, even though his political analyses 
could not be contained by them. 

Significant use of the term “oligarchy” by stu- 
dents of sociopolitical phenomena has lessened in 
recent centuries. This is not to say that the term 
has been wholly abandoned or that notable efforts 
have not been made to adapt the concept of oli- 
garchy to modern circumstances. Indeed, until rel- 
atively recently, general treatises on government 
have usually given some degree of credence to the 
classification of governments which contains oli- 
garchy as one form (see, for example, Sidgwick 
1891, or Wilson 1889). Some more specialized 
treatments have also sought to adapt the classical 
formula to new analyses, In 1832 George C. Lewis, 
mindful of parliamentary claims, adapted the clas- 
sical scheme, including its oligarchical component, 
to the new realities of political sovereignty in his 
Remarks on the Use and Abuse of Some Political 
Terms. 

However, these adaptations were, at best, diffi- 
cult to maintain in the context of the most impor- 


tant social phenomena among Western nations in 
the nineteenth and twentieth centuries: industri- 
alization and urbanization; the radical intensifica- 
tion of nationalism; the improvement and spread 
of representative parliamentarianism and of par- 
liamentary democracy; warfare conducted as the 
total effort of national populations; autocracy re- 
defined by twentieth-century totalitarianism; and 
political leadership vastly heightened by new tech- 
nological and organizational resources. 

In contrast to students of politics, spokesmen 
for ideological movements frequently used the con- 
cept of oligarchy to proclaim against degenerate 
oligarchs. Liberal and democratic reformers did the 
same thing in their works. Similarly, socialists dis- 
covered ruling coteries within the capitalist class. 
Their protests were in the language of opinion 
rather than systematic analysis. However, the scale 
of experience in which the classical concept of 
oligarchy was formulated could not produce, even 
for the socialist movements, ideas comprehensive 
enough to account for nationalism, or the complex 
semiegalitarianism of liberal societies, or the dyna- 
mism of historical necessity expressed through the 
warfare of classes. 

One group of twentieth-century scholars, includ- 
ing Ostrogorskii, Michels, Pareto, and Mosca, was 
particularly concerned with a redefinition of oli- 
garchy and its successes and failures. These writers 
sought an accommodation between forces of social 
class and of elitism and found it, to a large extent, 
in the idea of class oligarchies. Michels’ “iron law 
of oligarchy” made the point most explicitly: elitist 
control within the party structures that are an im- 
plementing response to social conflict is as inevi- 
table as class conflict itself. Michels, in company 
with others, provided a usable new definition of 
oligarchy, which was distinctive but had obvious 
kinship with the traditional form-of-government 
idea. But it is pertinent that these writings and 
their constituent ideas are now almost always in- 
dexed under a heading like “elitism” or “leadership 
rather than under the word “oligarchy.” 

Uses of the concept. The meanings associated 
with the word “oligarchy” have remained remark- 
ably stable throughout the centuries. Dictionary 
definitions accord very closely with the classical 
uses of the word. As a matter of fact, Websters 
New International Dictionary does indeed recog- 
nize a wider range of meanings, including extra; 
political meanings, but “government by the few 
remains the primary definition. Several concomi- 
tants to this unusual consistency are relevant. First, 
the word has retained its usefulness as a technical 
symbol for scholars whose interest in political phe- 


nomena is historical and descriptive rather than 
contemporary and analytical. Unlike many other 
key words, “oligarchy” does not require an elaborate 
etymology to establish historically prior uses. Sec- 
ond, the word and its associated ideas have not 
proved notably useful for nineteenth-century and 
twentieth-century analysis, and this period has 
witnessed its declining use by students of current 
societies. Among many illustrations, it may be 
noted that in Agger, Goldrich, and Swanson 
(1964), the authors do not find it necessary to 
mention oligarchy, although the book is concerned 
with the power of elites. Nor does the elaborate 
classification system of the annual International 
Bibliography of the Social Sciences find room for 
the word. Finally, the new uses for the term, espe- 
cially in its application to nonpolitical phenomena 
(for example, “oligarchy” in school systems) have 
been largely promoted by lay users. The epithet 
“oligarch” is more apt to be found in a novel than 
in a technical treatise on politics. 

The political phenomena associated with what 
previously was labeled as oligarchy, that is, effec- 
tive rule by a small and often irresponsible element 
in a society, have been given great attention by 
current analysts. However, data assembled for the 
analyses and the methods used make the inade- 
quacy of the concept of oligarchy apparent almost 
at once. Some contrasts will make the point. First, 
traditional treatments of oligarchy tended to re- 
strict the role of the oligarch to simple control 
telationships based on command. Current analyses 
of leadership by a minority define a plurality of 
toles and include conditional responses between 
leaders and followers. Second, the simple distinc- 
tion between the few and the many is replaced by 
a Continuum of relationships among members of a 
Polity who display quite different responses at dif- 
ferent times and under differing circumstances. In- 
trrelations between different social strata, groups, 
and organizations, including their leaders, and po- 
litical leaders are repetitive and contingent. Third, 
the concept of power runs through a range marked 
a one end by forced compliance backed by penal- 

es and at the other by the gentleness of influence 
hy Persuasion. The formal classical specification 
oy as a class of government largely 
p> ided the question of the techniques and other 

fans by which this form of government was main- 
ot Finally, contemporary analysis of elite 
Mlino is broadly social rather than narrowly 
7 na § Politically relevant leadership by “the few 
ete at many points in the society and not in 
ne S of government alone. In conclusion, there- 
What has happened to the concept of oligarchy 


OLIGOPOLY 283 


in recent decades may be taken as an index to what 
has happened in the social sciences. 


Tuomas P. JENKIN 


[See also AUTOCRACY; DICTATORSHIP; ELITES; LEGITI- 
Macy; MONARCHY; TOTALITARIANISM; and the biog- 
raphies of ARISTOTLE; MICHELS; MILLS; Mosca; 
OSTROGORSKII; PARETO.] 
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OLIGOPOLY 


Oligopoly, the economist’s analogue to oligarchy 
in political science, is defined as a market situation 
where independent sellers are few in number. The 
origin of the term is not clear, but it is known to 
have appeared in the original, 1518 Latin version 
of Thomas More’s Utopia. Common usage of the 
term in English writings, however, dates from the 
1930s (see Chamberlin 1957). The central analyt- 
ical problem with which the theory of oligopoly is 
concerned is how each of the few sellers reacts to 
the economic activities of its rivals in order to 
bring about determinate equilibrium solutions. The 
theoretical problem is therefore conceptually dif- 
ferent from, and vastly more complex than, that of 
either pure monopoly or pure competition, The 
monopolist’s revenue function is identical with the 
market demand schedule; that of the pure competi- 
tor is easily derivable from a given market price 
which, by assumption, cannot be affected by any 
one of the numerous competing firms. Hence, 
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neither the monopolist nor the purely competitive 
firm must consider how alternative actions by rival 
firms will affect its own revenue possibilities—the 
monopolist has no rivals and the competitive firm 
has so many that it can ignore any one of them. 
Oligopolists not only have rivals, but they have so 
few of them that each is large enough to affect the 
others significantly. Their prices, outputs, and other 
relevant variables are therefore interdeterminate. 

The problem of interdeterminacy among oli- 
gopolists can easily be shown by considering the 
simple maximizing equations of contemporary eco- 
nomic theory. It is generally assumed, at least as 
a first approximation to reality, that business firms 
seek to maximize profits. The validity of the as- 
sumption very likely varies directly with the degree 
of competition in the market. Highly competitive 
firms earn only a competitive return on capital even 
when they pursue profits maximization. Hence, 
should they pursue any other objective, they would 
earn less than a competitive rate of return, Firms 
with some degree of market power may at least 
seek another objective, such as maximum revenue 
or market share, subject to a minimum profits con- 
straint, and still survive (Baumol 1959, pp. 54-68). 

Profits, m, are defined as total revenue, R, less 
total costs, C. In turn, R and C are, respectively, 
price and unit costs multiplied by the quantity pro- 
duced and sold. That is, R = pq, and C = cq. Fora 
monopolist the R schedule for various alternative q's 
is readily determinable, since the price under given 
demand conditions depends entirely on the quantity 
the monopolist offers on the market: p= F(q). 
Hence, the monopolist’s equilibrium output and 
price are determined when 7=R-—C is maxi- 
mized. Under pure competition the same equations 
apply, except that in R= pq, p is determined by 
the market and is treated by each competitor as a 
parameter. 

For each oligopolist, on the other hand, the price 
it receives is dependent not only upon the quan- 
tity it supplies (q,) but also upon the quantities 
supplied by its rivals (q2, qs, ***, qn). Therefore, 
P=P(q+q:+qs + +q) and Ri =pq,. Ac- 
cordingly, unless each oligopolist makes certain 
assumptions about the quantities its rival will sup- 
ply, its own revenue schedule (R; ) is formally inde- 
terminate. Moreover, the interdependency of oligop- 
olists encompasses far more variables than their 
respective outputs: in the modern industrial mar- 
ket such variables as pricing, advertising, and 
innovational activities and strategies are also de- 
terminants of equilibrium. Theoretically, and as a 
practical matter, each oligopolist must evaluate 
these variables for all its rivals before it can 


rationally select its own course of action so as to 
maximize its profits. Furthermore, in the selection 
process it must allow for whatever effect its own 
course of action will have on the variables it must 
evaluate. This complex interdependency of firms’ 
decisions with respect to the important market 
variables is the essential and distinguishing fea- 
ture of oligopoly. It also explains, as will be dem- 
onstrated below, why oligopoly theory predicts less 
satisfactorily the behavior of business firms to 
which it applies than do the theories of monopoly 
and competition. 

The unsatisfactory and inconclusive state of 
contemporary oligopoly theory leaves an important 
gap in our knowledge of the operative mechanics 
of modern industrial economies containing a sig- 
nificant ingredient of private capitalism. Since 
World War 1 the more developed countries, espe- 
cially Canada, Japan, and those of western Europe, 
have placed increased reliance on competitive mar- 
ket forces to regulate their respective national 
economies—greater reliance than ever before, To 
secure at least a workable minimum of competi- 
tive market forces Austria, Belgium, Canada, Den- 
mark, France, West Germany, Ireland, Japan, 
Luxembourg, the Netherlands, Norway, Sweden, 
Switzerland, and the United Kingdom have enacted 
legislation limiting cartel activities; the Italian 
parliament is committed eventually to enact similar 
legislation, on which it has deliberated for several 
years. Both the European Coal and Steel Com- 
munity and the European Economic Community 
(European Common Market) treaties contain pro- 
visions prohibiting certain cartel activities and re- 
strictive business practices. In the United States 
antitrust legislation has been strengthened, and the 
laws have been administered with increasing vigor. 
Clearly, in much of the advanced industrial world 
the pace and direction of economic activity are left 
to market forces preserved by laws designed to pre- 
vent private firms from frustrating these forces 
through cartels and restrictive business practices. 
[See ANTITRUST LEGISLATION; CARTELS AND TRADE 
ASSOCIATIONS. | 

Yet in all these countries many industries con- 
form to some variant of oligopoly. In the United 
States, concentration ratios—the share of output 
accounted for by the four largest firms—exceed 
50 per cent in many manufacturing industries. The 
level of concentration in manufacturing in Canada 
and the United Kingdom is generally higher than 
in the United States (Rosenbluth 1955, pp. 71-79), 
as is probably the case in most other industrialized 
countries. In Denmark, West Germany, and the 
United Kingdom, dominant firms (usually defined 
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as single firms accounting for as much as 25 per 
cent of a relevant market) are sufficiently evident 
to be given special attention in legislation govern- 
ing restrictive business practices. The prevalence 
of oligopoly, and the present inadequate state of 
oligopoly theory, leaves a serious void in our under- 
_ standing of how market forces govern a significant 
portion of the economic activity in those countries 
which rely on them. The void persists in spite of 
the heroic efforts made since the 1840s to close it. 


Models 


The Cournot solution. The first formal solution 
to the problem of oligopolistic interdependence is 
associated with Antoine Augustin Cournot, the 
French economist and mathematician (1838). Con- 
temporary theory textbooks often identify Cournot 
only with the classical duopoly (two-firm) solution. 
It is true that his examples consisted of two-firm 
markets, but this seems clearly to have been a mat- 
ter of simplifying the presentation. At each point 
he extended his analysis to the three-, four-, and 
n-firm cases. 

Essentially, Cournot resolved the problem of 
oligopolistic interdependence by postulating that 
each (firm) independently sought maximum profits 
under the assumption that its rivals’ outputs were 
fixed. That is, given the output of its rivals, each 
oligopolist supplied that share of the remaining 
demand which maximized its profits. (The simi- 
larity in this regard of Cournot’s oligopoly model 
and the much more recent models of dominant-firm 
Price leadership will become apparent later on in 
the treatment of price leadership.) However, until 
they reached a stable equilibrium, any change in the 
quantity offered by one firm necessarily forced the 
others to alter their offerings too. Accordingly. 
while each rival proceeded to maximize profits on 
the assumption that it had no direct effect on the 
outputs of other firms, the outputs of all were made 
interdependent through the indirect effect: i.e., 
each move by one forced a countermove by the 
others, 

Cournot’s greatest contribution to the problem 
was his recognition of the strategic interdependence 
of oligopolists and the set of reaction curves he 
designed, which, under the foregoing assumptions, 
ad a stable equilibrium solution (see Figure 1). 
ee: curve mm, is defined as those profits- 
ali ae quantities firm 1 will offer for sale for 
ia a Eble quantities supplied by firm 11; firm 
River ogous reaction curve is m,n.. Thus, if firm 
nea to offer the quantity x’, which would be its 

x S-Maximizing quantity if firm 1 offered the 
quantity y,, firm m would then react by offering 
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Figure 1 — Cournot reaction curves* 


* Notation is as it appeared in Cournot’s original publication. Di and D: are 
respectively the outputs of Firm 1 and Firm 11. 


Source: Cournot 1838, 


the quantity y; but then firm 1 would react by ad- 
justing its quantity to x”, forcing firm 1 in turn to 
offer quantity y:. The reaction process would con- 
tinue until the quantities supplied by firm 1 and 
firm m were respectively x and y, the equilibrium 
output for each firm. The equilibrium is stable, 
since the outputs of the two firms will eventually 
settle at x and y, irrespective of the assumed initial 
offering of either firm. 

The implications of Cournot's oligopoly solution 
for welfare theory were extremely important, al- 
though rarely explicitly stated. He carefully pointed 
out that his reaction model, built upon formal inde- 
pendence among the participating oligopolists, yield- 
ed a lower price and a larger output than would be 
the case were they to enter into an agreement 
({1838] 1960, pp. 79-80). Moreover, the larger the 
number of participants, the lower the price and 
the greater the output. Cournot’s generalized price 
equation for markets comprising identical firms 
yielded an equilibrium when D + np(dD/dp) = 0, 
where D is quantity demanded, is the number of 
firms, and p is price. When oligopolists entered 
into a collusive agreement, n became unity and 
p became the monopoly price. The larger the num- 
ber of independent firms, the more closely p ap- 
proximated the purely competitive price. 

The defects in Cournot’s model for predicting 
oligopoly behavior in the real world have been 
pointed out by many. Bertrand, Marshall, Pareto, 
and Edgeworth all argued that it failed to allow for 
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the incentive confronting each firm to lower its 
price and enlarge its sales (Chamberlin 1933, 
pp. 36-37 and appendix A). Marshall noted that 
this incentive would be particularly strong under 
conditions of increasing returns, The most system- 
atic criticism of the theory of oligopoly as devel- 
oped up through the first quarter of the twentieth 
century, however, is associated with Edward H. 
Chamberlin. 

The Chamberlin model. Chamberlin’s criticism 
of the Cournot solution and its variants stemmed 
from their essential shortcoming: none of the solu- 
tions conformed perfectly to the hypothesis that 
each seller maximized its profits. Realistically, a 
profits-maximizing oligopolist must take account 
of its indirect as well as its direct effect on price. 
As Chamberlin put it, “When a move by one seller 
evidently forces the other to make a counter move, 
he is very stupidly refusing to look further than his 
nose if he proceeds on the assumption that it will 
not” (1933, p. 46). Chamberlin sought to remedy 
this defect by having oligopolists at the outset rec- 
ognize their mutual dependence; he thus gave birth 
to the notion of conjectural interdependence or 
oligopolistic rationalization. 

When each firm takes into account its total effect 
on price, under certain rigid assumptions, the 
oligopoly solution is the monopoly price. This price 
provides the maximum profits for the group as a 
whole and, when sellers recognize that their for- 
tunes are mutually dependent, the maximum profits 
for each. None of the oligopolists will reduce its 
price below the monopoly price, since it knows 
beforehand that others will be forced to follow, and 
thereby reduce the profits of all. Chamberlin em- 
phasized that this results when each firm inde- 
pendently seeks to maximize profits, provided the 
firms are so few in number that each must consider 
both the direct and the indirect effect it has on 
market price, When firms are sufficiently numerous 
to ignore their indirect effect, price will settle at 
the level given by the Cournot solution; when they 
are sufficiently numerous to ignore their direct 
effects as well, price will fall at once to the com- 
petitive level. Hence, in the Chamberlin model, 
unlike Cournot’s, there is no gradual descent to 
the competitive price as the number of firms is 
increased, Instead, a discontinuity occurs some- 
where along the spectrum of firm numbers between 
one and many, whereby the oligopoly solution 
changes radically from the monopoly price to the 
competitive price. 

The Chamberlin oligopoly solution was received 
with enthusiasm. Students of industry had often 
observed monopolistic behavior in industries that 


were not, by the classical definition, monopolized, 
Chamberlin’s theory provided an explanatory hy- 
pothesis for such behavior. In the United States, 
antitrust practitioners quickly translated Chamber- 
lin’s oligopolistic rationalization into the legal doc- 
trine of “conscious parallelism,” and in several 
cases the government successfully prosecuted oli- 
gopoly as illegal monopoly under the Sherman Act 
(Nicholls 1949). While such victories were ap- 
plauded by some, they were viewed with alarm by 
others. Chamberlin’s oligopoly solution rested on 
the rational actions of enlightened oligopolists seek- 
ing maximum profits. Moreover, the incidence of 
oligopoly in the United States, and elsewhere, was 
high. Hence, the judgments handed down in those 
cases implied either of two conclusions, both highly 
disturbing: business firms must stop behaving 
rationally or a significant percentage of United 
States industry would be declared in contravention 
of the antitrust laws. 

In time, recognition of the restrictive assump- 
tions underlying the Chamberlin solution revealed 
that much oligopoly behavior was still left unex- 
plained. Chamberlin had assumed static conditions 
in which oligopolists confronted identical cost func- 
tions and considered only price and output as vari- 
ables. In the real world, as Chamberlin himself 
carefully pointed out and elaborated, these highly 
restrictive assumptions are seldom fulfilled. The 
modern business firm may typically confront rel- 
atively few rivals, but it has available a wide variety 
of business policies and strategies to employ against 
them. Advertising and research budgets, product 
change, marketing methods, and nonprice terms of 
trade, as well as prices, are variables over which 
most oligopolists exercise some degree of discretion, 
and how they exercise it is the essence of the oli- 
gopoly problem. Empirical studies of oligopolistic 
industries have yet to reveal a systematic pattern 
of behavior with respect to such variables. Accord- 
ingly, models of oligopoly confined to only two of 
them, e.g., price and output, can scarcely be ex- 
pected to provide an explanatory hypothesis valid 
for oligopoly generally. As Fellner (1949) and 
Markham (1959) have pointed out, in oligopoly 
the struggle for competitive advantages over rivals 
logically centers on nonprice activities. The new 
product, process, or advertising campaign is much 
less susceptible to immediate detection and imita- 
tion than the new price schedule is. The significant 
contribution of Chamberlin’s work in the subject 
is not that oligopolists can generally be expected 
to behave as a monopoly but, rather, that they W 
select competitive strategies more complex anı 
more subtle than simple price competition. 


The game theory model. Recognition of the 
variety of competitive strategies open to oligopolists 
has led in recent years to a search for formally 
determinate solutions through the application of the 
mathematical theory of games [see GAME THEORY]. 
Since game theory is expressed in terms of matrix 
algebra, it has at least the pedagogical advantage 
over Cartesian diagrams of being able to handle a 
large number of strategies and counterstrategies 
simultaneously. The decision problem confronting 
each oligopolist in the game theory model consists 
of choosing an optimal strategy from some finite 
number of strategies, where a strategy is defined 
as the selection of a particular value for the vari- 
ables over which the oligopolist has some degree of 
control. For each strategy selected by one oligopo- 
list, there is a corresponding set of payoffs, or 
profits, one for each strategy selected by other 
oligopolists. An example of such a set of payoffs 
for an industry comprising two firms, A and B, 
each having available three possible strategies, may 
be put in the form of Table 1, where the pairs of 


Table 1 


values represent the profits A and B will earn for 
all combinations of strategies they can use; e.g., if 
A uses strategy 1 and B uses strategy 3, A sustains 
a loss of 2 and B earns profits of 8. (The illustra- 
tion is a nonzero-sum game. Zero-sum games, in 
which the algebraic sum of all gains and losses 
equals zero, have limited applicability to oligopoly. ) 
Which strategy each firm will in fact use depends 
entirely on its propensity to gamble and on the in- 
telligence it accords its rival. In the above example, 
if both A and B could assume that each would 
Cooperate with the other so that the combined 
ned of both would be maximized, A would use 
ee 1 and B would use its strategy 1. This 
uae be the cooperative equilibrium solution. 
ob if each rival distrusted the other, neither 
Ni use its strategy 1, because in doing so, B 
ano sca a profit of 8 and force A to take a loss 
eras oe its strategy 3, and A could earn a 
eater and force B to take zero profits by using 
Race Ea 2. Hence, if A and B each assume the 
EE oth will use strategy 3 and earn a minimum 

t of 3 each. The strategies are compatible, and, 
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so long as these conditions prevail, the industry is 
in equilibrium. 

Theory of games models can be made much 
more complex, involving mixed strategies whereby 
each oligopolist seeks to maximize its minimum 
level of expected earnings by shifting from one 
price strategy to another, in accordance with some 
predetermined selection scheme. Such an optimum 
mixed strategy may be determined by linear pro- 
gramming techniques [see PROGRAMMING]. But it 
is sufficiently clear from the foregoing simple pure- 
strategy model that formally determinate game 
theory solutions assume a much higher order of 
knowledge than business firms possess. If it is true, 
as economists and business spokesmen frequently 
assert, that the typical firm can only roughly ap- 
proximate the demand schedule for its product, it 
is virtually certain that its knowledge of all the 
possible strategies available to its rivals and their 
respective payoffs must be extraordinarily incom- 
plete. Entrepreneurs, however prudent they may be, 
pursue different strategies when confronted with 
incomplete knowledge than they do when all the 
possible outcomes are known. In truth, in a dynamic 
oligopolistic industry the process of creating com- 
petitive strategies is as essential to an understand- 
ing of industrial behavior as is how they are 
employed. Moreover, as game theory models are 
expanded to more nearly approximate actual oli- 
gopolistic situations, they become so complex that 
they are virtually meaningless. Perhaps the most 
heroic effort yet made in applying game theory to 
oligopoly markets can be credited to Shubik (1959). 
But as one reviewer of Shubik’s work (Kaysen 
1960) has observed, the gap between his game 
theory model of oligopoly and the United States 
automobile and cigarette industries, to which he 
applied it, is enormous. On the other hand, a model 
capable of handling five or six automobile pro- 
ducers, each having a half-dozen or more strate- 
gies, would have to be of such tremendous size 
that virtually nothing meaningful could be said 
about it. 

The Cournot, Chamberlin, and game theory mod- 
els do not exhaust the list of deductive generaliza- 
tions on oligopoly behavior; but they are landmarks 
in the development of oligopoly theory since the 
1840s and adequately illustrate its strengths and 
weaknesses. In its evolution the theory has unques- 
tionably been strengthened. To Cournot oligopoly 
was simply an intermediate range on the spectrum 
of firm numbers between one and many and was 
distinguishable in no fundamental sense from clas- 
sical monopoly or competition. Chamberlin made 
oligopoly a distinctive form of market organization 
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guided by its own modus operandi. Game theorists 
have recognized this distinctive character and have 
sought to design models capable of handling the 
variables over which oligopolists exercise discre- 
tion. But a theory of oligopoly generally valid for 
all industries comprising a few firms has yet to be 
developed. The absence of such a theory is attrib- 
utable to the inherent complexity of the problem 
and to the heterogeneity of oligopoly itself. 

Partial models. Oligopoly theory includes ex- 
planatory hypotheses for limited aspects of oligop- 
oly behavior as well as the foregoing more general 
models. The apparent rigidity of oligopoly prices 
and the uniformity of price among oligopolists have 
been accorded particularly close attention since the 
world-wide depression of 1929-1933. The theoreti- 
cal construction generally offered as a rationale for 
rigid oligopoly prices is the kinked demand curve; 
that for uniform prices, dominant-firm price leader- 
ship, or Chamberlinian conjectural interdependence 
as expressed in some form of barometric or con- 
spiratorial price leadership. 

The kinked demand curve hypothesis, advanced 
almost simultaneously by Paul Sweezy (1939) and 
a group of Oxford economists (Hall & Hitch 1939), 
rests on the assumption that each oligopolist visu- 
alizes two distinct demand schedules for its output, 
one that obtains if all rivals meet any new price it 
sets and another that obtains if rivals do not meet 
such new prices. The second is more elastic than 
the first because the sales of any one oligopolist 
will respond much more to price changes that rivals 
do not match than to those they do. However, if 
oligopolists reason that their rivals will meet all 
price reductions but will not follow price increases, 
a kink occurs in the demand schedule at the cur- 
rently prevailing price, since the first demand curve 
is relevant for all contemplated price reductions 
and the second for all contemplated price increases 
(see Figure 2), The kink in turn produces a gap 
in the firm’s marginal revenue schedule, the gap 
being proportional to the differences in the elastici- 
ties of the upper and lower segments of the demand 
curve D, D; . Because of the gap, costs may fluctuate 
over a significant range without disturbing the 
equality between marginal cost and marginal rev- 
enue. Accordingly, a given price may prevail for 
long periods of time. 

This theory of rigid oligopoly prices is incon- 
sistent with the various theories of price uniformity 
among oligopolists, such as Chamberlin’s theory 
and the several theories of price leadership (Mark- 
ham 1951), Under dominant-firm price leadership, 
the large firm, or partial monopolist, sets a price 
and lets the rest of the industry sell as much as it 


wishes at that price. In the absence of a dominant 
firm, price leadership may emerge as a vehicle for 
carrying out a tacit price conspiracy, or simply 
because, by custom, one firm serves as a price 
barometer. Under any one of the three forms of 
price leadership, the presumption is that price fol- 
lowers match the price leader’s increases as well as 
its decreases, and hence the oligopolist’s demand 
curve contains no kink. 

Stigler’s analysis of oligopoly prices (1947) 
casts considerable doubt on the validity of the 
kinked demand curve hypothesis. In his price com- 
parisons he found that oligopolies with dominant- 
firm price leaders generally tended to have less 
flexible prices than those possessing no price leader. 
If the kink existed, the opposite should have been 
the case. Moreover, according to Stigler, there is no 
logical basis for assuming the existence of such 
kinks. To exist, they must be created by entre- 
preneurs, and to create artificial obstacles to profit- 
able price increases would be inconsistent with the 
objective of maximum profits. 

Administered prices. Rigid oligopoly prices, 
often referred to as “administered” prices, still 
stand in need of a satisfactory explanation. How- 
ever, in recent years precisely what needs to be 
explained has itself changed. Until the end of 
World War oligopoly prices demonstrated con- 
siderable inflexibility, upward and downward, in 
comparison with competitive prices. In fact, a lead- 
ing American economist who served as a price con- 
trol official during the war has argued convincingly 
that oligopoly prices were relatively easy to admin- 
ister by public authority because typically they 
were administered privately (Galbraith 1952). In 
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Figure 2 — The kinked demand curve 


the postwar inflationary period certain oligopoly 
prices have tended to demonstrate considerable up- 
ward flexibility in business upswings but very little 
downward flexibility in business downswings. Ac- 
cordingly, the so-called administered price group 
has, at least in a statistical sense, contributed sig- 
nificantly to the postwar inflation in the United 
States. 

‘At least four independently conducted studies of 
the problem have concluded that such price be- 
havior is attributable much more specifically to 
market power than to administered prices (Adel- 
man 1961; Galbraith 1957; Stigler 1962; Markham 
1964), In fact, the term “administered price” prob- 
ably does more to cloud than to clarify the issue. 
Administered prices have generally been defined as 
those prices arrived at within the firm as an inte- 
gral part of its decision-making process, as distin- 
guished from those generated entirely by the 
impersonal interplay of market forces. Nearly all 
prices communicated to the public through pub- 
lished price lists are, by this broad definition, ad- 
ministered. But the oligopolistic prices contributing 
to the postwar inflation have been relatively few, 
confined principally to highly concentrated indus- 
tries that deal with strong labor unions. Clearly, 
neither oligopoly nor administered prices are mas- 
sive undifferentiated wholes but, instead, exhibit 
wide varicties of behavioral patterns. 

Retrospect and prospect. George Gaylord Simp- 
son, commenting on the social theorist, once 
observed: “Each student thus actually put his par- 
ticular theory into the data, and it is not surprising 
that each then gets his own theory out of these 
data when he is through” (1949, p. 139). 

In certain respects oligopoly theory has devel- 
Oped like this, There are many varieties of indus- 
trial structures, each exhibiting its own peculiar 
behavioral pattern, that can technically be defined 
as oligopoly. Some conform to Chamberlinian con- 
Jectural interdependence, others to one of the sev- 
eral price leadership models; some give factual 
Support to the kinked demand curve hypothesis, 
others exhibit reasonably flexible prices; some oli- 
gopolists appear to use sophisticated strategies, 
N to rely heavily on price concessions. Oligop- 
oly is far less homogeneous than the term “few 
sellers” might imply. 

Eee is a great need for developing the impor- 
Beakin Pesi facts on oligopoly which will permit 
eet own an unsatisfactory and meaningless 
oes A into classes useful for analytical pur- 
etie ndeed, until this is done, the plethora of 
RA e subtheories must remain untested, and 

ive mechanics by which large sectors of 
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most private enterprise economies are governed 
must remain obscure. The large number of industry 
studies published since the 1940s in the United 
States, Canada, the United Kingdom, Japan, and 
western Europe have provided a substantial begin- 
ning in amassing the relevant facts. When appro- 
priately classified and subjected to the powerful 
tools of modern statistical analysis, they should 
reveal the stochastic laws of various types of oli- 
gopoly. For the important purposes of administering 
the growing body of antimonopoly laws in capital- 
istic societies, of effectively using monetary and 
fiscal policies to combat inflation and unemploy- 
ment, and of designing and administering policies 
to stimulate economic growth, such predictive laws 
are obviously essential. Oligopoly, in its variety of 
forms, is and will remain a significant sector of 


private enterprise. 
Jesse W. MARKHAM 


[See also GAME THEORY; INDUSTRIAL CONCENTRATION; 
Monopoty; and the biography of Cournor.] 
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OPERANT CONDITIONING 
See LEARNING, article on INSTRUMENTAL LEARN- 
ING. 


OPERATIONISM 
See POSITIVISM. 


OPERATIONS RESEARCH 


The roots of “operations research” (commonly 
referred to as OR) go back at least to the industrial 
revolution, which brought with it the mechaniza- 
tion of production, power generation, transportation, 
and communication. Machines replaced man as a 
source of power and made possible the development 
of the large industrial, military, and governmental 
complexes that we know today. These developments 
were accompanied by the continuous subdivision of 
industrial, commercial, military, and governmental 
management into more and more specialized func- 


tions and eventually resulted in the kind of multi- 
level structure that today characterizes most or- 
ganizations in our culture. 

As each new type of specialized manager ap- 
peared, a new specialized branch of applied science 
or engineering developed to provide him with assist- 
ance. For example, in industry this progression 
began with the emergence of mechanical and chem- 
ical engineering to serve production management 
and has continued into more recent times with the 
development of such specialties as industrial en- 
gineering, value analysis, statistical quality control, 
industrial psychology, and human engineering, 
Today, no matter how specialized the manager, at 
least one relevant type of applied science or engi- 
neering is available to him. 

Whenever a new layer of management is cre- 
ated, a new managerial function, that of the exec- 
utive, is also created at the next higher level. The 
executive function consists of coordinating and 
integrating the activities of diverse organizational 
units so that they serve the interests of the organ- 
ization as a whole, or at least the interests of the 
unit that contains them. The importance of the 
executive function has grown steadily with the in- 
crease in the size and complexity of industrial, 
military, and governmental organizations. 

The executive function in business and industry 
has developed gradually. The executive was not 
subjected to violent stimuli from new technology 
as was, for example, the manager of production. 
Consequently the executive “grew” into his prob- 
lems, and these appeared to him to require for their 
solution nothing but good judgment based on rele- 
vant past experience. The executive, therefore, felt 
no need for a more rigorous scientific way of look- 
ing at his problems. However, the demands on his 
time grew, and he sought aid from those who had 
more time for, and more experience with, the 
problems that he faced. It was this need that gave 
rise to management consulting in the 1920s, Man- 
agement consulting, however, was based on experi- 
ence and qualitative judgment rather than on 
experimentation and quantitative analysis. The 
executive function was left without a scientific arm 
until World War 1. 

The major difference between the development of 
military executives and of their industrial counter- 
parts is to be found in the twenty-year gap between 
the close of World War 1 and the opening of World 
War 1. Because there was little opportunity to use 
military technology under combat conditions dur- 
ing this period, this technology developed $00 
rapidly for effective absorption into military tactics 
and strategy. Thus, it is not surprising that British 


military executives turned to scientists for aid when 
the German air attack on Britain began. Initially 
they sought aid in incorporating the then new radar 
into the tactics and strategy of air defense. Small 
teams of scientists, drawn from any disciplines 
from which they could be obtained, worked on 
such problems with considerable success in 1939 
and 1940. Their success bred further demand for 
such services, and their use spread to Britain’s 
allies—the United States, Canada, and France. 
These teams of scientists were usually assigned to 
the executive in charge of operations, and their 
work came to be known in the United Kingdom 
as “operational research” and in the United States 
by a variety of names: operations research, oper- 
ations analysis, operations evaluation, systems anal- 
ysis, systems evaluation, and management science. 
The name operations research was and is the most 
widely used in the United States. 

At the end of the war very different things hap- 
pened to OR in the United Kingdom and in the 
United States. In the United Kingdom expenditures 
on defense research were reduced. This led to the 
release of many OR workers from the military at 
a time when industrial managers were confronted 
with the need to reconstruct much of Britain’s 
manufacturing facilities that had been damaged 
during the war and to update obsolete equipment. 
In addition the British Labour party, which had 
come into power, began to nationalize several major 
and basic industries. Executives in these industries 
in particular sought and received assistance from 
the OR men coming out of the military. Coal, iron 
and steel, transport, and many other industries 
began to create industrial OR. 

In contrast to the situation in Great Britain, 
defense research in the United States was increased 
at the end of the war. As a result military OR was 
expanded, and most of the war-experienced OR 
workers remained in the service of the military. 
Industrial executives did not ask for help because 
they were slipping back into a familiar peacetime 
pattern that did not involve either major recon- 
struction of plant or nationalization of industry. 

x During the late 1940s, however, the electronic 
EPH en became available and confronted the in- 
mA manager with the possibility of automation 
i AERP eee ment of man by machines as a source 
oe i . The computer also made it possible for 
aries o control more effectively widely spread 
ae Panele activities because of its ability to 
quickly aa amounts of data accurately and 
ig ete Revise the spark that set off what 
tevalo vee been called the second industrial 

. In order to exploit the new technology 
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of control, industrial executives began to turn to 
scientists for aid as the military leaders had done 
before them, They absorbed the OR workers who 
trickled out of the military and encouraged aca- 
demic institutions to educate additional men for 
work in this field. 

Within a decade there were at least as many 
OR workers in academic, governmental, and in- 
dustrial organizations as there were in the military. 
More than half of the largest companies in the 
United States have used or are using OR, and 
there are now about 4,000 OR workers in the 
country. A national society, the Operations Re- 
search Society of America, was formed in 1953. 
Other nations followed, and in 1957 the Interna- 
tional Federation of Operational Research Societies 
was formed. Books and journals on the subject 
began to appear in a wide variety of languages. 
Graduate courses and curricula in OR began to 
proliferate in the United States and elsewhere. 

In short, after vigorous growth in the military, 
OR entered its second decade with continued growth 
in the military and an even more rapid growth 
in industrial, academic, and governmental organ- 
izations. 

Essential characteristics of OR. The essential 
characteristics of OR are its systems (or executive ) 
orientation, its use of interdisciplinary teams, and 
its methodology. 

Systems approach to problems. The systems 
approach to problems is based on the observation 
that in organized systems the behavior of any part 
ultimately has some effect on the performance of 
every other part. Not all these effects are significant 
or even capable of being detected. Therefore the 
essence of this orientation lies in the systematic 
search for significant interactions when evaluating 
actions or policies in any part of the organization. 
Use of such knowledge permits evaluation of ac- 
tions and policies in terms of the organization as a 
whole, that is, in terms of their over-all effect. 

This way of approaching organizational problems 
is diametrically opposed to one based on “cutting a 
problem down to size.” OR workers almost always 
enlarge the scope of a problem that is given to 
them by taking into account interactions that were 
not incorporated in the initial formulation of the 
problem. New research methods had to be devel- 
oped to deal with these enlarged and more compli- 
cated problems. These are discussed below. 

As an illustration of the systems approach to 
organizational problems, consider the case of a 
company which has 5 plants that convert a natural 
material into a raw material and 15 finishing 
plants that use this raw material to manufacture 
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the products sold by the company. The finishing 
plants are widely dispersed and have different 
capacities for manufacturing a wide range of fin- 
ished products. No single finishing plant can manu- 
facture all the products in the line, but any one 
product may be produced in more than one plant. 

Many millions of dollars are spent each year in 
shipping the output of the first group of plants to 
the second group. The problem that management 
presented to an OR group, therefore, was how to 
allocate the output of the raw-material plants to 
the finishing plants so as to minimize total between- 
plant transportation costs. So stated, this is a well- 
defined, self-contained problem for which a straight- 
forward solution can be obtained by use of one of 
the techniques of OR, linear programming [see 
PROGRAMMING]. 

In the initial phases of their work, the OR work- 
ers observed that whereas all the raw-material 
plants were operating at capacity, none of the fin- 
ishing plants were. They inquired whether the 
unit-production costs at the finishing plants varied 
with the percentage of capacity in use. They found 
that this was the case and also that the costs 
varied in a different way in each plant. As a result 
of this inquiry the original problem was reformu- 
lated to include not only transportation costs, but 
also the increased costs of production resulting 
from shipping to a finishing plant less material 
than it required for capacity operation. In solving 
this enlarged problem it was found that increased 
costs of production outweighed transportation costs 
and that a solution to the original problem (as for- 
mulated by management) would have resulted in 
an increase in production costs that would have 
more than offset the saving in transportation costs. 

The OR workers then asked whether the in- 
creased costs of production that resulted from un- 
used capacity depended on how production was 
planned, and they discovered that it did. Conse- 
quently, another related study was initiated in an 
effort to determine how to plan production at each 
finishing plant so as to minimize the increase in 
unit-production costs that resulted from unused 
capacity. In the course of this study of production 
planning it also became apparent that production 
costs were dependent on what was held where in 
semifinished inventory. Therefore, another study 
was begun to determine at what processing stage 
semifinished inventories should be held and what 
they should contain, Eventually the cost of shipping 
finished products to customers also had to be 
considered. 

In the sequence of studies briefly described, it 


was not necessary to wait until all were completed 
before the results of the first could be applied. 
Solutions to each part of the total problem were 
applied immediately because precautions had been 
taken not to harm other operations. With each suc- 
cessive finding previous solutions were suitably 
adjusted. Eventually some change was made in 
every aspect of the organization’s activities, but 
each with an eye on its over-all effect. This is the 
essence of the systems approach to organizational 
problems. 

The interdisciplinary team. Although division 
of the domain of scientific knowledge into specific 
disciplines is a relatively recent phenomenon, we 
are now so accustomed to classifying scientific 
knowledge in a way that corresponds either to the 
departmental structure of universities or to the pro- 
fessional organization of scientists that we often 
act as though nature were structured in the same 
way. Yet we seldom find such things as pure physi- 
cal problems, pure psychological problems, pure 
economic problems, and so on, There are only 
problems; the disciplines of science simply repre- 
sent different ways of looking at them. Nearly 
every problem may be looked at through the eyes 
of every discipline, but, of course, it is not always 
fruitful to do so. 

If we want to explain an automobile’s being 
struck by a locomotive at a grade crossing, for ex- 
ample, we could do so either in terms of the laws 
of motion, or the engineering failure of warning 
devices, or the state of physical or mental health 
of the driver, or the social use of automobiles as an 
instrument of suicide, and so on. The way in which 
we look at the event depends on our purposes in 
doing so, A highway engineer and a driving in- 
structor would look at it quite differently. 

Though experience indicates a fruitful way of 
looking at most familiar problems, we tend to deal 
with unfamiliar and complicated situations in the 
way that is most familiar to us. It is not surprising, 
therefore, that given the problem, for example, of 
increasing the productivity of a manufacturing 
facility, a personnel psychologist will try to select 
better workers or improve the training that workers 
are given. A mechanical engineer will try to me 
prove the machines. An industrial engineer will 
try to improve the plant layout, simplify the oper- 
ations performed by the workers, or offer them 
more attractive incentives. The systems and pro- 
cedures analyst will try to improve the flow of 
information into and through the plant, and so on. 
All may produce improvements, but which is best? 
For complicated problems we seldom can know in 


advance. Hence it is desirable to consider and eval- 
uate as wide a range of approaches to the problem 
as possible, OR has greatly enlarged our capacity 
to deal with all the complexities of and the ap- 
proaches to a given problem and has therefore 
expanded our opportunities to benefit from the use 
of interdisciplinary teams in solving problems. 
Since more than a hundred scientific disciplines, 
pure and applied, have been identified, it is clearly 
not possible to incorporate each in most research 
projects. But in OR as many diverse disciplines are 
used on a team as possible, and the team’s work 
is subjected to critical review by as many of the 
disciplines not represented on the team as possible. 
Methodology. Experimentation lies at the heart 
of scientific method, but it is obvious that the kind 
of organized man—machine system with which in- 
dustrial, military, and governmental managers are 
concerned can never be brought into the laboratory, 
and only infrequently can such systems be manip- 
ulated enough in their natural environment to 
experiment on them there. Consequently, the OR 
worker finds himself in much the same position as 
the astronomer, and he takes a way out of his diffi- 
culty much like that taken by the astronomer. If 
he cannot manipulate the system itself, he builds 
a representation of the system, a model of it, that 
he can manipulate. In OR such models are abstract 
(symbolic) representations that may be very com- 
plicated from a mathematical point of view. From 
a logical point of view, however, they are quite 
simple. In general they take the form of an equa- 
tion in which the performance of the system, P, is 
expressed as a function, f, of a set of controlled 
variables, C, and a set of uncontrolled variables, U: 


P = f(C,U). 


The controlled variables represent the aspects 
of the system that management can manipulate, 
for example, production quantities, prices, range of 
product line, and so on. Such variables are often 
called decision variables since managerial decision 
making may be thought of as assigning values to 
these variables. The uncontrolled variables repre- 
Sent aspects of the system and its environment 
et Significantly affect the system’s performance 
an, not under the control of management, for 
te as e, product demand, competitors’ prices, cost 

material, and location of customers. 
mee re of performance of the system may 
he ste cult to construct since it must reflect 
fens ive importance of each relevant objective 
ees eee This measure is sometimes 
e criterion or objective function since it 
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provides the basis for selecting the “best” or “better” 
courses of action. 

Limitations or restrictions may be imposed on 
the possible values of the controlled variables. For 
example, in preparing a budget a limitation is nor- 
mally placed on the total amount that may be 
allocated to different departments, or there may be 
legal constraints on the decision-making activities 
of managers. Such restrictions can usually be ex- 
pressed mathematically as equations or inequalities 
and can be incorporated in the model. 

Once the decision maker's choices and the sys- 
tem involved have been represented by a mathe- 
matical model, the researcher must find a set of 
values of the controlled variables that yields the 
best (or as close as possible to the best) perform- 
ance of the system. These “optimizing” values may 
be found either by experimenting on the model 
(ie., by simulation) or by mathematical analysis. 
In either case the result is a set of equations, one 
for each controlled variable, giving the value of 
that controlled variable relative to a particular set 
of values of the uncontrolled variables and other 
controlled variables that yields the best perform- 
ance of the system as a whole [see SIMULATION]. 

If the problem is a recurrent one, then the values 
of the uncontrolled variables (for example, de- 
mand) may change from one decision-making 
period to another. In such cases a procedure must 
also be provided for determining when values of 
the uncontrolled variables have undergone signifi- 
cant change and for adjusting the solution appro- 
priately. Such a procedure is called a solution-control 
system. 

The output of an OR study, then, is usually a 
set of rules for determining the optimal values of 
the controlled variables together with a procedure 
for continuously checking the values of the uncon- 
trolled variables. It must be borne in mind, how- 
ever, that a single, unified, and comprehensive OR 
study is seldom possible in an organization of any 
appreciable size. Rather, what usually occurs is a 
sequence of interrelated studies, each of which is 
designed to be adjustable to the results of the 
others. 

Ten years of constructing and working with 
models of managerial problems in industry have 
shown that, despite the fact that no two problems 
are ever exactly alike in content, most problems 
fall into one, or a combination, of a small number 
of basic types. These problem-types have now been 
studied extensively so that today we have consider- 
able knowledge about how to construct and solve 
models that are relevant to them. Adequate defini- 
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tions of these problem-types require more space 
than is available here, but the following brief 
characterizations indicate their nature. 

Inventory problem—to determine the amount of 
a resource to be acquired or the frequency of acqui- 
sition when there is a penalty for having either too 
much or too little available. 

Allocation problem—to determine the allocation 
of resources to a number of jobs where available 
resources do not permit each job to be done in the 
best possible way, so as to do all (or as many as 
possible) of the jobs in such a way as to achieve 
the best over-all performance, given criteria for 
measuring performance. 

Queuing problem—to determine the amount of 
service facilities required or how to schedule arrival 
of tasks at service facilities so that losses associ- 
ated with idle facilities, waiting, and turned-away 
tasks are minimized. 

Sequencing problem—to determine the order in 
which a set of tasks should be performed in a 
multistage facility so as to minimize costs associ- 
ated with the performance of the tasks and delays 
in completing them. 

Routing problem—to determine which path or 
route through a network of points or locations is 
shortest (or longest), has maximum (or minimum) 
capacity, or is least (or most) costly to traverse 
subject to certain limitations on the paths or routes 
that are permissible. 

Replacement problem—to determine when to re- 
place instruments, tools, or facilities so that acqui- 
sition, maintenance, and operating and failure 
costs are minimized. 

Competition problem—to determine the rule to 
be followed by a decision maker that yields the best 
results when the outcome of his decision depends 
in part on decisions made by others. 

Search problem—to determine the amount of 
resources to employ and how to allocate them in 
seeking information to be used for a particular 
purpose so as to minimize the costs associated with 
the search and with the errors that can result from 
use of incorrect information. 

The future of OR. OR has been primarily con- 
cerned with the executive's decision-making or con- 
trol process. There are, of course, other approaches 
to improving the performance of organizations, for 
example, selecting better personnel, providing bet- 
ter personnel training, better motivating personnel, 
accelerating their operations through work study, 
changing equipment and materials, modifying com- 
munications, changing organizational structure. 
This multiplicity of available approaches presents 
the executive with the additional problem of select- 


ing which approaches to pursue. He seldom has an 
objective basis for doing so. Clearly it would be 
desirable to develop an integrated and comprehen- 
sive approach to organizations, one that rationally 
selects from or combines different points of view. 
OR and other systems-oriented interdisciplinary 
research are taking steps to develop such an over- 
all approach to organizational problems. This is 
leading to mathematical descriptions of organiza- 
tional structures and communications systems, 
thus providing the ultimate possibility of integrat- 
ing studies of organizational structure, communi- 
cation, and control. 

Precise solutions of some limited problems of 
organizational structure have already been found. 
For example, given an organization’s over-all ob- 
jective and a description of its task and environ- 
ment, it is possible to determine the number and 
types of units into which the organization should 
be divided and the objectives to be assigned to these 
units so as to minimize inefficiency arising from 
the organization’s structure. This is a problem in 
structural design. Or, given an organization that 
has an inefficient structure, it is possible to deter- 
mine the types of decentralized control to be ap- 
plied to decentralized decision making so as to 
minimize inefficiency. This is a problem in struc- 
tural control. 

Such developments are leading to an integrated 
theory of, and generalized methodology for, re- 
search on organized systems. Since all of these 
systems are, in some sense, social systems, the 
participation of the social scientist in these inter- 
disciplinary efforts is essential. 


RUSSELL L. ACKOFF 
[See also SYSTEMS ANALYSIS.) 
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OPPENHEIMER, FRANZ 


Franz Oppenheimer (1864-1943) was born in 
a suburb of Berlin, the son of a rabbi. His first 
career was in medicine which he studied at the 
universities of Freiburg and Berlin; he received his 
M.D, at Berlin in 1885. After he had practiced 
medicine for a decade, his interests changed. He 
became a student of political and economic affairs 
and earned his PH.D. at the University of Kiel in 
1908. In 1909 he became an unsalaried lecturer 
at the University of Berlin, and in 1919 he moved 
as full professor of economics and sociology to the 
University of Frankfurt. Ten years later he retired 
for reasons of health and moved to a rural settle- 
ment near Berlin that he had helped to establish 
before World War 1. Like so many of his academic 
colleagues, Oppenheimer left Germany in 1933; he 
subsequently taught as a visiting professor in 
France, Palestine, and the United States. He died 
in Los Angeles. 

Oppenheimer conceived of sociology as a general 
comparative science of collective life, past and 
present. Sociology seeks to establish laws that ex- 
plain change. He was essentially interested in social 
dynamics, and he used theoretical models like 
comparative statics,” “kinetic processes,” and 
Structural changes” in order to explain change. 
He related his sociological theories to the psychol- 
ogy of human drives expounded by William Mc- 
Dougall, and he was influenced by such economic 
theories as Hermann Gossen’s marginal utility the- 
ory and Theodor von der Goltz’s theorem that the 
tural exodus corresponds to the proportion of agri- 
cultural land in large estates. These economic the- 
ee coed Oppenheimer’s central attack on 
a and monopoly” (Bodensperre), which blocks 
` Ss to free land as workable property. The aboli- 

on of the land monopoly seemed to him indis- 
Pae for a “normal” and therewith “just” 
te There are themes in Oppenheimer's system 
tae from the federalism of Konstantin 
hear an Proudhon, and the anthropological 

ties of Schmidt and Koppers are also elements 

of the system, 
: 4 dea saw prehistoric tribal conditions as 
Saitfetes hae if not idyllic, phase of human 
i - +ne great divide between tribal and civ- 
Society occurred when nomadic tribes of 


OPPENHEIMER, FRANZ 295 


camel and horse herdsmen and Nordic maritime 
nomads (Vikings) set on their course of conquest, 
subjugating sedentary populations to the east and 
west. Their conquests led to the land monopoly, 
tribute payment, and exploitation. All sorts of other 
monopolies emerged after the land monopoly. 

Successive forms of the state also emerged. Ini- 
tially, conquerors and conquered were integrated 
into a larger society held together by a new “frame 
group,” the “primitive state,” with its characteristic 
legal system; social stratification by class and status 
implemented this process of integration. The primi- 
tive state was succeeded by the Mediterranean 
“seacoast state” of antiquity. Agrarian capitalism 
combined with slavery ultimately led to the down- 
fall of ancient Mediterranean civilization. Field 
slavery was impossible in the northern alpine 
climate, and the “secondary herdsmen” conquerors 
of the Teutonic tribes consequently organized a 
feudal society. Further stages in the development 
of the state were the estate society, early modern 
absolutism, and the modern constitutional state 
that emerged in the revolutions of 1649 (in Eng- 
land), 1789 (in France), 1848 (in several Euro- 
pean countries), and 1905 (in Russia). The legal 
division of societies into estates was everywhere 
abolished in these revolutions, but the land monop- 
oly remained. 

Oppenheimer was proud to acknowledge his in- 
debtedness to Ludwig Gumplowicz, and he traced 
back the theory of the exogenous origin of the 
state via Lorenz von Stein to Saint-Simon, Ger- 
rard Winstanley, and Ibn Khaldūn. As an agrarian 
socialist, Oppenheimer opposed dogmatic Marxism, 
although he accepted Marx's social determinism 
with reservations and learned from his analyses 


of class and ideology. 
H. H. GERTH 


[For the historical context of Oppenheimer’s work, see 
the biographies of GOSSEN; GuMpLowicz; IBN 
Kuatptn; McDoucatt; PROUDHON; SAINT-SIMON; 


SCHMIDT; STEIN.] 
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ORESME, NICOLE 


Nicole Oresme (c. 1325-1382) was a man of 
broad interests, among which were both moral 
philosophy and the natural sciences, notably phys- 
ics and astronomy. Born in a village near Caen, 
Oresme studied at the University of Paris and took 
his degree in theology. He was a bursar at the 
collége at Navarre from 1348 until 1356, when, 
having received his master’s degree, he was ap- 
pointed grand master of his college. In 1361 he 


received a canonry at Rouen and soon afterward 
another at the Sainte-Chapelle in Paris. In 1364 
he became dean of the cathedral chapter of Rouen, 
but he was dispensed from taking up residence 
there at the special request of King Charles v. 
Oresme’s career reached its zenith in 1378 when 
he was elevated to the episcopal see of Lisieux, 
where he died four years later. 

Although a scholar rather than a man of action, 
Oresme enjoyed the confidence of Charles v, king 
of France, who reigned from 1364-1380, and he 
was for many years a member of the council. In 
this capacity Oresme was able to influence policy, 
especially monetary policy, in accordance with his 
own ideas. At the behest of his royal master, he 
translated Aristotle’s Politics, Nicomachean Ethics, 
and Economics from the Latin into French. (The 
Economics, of course, is not a treatise on econom- 
ics in the modern sense but a work dealing with 
household management.) To the text of all these 
translations Oresme, in medieval fashion, added 
his own glosses. 

At the king’s bidding, Oresme also translated 
into French De caelo et mundo (“Concerning 
Heaven and Earth”), an exposition of Aristotle’s 
cosmological system. He challenged this system by 
postulating the theory that the earth rotates in the 
center of a motionless universe. Although far from 
anticipating the Copernican system, such a theory 
at least demonstrates an inclination toward inde- 
pendent inquiry that was hardly common in the 
fourteenth century. Even more remarkable are 
Oresme’s views on the atomistic composition of 
matter. 

Oresme is remembered, however, less for his 
scientific views or his translations from Aristotle 
than for his Latin treatise on money, Tractatus de 
origine, natura jure, et mutationibus monetarum 
(c. 1373b, translated into English as De moneta in 
1956). The author himself prepared a French ver- 
sion under the title Traictie de la premiére inven- 
tion des monnoies. Although the Latin version was 
republished several times in the course of the six- 
teenth, seventeenth, and eighteenth centuries, 
Oresme’s treatise attracted little attention until it 
was “rediscovered” in 1862 by Wilhelm Roscher, 
who hailed it as a work of great originality contain- 
ing the first formulation of sound monetary prin- 
ciples. Such claims are rather extravagant, since 
Oresme anticipated neither the quantity theory of 
money nor even Gresham's law, although he did 
note that debasement will lead to the disappearance 
of the better coins and leave the realm void of gold 
and silver (c. 1373b, chapter 20). 

The Tractatus, indeed, is more concerned with 


monetary policy than with monetary theory. Ores- 
me’s approach to economics is purely scholastic, 
that is, ethica! and legal. He was an Aristotelian 
who, in the introductory and theoretical chapters 
of his treatise, carried his analysis little further 
than the Greek philosopher did, and confined him- 
self to some trite remarks on the inconveniences 
of barter and the functions of money as a standard 
of value and as a medium of exchange. 

Oresme’s originality lies mainly in his elabora- 
tion of policies that permit money to perform its 
functions adequately. Money should be stable, and 
Oresme, therefore, vigorously opposed debasement 
of the currency and the use of minting privileges 
as a source of revenue. Debasing the currency is, 
in his opinion, cheating and committing a crime 
worse than usury (c. 1373b, chapter 17). Money, 
according to Oresme, was instituted for the benefit 
of the common weal rather than of the prince, who 
should not alter the standard of the currency ex- 
cept as a last resort and with the people’s consent 
(consensus populi). An emergency warranting this 
step would be, for example, the defense of the 
realm against a foreign foe, or a change in the 
proportionate value of gold and silver that would 
necessitate a corresponding adjustment of the 
coinage. Progressive debasement, Oresme pointed 
out, benefits only a small minority of money dealers 
and bankers, whereas it harms the community at 
large by disturbing trade, undermining the sanctity 
of contracts, upsetting the existing social order, 
and bringing ruin to the recipients of rents, pen- 
Sions, and other fixed incomes. The policy advo- 
cated by Oresme was in fact adopted by Charles v, 
who maintained the same monetary standard from 
1365 until the end of his reign. 

Oresme did not favor free coinage. He recom- 
mended that the cost of minting be borne by the 
Hoe: that is, not by the prince or the states, 
ivan z by those who deliver bullion to the mint. 
ea seigniorage charges ought not to be 
a s would encourage inflation—but just 

cient to cover the actual costs of coinage. 

ieee Oresme's treatise represents the sum- 
‘ike Scholastic achievement on the subject of 
reas ET unsatisfactory in many respects and 
Ere ie Anam on later writers on the same 
rites Nee e exception of Gabriel Biel (c. 1483), 
tenets A is brief, mediocre, and unoriginal. 
canal concern with monetary problems is 
doctors of a rather than scholastic. The scholastic 
Progress ae sixteenth century did make some 
ahate of accepted a crude quantity theory as 
subject Piece but they did not delve into the 

any thoroughness. The basic defect 
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of the scholastic school of economics stems from its 
normative approach to economic and monetary 
problems, which warps its analysis in this field, as 
elsewhere. 

RAYMOND DE ROOVER 


[For the historical context of Oresme’s work, see Eco- 
NOMIC THOUGHT, article on ANCIENT AND MEDIEVAL 
THOUGHT; and the biography of Aquinas.] 
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The articles under this heading review the field 
of organizational sociology. Directly related are the 
entries ADMINISTRATION; BUREAUCRACY. Discus- 
sions of particular types of organizations may be 
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ORGANIZATIONAL INTELLIGENCE 
METHODS OF RESEARCH 


I 
THEORIES OF ORGANIZATIONS 


An organization comes into existence when ex- 
plicit procedures are established to coordinate the 
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activities of a group in the interest of achieving 
specified objectives. The collective efforts of men 
may become formally organized either because all 
of them have some common interests or because a 
subgroup has furnished inducements to the rest to 
work in behalf of its interest. Factory workers or- 
ganize themselves into unions to bargain collec- 
tively with management, and management has 
organized the workers’ tasks for the purpose of 
producing goods marketable for a profit. Unions 
and factories exemplify formal organizations, as 
do government bureaus and political parties, armies 
and hospitals. 

There are two basic principles that govern social 
life, and organizations manifest one of these. So- 
cial structures may emerge as the aggregate result 
of the diverse actions of individuals, each pursu- 
ing his own ends, or they may reflect the joint 
endeavors of individuals pursuing commonly ac- 
cepted ends. Thus, as individuals and groups in a 
community compete, enter into exchange relations, 
and use their resources to exercise power over 
others, an economic system and a class structure 
develop, which reveal organized patterns of social 
conduct, although nobody has explicitly organized 
the endeavors of individuals. The government of 
a society and a football team, on the other hand, 
are social structures deliberately established to 
achieve certain objectives, and the regularities ob- 
servable in them reflect deliberate design. The dis- 
tinction is essentially the one made by William 
Graham Sumner between “crescive” and “enacted” 
institutions. Social systems produced by formally 
enacted procedures, rather than merely emergent 
forces, are organizations. The distinction is an 
analytical one, since crescive and enacted forces 
typically interact in their effects on social systems. 
Nevertheless, the distinction finds concrete expres- 
sion in the difference between organizations and 
other collectivities. 

Whenever groups of men associate with one an- 
other, social organization develops among them, 
but not every collectivity has a formal organization. 
The defining criterion of a formal organization— 
or an organization, for short—is the existence of 
procedures for mobilizing and coordinating the 
efforts of various, usually specialized, subgroups 
in the pursuit of joint objectives. If all relations 
among the members of organizations and all their 
activities were completely predetermined by formal 
procedures, however, organizations would evidently 
not pose meaningful problems for scientific in- 
quiry, because everything about them could be 
ascertained by simply examining the official blue- 
prints and procedure manuals. Actually, social in- 


teraction and activities in organizations never cor- 
respond perfectly to official prescriptions, if only 
because not all prescriptions are compatible, and 
these departures from the formal blueprint raise 
problems for empirical study, Paradoxically, there- 
fore, although the defining characteristic of an 
organization is that a collectivity is formally organ- 
ized, what makes it of scientific interest is that the 
developing social structure inevitably does not co- 
incide completely with the pre-established forms. 


Weber's theory 


Max Webers theory of bureaucracy (1922a; 
1922b) addresses itself to both the problem of the 
changing social organization of modern society and 
that of the typical features of the formal organi- 
zations that pervade it. On the one hand, Weber 
was concerned with the increasing bureaucratiza- 
tion of ever wider aspects of contemporary social 
life. On the other hand, he presented the classical 
analysis of the typical social structure of the com- 
plex, large organization. Accordingly, his writings 
have given rise to two distinct traditions in the 
social sciences—historical studies of bureaucrati- 
zation in societies and empirical research on com- 
plex organizations. 

Authority. Weber ({1922b] 1957, pp. 124- 
132) emphasized that the foundation of a legiti- 
mate social order is a common value orientation 
that effects social control and compliance with au- 
thoritative commands in a society. The authority 
structure, in other words, is the core of the social 
organization, and authority structures can be dif- 
ferentiated in terms of the value systems that legit- 
imate them. On the basis of this criterion Weber 
distinguished three main types of authority. First, 
traditional authority is legitimated by the belief 
in the sanctity of tradition, according to which a 
given person or group, usually defined by heredity, 
is preordained to rule over the rest, as exemplified 
by the doctrine of the divine right of kings. Second, 
charismatic authority rests on a value orientation 
that considers the pronouncements of the leader 
of a social movement to be inspired by super 
natural powers, and this conviction prompts the 
converts to his cause willingly to obey the leader's 
commands. Legal authority, finally, is legitimated 
by a formalistic belief in the supremacy of a body 
of legal norms, whatever their specific content; 
these impersonal principles include the require- 
ment to follow directives originating from desig- 
nated official positions, regardless of the persons 
who occupy these positions. Legal authority is char 
acteristic of bureaucratic organizations and, gen- 
erally, of modern societies. Weber ([1922a] 1946, 


pp. 204-216) traced the historical conditions that 
promoted bureaucratization, such as a money econ- 
omy, the absence of slavery, the large size of a 
collectivity, the complexity of administrative tasks, 
and the growth of capitalism, and his historical 
analysis convinced him of the inevitable trend 
toward increasing bureaucratization in the modern 
world. 

Bureaucracy. The importance bureaucracy as- 
sumes in modern life led Weber to give its analysis 
a central place in his theory of society, Although 
he examined primarily government bureaus, his 
principles apply to complex formal organizations 
in general. The basic question he asked was how 
collective endeavors must be organized to ration- 
alize complex responsibilities that require the joint 
efforts of many men. In short, what are the prin- 
ciples of rational administration on a large scale? 
It is not entirely clear whether his answer to this 
question was intended as a normative theory, indi- 
cating what a large organization should be like to 
maximize efficiency, or as a descriptive theory, out- 
lining what the prototype of existing complex or- 
ganizations actually is. In any case, his theory 
specifies the typical characteristics of bureaucra- 
cies and analyzes the way in which these charac- 
teristics depend on each other. 

Large collectivities confronted by complex ad- 
ministrative tasks are most likely to become bu- 
Teaucratically organized. Such formal organization 
involves an extensive division of labor, with spe- 
Cialized responsibilities being assigned to trained 
experts as their circumscribed professional duties. 
A separate administrative apparatus tends to de- 
velop, whose members are responsible for main- 
taining the organization itself, including notably 
liz lines of communication, rather than for con- 
tributing directly to attaining its objectives. An- 
Other distinctive attribute of bureaucracy is that 
official positions are organized into a hierarchy of 
AEA the scope of which is precisely defined 
v TAi ies rules. Operations too are governed 
a eem system of rules and regulations. 
ie Sitter: etachment is expected to prevail in 

he ee of duties and in official relations. 
iove ery of bureaucracies are full-time em- 
Tene ave stable careers with some advance- 
solitieg ‘Nia leans since explicit personnel 
aca S bei from arbitrary dismissal and 
merit ‘and AS ased on objective criteria, such as 
eane R These are the major traits of 
example E although Weber noted others, for 

» the proliferation of written documents 


and A ý 
ats a ease with which personnel can be substi- 
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Functional interdependence. Weber presented 
an implicit functional analysis of the interdepend- 
ence of the characteristics of bureaucracy, with ra- 
tional efficient administration as the criterion of 
function. Effective accomplishment of complex ad- 
ministrative tasks on a large scale requires that 
they be subdivided into specialized responsibilities, 
which can be readily handled by individuals, and 
that professionally qualified experts be appointed 
to discharge these responsibilities. This pronounced 
division of labor creates serious problems of co- 
ordination, particularly in a large organization. A 
special administrative staff is needed to maintain 
channels of communication and coordination, and 
a strict hierarchy of authority serves to effect the 
coordination of diverse tasks in the pursuit of or- 
ganizational objectives by enabling superiors on 
successive levels to guide, directly or indirectly, 
the performance of increasingly wider circles of 
subordinates. But close supervision of all decisions 
is inefficient and produces strains. The system of 
official rules is designed to standardize operations 
and restrict the need for direct supervisory inter- 
vention largely to extraordinary cases. Professional 
training and official rules notwithstanding, how- 
ever, strong emotions and personal bias are likely 
to interfere with the ability to make rational de- 
cisions. The emphasis on impersonal detachment 
has the function of precluding the intrusion of 
such irrational factors into official decisions. Lest 
the impersonal discipline in the hierarchical bu- 
reaucracy alienate its members, secure careers 
lessen this burden and promote loyalty to the or- 
ganization. 

In brief, the problems created by one condition 
in the organization stimulate the development of 
another condition to deal with them. A number of 
interdependent processes of this kind give rise to 
the constellation of features characteristic of the 
typical bureaucracy, as conceptualized by Weber. 
He held that these features of an administrative 
organization and, especially, their combination are 
“capable of attaining the highest degree of effi- 
ciency” ({1922b] 1957, p. 337). 


Political and industrial organizations 

While Weber's is the sociological classic on the 
subject, theories of organizations have been ad- 
vanced by scholars in other fields. To be sure, since 
formal organizations are largely a modern phe- 
nomenon and exist only in rudimentary form, if 
at all, in simple societies, anthropologists have 
been little concerned with them. In anthropologi- 
cal studies “organization” refers generally to the 
kinship structure, which is not a formal organiza- 
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tion as here defined, although it is a highly formal- 
ized social structure in these societies. It makes 
sense, however, in studying almost any society, to 
speak of complex political and economic organiza- 
tions; and theories of organizations have been de- 
veloped in these fields. 

The political government is the most encom- 
passing formal organization in any society, and 
political scientists as well as philosophers have 
theorized about the state for more than two thou- 
sand years, But the unique characteristics of or- 
ganized government—notably its encompassing 
scope and its monopoly of force—discourage com- 
parison with other organizations. The case is dif- 
ferent for particular government agencies, on 
which Weber centered attention, and also for po- 
litical parties. 

The “iron law” of Michels. In his famous study 
of socialist parties and unions in Germany, Michels 
(1911) proposed an “iron law of oligarchy,” with 
which he accounted for transmutations in these 
organizations. Political parties and labor unions, 
according to Michels, invariably become bureauc- 
ratized, regardless of how egalitarian their ide- 
ology. The reason is that centralized control with 
the help of an administrative staff is essential for 
success in elections or bargaining. The experience 
leaders gain and the victories they achieve make 
them virtually indispensable. Since a strong or- 
ganization is important for implementing the radi- 
cal program, the leadership becomes increasingly 
concerned with fortifying the organization and pro- 
tecting it against attacks from conservative forces, 
even if doing so requires abandoning the original 
radical ideals for more moderate ones to pacify 
the powerful opposition, In sum, Michels held that 
inevitable organizational processes transform egali- 
tarian parties or unions into centralized bureauc- 
racies dominated by officials and turn their radical 
goals into modest reform programs. 

Conditions in the International Typographical 
Union challenge Michels’ thesis, inasmuch as an 
internal two-party system in this union prevented 
the development of a self-perpetuating bureau- 
cratic oligarchy. Research revealed that a major 
factor that preserved democratic processes was the 
existence of many semiautonomous organizations 
within the union’s boundaries—diverse associa- 
tions and clubs of union members as well as strong 
union locals (Lipset et al. 1956). These “inter- 
mediate associations” protected the democratic 
system by mobilizing membership participation in 
union affairs, providing a training ground for 
leadership, and serving as organizational bases for 
opposition to the incumbent leaders. The conclu- 


sions of this study do not so much negate the the- 
sis that effective unions become bureaucratized as 
refine it by indicating that intermediate organiza- 
tions can stem the trend toward centralization of 
power, which is otherwise inherent in bureaucra- 
tization. 

Political parties and the party system. Du- 
verger’s theory of political parties (1951) analyzes 
how their organization is affected by the political 
system in which they operate. Election by a simple 
majority promotes a two-party system, inasmuch 
as third parties suffer severe handicaps under it, 
while proportional representation encourages the 
developraent of many parties, If there are only two 
parties, each derives advantages from moving to- 
ward the center, because by doing so it can gain 
some support from moderates without losing much 
from extremists, who have nowhere else to go. In 
a multiparty system, by contrast, parties are in- 
clined to adopt extremist programs and, in the case 
of minor parties, even demagogic policies, as each 
seeks to differentiate itself from the others. Du- 
verger’s theory is of general interest as one of the 
few systematic attempts to relate the attributes of 
organizations to the nature of the broader system 
in which they function. 

The theory of administration. Whereas stu- 
dents of organizations have concentrated on their 
internal structure and paid little attention to their 
interrelations, economists have intensively investi- 
gated the relations among firms in the economy 
and inquired little into their internal organization, 
just as most political scientists have been more 
concerned with the larger political system than 
with the internal operations of government agen- 
cies. The specialized fields of business and public 
administration evolved to study these organiza- 
tions, with the primary aim of helping to solve 
practical problems of management. An early ex- 
ample is Taylor's “scientific management” (1911). 
It has been called a “physiological organization 
theory” (March & Simon 1958), because it stresses 
the influence of the capacities of the human Or- 
ganism on physical performance, although it also 
emphasizes pay incentives as well as time-and- 
motion studies. 

The traditional administrative theory, which was 
developed between the two world wars, addresses 
itself more directly to the principles that govern 
complex organizations. Thus, Luther Gulick spec 
fied purpose, process, clients, and place as four 
basic factors to be considered in organizing an 
administrative establishment; M. P. Follett sug- 
gested principles for effecting coordination an a 
large organization; and V. A. Graicunas derived 


from a mathematical model of social relations his 
well-known axiom of limited span of control (see 
Gulick & Urwick 1937). Unlike Weber's general 
theory, these principles are oriented toward the 
practical problems of administration, but like his 
theory, they conceive of an organization as a ra- 
tional instrument for implementing objectives and 
policies. 

Criticisms of this conception of organization 
stimulated subsequent theoretical developments. 
The rational model has been attacked as an ab- 
straction that ignores actual human relations and 
behavior in organizations and, in particular, the 
nonrational elements in human conduct and their 
implications for operations. Moreover, exclusive 
concern with the formally instituted conditions 
in the organization has been criticized for failing 
to take into account the informal patterns that 
emerge and modify social relations and the per- 
formance of tasks. Finally, the implicit assumption 
that organizations are functional systems in equi- 
librium has been assailed as misleading, because 
it tends to neglect dysfunctions, conflict, and 
change. While some of these criticisms were di- 
rected primarily by students of business manage- 
ment against traditional administrative theory and 
others by sociologists against Weber, both trends 
converged in a growing concern with informal 
organization. 


Informal organization 


Earlier students of organizations realized, of 
course, that actual practice did not follow the 
formal blueprint in every detail. Later researchers 
discovered, however, that these departures from 
official procedures are not idiosyncratic but become 
Socially organized. These social patterns, infor- 
mally organized by the participants themselves, 
complement those formally organized for them by 
management. Furthermore, as Chester Barnard 
Pointed out (1938, pp. 115-123), the informal 
organizations that always arise in formal organ- 
izations are essential for operations. 

Pe multargoualy with Barnard, a research team 
rane at the same insight from their study of in- 
TAA ‘cea in an electrical equipment factory. 
Gena erger and Dickson (1939) found that the 
sah al relations in a work group assumed a dis- 
ene structure, with subgroups and status dif- 
ened and that informal norms emerged that 
aes the performance of workers. Specifically, 
seats were expected by their fellows neither to 
rien too fast nor to produce too slowly, and 
See ns from these group norms were penalized 
ridicule, loss of status, and, eventually, ostra- 
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cism. In other words, workers informally organized 
themselves to control output, and an informal 
status structure complemented this organized so- 
cial control. The informal organization, therefore, 
had important implications for operations. 

Research on “human relations in industry,” as 
it came to be called, proliferated widely after this 
pioneering study. For example, the effect of group 
cohesion and of the relation between supervisor 
and subordinates on productivity were intensively 
and repeatedly investigated at the University of 
Michigan. One main conclusion of these studies 
was that close supervision lowers productivity, 
whereas an interest in the welfare of subordinates 
raises it (Kahn & Katz 1953). In this research tra- 
dition, the study of the social relations among the 
members of the organization came to replace any 
concern with the character of the organization it- 
self [see INDUSTRIAL RELATIONS, article on HUMAN 
RELATIONS]. 

Merton’s concept of goal displacement. In con- 
trast to the students of human relations, whose 
findings have implications for Weber's theory, al- 
though they did not directly address themselves to 
it, Merton (1940) explicitly called attention to 
shortcomings of Weber's analysis. He showed that 
bureaucratic procedure, although designed to fur- 
ther administrative efficiency, often has unantici- 
pated consequences that actually impede it. His 
central concept is that of displacement of goals, 
which refers to a notion implicit in Michels. The 
emphasis on strict discipline and adherence to 
rules in bureaucracies, intended to assure consist- 
ency and impartiality, leads officials to think of 
formal procedures not simply as means for accom- 
plishing certain objectives but as ends in them- 
selves. The rigid conformity resulting from this 
displacement of goals prevents adaptation to new 
situations and generally interferes with effective 
operations. Bureaucratic arrangements have dys- 
functions as well as functions for rational adminis- 
tration, since their unanticipated consequences re- 
currently create problems—a phenomenon that 
Weber consistently neglected. 

Case studies. A number of case studies of bu- 
reaucracies have attempted to refine Weber's theory 
by following Merton’s lead and by applying the in- 
sights of industrial research. Thus, a government 
agency in a hostile local environment was found 
to adjust to this situation through informal co- 
optation of opposition elements into its leadership 
structure, but this method of organizational adjust- 
ment gave rise to fundamental revisions in official 
policies (Selznick 1949). Gouldner’s study of an 
industrial organization found that managerial suc- 
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cession promotes bureaucratization (as implied by 
Weber’s theory), because a new manager, unfa- 
miliar with informal practices, is constrained to 
rely on official procedures to implement his direc- 
tives. But such exercise of authority that rests on 
bureaucratic rules and discipline should be dis- 
tinguished from professional authority that rests 
on technical expertness, a distinction Weber failed 
to make explicit (Gouldner 1954). 

Bureaucratization and professionalization are 
alternative means for rationalizing action by creat- 
ing order out of disorder. If bureaucratic routines 
have been established to handle most problems in 
an organization, the professional expert capable of 
coping with the remaining problems that have not 
been routinized is likely to acquire more actual 
power than his formal position prescribes (Crozier 
1963), Impersonal evaluation of performance on 
the basis of statistical criteria in bureaucracies may 
engender anxiety and competition, which impede 
effective operations, but unofficial practices tend 
to develop, such as informal consultation among 
colleagues, that relieve these difficulties and re- 
store operating efficiency (Blau 1955). 

The general conclusion that emerges from these 
studies is that procedures formally instituted for 
specific purposes in organizations recurrently cre- 
ate disturbances in other respects, and the informal 
patterns that typically arise to cope with these dis- 
ruptions often produce a basic reorganization of 
operations. Whereas the focus on informal prac- 
tices and relations permitted some refinement of 
Weber's theory, it also led investigators increas- 
ingly away from the study of the fundamental 
structural features of complex organizations. This 
is most evident in theories of administrative de- 
cision making. 


Organizations as equilibrium systems 


Barnard (1938) conceived of organizations as 
equilibrium systems of coordinated effort in which 
participants make contributions in return for in- 
ducements, and this concept has been elaborated 
by Simon (1947; cf. March & Simon 1958). The 
two major parts of the theory are concerned with 
the “inducement-contribution balance” and with 
the structure of decision making in organizations. 

The problem of motivation. An organization 
must furnish sufficient inducements to motivate 
individuals to participate in and contribute to it. 
The utility of the individual's contributions can be 
defined in terms of the alternatives foregone by 
making them. A direct measure of the utility of 
the inducement-contribution balance is the indi- 
vidual’s satisfaction with his participation. Zero 
satisfaction, however, does not necessarily result 


in the decision to leave the organization, since the 
latter is a function of both the desire to move and 
the perceived ease of moving. Provided certain as- 
sumptions about the utility functions are made, 
independent indirect measures of the inducement- 
contribution utilities are also possible. 

The problem of rationality. Simon considers 
that the primary function of administrative organi- 
zation is to provide a framework for rational deci- 
sion making. It is beyond a human being’s capacity 
to make the complex decisions required for 
implementing a large organization’s goals ration- 
ally, because value as well as factual judgments 
are involved and because no man can weigh the 
innumerable possible alternatives. The administra- 
tive structure is designed to furnish every member 
of the organization with both the factual informa- 
tion and the value premises necessary to enable 
him to make rational decisions among limited al- 
ternatives, No rational choice among various ends 
is possible; only the choice among different means 
for a given end can be rational. The hierarchical 
organization of responsibility defines the ends for 
every official and thereby restricts all decisions to 
choices among means for given ends. The decisions 
on one level in the hierarchy specify the responsi- 
bilities for the next lower level, so that all officials 
have the ends of their tasks set for them by their 
superiors and can rationally choose among the 
appropriate means for reaching these ends. 

Simon’s theory is a rational model of organiza- 
tion, as is Weber’s, and both have been criticized 
for their rationalistic assumptions. Otherwise, how- 
ever, the two theories are diametrical opposites. 
Whereas Weber analyzed the attributes of organ- 
izations and was little concerned with the behavior 
of individual members, Simon and March con- 
centrated on the motivation of individuals in or- 
ganizations and paid little attention to the various 
characteristics of bureaucratic structure. Indeed 
Simon's theory is really not one of organizations 
but a theory of human behavior in the con- 
text of organization. He takes for granted the con- 
ditions in the organization and raises questions 
about human motivation, while the central problem 
of an organizational theory is the social structure 
of organizations rather than the psychology of de- 
cision making. The reaction to Weber and tradi- 
tional administration theory for ignoring actu 
conduct in formal organizations has led all the way 
to an organizational theory that deals with such 
conduct to the exclusion of the very characteristics 
of organizations themselves! 

Parsons’ model of organizations. Parsons ne 
presented a different equilibrium model of orga! 
zations (1960), which treats them as natural soc! 


systems, and which contrasts with both Weber's 
rational model (see Gouldner 1959) and Simon’s 
individual-centered model. Parsons’ general theory 
stipulates that all social systems must solve four 
basic problems: adaptation to the environment, 
goal achievement, integration of subunits into the 
larger system, and latency, that is, maintenance 
of the value patterns over time. Since formal or- 
ganizations are the typical mechanism for imple- 
menting social objectives in the modern world, 
they are part of the goal-achievement system of 
the society. Furthermore, organizations can be 
classified by their functions as belonging in part 
to one of the four systems; economic organizations, 
for example, are partly elements of the adaptation 
system, Finally, as social systems, organizations 
themselves must meet the four generic system 
problems by developing appropriate subsystems. 
Thus, subsystems become differentiated for mo- 
bilizing resources to meet problems of adaptation, 
for organizing “production” to achieve the organi- 
zation’s goals, for winning the loyalty of members 
and coordinating their efforts in order to integrate 
the system, and for relieving tension and main- 
taining consensus on legitimating values. Such 
treatment of organizations as natural systems evi- 
dently minimizes their distinctive traits. 


Prospects of organizational analysis 


Three foci can be distinguished in the study of 
organizational life; they may be called “role an- 
alysis,” “group analysis,” and “organizational an- 
alysis.” In the first case, the focus is on the be- 
havior of individuals in their specific roles as 
members of organizations, as illustrated by studies 
of Career patterns, work satisfaction and produc- 
tivity; attitudes of soldiers toward combat, and ad- 
ministrative decision making. The second focus 
1S on the structure of social relations in work 
Stoups—for example, the significance of group co- 
bso for performance, the differentiation of in- 
te al Status in groups, the exchange of advice 
A ited among colleagues, and generally pat: 
‘Be ue organization. Third, organiza- 
fi nalysis proper focuses upon the system of 

errelated attributes that characterize the or- 
Se itself rather than its component parts, 
ee its size, the division of labor in it, its 
of eae and the degree of centralization 
ie ‘ol. To be sure, these are closely related 
E fi those on one level cannot be investi- 
mines ‘ithout reference to those on the others; but 
Hen, x can systematic inquiry center on all of 
es ee The crucial question is the 
terns į Ocus of the theory, that is, which pat- 

it seeks to explain—role behavior, group 
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structures, or organizational systems—and which 
ones are taken as given or as accepted principles 
in providing the explanation. 

The focus of organizational analysis. Social 
systems constitute “nesting series,” as Parsons has 
noted, with each system typically being encom- 
passed by wider systems and having, in turn, sub- 
systems within its boundaries. Organizations are 
the environment in which work groups develop 
their social structures, and these group structures 
provide the social setting for the role behavior of 
individual members. At the same time, organiza- 
tions function within the context of communities, 
and these exist in the wider framework of societies 
and cultures. While human conduct is the funda- 
mental datum for all studies of social as well as 
psychological systems, the theoretical principles 
abstracted from discrete empirical observations 
have an explicit or implicit focus on one of their 
systems of relationships. 

The three types of analysis differ in respect to 
the system referent of the variations they seek to 
explain and the variables they treat as problemati- 
cal rather than given. Although role analysis is 
often concerned with the significance of different 
bureaucratic conditions or informal relations, it 
considers them as given stimuli and tries to explain 
how they affect behavior, for instance, decision 
making. Group analysis similarly treats the formal 
characteristics of organizations as given conditions 
and investigates their influence on the informal 
relations and group structures that develop. Neither 
of them questions why the observed constellation 
of organizational attributes has come into exist- 
ence, whereas organizational analysis does pre- 
cisely that. Theories of role behavior and group 
structures in the context of formal organizations 
should be distinguished from genuine theories of 
organizations, which explain how differences in 
some organizational characteristics lead to differ- 
ences in others. Only the comparative focus on 
variations among organizations can produce such 
theories. 

A theory of organizations should not only estab- 
lish the prevailing relationships between organiza- 
tional features but also elucidate them in terms of 
the underlying processes that produce them. It is 
not enough to indicate, as Weber did, that bureauc- 
ratization facilitates substitution of personnel or 
that impersonal performance standards and im- 
partial treatment of clients tend to occur together 
jn bureaucracies. It is also important to know what 
processes account for the association between these 
factors. 

The contribution of case studies of bureaucracies 
is that they have intensively investigated these in- 
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tervening processes, which Weber's theory had neg- 
lected. Case studies consequently help to explicate 
the association between characteristics by show- 
ing, for example, that bureaucratic procedures and 
easy substitutability of personnel are related be- 
cause new officials unfamiliar with informal prac- 
tices usually must resort to formal procedures, or 
that impersonal criteria for evaluating perform- 
ance promote impartiality by causing officials to be 
so concerned with achieving specific results that 
they set aside their personal feelings about clients. 
Despite these contributions of case studies to organ- 
izational theory, however, its foundations must be 
supplied by the comparative analysis of many 
organizations. 

Need for comparative research. Systematic 
comparisons of many organizations are necessary 
not only to test the propositions of organization 
theory but also initially to establish the relation- 
ships the theory is designed to explain. The major 
limitation to case studies of organizations is not 
that they cannot demonstrate the validity of theo- 
retical propositions; Weber's analysis could not fur- 
nish such demonstrations either, since he lacked 
the empirical data required for this purpose. Their 
major limitation is rather that research in a single 
organization diverts attention from organizational 
analysis proper, and directs it instead to the analy- 
sis of roles and group processes in the organiza- 
tional context. But for such intensive analysis of 
internal processes to make its potential contribu- 
tion to organizational theory, it should be comple- 
mented by comparative studies that determine the 
concomitant variations of organizational character- 
istics. The beginning of a research trend in this 
direction can be observed, as exemplified by a series 
of studies on the relationship between the size of 
organizations, their complexity, and the relative 
size of their administrative apparatus (summar- 
ized in Blau & Scott 1962, pp. 225-227). The re- 
sults of such comparative research promise to sup- 
ply the empirical foundations needed to refine 
Weber's conceptions and build a systematic theory 
of organization. 

PETER M. BLAU 


[See also ADMINISTRATION, article on ADMINISTRATIVE 
BEHAVIOR; BUREAUCRACY; SOCIAL CONTROL, article 
on ORGANIZATIONAL ASPECTS; and the biographies 
of BARNARD; FOLLETT; MICHELS; SUMNER; WEBER, 
Max.] 
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1 
ORGANIZATIONAL GOALS 


In a modern society, where large organizations 
have acquired unprecedented importance, social 
scientists have increasingly sought to understand 
the nature of organizational goals—what they are, 
what shapes or determines them, what their impact 
is upon the organization and environment, and 
how they change. This concern with the properties 
of goals themselves is to be distinguished from the 
many organizational studies that utilize the con- 
cept of goal only to set the stage for inquiry into 
other matters. Thus, in studies of morale or pro- 
ductivity, from the explorations of Elton Mayo 
and his followers to the more sophisticated work 
of such contemporary social scientists as Rensis 
Likert (1961), we do not find the question of 
organizational goals treated as problematical. For 
example, it is generally taken for granted that the 
only managerial goal with which the researcher 
need concern himself is increased productivity, 
from which it follows that the principal research 
problem is to find out how subordinates can be 
made to conform to or share this goal. In this 
article, however, we shall be concerned with the 
work of authors who have made considered asser- 
tions about the properties of organizational goals. 
7 Three recent traditions have contributed to the 
literature on goals. In the 1940s a number of econ- 
omists questioned the basic assumption of micro- 
economic theory that the goal of the firm was to 
maximize profit or return on investment; the theory 
was criticized as overrational, and it was suggested 
that other goals might be at least as important as 
Profit (see the review in Cyert & March 1963). 
The debate continues; but while some interesting 
Concepts have emerged, they have remained largely 
untested by empirical research. 

Another tradition has stemmed from those soci- 
Ologists who tend to see formal organizations as 
complex institutions with evolving goals and a 
aT, of their own, rather than as rational 
a the designed to pursue a single, rational 
Haka e most influential of these sociologists are 
ii ee Robert K. Merton, and Philip 
ioe all of them have acknowledged the im- 

‘ance of the observations of Robert Michels 
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(1911) on the evolving character of European 
political and trade-union movements. The body of 
theory that originated with these authors is some- 
times referred to as “institutional” sociology. 

The teaching and research of the institutional 
sociologists have provided the stimulus for a great 
many studies of such “people-changing organiza- 
tions” as mental hospitals and prisons, both of 
which, since they are concerned with treatment 
as well as custody, are obvious examples of organi- 
zations pursuing two goals at once. Since the 
implicit aim of many of these studies has been 
to change the goals of the organization, this tradi- 
tion has paid some attention to clarifying and 
operationalizing the concept of goal (see Zald 
1963; Perrow 196la; Cressey 1961; Conference 
Group 1960). 

A variable concept. None of the three tradi- 
tions in the literature on goals has succeeded in 
developing rigorous or even very self-conscious 
conceptual tools. In part this is because almost any 
sequence of behavior can be cut into small or large 
pieces, each of which can be said to be goal- 
directed. What is designated as a goal, as well as 
what is used as evidence for the existence of a 
goal, depends for the most part on the purpose of 
the study being made. 

Since there is only a relative distinction between 
means and ends and since, therefore, any end or 
goal can be seen as a means to another goal, one 
is free to enter the “hierarchy of means and ends” 
(Simon [1947] 1961, pp. 63-66) at any point. 
Moreover, a goal may be said to exist on the basis 
of the statements of officials, questionnaire data 
from employees, observations of behavior, or infer- 
ences about contributions to a larger social system. 
Accordingly, there is a very wide range of organ- 
izational behavior that can be interpreted as goal- 
directed, and the scope of general statements about 
the effect of goals on an organizational structure, 
changes in goals, goal displacement, and so forth, 
will inevitably vary with the main interest of the 
researcher. 

Thus, to say, as Talcott Parsons does (1960, 
p. 17), that an organization is defined as pursuing 
a specific goal is perhaps as true as the much more 
frequent observation that organizations have mul- 
tiple goals. Indeed, much of the dispute about the 
role of profit in economic organizations hinges on 
differing conceptions of goals. For those concerned. 
with macroeconomic theory (the theory of the mar- 
ket place), the assumption that profit is the only 
important goal is quite useful; for those concerned 
with studying individual firms or sectors of the 
economy, it may be quite misleading. When the 
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National Foundation for Infantile Paralysis shifted 
from an exclusive concern with polio to a concern 
for a variety of diseases, it was, from one point of 
view, a case of “goal succession” (Sills 1957, pp. 
253-265), but not from other perspectives, such 
as the view that the goal is research on disease, or 
conserving the organization, or even (following 
Michels) maintaining positions of prestige and 
power for the top executives. In these views, only 
the means had changed. 


Categories of goals 

To provide some order in this subject area and 
to illustrate the work on goals that has been done, 
we shall distinguish six categories of goals, recog- 
nizing that the number could be smaller or larger. 
Three have external referents—society, the public 
in contact with the organization, the investors— 
and will be referred to, respectively, as societal 
goals, output goals, and investor goals. The other 
three have internal referents, that is, to the organ- 
ization and its members. They are system goals 
(survival, growth, etc.), product goals (the de- 
fining characteristics of the product such as quality, 
availability, styling), and the somewhat residual 
category of derived goals (those which make use 
of the power the organization generates in the pur- 
suit of the other goals). 

The relationships between these categories may, 
in some cases, be those of means to ends, but not 
necessarily, To assume a means-end hierarchy is 
to assume that behavior is completely integrated 
and that all behavior is therefore functionally in- 
dispensable, By contrast, we assume, not that all 
things contribute to a single ultimate end, but that 
organizations may pursue a variety of goals, that 
these goals may be incompatible or in competition 
with each other, and that an organization may 
attend to its different goals in sequence rather than 
all at once. 

Goals as societal functions. At the most ab- 
stract level the goal of an organization (or more 
usually, a type of organization) is expressed in 
terms of its function for society. One such scheme, 
advanced by Talcott Parsons and others (1953, pp. 
183-186), posits four functions that must be ful- 
filled in any social system [see SYSTEMS ANALYSIS, 
article on SOCIAL SYSTEMS]. This scheme enables 
one to classify organizations according to their 
characteristic functions, For instance, a steel firm 
is in the “goal-attainment” sector, presumably con- 
cerned with material production, while a correc- 
tional institution is in the “integrative” sector, con- 
cerned with integrating deviants into society. A 
similar classification is that of Etzioni (1961), who 


posits economic, order, and cultural goals for or- 
ganizations. Little analysis of concrete organization 
can bé done at this level; but these typologies have 
the merit of causing us to ask how organizations 
are legitimated by society and how changes in a 
society's values will affect the growth of types of 
organizations. Such questions can lead to analysis 
of broad changes in organizational goals; for in- 
stance, the development of a religious sect into a 
church can be seen as a case of an organization’s 
transition from an “integrative” function to a 
“latency” or “pattern-maintenance” function (Clark 
1948). Similarly, an analysis of the British East 
India Company might focus upon the shift from 
trade and transport to political domination in the 
interests of British foreign policy. Organizations 
may even develop from fulfilling a single, basic 
societal function, such as the production and mar- 
keting of agricultural products, to providing for all 
the functional needs of a social system (Diamond 
1958). 

Goals as output. At a less abstract level it is 
convenient to distinguish organizations according 
to output or general type of product; the referent 
is the public constituency of the organization. 
Shifts in the application of resources are analyzed 
at this level, as when the emphasis in deviance 
control shifts from incarceration to probation or 
preventive work. Peter Blau and Richard Scott 
(1962, pp. 42-44) use an essentially output cate- 
gory of goals when they distinguish organizations 
in terms of the question, Cui bono? (Who bene- 
fits? ). They distinguish four types of beneficiaries: 
membership (for example, the members of volun- 
tary associations); owners (covering most business 
organizations); clients (such as the clients of a 
welfare agency); and society (a category filled 
almost exclusively by governments). This approach 
draws our attention to the central problems of each 
type of organization; thus, membership organiza- 
tions have the problem of maintaining a demo- 
cratic form of administration, and owner organiza- 
tions are chiefly concerned with efficiency. 

A number of studies focus mainly on organiza- 
tional output and can thus be easily related to the 
schema outlined above. James Thompson and 
Frederick Bates (1957; compare Thompson & 
McEwen 1958; Pittsburgh, University of, 1959) ex- 
amine how output goals are affected by the tech- 
nology employed: thus mining organizations are 
tied to a static output, while hospitals have adapt- 
able technologies and can shift or multiply their 
outputs (for example, by branching out into 1 
search or education). However, most case studies 
of organizations tend to range over several cate 


ORGANIZATIONS: Organizational Goals 


gories of goals. In Selznick’s study (1949) of the 
Tennessee Valley Authority (TVA) it appears that 
the output goals of that organization were flood 
control and fertilizer production; with this perspec- 
tive, we can see how the need to maintain these 
outputs influenced the organization’s ability to 
pursue specific “product” goals, such as assistance 
to farmers and the development of recreation 
areas. 

Investor goals. A more specific referent for 
goals is the return to investors. Whether investors 
provide capital, labor, or legitimation, at least some 
organizational efforts must be directed toward this 
source of support. In the case of business and in- 
dustrial firms return on investment is the goal most 
frequently cited, even in periods when capital is 
generated internally, and stockholders therefore 
play a passive role. Profit alone is not a sufficient 
characterization, for it may be a long-run or short- 
run profit, with differing consequences for the or- 
ganization. More importantly, profit may be dis- 
tributed to groups other than the investors, as 
when it is reinvested, or held for reasons of organ- 
izational security. 

In the case of public welfare organizations the 
legislature or authority that provides funds and 
legitimation may simply be content to have the 
organization exist as evidence of some effort on its 
part and take no interest in the organization as 
long as there are no important complaints about 
the way funds are spent or about the adequacy of 
the effort; under such circumstances the organiza- 
tion is free to pursue other goals. The same is often 
true of economic organizations. In private welfare 
organizations investors may demand that the or- 
ganization engage in activities that bring public 
eny or confer prestige upon the donors. Such 
era have manifold consequences for the or- 
noe aa (Perrow 1961b). In Selznick’s TVA study 
dana nd ne investor goals included the promo- 
fies “es ic power, thus placing limits on the 
Gator product goals that could be pursued. For 
ie ch “ed a on the public power issue had to 
Stee ee help and resources to the 
needy sea l-to-do farmers rather than to the 
hia PE i nother study, by Burton Clark (1960), 
TR eee. goals constituted the “admin- 
college's of that seriously weakened a junior 
e. orts to cope with the educational de- 

3 its students. 
mine Turning to internal referents, the 

y key = is concerned with the goals established 
Bian ie (and perhaps investors, if they 
stabilit ve role) regarding the size, growth, 

Y, market share, domain, and so forth, of 
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the organization. These are characteristics of the 
collectivity as a whole, independent of other refer- 
ents, such as output or social function. The com- 
mon assertion that the primary task of an organ- 
ization is to survive refers to this goal category. 
The concept of organizational equilibrium (Bar- 
nard 1938; compare Simon 1947), where inputs 
balance outputs, is also based on a survival goal. 
While many propositions at this level are highly 
tautological and tell us little, this goal category is 
useful. To know what an organization is willing to 
pay for survival, security, or growth, or what it 
considers to be the domain upon which others shall 
not encroach can be crucial for understanding its 
behavior. For instance, some economic organiza- 
tions are generally considered to be highly security 
conscious, and others are risk takers; likewise, 
some business organizations appear to be more 
concerned with growth than with profits, however 
the latter are distributed. William Baumol (1959) 
notes that an increase in sales is a frequent goal 
of business organizations even at the expense of 
profits, while the du Pont corporation is fond of 
announcing that it prefers a high return to stock- 
holders over sales growth. Herrymon Maurer 
(1955, pp. 35-39) mentions that in the early his- 
tory of the United States there was a point beyond 
which some prominent owners would not expand, 
both because a larger business would take too 
much of their time and because their businesses 
already brought in sufficient return. Attitudes 
toward growth and size are examined at length in 
David Granick’s report (1962) on European busi- 
nessmen; goals at this level in Europe, it would 
appear, differ markedly from those in the United 
States. This may be because of differences in the 
way in which elites are selected, as well as the 
predominance in Europe of closely held firms. 

The behavior of voluntary associations can often 
be explained in terms of survival goals. For in- 
stance, the leaders of the Townsend organization, 
a group promoting the interests of older people, 
were apparently willing to go along with a change 
in output goals (from political action to recrea- 
tion) rather than insist upon the ideological purity 
which would have further weakened what was in 
any case a dying organization (Messinger 1955). 
The Woman's Christian Temperance Union did not 
choose to reactivate broader welfare goals when 
their brief prohibition victory turned into a rout; 
thus the organization appears to be dying because 

owth, or even survival, is less important than 
ideological purity (Gusfield 1955; 1957). These 
examples certainly suggest that the system cate- 
gory of goals should figure heavily in studies of 
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organizational behavior; however, although excel- 
lent work has been done in a variety of organiza- 
tional settings, little systematic attention has been 
paid to the topic. 

Goals as product characteristics. Perhaps the 
most fruitful goal category for the study of specific 
organizations is that of product characteristics. 
Here the referent extends from key executives to 
the leaders of such functional groups as sales, pro- 
duction, engineering in economic organizations, or 
the clinical staff, child supervisors, and work staff 
in correctional institutions. Product goals are a 
subcategory of output goals; they are most useful 
for comparisons between similar organizations. 
Such dissimilar organizations as a steel firm and 
a correctional institution, for instance, have output 
goals that admit comparison only in the most gen- 
eral terms; the goals and associated behavior of 
two or more steel firms, however, can be compared 
in terms of actual product characteristics. 

The product goal includes such things as the 
type, quality, quantity, cost, styling, and availa- 
bility of the goods or services produced. Thus, a 
recent study of several juvenile institutions (Street 
et al. 1966) contrasts the consequences for each 
organization of adherence to such different product 
goals as bringing about extensive changes in char- 
acter and personality, inculcating obedience, or 
merely deterring, punishing, or containing inmates. 
Economic organizations have not been systemati- 
cally studied in these terms, but casual evidence 
from a variety of sources suggests that they differ 
widely in their “character” because of different 
commitments to different types of products. This 
category of goals is also heavily emphasized by 
Philip Selznick (1957), who gives examples from 
business and military organization. 


Goal changes 


“Succession of goals,” a phenomenon identified 
by Robert K. Merton and intensively discussed in 
a study by David L. Sills (1957, pp. 253-265), 
generally involves changes in product character- 
istics rather than in categories of output. Evidence 
of this may be found in a study of the Young Men’s 
Christian Association, which distinguishes various 
points at which the organization took up new goals 
while maintaining the same general “output” of 
serving individuals (Zald & Denton 1963). Changes 
in product characteristics are also discussed in an 
analysis of a general hospital by Charles Perrow 
(1963): as control of the organization, in response 
to changes in technology and consumer demand, 
passed from trustees to doctors, and then from 
doctors to administrators, the characteristics of the 
product goals also changed, together with investor 


goals and derived goals. Thus, although the output 
goals (as measured by official statements ) changed 
only gradually, the “operative” goals (measured in 
terms of effective policies) showed several very 
pronounced changes, including a shift from em- 
phasis on serving indigent Jews to one on serving 
middle-class Jews and gentiles, an increase in re- 
search as opposed to teaching, and the develop- 
ment of programs that competed with non-Jewish 
hospitals, whereas previously such programs had 
been avoided. 

Derived goals. In the course of producing 
goods and services and meeting such threats to 
survival as competitive products, dissatisfied in- 
vestors, or changes in demands of the consuming 
public, an organization generates a good deal of 
power that is not directly related to these require- 
ments, It has the power to hire and fire, relocate 
its plant, develop or enfeeble its employees, invest 
in times of imminent recession, donate to educa- 
tional or other community purposes, support ideo- 
logical positions, and so on. Sometimes these 
efforts are minor or trivial and have few apparent 
consequences for the organization or the environ- 
ment. However, they can also represent goals that 
are essential for an understanding of the organiza- 
tion and its impact. We shall call them derived 
goals, since the ability to pursue them is derived 
from the existence and behavior of the organiza- 
tion but is not considered essential to its conduct. 
Thus, a firm may sacrifice dividends, executive 
salaries, sales, stability, and growth because it en- 
gages in a protracted effort to avoid unionization, 
even though similar firms do not. Some firms may 
distribute profits to employees in the form of bene- 
fits far beyond those which similar firms find 
necessary. One firm may hire minority groups, 
while its equally successful competitor does not. 
Indeed, the whole question of the “social goals” of 
business organizations, evident since the indus- 
trial revolution but receiving increasing attention 
today, is primarily a question of derived goals: 
How is the power the organization generates to be 
used or controlled? Concern ranges from the psy- 
chological development of employees to impact on 
foreign policy. For instance, Chris Argyris argues 
that a great service the industrial world can offer 
to our society at large is to develop “fully func; 
tioning human beings who aspire to excellence 
(1962, p. 5). Organizations have the power to en- 
rich the lives of their members in other senses 
too; Michels and many other commentators (see, 
for instance, Lipset 1960, chapter 12) have noted 
the ability of organizations to raise the status o! 
their leaders and offer them perquisites that ate 
hard to abandon. Pursuit of status and financia! 
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rewards, derived from the existence of the enter- 
prise, has led, it is asserted, to an innate conserva- 
tism that causes deflection of the original goals, 
because the organization’s ability to pursue more 
precarious ends, such as political or economic mili- 
tancy, is thereby weakened. 

Studies of the impact of informal social rela- 
tions upon organizations generally provide ex- 
amples of derived goals. Pursuit of derived goals 
may be the occasion for goal succession, so that 
what was once incidental to the aim of the organ- 
ization becomes its product goal. Thus, for mem- 
bers of the Townsend organization the derived goal 
of fellowship supplanted the product goal of po- 
litical action (Messinger 1955). 

Goal displacement. The phenomenon of goal 
displacement, probably the most widely noted char- 
acteristic of organizational behavior (Sills 1957, 
pp. 62-69), is often a case of pursuing derived 
goals, Essentially it is asserted that adherence to 
prescribed means on the part of employees may 
interfere with goal achievement; the means be- 
come ends in themselves. Indeed, many of the ex- 
amples in the literature cited by Sills are not con- 
cerned with goals at all, but result from inflexible 
means that should have been changed to meet new 
Situations, or from inefficient or inappropriate 
means that did not contribute to the goal of the 
organization, or from the application of an unreal- 
istic performance standard on the part of the re- 
Searcher (for which, see Etzioni 1960; Gouldner 
1955). Furthermore, many organizations would 
fall into disorder if employees were free to discard 
rules and regulations and themselves attempt to 
interpret, from their own truncated perspective, 
What the goals of the organization are and how 
they should be met. 

Nevertheless, subgroup or personal goals, which 
ia derived from participation in the organization 
Mee essential for the achievement of goals 
ae an derived goals, can certainly contribute 
a a displacement and thus alter the character 

e organization, As William Starbuck (1965, 
taba points out, in young organizations 
Ribera inne goals rather than to tasks or social 
factions is likely to be high; but with time, satis- 
an o afforded simply by participating in the 
an oe that grow up. Commitment to goals 
sya sain about changes in goals tend to dimin- 
of noe er with concern about the manipulation 

zational resources that displace goals. 


G ni 
oals and interest groups 
ke two levels—product goals and derived 
A Sane an analysis of the organization as 
ition or set of interacting interest groups. 
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rather than as an integrated entity. In fact, most 
discussions of conflict over goals view the organ- 
ization in these terms and utilize the product cate- 
gory or the derived category. One statement of this 
view focuses upon shifting coalitions of groups 
with parochial interests and notes that each may 
have a “file drawer” of goals from which it selects 
particular ones at particular times (Cyert & March 
1963, pp. 26-43). These goals may, of course, be 
attended to sequentially and therefore need not be 
integrated. The same authors (1963, pp. 36-38) 
propose the notion of “organizational slack”—what 
the economist would term “economic rent”—which 
is a surplus of resources that encourages the pur- 
suit of a variety of goals, a pursuit that would be 
curtailed in less prosperous times. 

The development of group interests within the 
organization is generally taken to be a negative 
factor in much of the human-relations literature, 
since the interests are felt to be uncontrolled by 
top management (for example, see the critical re- 
view by Krupp 1961, especially chapter 3). How- 
ever, it has been pointed out that groups such as 
sales, auditing, personnel, or production represent 
latent sources of energy and commitment and are 
expected to put forth their views forcefully and 
protect the integrity of their operation; if they fail 
in this responsibility, the organization loses the 
benefits of their perspective (Selznick 1957), Ac- 
knowledged conflict over product characteristics 
(“Is reliability more important than styling?”) or 
over derived goals (“What is our responsibility to 
the community?”) openly serves to determine “the 
fundamental prize” in organizations—the nature 
of the organizational character ( Selznick 1949, 


p. 181). 


Goal analysis—a continuing problem 


As this discussion of goal categories has sug- 
gested, the sources of goals and the consequences 
of goals, both for organizations and for society, are 
manifold. Broad social changes set the stage for 
technological developments, which in turn de- 
termine, within broad limits, the range of goals 
possible in types of organizations (Perrow 1965). 
Changes in one category of goals can produce 
changes in others, as when derived goals become 
defined as essential, or investor goals require shifts 
in product goals. More specific causes abound, such 
as changes in the source of recruitment of per- 
sonnel (Selznick 1957, pp- 47-111), government 
regulation (Keller 1963), or pressures for forward 
integration (e.g., distributing and marketing func- 
tions added to productive functions, as discussed 
in Chandler 1962). The variety of sources asserted 
to generate or change goals reflects, in part, the 
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variety of goal categories utilized. But it also re- 
flects the essentially dependent nature of goals. 
Organizations are not born with a fixed structure 
or a stable internal guidance system directing them 
to some precise end; they are subject to countless 
internal and external forces, and both their com- 
petencies and liabilities change with a changing 
environment. Nor do these changes occur at a uni- 
form rate; societal and output goals may change 
at a glacial pace, but the others may often fluctu- 
ate with the seasons. 

If goals are essentially dependent phenomena, 
they may be no less dependent than other elements 
of organizations, such as task structure and social 
structure. Technology and the structure of tasks 
necessary to produce changes in the raw material 
may alter goals or be altered by goals. The social 
structure, or systems of relationships that surround 
the work effort, may also influence as well as be 
influenced by goals. This perceived fluidity of or- 
ganizations, coupled with, or abetted by, our crude 
and ad hoc conceptual tools and the relativity of 
means and ends, poses a nearly insuperable prob- 
lem for goal analysis. The serious student of organ- 
izational goals finds the matter so complex and 
categories and concepts so interdependent that there 
is no certainty about what should be labeled a 
goal, where it comes from, how it changes, and 
what impact it has. Nevertheless, the best work 
done in attempting to understand complex organ- 
izations has had to confront the question, To 
what variety of ends is organizational behavior 
patterned and motivated? The future theories of 
organization will inevitably founder unless they do 
the same. 

CHARLES PERROW 


[See also ADMINISTRATION; INDUSTRIAL RELATIONS, arti- 
cle on THE SOCIOLOGY OF WORK; and the biographies 
of Mayo; MicuELs,] 
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EFFECTIVENESS AND PLANNING OF CHANGE 


Concepts of organizational effectiveness typically 
focus on the degree to which the organization ac- 
complishes its objectives. A business firm is effec- 
tive if it makes a profit, a welfare agency if it 
provides professional service of high quality, the 
Department of State if it achieves its substantive 
goals related to foreign policy. This concept has 
Jed scholars like Blau and Scott (1962) to suggest 
that organizational effectiveness be defined in terms 
of “who benefits.” Thus, a business organization is 
effective if it reduces costs (because the owners 
have profit as their objective). Unions are effective 
if they have strong rank-and-file participation (be- 
cause the union is created to serve the needs of 
the membership ). 

The limitations of this view can be seen when 
we realize that presently there are more large busi- 
ness firms than trade unions concerned about 
increasing rank-and-file participation and designing 
work to be more meaningful (which increases 
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costs). Trade unions, on the other hand, are be- 
coming more conscious of cost and efficiency, more 
worried about “marketing” procedures (Argyris 
1964), and less concerned about their internal 
democracy, Recently, officials of the U.S. Depart- 
ment of State, after a searching study, concluded 
that one of the most significant factors contributing 
to organizational ineffectiveness is the subservience 
of the administrative activities to the substantive 
activities. Guest (1962) found that an organization 
which for years had focused primarily on making 
a profit, to the neglect of its human resources, had 
to pay more attention to the effectiveness of its 
internal human system. Argyris (1960) found that 
an organization which for years had overempha- 
sized its “people-centeredness” had to reduce this 
emphasis or face the prospect of destroying itself. 

These illustrations point up the fact that the 
moment one considers the “ongoing” property of 
organizations, one must add two dimensions to 
make the concept of organizational effectiveness 
more valid. They are, first, the organization’s capac- 
ity to maintain its internal system in working order 
so that it can solve problems effectively and, sec- 
ond, the relative cost of transforming the human 
energy inputs into useful organizational outputs. 
Organizational effectiveness, then, is the balanced 
or optimal emphasis upon achieving objectives, 
problem-solving competence, and human energy 
utilization, It may be possible, in the short run, to 
emphasize one of these activities at the expense of 
the others, but eventually all must be optimally 
expressed. 

How should an organization be constructed if it 
is to be effective? Max Weber suggested creating 
a system characterized by specialization of effort, 
a clearly defined authority structure, formal posi- 
tion descriptions, and a sense of impersonality. 
“Scientific management” and early public admin- 
istration theorists, plus military organizational 
specialists, would agree with Weber but would say 
it somewhat differently. They would emphasize 
such factors as work specialization, unity of com- 
mand, unity of direction for any given unit, span 
of control, and employee loyalty. 


The causes of dysfunctional activities 


Toward the end of World War 1 a number of 
social scientists began to study organizations by 
observing the actual behavior of the participants. 
As expected, they did find people highly specialized 
(especially at the lower levels), reporting to one 
supervisor, and organized as members of a unit 
that had one primary function. But they saw more. 


They observed some people who reported to several 
superiors (Shartle 1956); some who bypassed their 
superiors ( Arensberg & McGregor 1942): some who 
enlarged their work much beyond the prescribed 
specialization (Richardson & Walker 1948); and 
some who fought their work and their superiors 
(Dalton et al. 1946), the incentive system (Whyte 
et al. 1955), and managerial controls such as 
budgets (Argyris 1952). In other words, they 
found an informal system in addition to the formal 
one. Indeed, the more they studied the behavior in 
organizations, the more they became impressed 
with the scope and potency of the informal activi- 
ties, especially the dysfunctional ones (Gouldner 
1959). 

What causes these dysfunctional activities? Why 
do people seem to fight the system? McGregor 
(1960) and Likert and his associates (1961) found 
some of the problems to be related to the frequent 
use by management of production-centered, de- 
pendency-producing, submission-oriented leader- 
ship styles. Whyte (1961) agreed substantially with 
this finding, although he conceptualized it as the 
initiation of action downward being much greater 
than the initiation of action upward. 

Later, the thesis was developed that all the major 
components of organization—structure, leadership, 
managerial controls, incentive systems, and tech- 
nology—tend to have a similar impact on the em- 
ployees, that is, to place them in a dependent, 
submissive position where they can use few of their 
central abilities (Argyris 1964). It was predicted 
that this tendency should be more pronounced as 
one goes down the administrative hierarchy and 
as the demands of the job control the individual to 
a greater degree. The opposite should be the case 
as one rises in the hierarchy and as the individual 
has greater control over the job demands made 
upon him. (The existence of large stock options 
for high executives may blunt the first part of this 
proposition, because the executive may be willing 
to become dependent and submissive in order not 
to lose the lucrative rewards available to him at the 
end of the stipulated period.) From this point of 
view, the underlying causal factors of dysfunction- 
ality are the basic managerial principles, or what 
some sociologists call the “organizational impera- 
tives”: work specialization, chain of command, 
unity of direction, etc. The impact of such factors 
as technology, organizational structure, managerial 
controls, and leadership is seen as a manifestation 
of these principles. It is not true, therefore, 4S 
some have suggested, that this approach to organ- 
izations ignores the part played by technology: 


rather, it refuses to consider technology apart from 
management. 

It is also important to point out that this ap- 
proach is not limited to the lower levels of em- 
ployees and thereby to a particular social class 
(Strauss 1963). It has been successfully applied 
to scientists in 11 university, industrial, and gov- 
ernmental research laboratories (Farris 1962), to 
a national sample of nearly two thousand man- 
agers ranging from presidents to first-level super- 
visors (Porter 1962), to a study of a thousand 
managers in five firms (Carnarius 1962), as well 
as to a national sample of lower-level employees 
(Gurin et al. 1960), two extensive reviews of the 
literature (Blauner 1960; Argyris 1957), and a 
depth study of nearly five hundred employees at 
the lower levels (Kornhauser 1962)—to mention 
only a few of the published studies. Moreover, the 
relevance of social class is questioned by the recent 
work of Mann and Williams (1962). They found 
that employees whose social class remained con- 
stant while their work was altered to provide them 
with greater control and use of their abilities 
changed their attitudes about their work in the 
direction of desiring more challenge and more re- 
sponsible work. The frequently cited example of 
Dubin’s results (1956) that employees do not place 
work as their central life interest is not a discon- 
firmation of this point of view; indeed, I would 
Predict such results. It may be that such employee 
apathy is a reaction to the very conditions that are 
described above. 

Nor does this point of view make predictions 
about employee happiness. Its only prediction is 
that as one goes down the organizational hierarchy, 

_ the employees should report less control over their 
Work world. Even Blauner (1964), who found that 
Most employees are satisfied, confirmed the fact 
that decreasing control over work is reported by 
employees as one goes down the hierarchy; this 
finding was also confirmed by Inkeles (1960) and 

aldamus (1961) in studies of employees in other 
countries, 

This viewpoint also predicts that the dysfunc- 

onal attitudes and informal activities that have 
en so thoroughly documented—apathy, absentee- 
ee unionization, decreasing emphasis upon 

SA ia values and increasing emphasis upon mate- 

ae (Erich Fromm’s “market orientation”), 
ing Ree Motivating power of wages, and increas- 
tions ew tend to arise under two condi- 
do i uch reactions will occur when individuals 

want to be primarily dependent, submissive, 

Confined to a position where they can use few 
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of their abilities, but the organization requires them 
to be so; or when individuals do want to be depend- 
ent and submissive, and their organization requires 
that they use more of their abilities and take on 
more responsibility, 

On the other hand, these informal activities 
should not tend to occur when individuals do want 
to have more control and responsibility and to use 
more of their abilities, and the organization per- 
mits it; or when individuals do not want to take 
on responsibilities and do not want to have control 
over their work, and the organizational context does 
not require it. (Apathy, indifference, and aliena- 
tion may be exceptions, in the last case, because 
these qualities represent the employees’ needs.) If 
data can be produced to show that the informal 
activities listed above do occur under the latter 
conditions or do not occur under the former con- 
ditions, then this viewpoint would be in trouble. 

Nor does this view make the assumption that 
all people want to “actualize” themselves, in the 
sense of expressing some higher-order needs. It 
merely states that if people want to actualize their 
higher-order needs and if their job requirements 
make them dependent and submissive and mini- 
mize their control, then certain difficulties will 
occur. However, if self-actualization is taken to 
mean the expression of any need (e.g., autonomy 
or dependence), then all that we are claiming is 
that whenever employees’ needs are frustrated, the 
consequences will be the ones described above. One 
would also predict that the greater the control, the 
responsibility, and the variety of individual experi- 
ences involved in the worker's job, the less the 
frustration, dissatisfaction, and alienation he will 
experience; this proposition is supported, for exam- 
ple, by Blauner’s finding (1964) that the degree of 
alienation and dissatisfaction decreased as the job 
became more challenging and permitted greater 
control. In terms of understanding organizational 
effectiveness, the main point is that most of these 
informal activities (with the exception of trade 
unionism) tend to inhibit problem-solving effective- 
ness and consume much human energy in non- 
productive activities. 


Assumptions and values of management 
According to McGregor (1960), the principles 
of organization as described by Weber, the scien- 
tific management school, and the military theorists 
in reality imply certain assumptions about em- 
ployees: (1) the average person has an inherent 
dislike for work and will try to avoid it; (2) because 
of their dislike for work, most people have to be 
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coerced to put forth an adequate day’s work toward 
company goals; and (3) the average worker wants 
to be closely supervised and to avoid responsibility. 
Some of the protagonists of traditional organiza- 
tional theory have questioned these assumptions. 

Recently, in both a national sample of executives 
(Porter 1964) and an international study of 3,000 
managers in 14 countries (Haire et al. 1963), the 
researchers found little belief in the capacity of 
others for initiative, individual action, and leader- 
ship. In all but three countries managers viewed 
the ordinary individual as preferring to be directed, 
wishing to avoid responsibility, and possessing little 
talent for leadership. 

Employees feel these basic assumptions and, as 
is pointed out above, tend to adapt by behaving in 
an apathetic manner and seeking less responsi- 
bility; thus, the worker adapts by making the man- 
agers’ assumptions self-fulfilling prophecies. Man- 
agement, to put it dramatically, tends to create a 
world that makes its own administrative strategy 
self-defeating. 

Before we can turn to the question of how to 
change the situation, we need to look more closely 
at the upper levels of organization. My own sug- 
gestion is that the Weberian-scientific manage- 
ment concepts of organization imply three basic 
values of effective human relationships at the upper 
levels and that, to the extent these values are in- 
ternalized, they will tend to create an executive 
subsystem that carries on many unintended and 
self-defeating dysfunctional activities. 

The first basic value is that the important hu- 
man relationships are those which are related to 
achieving the organization’s objective. One study 
found, for example, that in over 265 different types 
and sizes of meetings, the executives always tended 
to focus their behavior on “getting the job done” 
(Argyris 1966). Out of literally thousands of units 
of behavior, almost none were spent in analyzing 
and maintaining group effectiveness. This was true 
even though in many meetings the group's effec- 
tiveness “bogged down” and the objectives were not 
being reached because of interpersonal factors. 
When the executives were interviewed and asked 
why they did not spend some time in examining 
the group operations or process, they replied that 
they were there to get a job done, and besides, “If 
the group isn’t effective, it is up to the leader to 
get it back on the track by directing it.” 

The second value is the emphasis on cognitive 
rationality and the de-emphasis on the rationality— 
indeed, the existence—of feelings and emotions, 
This value influenced the executives to see cog- 


nitive, intellectual discussions as “relevant,” “good 
work,” and so on. The emotional and interpersonal 
discussions tended to be viewed as “irrelevant,” 
“immature,” “not work,” and so on. As a result, 
when emotions and interpersonal variables became 
blocks to group effectiveness, all the executives re- 
ported (and were observed doing so) that they 
should not deal with them. For example, if there 
was an emotional disagreement, they would tell the 
members “to get back to facts” or “keep personali- 
ties out of this.” In other words, the value that 
effectiveness was primarily a cognitive dimension 
tended to coerce the men to suppress and deny the 
emotional and interpersonal variables that influ- 
enced their group’s effectiveness. 

The third value suggests that human relation- 
ships were most effectively influenced through uni- 
lateral direction, coercion, and control, as well as 
by rewards and penalties that sanctioned all three 
values. This value of direction and control was im- 
plicit in the chain of command and also in the 
elaborate managerial controls that had been de- 
veloped within the organizations. 

The impact of these values can be considerable. 
For example, to the extent that individuals dedi- 
cate themselves to the value of intellectual ration- 
ality and “getting the job done,” they will tend to 
be aware of and emphasize the cognitive, intellec- 
tual aspects of work and consciously or uncon- 
sciously to suppress the interpersonal and emo- 
tional aspects, especially those which do not seem 
relevant to achieving the task. As the interpersonal 
and emotional aspects of behavior become sup- 
pressed, organizational norms will tend to arise 
that coerce individuals to hide their feelings. Theit 
interpersonal difficulties will either be suppressed 
or be disguised and brought up as cognitive, tech- 
nical, intellectual problems. Under these conditions 
the individual may tend to find it very difficult to 
develop competence in dealing with feelings and 
interpersonal relationships. Also, the individual 
may build defenses, both psychological and 0r- 
ganizational. 

Another way to prevent individuals from violat- 
ing the organizational values and from embarrass- 
ing one another is to block out, that is, refuse to 
consider consciously or unconsciously, ideas that, 
if explored, could expose suppressed feelings. Such 
a defensive reaction in the organization eventually 
inhibits the effectiveness of the search process in- 
volved in decision making, The participants tend 
to limit themselves to those ideas and values which 
are not threatening and do not violate organiza 
tional norms. Consequently, the members decrease 


their openness to new ideas and values. As the 
degree of openness decreases, the capacity to ex- 
periment also decreases, and the fear to take risks 
increases. As the fear to take risks increases, the 
probability of experimentation decreases. This, in 
turn, decreases the range or scope of openness, 
which, in turn, decreases risk taking. We have an- 
other closed circuit, which becomes an additional 
cause of organizational ineffectiveness. 

Moreover, the same factors help to keep the true 
state of affairs from the awareness of those who 
could change the situation. This can result in a 
sense of resignation among the people below (i.e., 
“the top will never change”), which, in turn, in- 
hibits full and open discussion during the more 
innovative decision-making sessions and conse- 
quently decreases internal commitment to the more 
innovative decisions, increases subordinate and 
superior “gamesmanship,” and increases executive 
blindness about their relationships with their sub- 
ordinates (Argyris 1962). 


Optimizing organizational effectiveness 


Formal organizations may have built into their 
designs the seeds for many nonproductive, dys- 
functional, energy-consuming activities at all levels 
which tend to result in increasing organizational 
rigidity, organizational defensiveness, and inter- 
group conflict, as well as less effective decision- 
making processes (especially at the upper level). 
In short, organizations have a built-in tendency 
toward ineffectiveness or disorganization (as is 
apparently the case in physical nature). 

Can this tendency toward ineffectiveness be de- 
creased and the tendency toward effectiveness 
strengthened? What changes would this require at 
the different levels of the hierarchy? Beginning 
with the lower levels, we need to redesign the tech- 
nology, the managerial controls such as budgets, 
and the financial incentives, as well as the jobs 
themselves, in order to increase the degree to which 
the employees use their more important abilities, 
have control over their work, and develop feelings 
of essentiality toward it. 
ae, may these objectives begin to be accom- 
chet ? First, jobs can be enlarged so that indi- 
‘ee S have more responsibility and use more of 
aa central abilities. In spite of the increased 
fe ng and redesign costs, apparently the long- 

nge financial returns can be attractive (Conant 
eee 1965). Some very preliminary research 
ee sts that feelings of essentiality can be effec- 
ieai Produced if the organizational unit is Op- 

y undermanned” (Barker 1962; Sweetland & 
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Haythorn 1961). For example, branch banks that 
were designed to be small models of the main office 
were found to be more productive, and employee 
commitment and cooperation in them was found 
to be much higher, as compared with the main 
office, partially because the branches had been 
optimally undermanned (Argyris 1954). 

Gillespie (1948) has shown that employees can 
conduct their own work studies that lead to effec- 
tive work design. Likert and Seashore (1963) have 
presented examples of how budgeting can be 
altered to provide employees more control over the 
budgets, which, in turn, can lead to decreased costs 
as well as to increased employee commitment to 
further cost reduction, Lawler (1966) has shown 
that pay can be an effective incentive if employees 
see a clear connection between good job perform- 
ance and their pay. When pay is seen as a reward 
for good performance, it is a particularly potent 
incentive because it becomes a form of recognition 
and a mark of achievement, thereby satisfying sev- 
eral important needs. McKersie ( 1963) has sug- 
gested that by including greater employee partici- 
pation in the setting of performance standards and 
by having standards that are simple and stable and 
provide quick rewards, the incentive plan of the 
future can create the perception that pay is a re- 
ward for good performance. Of course, employees 
who do not seek to use their abilities, who are not 
motivated by autonomy needs, and who do not 
make work a central life interest may be threat- 
ened by these changes (see Strauss 1963). 

Turning to the upper levels of the organization, 
technology, controls, and structure become less 
important and interpersonal relationships among 
the executives become more important. Executives 
with power can alter the components of formal 
organization if they wish. However, they tend to 
have much more difficulty in altering their own 
behavior, especially the conformity and mistrust 
within their executive subsystem, as well as the less 
effective decision-making activities (Argyris 1966). 
According to my analysis, change at the upper 
levels should begin with changing the values and 
the behavior of the executives. Lewin and Grabbe 
(1945) have pointed out that changing values and 
behavior is not an easy process and that it should 
begin by “unfreezing” and re-educating the indi- 
vidual. Schein (1963), following Lewin, has pro- 
posed that effective change requires several ele- 
ments: the involvement of the people in the mutual 
determination of goals, a reliance on publicly 
shared data, a voluntary mutual interaction of the 
client and the change agent, the freedom of the 
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client and consultant to influence each other, and 
an emphasis on the importance of making in- 
formed decisions (Bennis 1963). 

Methods of organizational change. How can 
some of the above changes be instituted in organ- 
izations? Four approaches have been developed, to 
date, and used in various combinations. Mann 
and Likert (1952) have attempted organizational 
change by feeding back the research results to the 
members at all levels. The individuals are encour- 
aged to disagree with and modify the findings and 
to develop their own strategies for change based 
upon those parts of the diagnosis which they ac- 
cept. Each group is free to ask for further data and 
to feed information to, and to try to influence, those 
above as well as those below them, Seashore and 
Bowers (1963) published a systematically docu- 
mented study of this type of change process, which 
resulted in a significant improvement in employee 
satisfaction; machine waste and absentee rates also 
improved, but not significantly. 

The second approach focuses on two-man and 
small-group relationships. Argyris (1965), Sofer 
(1961), and Rice (1965) have published illustra- 
tions of three relatively different variations of this 
approach. The basic thrust of Sofer and Rice is to 
rely on tensions released through anxiety reduc- 
tion: that is, they try to help the executive become 
aware of his real anxieties and then help him to 
overcome them. Argyris focuses more on helping 
executives to increase their interpersonal compe- 
tence, which, in turn, should decrease their de- 
fensiveness and increase the effectiveness of their 
problem-solving and decision-making activities, 
Rice is willing to focus on unconscious material 
and Sofer is less so, while Argyris explicitly at- 
tempts to stay away from such material, 

The third approach is concerned with the reduc- 
tion of intergroup rivalries and the management of 
conflict. Presently organizations are so structured 
that competing groups are created and the power- 
ful overcome the weak in political warfare. How- 
ever, organizations could be structured so that each 
group can maintain its identity and seek resolution 
of conflicts through functional relations. This 
means creating new organizational policies and 
developing executives who are capable of bringing 
the competing groups together as groups. Each 
group discusses the problem and seeks, in private, 
to agree on its perceptions and attitudes toward the 
other groups and toward itself. Then representa- 
tives of the groups talk together in the presence of 
other members of the organization. This joint meet- 
ing is followed by further private discussion within 
each group regarding the way the group is per- 


ceived by others. Thus the groups develop an un. 
derstanding of the discrepancy between their views 
(Blake et al. 1965). 

The fourth method for organizational change is 
the laboratory method of education, at the core of 
which is the T (for training )-group. Briefly, the 
T-group experiences are structured so that indi- 
viduals are placed in a learning experience where 
they are in control of their learning and can focus 


upon their behavior in a relatively isolated but 
accepting “culture.” In this training group they can 
give and receive feedback about the impact of 


their own behavior as well as the behavior of 
others. Thus, it is hoped that they will develop a 


deeper awareness and acceptance of themselves 
and others (as well as of the dynamics of small 
groups) and, if the experience lasts long enough 
(or if they learn fast enough ), increase their inter- 


personal competence (Schein & Bennis 1965). 

Variants of the laboratory method of education 
have been used with encouraging results. For ex- 
ample, Blake and Mouton conducted a change pro- 
gram at all levels of an organization (Blake et al. 
1964). L. B. Barnes and L. E. Greiner conducted 
an independent evaluation of this program (see 
Blake et al. 1964). Although the degree of meth- 
odological rigor possible was limited by the field 
situation, the authors provide quantitative and 
qualitative data to show significant increases in 
productivity, profits, and employee commitment to 
organizational effectiveness. Argyris (1965) te- 
ports a change in the values and behavior of a top 
management group as a result of a laboratory pro- 
gram; the change is known to have lasted for 
nearly a year (unfortunately, further follow-up 
data were not available). Harrison (1962) con 
ducted an independent evaluation of the same pro- 
gram and reported similar but not conclusive Te- 
sults, while Argyris (1965) later found that the 
laboratory method altered both the values and the 
behavior of a board of directors. These behavioral 
and attitude changes, as reported in both quanti- 
tative and qualitative terms, were still evident after 
16 months, at the time the study ended. Increases 
in organizational effectiveness were also observed, 
tanging from the development of new products 
through the change of top officers who were ae 
effective, to a decrease in interpersonal tension 
among the board members. 


Evaluating Strategies for change l 
_ In the mid-1960s a number of large organi 
tional change programs are in process, each its 
which has a built-in research activity to study Í 
effectiveness. For example, Herbert Shepard 4 


his associates are involved in the organization 
change program of one of the largest space labora- 
tories in the United States, and a separate research 
team is studying the change. Friedlander’s prelimi- 
nary results (1966) show that this program is be- 
ginning to have important positive effects. The 
National Training Laboratories is conducting a 
two-year change program in a very large industrial 
research organization; again, a team of social 
scientists (half of whom are not members of that 
network) are studying the results. The coupling of 
systematic research with each of these programs 
should help us to evaluate more effectively the 
results of the changes as well as the differential 
effectiveness of the several change strategies. How- 
ever, much more research is needed, especially 
studies of a longitudinal nature. 

To date, all the organizational changes influ- 
enced by social scientists have been in the direc- 
tion of creating more “organic” organizations. This 
approach has been variously called “participative 
group” (Likert), “problem solving” (Bennis ), “open 
system” (Louis B. Barnes), “human relations” 
(Eugene Litwak), “Theory Y” (McGregor), and 
“flexible organization” (Rice). The organic organ- 
ization is characterized by decentralization of de- 
cision making; an emphasis on mutual dependence 
and cooperation based on trust, confidence, and 
high technical or professional competence; a con- 
stant pressure to enlarge tasks and interrelate them 
so that the concern for the whole is emphasized; 
the decentralization of responsibility for, and use 
of, information, rewards, and penalties; the re- 
sponsibility of participants at all levels for develop- 
ing and maintaining loyalty and commitment at 
as high a level as possible; and an emphasis on 
status through contribution to the whole and 
through intergroup and interindividual cooperation. 
Such an organization assumes that people are 
capable of being responsible, committed, and pro- 
ductive and that they desire a world in which the 
rationality of feelings and interpersonal relation- 
ships is as valued as cognitive rationality. 

Although I have been involved in helping some 
organizations become more organic, I do not be- 
lieve that the organic organization is the final 
answer, I believe that organizational effectiveness 
can be optimized by using any one of at least four 
Tek organizational strategies or structures, each 
i ich would have a set of decision rules guid- 
ng the administrators in their use. The first is the 
Pyramidal structure; the second, a Likert-type 
aa structure; the third, a structure where 
cae obtain power according to their functional 

‘ibutions; and fourth, a structure where all 
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members have equal power and may not abdicate 
their basic responsibilities. 

In addition, I believe that there are four differ- 
ent types of leadership patterns, ranging from 
the traditional authoritarian leadership to a more 
“reality-centered” leadership (which Likert calls 
“cultural” and “supportive”). In reality-centered 
leadership the power, rewards and penalties, con 
ditions of membership, and control of information 
can reside with one leader, or they can be increas- 
ingly shared by all the members, Mann (1965) has 
identified three basic skills in reality-centered lead- 
ership: administrative, human relations, and tech- 
nical skills. The importance of each of these skills 
varies with the level of supervision, the technology 
involved, the degree of professional work involved, 
and situational changes, such as a change to elec- 
tronic data-processing equipment. 


It is too early to evaluate the usefulness of these 
views in regard to the organization of the future. 
Much more research is needed. The only available 
“evidence” is from a few industrial firms that are 
experimenting in limited fashion. Brown (1960) 
has reported that profits and problem-solving effec- 
tiveness have increased in firms that have created 
such substructures as a representative system to 
represent employees’ views, a legislative system 
that allows every member to participate in formu- 
lating and implementing policy, and an executive 
system in which employees have significant influ- 
ence on the present and future operations of the 
firm. Non-Linear Systems, an electronic manufac- 
turing firm near Los Angeles, reported an increase 
of 30 per cent in gross income by giving employees 
more responsibility in designing the product, in 
enlarging their jobs, and in controlling most of the 
accounting process (Kuriloff 1966). Gomersall and 
Myers (1966) reported significant financial gains 
through decreased costs and increased productivity 
achieved by giving the employees greater control 
over the design of their work. Union Carbide and 
General Electric have been so favorably impressed 
with the results of the change processes outlined 
above that they have trained change agents and 
created departments for organizational diagnosis 
and change. Here again, studies of whole organiza- 
tions over a long period of time are needed. Under- 
standably, it is not easy to get executives to agree 
to permit behavioral scientists to experiment with 
the very core components of their organization. 
Perhaps, in addition to encouraging partial experi- 
ments in organizations, organizational simulation 
may provide a vehicle by which more of the totality 
and complexity can be studied. As the findings in 
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this field become better validated, we can expect 
greater theoretical precision, methodological so- 
phistication, and, for the practitioner, the proba- 
bility of less risk and greater returns. 


CHRIS ARGYRIS 


[See also BUDGETING; INDUSTRIAL RELATIONS; LEADER- 
SHIP; WoRKERS.] 
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ORGANIZATIONAL INTELLIGENCE 


oe Sather, process, interpret, and communicate 
technical and political information needed in 
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decision making is to fulfill the intelligence func- 
tion. The staff and consultants involved are typi- 
cally men of specialized knowledge and training, 
who serve as experts. Many of them are, or once 
were, “intellectuals.” This article draws on a mili- 
tary analogy and brings together the scattered liter- 
ature on the intelligence function in complex 
organizations in order to offer hypotheses about 
the determinants of the uses of intelligence and 
about the structural and ideological roots of intel- 
ligence failures. 

Increased availability of intelligence has been 
assured by the so-called knowledge explosion. This 
may be seen in the exponential expansion of scien- 
tific knowledge, knowledge-producing professions 
and industries, research and development units, in- 
formation services and machines, and the media 
of communication themselves. It is also evident in 
the growing emphasis on education, not just in the 
school but in the home, on the job, in the church, 
and in the armed forces (Machlup 1962). It is 
symbolized in the spectacular spread of electronic 
computers and automatic control systems. In indus- 
try and politics, as in the armed forces and the 
agencies of national security, the greater ease of 
obtaining and processing information has induced 
more information consciousness among executives, 
who are now more willing to use both overt sources 
of intelligence, such as government data and pri- 
vate surveys, and covert sources, such as electronic 
snoopers. 

Information is more available; so is manpower. 
The swift growth of the World War m OSS, of its 
successor, the CIA, of the FBI, of the intelligence 
units of both the armed forces and the police, of 
research in every agency of government—a trend 
common to every rich country—has yielded a pool 
of potential manpower for industry, labor, political 
parties, indeed, for every type of organization. 
Many intelligence specialists in such agencies find 
their skills transferable; a free labor market insures 
their use. Another growing source of manpower, 
tapped mainly for covert intelligence, is mainte- 
nance technicians and engineers, men preferred as 
undercover operators in industry because they can 
move about unnoticed. 

In short, the opportunity to gather, process, 
transfer, and deliver information has expanded; 
the relevant manpower is available; and there has 
been a corresponding increase of information con- 
sciousness among officials and executives. 


Determinants of the use of intelligence 


Organizations vary a great deal in the extent to 
which they use staff services; the types of experts 
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they use also vary, To explore these variations, I 
shall delineate types of experts found in all com- 
plex organizations (Wilensky 1956, pp. 33-108) 
and relate their incidence to general propositions 
about the reasons for large budgets for intelligence. 

There are four major determinants of the man- 
power, time, and money that a formal organization 
allocates to the intelligence function: (1) the de- 
gree of its conflict or competition with the environ- 
ment—typically related to the extent of its in- 
volvement with, and dependence on, government; 
(2) the degree of dependence on internal support 
and unity; (3) the degree to which internal opera- 
tions and external environment are believed by the 
organization’s managers to be capable of rationaliza- 
tion, that is, characterized by predictable uniform- 
ities and therefore subject to planned influence; 
and affecting all of these, (4) the size and struc- 
ture of the organization—the heterogeneity of its 
membership, the diversity of its goals, and the cen- 
tralization of its authority system. On all counts we 
can expect that the typical formal organization in 
modern society, whatever its products or services 
and whatever its cultural context, will make in- 
creasing use of experts, 

Contact men. The more an organization is in 
conflict with its social environment or depends on 
it for the achievement of its central goals, the more 
resources it will allocate to the intelligence func- 
tion, and the more of those resources will be spent 
on experts we might call “contact men.” The con- 
tact man supplies political and ideological intelli- 
gence the leader needs to find his way around 
modern society; he mediates the relations of the 
organization with the outside world. He is primar- 
ily concerned with, and skilled in, dealing with 
facts about, and techniques of, changing the 
thoughts, feelings, and conduct of men through per- 
suasion and manipulation. He is valued for his 
knowledge of the political and social topography of 
the containing society, for his “contacts,” which are 
so well developed that they become nontransfer- 
able, and for his skills in exploiting these contacts 
—skills in private inquiry, consultation, negotia- 
tion, mediation. Typical titles in this area include 
general counsel; lobbyist, Washington representa- 
tive, legislative representative, or trade-association 
representative; public-relations, community-rela- 
tions, or press-relations man. The backgrounds of 
such specialists are heterogeneous; most have held 
jobs in the bureaucracies with which their em- 
Ployers deal. Their contacts are used mainly for 
keeping information channels open for the speedy 
transmission of accurate political intelligence, typ- 
ically between competing staffs (Milbrath 1963, 


pp. 256 ff., 269, 339; Wilensky 1956, pp. 61-79; 
Wildavsky 1964). In addition to serving as a listen- 
ing post, the contact man interprets his employers 
or clients to important publics, real or imagined, 
and those publics to the organization he represents. 

Internal-communications specialists. A second 
major force shaping the use of experts is the prob- 
lem of internal control. The more the organization 
depends on the unity and support of persons, 
groups, factions, or parties within its membership 
for the achievement of its central goals, the more 
resources it will devote to the intelligence function, 
and the more of those resources will be spent on 
experts we might call “internal-communications 
specialists.” 

The internal-communications specialist supplies 
political and ideological intelligence the leader 
needs to maintain his authority. He transmits pol- 
icies to the membership or reports to the leader on 
membership sentiments and opinion, or he does 
both; he helps induct new members and train ac- 
tivists and leaders. He is primarily concerned with, 
and skilled in, changing the thoughts, feelings, 
and conduct of members. He is valued for his prop- 
aganda and group work skills, his ability to gauge 
group reactions, and his knowledge of the politics 
and personalities of the employing organization. 
Typical titles include: editor of the house organ, 
education, recreation, or training director, person- 
nel- or industrial-relations officer, safety director, 
security man, community- (i.e., race) relations 
man. Some auditors or controllers also belong in 
this list. Backgrounds of such specialists, while 
diversified, typically include administrative, ideo- 
logical, or political jobs that provide a grasp of the 
internal operations and structure of the organiza- 
tion in question. 

Heterogeneous in origin and activity, these men 
fulfill similar functions. They build executives 
prestige and facilitate their control over the mem- 
bers; they supply information about the perform- 
ance, politics, and morale of various organizational 
units; they locate and train new leaders; and they 
allay minority group pressures that threaten sta- 
bility. Occasionally they function to maintain mo- 
rale in crisis situations (Wilensky 1956, pp. 80 ff. 
Myers & Turnbull 1956; Controllership Foun- 
dation 1954). Political parties, unions, voluntary 
associations—formally democratic organizations 
that cannot assume member support but must win 
it—are especially prone to use these specialists. In 
general, the more directly accountable the leader is 
to the rank and file, the more he needs to know 
about their state of mind. Further, the more the 
leader depends on the rank and file’s compliance 
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with organizational rules and goals, the more he 
needs to know about the quality of the rank and 
file's performance. 

Facts-and-figures men. A third major source of 
variation in the number and types of experts used 
js the uneven rationalization of social life. The 
more an organization sees its external environment 
and internal operations as rationalized—that is, as 
subject to discernible, predictable uniformities in 
relationships among significant objects—the more 
resources it will devote to the intelligence function 
and the more of those resources will be spent on 
experts we might call “facts-and-figures men.” The 
facts-and-figures man supplies technical economic, 
legal, or scientific intelligence that helps the leader 
build his case in relations with outsiders and mem- 
bers, fend off attacks, and compete with rival or- 
ganizations for markets, power, and prestige. He 
is primarily concerned with, and skilled in han- 
dling, empirical data; he is expected to produce 
quick, simple answers to complex technical ques- 
tions as well as to make judgments of the power 
and intent of competitors and enemies. His human- 
relations skills are less prominent. Typical titles of 
facts-and-figures men include: director of research, 
of planning and analysis, of reports and estimates, 
or simply, of intelligence; economist, statistician, 
accountant, actuary, pension and insurance con- 
sultant; industrial engineer, human engineer; and 
the ubiquitous “management consultant.” Included 
here are those lawyers who specialize in court lit- 
igation, case building for negotiations and arbitra- 
tion, drafting contracts and briefs, or appearances 
before administrative agencies; it also includes 
those lobbyists who accent “research.” The back- 
grounds of facts-and-figures men are mainly in the 
law, college teaching, accounting, market and con- 
Sumer research, operations or systems research, 
engineering and the physical sciences, and in the 
Social sciences. On the average, they have more 
formal education than other intelligence specialists. 
intent ures men function both to supply 

ation and to mobilize support. In internal 
Fplicy deliberations and decision making generally, 
EN Seem a “rational-responsible bias—a 
aaa Sees examination of alternatives, of rel- 
Siar “ie beyond the organization’s power, 
tions Rena consequences. In external rela- 
Rein, uild pressure on outside parties via the 
Public eo and the public, imparting a tone of 
Petbilade Sponsibility to executive rhetoric. They 
utivea via impress legislative committees, exec- 
Betas, cies, and judicial or quasi-judicial boards, 

a Bie a general reorientation toward data. 

ngthen the morale and conviction of their 
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employers, and so of the members, thus boosting 
bargaining power with rivals (Wilensky 1956, 
pp. 39-60). 

We see here a kind of dialectic of expertise. 
Facts-and-figures men are preoccupied with ration- 
al argument and criteria; their technical compe- 
tence compels opposing parties to be more careful 
or honest in their use of information, to match 
each other expert for expert, fact for fact. 


When the facts count 


Among intelligence specialists, no conviction is 
stronger than the notion that their main function, 
whatever their intentions, is “backstopping,” or 
rationalizing established policy. And administrative 
leaders who hire experts tend to derogate technical 
intelligence and accent the importance of political 
and executive skills they themselves possess. 

There is no doubt that much policy is improvised 
and later, with the help of experts, made to appear 
planned. Nor is there doubt that all experts, includ- 
ing facts-and-figures men, contribute to the self- 
esteem of their employers by lending them prestige 
and “fronting” for them with various publics, 
“taking the heat” when mistakes are made, and, in 
general, creating the verbal environment of the 
organization (Wilensky 1956, pp. 33-38 ). But such 
observations obscure both the conditions under 
which the “facts” do count and the significance of 
“mere slogans.” 

A study of the interplay of technical economic— 
legal considerations and political—ideological con- 
siderations in union decision making found that 
the area in which union staff experts have their 
highest sustained influence in the entire decision- 
making process is that of problems in public rela- 
tions and governmental relations—issues such as 
taxes, tariffs, welfare legislation, and world politics, 
which, while objectively important to the members, 
are seen by top officers as far from the core func- 
tion of the union. But some problems are so tech- 
nical that the specialized knowledge of facts-and- 
figures men becomes overriding even at the core of 
union functioning, despite the presence of political 
considerations (ibid., pp. 181-195). 

Similarly, a study of governmental budget 
making—an eminently “political” process—sug- 
gests that a good “case” often counts. While prece- 
dent and political support are the major determi- 
nants of who gets what the government has to give, 
substantial departures from previous budgets and 
new programs with no precedent receive careful 
scrutiny and have a better chance if accompanied 
by well-documented arguments. Moreover, the cul- 
tivation of political support and official confidence 
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are themselves partly a product of how well the 
homework is done. U.S. Bureau of the Budget exam- 
iners depend for their information on the agency 
they are assigned to investigate; the agency often 
converts the examiner into an advocate of partic- 
ular programs by a sensible, even open, flow of 
information. Slippery statistics, grossly inaccurate 
reports are avoided; the expert from the Bureau of 
the Budget is referred to agency experts whose 
data he can trust (Wildavsky 1964, pp. 3, 13, 39, 
56, 62-67, 136, 147, 169). If trends in federal 
budgeting practices of the 1960s continue—that is, 
if there is a further shift from “ceiling budgets” or 
from “incremental budgeting” based on previous 
decisions to a “planning—programming budgeting 
system” (PPBS) based on calculations of the cost 
effectiveness of alternative-program outputs—then 
reliance on technical data will increase [see 
BUDGETING, article on BUDGETING AS A POLITICAL 
PROCESS, and compare Hitch 1965]. 

This story is repeated in industry, where an en- 
terprise may use “share of the market” as an opera- 
tional guide to simplify its calculations and yet at 
the same time be forced to resort to special cost 
studies, carefully scrutinized, whenever there is de- 
viation or innovation. And an auditor, investigating 
the operations of a local plant, can more easily be 
converted into its advocate at central headquarters 
if he learns to trust the local sources of his infor- 
mation (Cyert & March 1963; Controllership Foun- 
dation 1954). 

Facts-and-figures men who command technical 
intelligence obviously are given more discretion 
where the problems are technical. Less obviously, 
they also carry more weight when the organization 
is weak in grass-roots political resources. Among 
Washington lobbyists, for instance, representatives 
of small organizations with limited political re- 
sources, such as humanitarian organizations and 
specialized trade associations, accent research in 
their lobbying strategy, in contrast to large-mem- 
bership organizations, such as farm groups, vet- 
erans groups, and labor unions, who incline toward 
grass-roots campaigns and publicity (Milbrath 
1963, pp. 227 ff., 343, 348). Paradoxically, this 
may give an advantage to the weak, whose case, 
if strong and technical, can count for something 
[see LOBBYING]. 

Technical and political intelligence. World War 
1 saw an accelerated use of social science by gov- 
ernment and the military as well as the emergence 
of freewheeling experts who combined training in 
physical science with some understanding of social 
and political life. Through the devices of the stand- 
ing advisory committee, contract research, and the 


support of semiautonomous operations research in- 
stitutes, the government began to tap more general- 
ized advice from the academic community (Price 
1954). A model for the nonmilitary uses of such 
advice can be derived from the best of the military 
examples: small, flexible, loosely organized, inter- 
disciplinary groups in RAND, an advisory corpora- 
tion engaged in systems analysis for the United 
States Air Force (B. Smith 1964; for similar exam- 
ples from Britain, see Jones 1947). There is an 
urgent demand for broad policy advice on those 
political or administrative issues that rest heavily 
on technical intelligence. To cope with such issues, 
administrative leaders are turning to a new breed 
of experts who can interpret the work of separate 
disciplines, combine the functions of contact men 
with those of internal-communications specialists 
and facts-and-figures men, supply a blend of tech- 
nical and ideological intelligence, and in general, 
enrich the verbal environment of elites. Needless 
to say, there is more demand for than supply of 
such experts. 


Intelligence and structural complexity 


Within a context of increased availability of in- 
formation and of growing demand for generalized 
policy advice, it may be hypothesized that (1) an 
organization in conflict with its environment, espe- 
cially one deeply involved with government, will 
turn toward contact men; (2) an organization 
heavily dependent on internal solidarity will hire 
internal-communications specialists; (3) an organ- 
ization that sees its external relations and internal 
operations as rationalized will depend on facts-and- 
figures men. 

These general determinants of the amount and 
type of resources devoted to the intelligence func- 
tion are reflected in more familiar attributes of 
structure, Briefly, the more complex the structure, 
the more use of experts, The main locus of the 
managerial revolution is the large organization with 
many specialized units to coordinate, great hetero- 
geneity of membership (manifested in a diversity 
of goals), and much centralization. Other things 
being equal, the larger the size of an organization 
the greater is its public impact, the more intense 
are its problems of internal control, and the more 
resources are available to it for the intelligence 
function. The more specialization there is, the more 
interdependent the specialized parts become, and 
the greater is the cost if any one part fails. Greater 
specialization thus calls for greater resources for 
each part of the organization to keep track of the 
other parts, and for more staff at the center to 
coordinate the whole. Moreover, greater specializa- 
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tio increases the difficulty of recruitment and 
training and the amount of effort needed to secure 
information about morale and performance. Fi- 
nally, the more heterogeneous the membership or 
constituents of an organization, the more ambig- 
uous, diffuse, diversified, and numerous its pur- 
poses or products; this more complex structure of 
goals means more variables to consider and a more 
urgent problem of coordination. In other words, 
size, specialization, centralization, heterogeneity of 
membership and goals—all of them attributes of 
structure—generate the need for experts insofar 
as they increase the number and diversity of social 
units in the environment that must be taken ac- 
count of, aggravate the problem of internal control, 
and intensify the search for uniformities, thereby 
multiplying formal rules. The law for the computer 
age is: Programmed activity either drives out un- 
programmed activity or drives it underground. 


The quality of intelligence 


To explain the great expansion in intelligence 
and major variations in the employment of experts 
is not to understand how their work is organized. 
Nor does that explanation tell us much about varia- 
tions in the quality of intelligence. The knowledge 
explosion intensifies an old problem: how to draw 
from a highly compartmentalized body of knowl- 
edge the messages—whether in the form of ques- 
tions, of insights, or of evidence—relevant to policy 
and get them into the room where decisions are 
made. Sources of failure are legion. Even if the 
initial message is accurate, timely, and relevant, it 
may be translated, condensed or completely blocked 
by personnel standing between the sender and the 
intended receiver; if it does get through, it may be 
only in distorted form. If the receiver is in a posi- 
tion to use the message, he may screen it out be- 
nue it does not fit his preconceptions, because it 
an through a suspicious or poorly regarded 
is any (because he is in the wrong), because it 
Wess edded in piles of inaccurate or useless mes- 
eae information theorists call excessive 
ae in the channel) or simply because too many 
fen a are being transmitted to him (“informa- 
the a load”) [see INFORMATION THEORY]. When 
tera ant message is not in the system or when 
it (he ped receiver is not in a position to act on 
ae : clearly constrained by forces beyond his 
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top ea into the minds of general staffs and 
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commanders the urgent warnings about Japan’s 
intentions supplied by Far Eastern code analysts is 
said to have been caused by the lack of a central 
intelligence organization (Ransom 1958, p. 58; 
see also ibid., pp. 41, 54-56; and Wohlstetter 
1962, pp. 125 ff., 311 ff., 395). Yet 25 years later, 
after a reorganization of the intelligence function 
along these lines, two American presidents were 
led into disastrous military adventures based largely 
on miscalculation by intelligence agencies commu- 
nicating at the top misleading pictures of the sit- 
uations in Cuba, the Dominican Republic, and 
Vietnam. The vast expansion of the American gov- 
ernment’s intelligence apparatus has not prevented 
successes in the uses of intelligence; neither has it 
prevented a string of fateful failures. (For exam- 
ples of both, see Ransom 1958, pp. 60 ff.; Snyder 
& Paige 1958.) 

Intelligence failures are rooted in structural 
problems that cannot be fully solved; they express 
universal dilemmas of organizational life that can 
be resolved in various ways at varying costs. In all 
complex social systems, hierarchy, specialization, 
and centralization are major sources of distortion 
and blockage of intelligence. The quality of intel- 
ligence is also shaped by the prevailing concepts of 
intelligence, the types of problems to be confronted, 
the stages of growth of the organization, and the 
cultural or ideological contexts of decision. 

Hierarchy. Insofar as the problem of organiza- 
tional control is solved by rewards of status, power, 
and promotion, the problem of obtaining accurate, 
critical intelligence is intensified. For information 
is a resource that symbolizes status, enhances 
authority, and shapes careers. At every level, hier- 
archy is conducive to concealment and misrepre- 
sentation. When subordinates are asked to trans- 
mit information that can be used to evaluate their 
performance, their motives for making it look good 
are obvious. 

As for the opportunity to distort information, 
middle-level managers, and even lower-level em- 
ployees, sometimes have a near monopoly of wis- 
dom about feasible alternatives. For instance, ob- 
servers of man-paced factory jobs have noted the 
ingenuity of machinists who invent and hide cut- 
ting tools that do the work more efficiently than 
the prescribed tools and thereby permit more 
worker control of the pace. Although automation 
and centralization may change this, first-line super- 
visors still have indispensable practical knowledge 
of both unofficial work behavior and “bugs” in the 
technical system. Local plant managers in multi- 
plant systems know the limits of their productive 
capacities far better than does central headquarters 
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(Whyte et al. 1955; Sayles 1958; Dalton 1959; 
Ward 1967). 

Matching the motive and opportunity of the sub- 
ordinate to remain silent are the superior’s motive 
and opportunity to close his ears. The common 
belief that staff experts should be “on tap, not on 
top” functions to maintain line authority and re- 
duce the status of the staff. Thus, if an organiza- 
tion has many ranks and in its administrative style 
and symbolism it emphasizes rank, it will display 
the greatest distortion and blockage in the upward 
flow of information. Where the pyramid is tall and 
narrows sharply at the top, providing a long promo- 
tion ladder for only a few, there are many time- 
servers at lower ranks who have neither informa- 
tion nor the motive to acquire it. In the middle, 
among the nonmobile, there are many defensive 
cliques that restrict information to prevent change, 
many mutual aid and comfort groups that restrict 
information because of resentment of more am- 
bitious colleagues, and many coalitions of ambitious 
men who share information among themselves but 
pass on only the portion that furthers their careers. 
For the purposes of intelligence, the optimal shape 
of the hierarchy would be flat (since an organiza- 
tional structure with few ranks makes for more 
accurate and more speedily diffused information) 
with a bulge in the middle (that is, there would be 
more specialists who had information, and many 
potential managers motivated to command it), 

Whatever the shape of the hierarchy, however, 
to extract information from those who have it typ- 
ically requires the bypassing of regular organiza- 
tional structures. Efforts to resolve the dilemma of 
hierarchy versus need for intelligence include team 
or project organization, devices for communicat- 
ing out of channels, machinery for investigation 
and inspection, performance checks, and reliance 
on informed outsiders, 

The higher in the organizational hierarchy one 
goes, the less can the kinds of problems encoun- 
tered be translated into terms that are part of the 
existing structure of specialized knowledge and the 
less, therefore, can a decision be programmed. Only 
at the lower levels of policy deliberation can the 
specialized expert tackle a specialized problem with 
a chance of solving it by the precise methods of 
science (Price 1954, p. 164). Further, the role of 
the expert at any level is self-changing: if he is 
successful within his sphere of competence, if his 
advice is taken on matters where his specialized 
knowledge is relevant, he is likely to be chosen for 
tasks outside that sphere of competence, where his 
specialized knowledge will very likely be irrelevant 
(Wilensky 1956, pp. 209 ff.). This is why there 


is so urgent a demand for generalized advisors at 
the top. It is also why an appropriate form of or- 
ganization for industrial research and development 
or, more broadly, for the use of experts anywhere 
is the task force—a team or project of diverse spe- 
cialists who are brought together to solve a limited 
range of problems and are then reassigned when 
the task is done. Several swiftly growing corpora- 
tions based on sophisticated technology, such as 
the IBM Corporation, have adopted this form 
(Shepard 1956; Goldner 1965; Kornhauser 1962, 
pp. 50 ff.). It is possible that the task force is best 
for crisis situations, higher policy deliberations, 
policy-oriented research, and technical develop- 
ment, while other forms of organization are ap- 
propriate for basic research. 

Additional defenses against the information 
pathologies of hierarchy include various reporting 
services and statistical controls. These include data 
on the cost variances of individual departments, 
trend reports on “invisible” operations (machine 
performance or the consumption of operating sup- 
plies), accounting statements for “profit centers,” 
and special “problem-solving” studies done by teams 
drawing on accounting information, engineering 
estimates, and industrial engineering standards 
(Controllership Foundation 1954). In interpreting 
such data, top executives even in very hierarchical 
organizations are not entirely defenseless: they 
have built up a sense of reasonableness regard- 
ing performance reports, and they can check sub- 
mitted data for consistency. Finally, there is a 
universal check in every type of economy: the 
quality of performance of one organization is 
known by other organizations that receive its prod- 
ucts. Thus, a Soviet metal fabricator who must 
work with defective steel will complain no less than 
his counterpart in the United States (Ward 1967). 

Perhaps the most fruitful and common response 
to hierarchy is communication out of channels: 
contact men, on salary or retainer, keep in touch 
with outsiders who have a detached or critical view 
of the organization; internal-communications spe- 
cialists, some of whom combine close ties to local 
leaders with loyalties to central headquarters, Te- 
port on local performance and morale. A variety 
of marginal men at points along the organization's 
boundaries supply supplementary and often crucial 
intelligence. 

Modern military services evidence all of these 
responses to hierarchy. Like executives everywhere, 
military elites have moved from coercion and com- 
mand to persuasion and manipulation. Instead of 
a pyramidal structure, they now favor a diamond 
bulging in the middle; instead of ritualistic close- 
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order drill, an accent on the training mission; in- 
stead of a mechanical assimilation of the army way, 
a reduction in symbols of rank; instead of stand- 
ing operating procedures (SOP), group command 
conferences and informal briefings, bypassing nor- 
mal channels (Janowitz 1960, pp. 38 ff.). 

Specialization. As a source of information 
blockage and distortion, specialization may be more 
powerful than hierarchy. The organization of the 
armed forces and industry alike encourages rivalry 
and restriction of information. Lines of organiza- 
tion become lines of loyalty and secrecy; each de- 
partment restricts information that might advance 
the competing interests of the others. While infor- 
mation can also be used to persuade potential allies 
and facilitate accommodation with rivals, it is more 
commonly hoarded for selective use in less collab- 
orative struggles for power and position. 

The dilemma of intelligence versus specializa- 
tion is twofold: specialization is essential to the 
efficient command of knowledge but antithetical 
to the penetrating interpretation that bears on high 
policy. Specialization and its concomitant, inter- 
unit rivalry, frequently block the sharing of accu- 
rate information, but if problems of upward com- 
munication can be solved, rivalry can result in the 
specification of clashing alternatives and the pres- 
entation of opposing cases, both of which are great 
gains. The main cost of specialization in intelli- 
gence is parochialism, or the production of mis- 
leading or irrelevant information—a result of the 
familiar limitations of the expert (Laski 1931). 
The professionally biased intelligence producer re- 
mains too distant from the intelligence user and 
too ignorant of policy needs; he is also forced to 
compete with other producers for the support and 
guidance of the user (Kent 1949, pp. 81, 94 ff.). 
a ae from constructive rivalry is another mat- 
a A 5 Seam on administrative styles and struc- 
lives at expedite the free flow of rival perspec- 
ae fae solutions to the responsible executives 

ir general advisers. 

as this structural dilemma, especially in 
a oes dependent on technical intelligence, 
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stadt 1960, pp. 154 ff.). However, if rival depart- 
ments are forced to a consensual decision in com- 
mittee, the gains of calculated competition cannot 
be secured (Sorensen 1963, pp. 62-63; Possony 
1941, pp. 226-227; Schlesinger 1965, p. 209). 

The firmest generalization that can be made in 
this area is that the greater the number of ranks 
and the greater the number of organizational units 
involved in a decision process, the more the distort- 
ing influence of rank and jurisdiction and the 
greater the chance of an intelligence failure. 

Centralization. Related to the information pa- 
thologies of hierarchy and specialization is the 
dilemma of centralization. If intelligence is lodged 
at the top, too few officials and experts with too 
little accurate and relevant information are too far 
out of touch and too overloaded to cope. On the 
other hand, if intelligence is scattered in many 
subordinate units, too many officials and experts 
with too much specialized information may engage 
in dysfunctional competition, may delay decisions 
while they warily consult each other, and may dis- 
tort information as they pass it up (Wiles 1962, 
pp. 141 ff.; Ward 1967; Kent 1949, p. 81). 

Centralized intelligence presents the same dan- 
gers as specialized intelligence: it keeps the collec- 
tion of data too far from their use in policy; it en- 
courages agreed-on estimates that may conceal 
strong disagreement and that in any case do not 
reveal the weights of diverse opinions (these dis- 
advantages are only partly offset by such devices 
as the dissenting footnote, which themselves result 
from horsetrading and logrolling); and it competes 
with its own subsidiaries for scarce personnel and 
documentation facilities (Ransom 1958, pp. 81, 
94, 197 ff.). Centralization contributes one addi- 
tional danger peculiar to itself: the acquisition of 
unnecessary responsibility (“empire building”) is 
always accompanied by cries of duplication and 
inefficiency among the units to be absorbed or elim- 
inated. After administrative reform, a more unified, 
centralized intelligence agency producing a unified 
consensual judgment fosters an illusion of security 
and reliable intelligence that can, as in the case of 
the Bay of Pigs invasion, conceal fantasies at the 
highest level. 

In the minds of political, military, and industrial 
elites the advantages of centralized intelligence 
have apparently tended to outweigh these dangers. 
Most executives have been less concerned with pre- 
serving the independence and objectivity of their 
experts than with controlling them. For their part, 
the experts, seeing a chance for greater influence, 
have not been loath to secure guidance from the 
top. And in the reorganization of national strategic 
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intelligence in the United States, the image of Pearl 
Harbor—the signals scattered, no central interpre- 
tive agency alert to them—has been commanding. 

This debate about departmental or local versus 
central or national intelligence has obscured two 
fundamental points. First, there is need for inter- 
pretive skills at every point where important de- 
cisions are taken; the ideal is the close integration 
of data collection and evaluation in every depart- 
ment and division, and within the limits of budget 
and personnel, at every level (Wasserman 1960; 
Kent 1949, p. 81). Second, various degrees and 
kinds of decentralization are appropriate for var- 
ious purposes and measures of effectiveness. For 
instance, a study of the controller’s department in 
seven multiplant companies—all of which aimed 
to provide high-quality information services at min- 
imum cost and, incidentally, to facilitate the train- 
ing of accounting and operating executives—found 
that the effectiveness of decentralization depended 
on the type of information to be provided (Con- 
trollership Foundation 1954). If the main aim is 
to keep score (“Am I doing well or badly?”) and to 
direct attention (“What problems should I look 
into?”), then it is best to hire competent professional 
men who look to the controller's department for 
promotion, attach them to local factories and dis- 
trict offices, and put them under the factory man- 
ager. The formula for success is much geographical 
decentralization, some decentralization of formal 
authority, but very limited decentralization of loy- 
alty. For the purpose of keeping score, the decen- 
tralization of location and authority makes it easier 
to gain access to resource documents and to know 
the reliability of source records. For the purpose 
of directing attention, such decentralization gives 
operating executives direct and active contact with 
accounting people and thus more confidence in the 
operating standards and performance reports going 
to the boss. For the purpose of preserving the qual- 
ity of data available to central headquarters for 
special studies and big policy, it is equally impor- 
tant to maintain loyalty to the controller's depart- 
ment through recruitment of professionally com- 
mitted men who look there for promotion. Ideally, 
the formal organization puts accountants close 
enough to the situation to be accepted by factory 
management and obtain roughly accurate informa- 
tion; at the same time, career-line loyalty and pro- 
fessional identification provide enough social sup- 
port for them to apply standards. 

In short, there is a balance to be struck among 
various kinds of decentralization—an optimal, or 
at least “workable,” set of gains and losses depend- 
ing upon the purposes to be pursued. This is an 
area where research has barely begun. 


Doctrines of intelligence. Concepts of the in- 
telligence function deeply affect the organization 
of intelligence, its recruitment base, and hence its 
quality. Administrative leaders often evince anti- 
intellectualism, narrow empiricism, and a demand 
for secrecy in odd combination with a demand for 
scientific prediction and quantitative estimates. 
This combination leads to foolish ideas about the 
proper organization and possibilities of intelligence, 

If users of intelligence think of it as “facts” 
rather than “evaluated facts,” if they are imbued 
with the notion that more raw “information” to 
“fill in gaps” is their great need, they are likely to 
make intelligence a distinct, subordinate function, 
to separate sharply collection of data from inter- 
pretation, and to exclude experts from policy delib- 
erations. They will attract either crude empiricists 
or conformists content to “backstop” the preconcep- 
tions of policy makers. Thus, intelligence adminis- 
trators in government and in the military have typ- 
ically been lawyers or soldiers trained in school and 
in practice as empiricists—suspicious of plans and 
predictions, hostile to generalizations and abstrac- 
tions. Similarly, intelligence staffs have been com- 
posed mainly of historians, lawyers, journalists, 
and soldiers (Kendall 1949, pp. 550-551; Hilsman 
1956, pp. 81, 96; Millikan 1959, p. 163; Wasser- 
man 1960). The most influential Washington lob- 
byists and union staff experts have been lawyers 
(Wilensky 1956, pp. 230 ff.; Milbrath 1963, p. 157). 
The background and outlook of the intelligence 
producer often match those of his boss; both some- 
times display a pervasive anti-intellectualism, a 
general resentment of the outsider, and an exag- 
gerated belief in practical experience. In response, 
students of intelligence, like the writers of intro- 
ductory textbooks in social science, still tend to 
belabor the points that the facts never speak for 
themselves and that information should be gath- 
ered in relation to questions and categories for 
analysis. The main questions remain open: What 
types of experts, with what background and train- 
ing, make the best interpreters, and what concepts 
and organization of intelligence attract and moti- 
vate them? és 

The common distinction between “intelligence 
(information gathering) and “operations” (clan- 
destine action) also has consequences for recruit- 
ment. While fact gathering appeals to naive realists, 
whose interpretive capacities are limited, the eX- 
citement of operations attracts the kind of adven- 
turers and activists who guided the American 
government into the Bay of Pigs. Two related con- 
cepts of intelligence further complicate the prob- 
lem of organization and reduce the quality of 
information services—the accent on short-run pre- 
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diction, reflected in a pressure for direct, simple 
answers to immediate questions, and the accent 
on secrecy. Given the familiar difficulties of predic- 
tion in social science and politics, intelligence esti- 
mates, like stock market forecasts, are heavily 
hedged, as in: “There is a serious possibility that 
Red China will intervene, but there are some 
grounds for optimism.” Intelligence users and op- 
erators, incongruously demanding from researchers 
“all the facts” at the same time as they demand 
short, speedy, journalistic estimates of future de- 
velopments, use the inevitable hedging and the 
frequent failures of prediction to justify their anti- 
intellectualism, their naive separation of “facts” 
from “know-how,” and their denigration of research. 
Faced with repeated disconfirmation of short-run 
predictions, they are susceptible to the “cry-wolf 
syndrome,” which discounts all predictions and 
warnings whatsoever (Wohlstetter 1962, p. 52). 
The notion that secret sources are intrinsically su- 
perior is reflected in loyalty-security systems and, 
in the case of government, the segregation of clan- 
destine operations from research. Students of in- 
telligence agree that while covert, or “black,” intel- 
ligence, like overt intelligence, has expanded in 
every sphere of modern life (U.S. Congress... 
1965; R. Smith 1956), policy makers grossly exag- 
gerate its importance and reliability. They also 
agree that an emphasis on secrecy impairs critical 
judgment in the production and interpretation of 
intelligence and dulls the sense of relevance. At 
the same time, secrecy blinds the executive to the 
Increasingly available open sources (Kent 1949, 
P. 168; Schlesinger 1965, pp. 248, 258). 

The distorting effects of secrecy can be discerned 
both in the type of personnel attracted to secret 
Operations and in the peculiar conditions for in- 
tellectual work that secrecy imposes. The more 
feeds the smaller the intelligent audience, the 
aid systematic the distribution and indexing of 
een the greater the anonymity of authorship, 
ae e more intolerance for deviant views and 
ce of life (B, Smith 1964, p. 78; Merton 1957). 
tives r restrictive atmosphere reduces the incen- 
wore 7. treat ie by the best scholars, whose 
te ife and thought are not always orthodox 
Aa Who work best with critical appraisal and 
TAA and with wide-open access to the best 
eS Thus, the security regulations and loyalty 
e Eations of the McCarthy era in the United 
ae ay aitten cautious mediocrities, dedicated 
tte t ost impoverished clichés of the cold war, 
nace 0 the top of the foreign service (Schlesinger 

> P. 411; Kent 1949, pp. 69, 74, 146-147; 

ansom 1958, p. 178). 

We consider the motives of secret agents, in- 
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formers, stool pigeons, and other spies, as well as 
the stigma under which they labor, the limitations 
of covert intelligence become obvious. In order to 
protect their sources, clandestine agencies want 
complete control of data collection and operations; 
they therefore have a natural penchant for the kind 
of compliant people who make good security risks. 
The motives of such people have been noted by 
students of police work, of military intelligence, of 
foreign affairs: fear, insecurity, revenge, envy, Te- 
morse, money—a shaky basis for objective report- 
ing (Harney & Cross 1960, pp. 33-39; Bontecou 
1953, pp. 131-135; Hilsman 1956, pp. 21) fi; 
Schlesinger 1965, pp. 230 ff.). While the classic 
spy of fiction and practice struggles successfully to 
obtain startling information, his home office, know- 
ing that in sensitive areas the enemy always leaks 
false information, is likely either to discount ac- 
curate information (Moyzisch 1950; Seabury 
1954) or fall for fabrications (Ransom 1958, 
pp. 168-169). 

In the end, the most reliable intelligence sources 
for competing organizations are open; the best 
data, seldom secret, are the actions of the other 
party (compare George 1959). Yet, at the very 
time that open sources multiply and communica- 
tions technology speeds the processing and flow 
of information, we may be elevating the position 
of the spy. Since the onset of World War 1, the 
reservations of diplomats and administrative 
leaders about spying have weakened (Vagts 1956, 
pp. 65, 70-71); in government and industry alike, 
the budgets and influence of spies have expanded 
(R. Smith 1956; U.S. Congress . . . 1965; Interna- 
tional Union . . . 1964, pp. 323). 

An emphasis on secret information not only 
threatens individual privacy; it can demoralize an 
organization. Such practices as prying into the per- 
sonal lives of employees to fend off the efforts of 
rivals to obtain secret information, or conducting 
disguised surveys to uncover prounion sentiment, 
can build resentment, provoke strikes, increase 
labor turnover, and reduce efficiency. 

This area of study is rife with sensationalism; 
systematic research is rare. The conditions that 
would give substance to an Orwellian nightmare 
are only dimly understood. 

The nature of the problem. The nature of an 
executive decision itself shapes the uses and qual- 
ity of intelligence because it affects the number, 
kinds, and organization of experts called to serve. 
To illustrate, I will suggest a few hypotheses using 
the variables of urgency, cost, innovation, and 
certainty. 

First, when the decision is urgent, the distortions 
of hierarchy, specialization, centralization, and doc- 


328 ORGANIZATIONS: Organizational Intelligence 


trine are minimized. Ironically, “hasty” decisions 
made under pressure may on the average be better 
than less urgent ones. 

Compare the two Cuban crises that dominated 
the Kennedy years. In the Bay of Pigs affair, plan- 
ning, begun in the Eisenhower administration, was 
lengthy, discussion was rank-oriented, the range of 
serious exploration of alternatives and conse- 
quences limited, and the outcome disastrous. In 
the more urgent missile crisis, discussion was egal- 
itarian; protocol, seniority, and rank counted for 
little; a wide range of alternatives was explored in 
depth (Sorensen 1965, pp. 679 ff.); and, by the 
standards of the Bay of Pigs, the decisions, in both 
process and consequence, were superior. Urgency 
functioned to overcome many information pathol- 
ogies. Similarly, President Truman’s rapid decision 
to intervene in Korea, with the modest objectives of 
restoring the status quo ante and preserving Allied 
unity (Snyder & Paige 1958), was clearly more 
sensible than the later series of more leisurely de- 
cisions culminating in a new war aim of a unified 
noncommunist Korea—an aim which provoked 
Chinese intervention (Neustadt 1960, p. 139). 

The greater the costs, risks, and uncertainty, 
and the more significant the changes in methods 
and goals involved, the more intense is the search 
for information, but the stronger is the weight of 
established policy and vested interests. Decisions 
involving many people, much money, great uncer- 
tainty or great risks, and major innovations evoke 
action and advice from every specialized unit at 
every level of the hierarchy, thereby increasing the 
dangers of paralyzing delays and of distortion or 
blockage of communication. 

Perhaps the soundest hypothesis we can derive 
from these diverse considerations is that only those 
“big” (that is, costly, risky, or innovative) policy 
decisions that are also very urgent are likely to 
activate intelligence of a high quality, because in 
such cases deliberation moves out of channels 
toward men of generalized wisdom, executives and 
experts alike, who can then communicate infor- 
mally and effectively at the top. But when crises 
become routine, the effects of urgency on intelli- 
gence may be modified; if there are too many turn- 
ing points, too many critical junctures, the usual 
response to the extraordinary event—the cutting 
of new channels of intelligence—is inhibited. (For 
Opposing hypotheses, see March & Simon 1958, 
pp. 168-175, who offer a theory that derives social 
organization from individual psychology. ) 

Organizational processes. Any analysis of the 
determinants of the quality of intelligence must 
be sensitive to the flow of time and talent. 


Because little is known about the growth of or- 
ganizations and hence there is no agreement on 
possible “stages” of growth, we cannot systemati- 
cally relate the problem of intelligence to processes 
of structural change. The literature provides only 
contradictory clues (see Haire 1959, pp. 292-293; 
March & Simon 1958, pp. 185-187; Schumpeter 
1942, pp. 132 ff. in 1947 edition). Applying our 
scheme for the analysis of intelligence failures, 
however, we can hypothesize that stages of organi- 
zational growth will affect the quality of intelli- 
gence because they affect the types of experts hired 
and the problems they confront, the doctrines of 
intelligence, and the salience of structural distor- 
tions of intelligence. 

Assume a new, swiftly growing organization in 
a fast-changing environment. Its problems—un- 
routinized, unprecedented—are mainly those of 
internal control and external competition and con- 
flict. It must build a social base of members, em- 
ployees, customers, clients, or constituents; it must 
select and shape a core of top personnel committed 
to its mission (compare Selznick 1957). While 
high degrees of centralization may prevail, it is 
more likely that hierarchy and specialization will 
be de-emphasized. And while secrecy may be de- 
manded in intelligence doctrine as in everything 
else, urgency will shape many decisions and free 
the flow of information, secret or not. In short, 
information pathologies are minimized. Aggressive, 
expansionist organizations in touch with a fluid 
environment apparently have the best intelligence 
systems (Ransom 1958, p. 46). 

Suppose that the organization becomes institu- 
tionalized (“established”) at a slow growth rate 
and that it negotiates many stable arrangements 
with its environment (Cyert & March 1963, p. 119). 
The shift in problems and personnel is not simply 
from innovation to execution, from idea men to 
orderly bureaucrats (March & Simon 1958, p. 187; 
Lazarsfeld 1959, p. 22). It is a more complicated 
shift from entrepreneurial careerists and dedicated 
missionaries to professionals (some of whom are 
bureaucratic in mentality but others of whom are 
program-minded innovators), with a scattering of 
enduring old missionaries (Wilensky 1956, pp. 138- 
153, 170-174; Wilensky 1964, pp. 150-151). It is 
also a shift from ideological intelligence emphasiz- 
ing problems in the strategy of conflict to technical 
intelligence emphasizing problems of negotiation 
and administration; a shift from less routinized to 
more routinized problems; and a shift from most 
urgency to least. Organizational (or national) 
myths become more fixed, the counter-stereotyping 
activities of intellectuals more difficult. Hierarchy 
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and specialization become prominent. Insofar as 
there is also a shift away from secrecy, informa- 
tion pathologies may be reduced, but all the other 
changes listed would seem to reduce the accuracy, 
relevance, and timeliness of intelligence. 

Related to the growth of organization is the 
process of succession. In the succession crises 
characteristic of rapidly changing organizations, 
there may be greater play for policy-oriented in- 
tellectuals. In the routine succession characteristic 
of large, stable organizations, there may be a strong 
bias in favor of established policy and official 
prejudice. The new man, like his predecessor, 
strongly desires continuity; he must avoid frighten- 
ing key incumbents and potential supporters below. 
The outgoing official wishes to avoid being dis- 
credited. Moreover, the new man requires briefing, 
which both the outgoing administration and hold- 
over experts are glad to give. 

Sheer frequency of succession exerts an influ- 
ence apart from the mechanism of turnover. Other 
things being equal, high turnover of administra- 
tive leaders discourages the expression of critical 
opinion in the short run; the newcomers tend to 
keep quiet until they learn the lay of the land, 
build confidence among superiors and subordi- 
nates, acquire a political base, or in other ways 
solidify their positions. Frequent, institutionalized 
succession is one reason for the reluctance of the 
United States government to liquidate error in such 
places as Cuba and Vietnam—a striking continuity 
of policy through a succession of men so different 
in viewpoint and style as Eisenhower, Kennedy, 
and Johnson. i 
_ A severe limitation of this analysis of process 
is that we cannot assume a straight-line trend— 
that is, an inevitable slowing down of growth rates, 
increasing stability of the environment, a decline 
in urgency, a reduction in succession crises, or the 
like. Until we have a diversified analysis of typical 
ves and their correlates, research in this 

will remain primitive. 


Cultural and national variations 
ae remaining piece of the picture is the most 
tes Although my analysis has been 
' in general terms applicable to complex or- 
iC ee in every modern state, cultural—ideo- 
ieni contexts obviously condition the uses and 
ce of intellectuals and experts as well as 
tee ee of intelligence itself. But no compara- 
national ies show how. Moreover, some of these 
e, cultural” variations turn out to be them- 
The e Structural variations already discussed. 
main distinctions for our purposes are be- 
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tween command and market economies and be- 
tween totalitarian and pluralist polities. First, 
economic and political contexts affect the balance 
between types of intelligence used. The contact 
man, for instance, is everywhere called to service 
because of the entanglements of diverse private 
and public bureaucracies defending and promoting 
special interests. The greater the public impact of 
these struggles, the more government intervention 
and the more indispensable the lobbyist, the pub- 
licist, and the lawyer. If labor unions and public 
utilities deeply involved with government in a 
market economy must rely on the ideological- 
political intelligence of contact men, major indus- 
trial firms in a command economy are even more 
dependent on such information. In a Soviet-type 
economy, “planners’ tension” (an excess of ad- 
ministratively ordered output over capacity output) 
necessitates an immense formal apparatus for 
communicating and enforcing orders, and a com- 
plicated system of unofficial expediters and fixers 
who try to circumvent the orders so that goals 
may be met—which creates a demand for more 
planners to keep track of what is going on (Wiles 
1962, pp. 132-135; Berliner 1957; Dahl & Lind- 
blom 1953; Devons 1950). 

More generally, totalitarian states, with their 
penchant for ideological indoctrination, allocate a 
larger share of all intelligence resources to political 
intelligence. This is true both for contact men and 
for internal-communications specialists. Finally, 
totalitarian states tend to obliterate the distinction 
between internal and external intelligence [see IN- 
TELLIGENCE, POLITICAL AND MILITARY]. 

Second, economic and political contexts shape 
the quality of the technical, economic, and legal 
data that are generated, reported, or both. Com- 
mand economies and totalitarian political systems 
strengthen the structural roots of intelligence dis- 
tortion. Hierarchy, of course, is more prominent, 
so the temptation and opportunity to conceal, 
misreport, and mishandle data are enormous 
(Dahl & Lindblom 1953, pp. 254-271; Wiles 1962, 
pp. 222 ff.). Moreover, if central planning is 
emphasized, information-processing costs soar, un- 
certainty is great, and problems of data collection 
and interpretation are often insoluble (Ward 1967). 

Secrecy, too, is more prominent and the infor- 
mation pathologies associated with it more highly 
developed in totalitarian states. The national varia- 
tions here are considerable. It is said that Russian 
regimes, whether tsarist or communist, have 
shown a passion for secrecy not found in the West. 
One has the impression, however, that no gov- 
ernments have ever been more obsessed by fear 
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of espionage or more organized to protect secrets 
than such modern totalitarian systems as Hitlers 
Germany and Stalin’s Russia. Similarly, one could 
argue that among free societies the distorting 
effects of secrecy are felt least in the United States 
and most in parliamentary democracies with aris- 
tocratic traditions, such as Great Britain. In the 
decentralized federalism of the United States, in 
which the separation of powers is emphasized, 
officials and experts can act out parochial loyalties 
by “leaking” secret information to the press, by 
conducting investigations, or by encouraging ex- 
posés of rivals. The publicity consciousness of the 
American public combines with easygoing libel 
laws to free newspapermen from normal con- 
straints. Secrets cannot be kept for long. In con- 
trast, administrative or scientific subordinates in 
Britain owe their main allegiance to the Cabinet, 
and hardly anything justifies public deviation from 
its policies, let alone the use of restricted informa- 
tion as a weapon in the struggle for power; French 
officials also enjoy intensive privileges of secrecy 
(Price 1954, pp. 138-139). 

Unfortunately, the bright light of publicity in 
the United States may not, on balance, improve 
the reliability, relevance, and timeliness of intelli- 
gence. The pressure of publicity evokes a counter 
pressure for the safeguarding of secrecy. The op- 
portunity and desire to expose political enemies 
and their secrets are forces making for extremism, 
as a result of which truth is assassinated along 
with the characters of the men who seek it. Shils 
(1956) has provided a sensitive analysis of se- 
crecy-fearing extremism, as institutionalized in the 
loyalty-security programs of the 1950s. To design 
studies of the amount of secrecy, of its effects on 
recruitment, morale, and the conditions for intel- 
lectual work in various countries and organizations 
would tax the ingenuity of social scientists. But no 
set of problems in this area is more important. 

Related to cultural variations in the incidence 
and effects of secrecy are variations in ethno- 
centricism among elites. Ethnocentricism in the 
modern state is occasionally a major source of 
intelligence failures (Hilsman 1956, pp. 103 ff.; 
Kendall 1949, p. 543; Kautsky 1965, p. 15; Ran- 
som 1958, p. 171; Wasserman 1960, pp. 167-168). 
National stereotypes helped lead American states- 
men to faulty interpretations of intelligence in the 
period before Pearl Harbor, when Far Eastern spe- 
cialists were systematically kept from communi- 
cating accurate judgments not only because of 
their subordinate positions but because they were 
believed to be too immersed in “the Oriental point 
of view” (Wohlstetter 1962, pp. 102, 395). This 


was also true in the Korean case (Snyder & Paige 
1958, p. 241 in 1962 edition). If American leaders 
in the early 1960s were having a difficult time 
putting aside their delusions about a unitary “Sino- 
Soviet bloc,” imagine how hard it was for the xeno- 
phobic men in Peking to recognize a thaw in the 
cold war. These examples suggest that elites vary 
in their ability to transcend ethnocentric molds. 
The sources, amount, and effects of such variation 
deserve study. 

Insofar as national ideologies proscribe particu- 
lar academic disciplines or encourage antiexpert 
or anti-intellectual fervor, they shape the quality 
of intelligence at its source. During the period of 
maximum Stalinist terror the Soviet regime wiped 
out the science of genetics by persecuting Men- 
delian “deviationists,” pronounced quantum me- 
chanies inconsistent with dialectical materialism, 
and labeled mathematical economics as bourgeois— 
idealist. This last step directly affected the techni- 
cal intelligence needed for economic planning. It 
is difficult to separate ideological from social con- 
flicts here. Objection to particular intellectual per- 
spectives may rationalize a general mistrust of 
troublesome experts and technicians, a mistrust 
inspired by their indispensability, their love of 
autonomy, and the great social distance between 
them and party rulers. Within free societies a 
similar phenomenon is found in labor movements 
and labor parties (Wilensky 1956, pp. 260-269). 
Perhaps the surest generalization is that national 
ideologies accenting either workers’ control or the 
dictatorship of the proletariat tend to subvert effec- 
tive organization of the intelligence function by 
exacerbating universal differences between men of 
knowledge and men of power. 

A final determinant of the use and quality of 
intelligence that might be called cultural—ideologi- 
cal is the time perspective of elites (Speier 1952, 
p. 448). The roots of national variations in the 
range of foresight typically employed by adminis- 
trative leaders might be sought in the frequency 
and mechanism of succession. The frequent, TOUS 
tine succession of leaders in stable democracies, 
while encouraging continuity of past policy, may 
discourage long-range plans. The repeated succes- 
sion crises of unstable totalitarian regimes may 
also permit little long-range planning. If not too 
frequent, however, succession crises may open: up 
Opportunities for long-range planning by executives 
and staff with training and outlook vastly different 
from those of their predecessors. It is possible that 
Mussolini, who ruled for 21 years, and Stalin, who 
ruled for 27, had longer time perspectives than 
those of their contemporary counterparts in the 
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Western democracies or the shaky military dicta- 
torships of Latin America. In view of the common 
assumption that the other fellow has a plan—if 
not a plan for a Thousand Year Reich, then at least 
one for next year—it is remarkable that the soci- 
ology of complex organizations has so little to say 
about the conditions that foster the failure of fore- 


sight. 


Research needs 


Information has always been a source of power, 
but it is now increasingly a source of confusion. 
In every sphere of modern life, the chronic con- 
dition is a surfeit of information, poorly integrated 
or lost somewhere in the system. At the same time, 
available technology and expert staff have made 
it possible to cope with the problem. Whatever the 
national variations, elites in every rich country are 
being moved to overcome the information overload 
and even to break through comfortable slogans 
and grasp reality. Their success in understanding 
internal operations and external environment is 
affected by the shape of the organizations they 
command and by their defenses against informa- 
tion pathologies. 

Some gains in the quality of intelligence are 
possible from a reorganization of the intelligence 
function; for instance, if the hierarchy is too tall, 
it can be flattened; or experts can be given more 
autonomy by having them report to fewer bosses. 
Such redesign of structure, however, meets its 
limits where it entails large losses of efficiency, 
coordination, and control, and where structural 
resistance to change is strong. 

Unfortunately, intelligence failures are built into 
complex organizations. Thus, alert executives are 
everywhere forced to bypass or diversify the regular 
Machinery and seek firsthand exposure to intelli- 
pace sources both inside and outside the organ- 
zation. They typically move to points along the or- 
aegon boundaries: looking toward the bottom, 
N on internal-communications specialists; 
talk alae they rely on contact men. They 
ethic oe and researchers investigating their 
acca tions; they establish study commissions 
ik teview boards comprised entirely of outsiders, 
che members of British royal commissions 
E eedu 1965); they institutionalize complaints 
Ca res and thereby subject themselves to sys- 

ee independent criticism from below, as in 
ae ae of the ombudsman; they assemble ad hoc 
ne ees, advisory corporations, kitchen cabi- 

a ee advisers, personal representatives. 
the eon intelligence agents may constitute 

important and reliable source of organiza- 
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tional intelligence. They are sufficiently sensitive 
to the culture of the executive to communicate 
successfully with him and sufficiently independent 
to provide detached judgment. They bring to bear 
the multiple perspectives of marginal men. 

Research implications are apparent. We need 
studies that compare the performance of organiza- 
tions or governments that rely most and least on 
(1) unofficial sources and (2) various types of 
outside sources, both official (study commissions, 
grievance commissioners, advisory corporations, 
task forces) and unofficial (journalists, scholars ). 
For instance, is a sophisticated reporter working 
with open sources better than a secret agent work- 
ing with top-secret information ( Wohlstetter 1962, 
pp. 130, 169; Schlesinger 1965, pp. 247-248 )? 
The question could not be settled without compari- 
sons of both good and bad estimates among large 
populations of outsiders (e.g., columnists in all 
the prestige papers) and insiders (e.g., all top staff 
in major intelligence agencies) on several policy 
issues. More generally, estimates of the costs and 
gains for intelligence of various strategies for by- 
passing or diversifying official channels might be 
feasible. An equally important area for research 
is the effect of the new information technology on 
the problem of intelligence (for leads, see Simon 
1960; Shultz & Whisler 1960; and Kaufmann 
1964). 

Given the institutional roots of intelligence fail- 
ures and the urgency of so many big decisions, 
what counts is the top executive's preconceptions— 
what he has in mind when he enters the room 
and must act. The role of experts and intellectuals 
in shaping these preconceptions, both inside and 
outside the organization, is little understood. For 
instance, in college and in on-the-job training 
programs, business executives and government 
officials are increasingly exposed to social-science 
perspectives. We do not know the impact of such 
education on them. If it does not yield direct 
answers to their immediate questions, perhaps it 
does break through their cruder stereotypes, en- 
hance their understanding of themselves and their 
organizations, alert them to the range of relevant 
variables, and make them more skillful in the use 
of experts. 

Similarly, the symbols that surround the execu- 
tive in his daily life shape his orientation. In 
foreign affairs, experts invent slogans that rein- 
force or diminish policymakers’ adherence to the 
wartime myth of the single enemy. For example, 
the image of the Sino-Soviet bloc fostered so much 
rigidity in the State Department and its publics in 
the 1950s and early 1960s, that President Kennedy, 
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seeking very minor modifications of America’s 
China policy in accordance with the dramatic 
signs of the Sino-Soviet split, felt himself a captive 
of both his foreign office and the mass media 
(Schlesinger 1965, pp. 410-430, 483 ff.). Self- 
righteous cold war rhetoric, rooted in the facts of 
past conflicts, served to perpetuate policies long 
outmoded (Neustadt 1960, p. 139; Wasserman 
1960; Sorensen 1965; Kautsky 1965). On the other 
hand, such doctrines of the arms-control experts 
as “stable nuclear deterrence” and “transitional 
deterrent” moved debate inside the Kennedy ad- 
ministration toward a disarmament policy cul- 
minating in the 1963 test-ban treaty (Schlesinger 
1965, pp. 470 ff., 889 ff.). Thus, the most impor- 
tant function of the general adviser may be to set 
the tone of policy discourse—to reduce or increase 
the number of recognized options and to bolster 
or break up the brittle slogans that blind executive 
vision. 

Harotp L. Witensky 


[Directly related are the entries ADMINISTRATION; 
BUDGETING; BUREAUCRACY; INFORMATION STORAGE 
AND RETRIEVAL; INTELLECTUALS; INTELLIGENCE, 
POLITICAL AND MILITARY; LOBBYING; PRIVACY; 
STRATEGY. Other relevant material may be found in 
BUSINESS MANAGEMENT; DECISION MAKING; INFOR- 
MATION THEORY; KNOWLEDGE, SOCIOLOGY OF; LEAD- 
ERSHIP; MILITARY; ORGANIZATIONS, article On THE- 
ORIES OF ORGANIZATIONS; OPERATIONS RESEARCH; 
PLANNING, SOCIAL; War.] 


BIBLIOGRAPHY 


pressure groups in the formulation and administration of 
public policy; see, for instance, Schattschneider 1935 (the 
tariff); Macmahon et al, 1941 (work re ; Selznick 1949 
(the Tennessee Valley Authority); and Meyerson & Banfield 
1955 (public housing), A sociological treatment of organi- 
zational intelligence based on cases from international rela- 
tions, politics, welfare, and industry is Wilensky 1967, 
General books on science and the soc ial order include 
Barber 1952; Machlup 1962; Gouldner & Miller 1965, 


BAILEY, STEPHEN K. 1950 Congress Makes a Law: The 
Story Behind the Employment Act of 1946. New 
York: Columbia Univ. Press, 

BARBER, BERNARD 1952 Science and the Social Order, 
Glencoe, Ill.: Free Press. 

Barirz, LorEN 1960 The Servants of Power: A History 
of the Use of Social Science in American Industry, 
Middletown, Conn.: Wesleyan Univ. Press, 

BERLINER, JosEPH S. 1957 Factory and Manager in the 
U.S.S.R. Cambridge, Mass.: Harvard Univ. Press. 

BonTEcov, ELEANOR 1953 The Federal Loyalty—Security 
Program. Cornell Studies in Civil Liberty. Ithaca, N.Y.: 
Cornell Univ. Press. 

Brooxs, Jonn N. 1963 The Fate of the Edsel and Other 
Business Adventures. New York: Harper. 

CONTROLLERSHIP FOUNDATION 1954 Centralization vs. 
Decentralization in Organizing the Controller's De- 
partment; A Research Study and Report, by Herbert A. 
Simon et al. New York: The Foundation. 

Cyerr, RicHarp M.; and Marcu, James G. 1963 A Be- 
havioral Theory of the Firm. Englewood Cliffs, N.J.: 
Prentice-Hall. 

Daut, ROBERT A.; and LINDBLOM, CHARLES E. 1953 Pol- 
itics, Economics, and Welfare: Planning and Politico- 
Economic Systems Resolved Into Basic Social Proc- 
esses. New York: Harper. 

DALTON, MELVILLE 1959 Men Who Manage: Fusions of 
Feeling and Theory in Administration. New York: 
Wiley. 

Devons, ELY 1950 Planning in Practice: Essays in Air- 
craft Planning in War-time. Toronto: Macmillan; 
Cambridge Univ. Press. A 

Fras, Epwarp S. 1965 Economic Advice and Presiden- 
tial Leadership: The Council of Economic Advisers. 
New York: Columbia Univ. Press. 

GEORGE, ALEXANDER L. 1959 Propaganda Analysis: A 
Study of Inferences Made From Nazi Propaganda in 
World War II. Evanston, Ill.: Row, Peterson. 

Gotpner, Frep H. 1965 Demotion in Industrial Man- 
agement. American Sociological Review 30:714-724- 

GOULDNER, ALVIN W.; and MILLER, Seymour M. 1965 
Applied Sociology: Opportunities and Problems. New 
York: Free Press. 

Hamre, Mason 1959 Biological Models and Empirical 
Histories of the Growth of Organizations. Pages 272- 
306 in Foundation for Research on Human Behavior, 
Modern Organization Theory: A Symposium. Edited 
by Mason Haire. New York: Wiley. 

Hanser, CHARLES J. 1965 Guide to Decision: The Royal 
Commission. Totowa, N.J.: Bedminister Press. 

Harney, MaracHı L.; and Cross, Jonn C. 1960 The 
Informer in Law Enforcement. Springfield, Il.: 
Thomas. 

Hitsman, RocER 1956 Strategic Intelligence and Na- 
tional Decisions. Glencoe, Ill.; Free Press. 

Hiren, CHARLES J. 1965 Decision-making for Defense. 
Berkeley: Univ. of California Press. 

INTERNATIONAL UNION, UNITED AUTOMOBILE, AIRCRAFT 
AND AGRICULTURAL IMPLEMENT WORKERS OF AMER- 


ORGANIZATIONS: Organizational Intelligence 


ca (UAW), NINETEENTH CONSTITUTIONAL CONVEN- 
tion, Marcu 20-27, 1964 1964 Proceedings. At- 
lantic City. N.J.: UAW. 

Janowitz, Morris 1960 The Professional Soldier: A So- 
cial and Political Portrait. Glencoe, Ill.: Free Press. 
> A paperback edition was published in 1965. 

Jones, R. V. 1947 Scientific Intelligence. Journal of the 
Royal United Service Institution 92:352-369, 

KAUFMANN, WiLLIaM W. 1964 The McNamara Strat- 
egy. New York: Harper. 

Kautsxy, Jonn H. 1965 Myth, Self-fulfilling Prophecy, 
and Symbolic Reassurance in the East-West Conflict. 
Journal of Conflict Resolution 9:1-17. 

KENDALL, WILLMOORE 1949 The Function of Intelli- 
gence. World Politics 1:542-552. 

Kent, SHERMAN 1949 Strategic Intelligence for Ameri- 
can World Policy. Princeton Univ. Press. 

Kyorr, Kraus 1964 Failures in National Intelligence 
Estimates: The e of the Cuban Missiles. World 
Politics 16:455—467. 

KorNHAUSER, WILLIAM 1962 Scientists in Industry: 
Conflict and Accommodation. Berkeley: Univ. of Cali- 
fornia Press 

Lasxi, HaroLD J. 1931 The Limitations of the Expert. 
London: Fabian Society. 

LazaRsFELD, PauL F., 1959 Reflections on Business. 
American Journal of Sociology 65:1-31. 

Macuiup, Frirz 1962 The Production and Distribution 
of Knowledge in the United States. Princeton Univ. 
Press. 

Macmanon, ArtHUR W.; MiLLerT, J. D.; and OGDEN, 
Gtapys 1941 The Administration of Federal Work 
Relief. Published for the Committee on Public Admin- 
istration of the Social Science Research Council. Chi- 
cago: Public Administration Service, 

Marcu, James G.; and Simon, HERBERT A. 
izations. New York: Wiley. 

MarsHatt, S. L. A. 1953 The River and the Gauntlet: 
Defeat of the Eighth Army by the Chinese Communist 
Forces, November, 1950, in the Battle of the Cong- 

i chon River, Korea. New York: Morrow. 

ERTON, RoweRT K. 1957 Priorities in Scientific Dis- 
oh A Chapter in the Sociology of Science. Amer- 

Mes ican Sociological Review 22:635-659. 
ae Martin; and BANFIELD, Epwarp C. 1955 

olitics, Planning, and the Public Interest: The Case 
Me Public Housing in Chicago. Glencoe, Il.: Free 

ee > A paperback edition was published in 1965. 
nee Lester W. 1963 The Washington Lobbyists. 

ivi ago: Rand McNally. 
ree F. 1959 Inquiry and Policy: The Rela- 
ae nowledge to Action. Pages 158-180 in Daniel 
Stas (editor), The Human Meaning of the Social 

Nea ces. New York: Meridian. 

EREN C. 1950 Operation Cicero. New York: 
as Be -McCann. 

aha mane A.; and TURNBULL, Jonn G. 1956 Line 

esi taff in Industrial Relations. Harvard Business 

Neat view 34, no. 4: 113-124. 

E EA E. 1960 Presidential Power: The 
ack esta eadership, New York: Wiley. > A paper- 
tions. ion was published in 1962 by Science Edi- 

Per 
oF oe S. 1946 The Future of American Se- 

Piven igence. Washington: Infantry Journal. 

Micah io _ 1957 Strategic Intelligence Pro- 

+ Basic Principles. New York: Praeger. 


1958 Organ- 


333 


Possony, STEFAN T. 1941 Organized Intelligence: The 
Problem of the French General Staff. Social Research 
8:213-237. 

Prick, Don K. (1954) 1961 Government and Science: 
Their Dynamic Relation in American Democracy. 
New York Univ. Press. > A paperback edition was 
published in 1962. 

Ransom, Harry Howe 1958 Central Intelligence and 
National Security. Cambridge, Mass.: Harvard Univ. 
Press. 

SAYLES, LEonaRD R. 1958 Behavior of Industrial Work 
Groups: Prediction and Control. New York: Wiley. 
SCHATTSCHNEIDER, ELMER E. 1935 Politics, Pressures, 
and the Tariff: A Study of Free Private Enterprise in 
Pressure Politics, as Shown in the 1929-1930 Revi- 
sion of the Tariff. Englewood Cliffs, N.J.: Prentice- 

Hall. 

SCHLESINGER, ARTHUR M. Jr. 1959 The Age of Roose- 
velt. Volume 2: The Coming of the New Deal. Bos- 
ton: Houghton Mifflin. 

SCHLESINGER, ARTHUR M. Jr. 1965 A Thousand Days: 
John F. Kennedy in the White House. Boston: Hough- 
ton Mifflin. 

SCHUMPETER, JosePH A. (1942) 1950 Capitalism, So- 
cialism, and Democracy. 3d ed. New York: Harper; 
London; Allen & Unwin. + A paperback edition was 
published by Harper in 1962. 

SEapury, PauL 1954 The Wilhelmstrasse: A Study of 
German Diplomats Under the Nazi Regime. Berkeley: 
Univ. of California Press. 

SELznick, Putte 1949 TVA and the Grass Roots: A 
Study in the Sociology of Formal Organization. Uni- 
versity of California Publications in Culture and So- 
ciety, Vol. 3. Berkeley: Univ. of California Press. 

SELZNICK, PHILIP 1957 Leadership in Administration: 
A Sociological Interpretation. Evanston, nl.: Row, 
Peterson. 

Sneparp, Hersert A. 1956 Nine Dilemmas in Indus- 
trial Research. Administrative Science Quarterly 1: 
295-309. 

SHERWOOD, RoserT E. (1948) 1950 Roosevelt and Hop- 
kins: An Intimate History. Rev. ed. New York: Harper. 

Suits, Epwarp 1956 The Torment of Secrecy: The 
Background and Consequences of American Security 
Policies. Glencoe, Ill.: Free Press. 

SHULTz, GEORGE P.; and WHISLER, THOMAS L. (editors ) 
1960 Management Organization and the Computer: 
Proceedings of a Seminar Sponsored by the Graduate 
School of Business, University of Chicago and the 
McKinsey Foundation. Chicago, University of, Gradu- 
ate School of Business, Studies in Business. Glencoe, 
Ill.: Free Press. 

Simon, HERBERT A. 1960 The New Science of Manage- 
ment Decision. Ford Distinguished Lectures, Vol. 3. 
New York: Harper. 

SIMON, HERBERT A.; SMITHBURG, Donatp W.; and THOMP- 
son, Victor A. 1950 Public Administration. New 
York: Knopf. 

SmirH, Bruce L. R. 1964 Strategic Expertise and Na- 
tional Security Policy: A Case Study. Volume 13, 
pages 69-106 in J. D. Montgomery and A. Smithies 
(editors), Public Policy: A Yearbook of the Graduate 
School of Public Administration, Harvard University. 
Cambridge, Mass.: Harvard Univ. Press. 

Smit, Ricnarp A. 1956 Business Espionage. Fortune 
53, no. 5:118—121, 190, 192, 194. 

Snyper, Ricuarp C.; and Paice, GLENN D. 1958 The 
United States Decision to Resist Aggression in Korea: 


334 ORGANIZATIONS: Methods of Research 


The Application of an Analytical Scheme. Adminis- 
trative Science Quarterly 3:341-378. > Reprinted in 
Richard C. Snyder et al., Foreign Policy Decision- 
making, 1962, 

SORENSEN, THEODORE C. 1963 Decision-making in the 
White House: The Olive Branch or the Arrows. New 
York: Columbia Uniy. Press. 

SORENSEN, THEODORE C. 1965 Kennedy. New York: 
Harper. 

Sperer, Hans 1952 Social Order and the Risks of War. 
New York: Stewart. 

THOMPSON, V. 1950 The Regulatory Process in OPA 
Gas Rationing. New York: King’s Crown Press. 

TUCHMAN, BARBARA 1962 The Guns of August. New 
York: Macmillan, 

U.S. CONGRESS, SENATE, COMMITTEE ON THE Jupiciary, 
SUBCOMMITTEE ON ADMINISTRATIVE PRACTICE AND 
PRocEDuRE 1965 Invasions of Privacy: Government 
Agencies. Parts 1-3. Hearings held February 18- 
March 3, 1965; April 13—June 7, 1965; and July 13- 
August 9, 1956, Washington: Government Printing 
Office. 

Urwick, LYNDALL (1937) 1954 Organization as a Tech- 
nical Problem, Pages 49-88 in Luther H. Gulick et al. 
(editors), Papers on the Science of Administration. 
New York: Columbia Univ., Institute of Public Ad- 
ministration. 

Vacrs, ALFRED 1956 Defense and Diplomacy: The Sol- 
dier and the Conduct of Foreign Relations. New 
York: King’s Crown Press. 

Warp, BENJAMIN 1967 The Socialist Economy: A Study 
of Organizational Alternatives, New York: Random 
House. 

WASSERMAN, BENNO 1960 The Failure of Intelligence 
Prediction. Political Studies 8: 156-169. 

Wayte, WILLIAM F. et al, 1955 Money and Motivation: 
An Analysis of Incentives in Industry. New York: Har- 
per. 

Witpavsky, AARON B. 1964 The Politics of the Budget- 
ary Process. Boston: Little. 

Witensky, Harotp L. 1956 Intellectuals in Labor Un- 
ions: Organizational Pressures on Professional Roles. 
Glencoe, Ill.: Free Press. 

WiteNsKY, HaroLD L. 1964 The Professionalization of 
Everyone? American Journal of Sociology 70:137- 
158, 

Witensky, HaRotp L. 1967 Organizational Intelligence: 
Knowledge and Policy in Government and Industry. 
New York: Basic Books. 

Wires, P. J. D. 1962 The Political Economy of Com- 
munism. Cambridge, Mass.: Harvard Univ. Press. 
WOHLSTETTER, ROBERTA 1962 Pearl Harbor: Warning 

and Decision. Stanford Univ. Press. 


v 
METHODS OF RESEARCH 


A formal organization consists of a set of people 
who are engaged in activities coordinated by the 
relatively consistent expectations that these people 
have about one another and about the purposes of 
the organization. Such expectations define a set 
of organizational statuses or offices, each status 
with a set of roles linking it to the other statuses 
with which it interacts. Organizational research 


examines how people behave in their organizational 
roles and how organizations behave as collective 
units. Here we will consider the types of data col- 
lection and measurement, and research design and 
analysis used in organizational research. 


Data collection 


Organizational research uses three main sources 
of data: qualitative observation and interviewing, 
surveys of organization members, and institutional 
records. Qualitative methods of data collection are 
described elsewhere [see FIELD work], Surveys 
within organizations take three forms: surveys of 
one stratum only; surveys of two or more strata 
within the organization (for instance, workers and 
management, or students and faculty); and “rela- 
tional surveys” of linked pairs or sets of role part- 
ners. In surveys of two or more organizational 
Strata (“multistratum surveys”) individuals are 
usually sampled at random within status groups 
and are identified only by the general status each 
one occupies. In “relational surveys” each individ- 
ual is identified by the specific others with whom 
he interacts, and the sample is set up to include 
these role partners. When a sample of individuals 
is asked to provide the information on their role 
partners, who are not themselves interviewed, we 
have a “pseudo-relational survey.” 

Institutional data include records made by or- 
ganizations for their own use; information from 
directories, which can be turned into “data banks”; 
and “institutional questionnaires,” sent to samples 
of organizations, to be filled out by one or more key 
informants in each organization. 


Measurement 


The formal aspect of measurement that is most 
significant for organizational research is the rela- 
tion of the units from which the basic data are 
gathered to those which are characterized by the 
measurement. Information can be gathered on the 
organization as a whole or on component parts of 
it, such as individuals, interacting pairs and sets, 
and subgroups of members. Data on component 
parts can be aggregated in various ways to charac- 
terize the organization; and data on the organiza- 
tion can be considered as a “contextual char- 
acteristic” of the members. The following types of 
characteristics of collectives and their members 
have been distinguished by Lazarsfeld and Menzel 
(1961), and further discussed by Barton (1961, 
pp. 2—3 and appendix 1). , 

Integral or global characteristics of a unit do 
not derive from aggregation of members, pairs, OF 
sets within that unit, but from properties of the 
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unit as a whole. In the case of an organization, 
examples of such properties are its physical equip- 
ment, formal rules, budgets, programs, collective 
events, and collective outputs. 

Relational characteristics derive from informa- 
tion concerning the relationship between a unit and 
other units. In the case of an individual, they in- 
clude popularity, measured by number of choices 
received on sociometric questions; participation in 
an occupational community, measured by the num- 
ber of friends who belong to the same occupation; 
supportiveness of political environment, measured 
by the proportion of his associates who vote the 
same way as the respondent; cosmopolitanism, 
measured by the number of extraorganizational 
contacts he has; and so on. For collective units, 
relational characteristics would include the amount 
of communication between one unit and various 
others in its environment, the frequency of coopera- 
tion or conflict with other units, the volume of eco- 
nomic transactions between them, and so on 
(Levine & White 1961; Litwak & Hylton 1962). 

The underlying model for relational data is a 
“who-to-whom” matrix showing the value of the 
relationship for each pair within a group. There 
may be different types of relationships (who likes 
whom, who gives orders to whom, who talks shop 
with whom, etc.), each of which generates a matrix 
[see SOCIOMETRY]. Summary scores for individuals 
are found in the marginal totals of each matrix. 
Other operations can give us the number of second- 
order and higher-order connections which an indi- 
vidual has (through his immediate partners) and 
the characteristics of his interpersonal environ- 
ment, Organizations likewise have higher-order con- 
nections and an interorganizational environment. 

Aggregate measures characterize a collective in 
ras of distributions of data concerning its mem- 

ers. The simplest aggregate measures are additive 
aon such as rates and means. Thus the mo- 
fion S units may be measured by the propor- 
on a eir men who feel strong loyalty to the 
oa “ey climate of a college by the pro- 
aut a its faculty members who are liberal; the 
iia ea of a school by the mean IQ of its stu- 
a denis these cases, as we would expect, there 
iti ae haa seg between the individual 

Sad Ea ective properties; relationships true on 
tonal iey oe level are duplicated on the organiza- 
aan n However, this kind of correspondence 
cal falla, ways exist, as discussions of the “ecolog- 

Davis as have shown (Robinson 1950; Duncan 
can also b 3; Goodman 1953). Additive measures 
Organization, used as indicators of more complex 

nal characteristics that are not simply 
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the sum of individual traits: for example, the pro- 
portion of an organization’s total personnel that 
is in administrative positions may indicate how 
bureaucratized the organization is. 

A peculiar type of aggregate property arises when 
researchers ask samples of members for their per- 
ceptions of the organization or parts of it, and add 
up their answers to obtain a kind of aggregate per- 
ception, For example, an army company may be 
classified as having authoritarian leadership if a 
large proportion of members report that the leader 
behaves in an authoritarian way (Selvin 1960). 
A problem here is that different groups within an 
organization may evaluate the same behavior dif- 
ferently: Halpin (1956) found school principals 
reporting themselves as high on “consideration” for 
teachers; the school boards agreed with them, but 
the teachers themselves did not. 

The variance of the distribution of member data 
may also be used to characterize the collective. 
Thus the range or standard deviation of the in- 
comes of an organization’s members can be used 
as a measure of the organization’s equalitarianism, 
and the same measures, when applied to the ability 
levels of a high school class or the values of mem- 
bers of a union local, can indicate the degree of 
heterogeneity in the class or of value consensus in 
the union. Equality, homogeneity, and consensus 
are emergent properties on the collective level, 
without individual counterparts. 

Finally, the correlations between variables with- 
in a collective can be used as aggregate measures. 
For instance, the correlation of values with rank 
in the American army indicates its degree of “value 
stratification” (Speier 1950). Similarly, the corre- 
lation of military rank with external social status 
measures the degree of ascription as compared 
with achievement in the organization’s promotion 
system. 

The members of the collective from which data 
are aggregated need not be individuals; they may 
be smaller collectives within the larger one. Thus 
an aggregate measure of the degree of local democ- 
racy within national unions might be the propor- 
tion of locals within a union that had contested 
elections. 

When we have data from several status groups 
within a number of organizations, we can obtain 
aggregate measures for each stratum. An organiza- 
tion might thus be characterized as having high 
morale in the upper strata but low morale among 
the rank and file, as having authoritarian top man- 
agers and democratic foremen, and so on. The 
differences between different strata, in rates or 
mean scores on various attitudes, behaviors, back- 
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ground data, etc., are actually a form of correla- 
tional index. 

Relational-pattern measures (sometimes termed 
structural or sociometric measures) are aggrega- 
tions of relational data on the members of a collec- 
tive unit. Group cohesion is often measured in this 
way by the ratio of in-group to out-group choices 
when each member of a group is asked whom he 
chooses or would choose as a close friend; group 
integration may be indicated by the over-all fre- 
quency with which group members communicate 
with one another [see COHESION, SOCIAL]. More 
complex measures would include the extent to 
which relations within a group form self-contained 
cliques. Patterns of relationship among subunits— 
for example, the amount of conflict between sub- 
groups in an organization—can also characterize 
a collective. Just as relational characteristics of 
individuals are indicated by their row or column 
scores in a who-to-whom matrix, relational pattern 
measures for a collective derive from the matrix 
as a whole—that is, from the distribution or pat- 
terning of pair relations. 

Contextual measures arise when we characterize 
a member by the properties of the collective of 
which the member is a part. Being dissatisfied is 
an integral property of the individual; being a 
member of a work group where a high proportion 
are dissatisfied is a contextual property, derived 
from the aggregate of individual properties of mem- 
bers. Being a member of a conflict-ridden organiza- 
tion is a contextual property derived from a rela- 
tional-pattern property of the collective; attending 
a college with a large library is a contextual prop- 
erty derived from an integral property of the col- 
lective. 

In a multistratum survey of many organizations, 
contextual data can also be derived from different 
strata. For example, aggregate data on university 
faculty are contextual for students, and character- 
istics of foremen are contextual for the workers 
under them. 

Organizations, too, have contexts, such as the 
larger organization of which they are part or the 
industry or community in which they are located. 
Locals may be part of a “democratic” or an “un- 
democratic” national union; business firms may be 
located in communities with growing or shrinking 
populations, or belong to competitive or concen- 
trated industries. 

In summary, individual members of an organiza- 
tion have integral, relational, and contextual prop- 
erties, Collectives likewise have integral properties, 
relational properties (with regard to other organ- 
izations), and contextual properties (those of larger 


collectivities of which they are members). But in 
addition, collectives have aggregate properties de- 
rived from their members’ integral characteristics, 
and relational-pattern properties derived from their 
members’ relationships, 


Research designs 

We have distinguished five main sources of data: 
qualitative observation, one-stratum surveys, multi- 
stratum surveys, relational surveys, and institu- 
tional data; this is the first of three dimensions on 
which research designs can be classified. A second 
dimension is the number of organizational units 
studied: “case studies” of one organization, “com- 
parative studies” of two or several organizations; 
and “large-sample studies” of enough organizations 
to make statistical analyses with organizations as 
units of analysis. (Either a whole organization or 
a formal subunit can be the focus of study; a study 
of 100 work groups within a large corporation 
would be a “large-sample study” of work groups.) 
The third dimension is whether data are gathered 
for one point in time or whether comparable data 
are gathered for each of two or more time periods. 
Studies over time permit analysis of change and 
inferences of causal relationships. 

A typology of designs for organizational studies 
is generated if we run these three dimensions 
against one another (see Table 1, in which each 
row of six cells derives from a different source 
of data). 


Analysis of data on organizations 


The different types of research design outlined 
in Table 1 permit several basic types of analysis, 
some qualitative and some quantitative. Each type 
of analysis can be performed for units at different 
levels of aggregation—for individuals, for linked 
sets of individuals, or for organizations. We will 
examine each type of analysis in turn, indicating 
its limitations as well as the problems that it per- 
mits us to study. è 

Ideal-type comparison. A study of one organiza- 
tion at one point in time, using qualitative methods, 
provides a very limited scope for analysis. Such 
analysis usually involves comparing the case at 
hand with a theoretically derived ideal type— 
Weber’s model of bureaucracy, for instance, OF the 
economists’ model of a firm composed of rational 
economic men. Thus, an analysis of bureaucratic 
patterns in the navy officer corps describes a nuni 
ber of striking behavior patterns (avoiding respon- 
sibility, ritualism, insulation from the outside 
world, and ceremonialism) and concludes: “The 
military variant of bureaucracy may thus be view 
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Table 1 — A typology of designs of organizational studies 


One 
Qualitative case study 
Davis 1948 


Qualitative case study over time 
Selznick 1949 

Gouldner 1954 

Roethlisberger & Dickson 1939 


One-stratum one-unit survey 


Walker & Guest 1952 


One-stratum one-unit panel survey 
Stouffer et al. 1949 
Multistrotum one-unit survey 


Stouffer et al, 1949 


Multistratum one-unit panel survey 


Newcomb 1943 
lieberman 1956 


One-unit relational survey 


Weiss 1956 
Organizational Stress 1964 
Stogdill et al. 1956 


One-unit relational panel survey 


W, Wallace 1964 
A. F. C. Wallace & Rashkis 1959 


Case study using one-time institutional 
data 


Case study using institutional data 
over time 


NUMBER OF UNITS STUDIED 


Several 
Qualitative comparative study 
Coser 1958 
Form & Nosow 1958 


Qualitative comparative study over time 
Guest 1962 


One-stratum comparative survey 
Katz & Hyman 1947 


One-stratum comparative panel survey 
Multistratum comparative survey 
Georgopoulos & Mann 1962 

Illinois, University of 1954 


Mann & Hoffman 1960 
Multistratum comparative panel survey 


Comparative relational survey 


Comparative relational panel survey 
Morse & Reimer 1956 

Comparative study using one-time 
institutional data 


Harbison et al. 1955 


Comparative study using institutional 
data over time 


Many 
Many qualitative case studies 
Udy 1959 


Many qualitative case studies over time 


One-stratum survey of a large sample of 
organizations 

Lazarsfeld & Thielens 1958 

Bowers 1964 

One-stratum panel survey in a large 
sample of organizations 


Multistratum survey of a large sample 
of organizations 


Multistratum panel survey of a large 
sample of organizations 


Relational survey of a large sample of 
organizations 

Kahn & Katz 1953 

Gross et al. 1958 


Relational panel survey of a large 
sample of organizations 


Study of a large sample of organizations 
using one-time institutional data 

Faunce 1962 

Douglass 1926 

Study of a large sample of organizations 
using institutional data over time 


Brown 1956 Haire 1959 


a a skewing of Weber's ideal type by the situa- 
ohm elements of uncertainty and standing by” 
3 ‘ai [1948] 1952, p. 384). Similarly, the authors 
ea a famous Western Electric study (Roethlis- 
hic, & Dickson 1939) sought to discover, by 
Phat: of qualitative interviewing and observation, 
ae er factory workers behaved like a set of dis- 
when ae rational economic units and concluded, 
tion poera with such phenomena as group produc- 
Sin Fae and informal leaders, that they did not. 
sie Re, case studies thus can disclose the exist- 
kone Permena that raise problems for the 
ated s from which ideal types have been gener- 
-Such studies therefore inspire further research 


Lipset et al. 1956 


of more complex design to find the conditions 
under which the ideal—typical or the deviant phe- 
nomena occur. 

Multivariate comparison of a few cases. When 
we have data on two or several organizations— 
whether these data are qualitative, aggregated from 
surveys, Or compiled from institutional records— 
we can use some form of “quasi-experimental” anal- 
ysis (also known as “quasi-correlational” analysis ). 
The simplest form is “strategic paired comparison,” 
a method that calls for two organizations that are 
formally similar but differ strongly in one respect. 
The presumed consequences of this difference are 
then explored in detail. Thus Coser (1958) com- 
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pared a medical ward with a surgical ward in the 
same hospital; the wards, though formally similar, 
presented many differences in the actual exercise 
of authority and the informal relations of doctors 
and nurses. Mann and Hoffman (1960), using a 
survey of top managers, foremen, and workers, 
compared an automated power plant with a non- 
automated one. Treating the results as a “quasi- 
experiment,” they suggest that automation in- 
creased the men’s sense of having an influence on 
plant operations, as well as their interest and satis- 
faction, but also aggravated tensions related to the 
job. 

In these two examples, the researchers translated 
a gross institutional characteristic (medical versus 
surgical, automated versus nonautomated) into 
several more general sociological characteristics 
(time pressure on decision making, job enlarge- 
ment, interdependence of components), which 
were capable of explaining the observed differences 
in the dependent variables in terms of general so- 
ciological propositions (for example, when there 
is more time pressure on decision making, there 
will be less group consultation). Having only two 
cases, however, they could not test the relative ef- 
fects of the several proposed explanatory variables 
or control for the effects of other variables. 

When researchers have somewhat more cases, 
they can locate each organization in a multidimen- 
sional classification of explanatory variables. Thus 
a study of six organizations’ responses to disaster 
found seven characteristics that differentiated the 
more effective from the less effective ones; each 
organization represented a particular pattern of 
these seven attributes (Form & Nosow 1958). Such 
an analysis is suggestive but not conclusive, since 
there is only one case per cell. 

Qualitative study of change. Most qualitative 
case studies that go beyond description of formal 
structure necessarily examine time sequences of 
the behavior of organization members. These may 
disclose the “microprocesses”—the exercise of in- 
formal pressures, the ways of getting around rules, 
the immediate causes of deviance—that maintain 
normal equilibrium in the organization. Many ex- 
amples of such microprocesses were observed in 
the Western Electric study. A worker who went 
beyond the group norm of output was ridiculed or 
“pinged” on the arm by fellow workers. A researcher 
entering the room was mistaken for a time-study 
man, and everyone slowed down. Changes in pro- 
duction often fo!lowed off-the-job personal problems 
(Roethlisberger & Dickson 1939). More recently, 
it was found that episodes of patient disturbance 
in a mental hospital followed staff disagreements 


on the patients’ treatment (Stanton & Schwartz 
1954). 

The logic of sequence analysis is post hoc, ergo 
propter hoc; but this kind of reasoning can be mis- 
leading in the absence of experimental or statis- 
tical controls to eliminate accidental or spurious 
factors. Researchers therefore try to test their inter- 
pretations by observing as many repeated sequences 
as they can and by locating changes precisely in 
the time sequence. This procedure can become a 
qualitative form of time-series experiment [see 
EXPERIMENTAL DESIGN, article on QUASI-EXPERI- 
MENTAL DESIGN; see also Campbell & Stanley 1963]. 

Qualitative studies over long time periods, par- 
ticularly if they cover major organizational changes 
such as succession, growth, and reorganization, 
permit derivation of relationships between organ- 
izational variables. Such studies also test hypotheses 
about the functions of various parts of the system 
by observing what happens when one part of it is 
changed, For instance, McCleery (1957) studied 
the process of change in a prison from an authori- 
tarian to a more liberal regime. His main argument 
was that the arbitrary behavior of the old regime 
created such insecurity that the inmates accepted 
an exploitative elite of “old cons” who interpreted 
and negotiated with the authorities, and who con- 
trolled disorderly inmate behavior that might have 
jeopardized their own special privileges. At the 
same time, this administrative arbitrariness made 
the inmates hostile to the official programs of re- 
habilitation. When the new administration created 
fairer procedures and more communication with 
the inmates, this hostility was reduced, but the 
power of the inmate elite was reduced still faster, 
so that an outburst of disorder resulted. A new 
equilibrium was finally achieved without an ex- 
Ploitative elite and with more rehabilitative activity. 

Several operations can be distinguished here: 
identifying the “anatomy” of the system—the major 
formal and informal status groups; identifying the 
key variables characterizing each group; reporting 
the values of these variables at several stages in 
the process of change; deriving causal relationships 
among them; and locating these causal relation- 
ships in a functional model of the whole system 
(see Figure 1). This model can be analyzed to 
account for the initial equilibrium, the sequence © 
changes, and the final equilibrium (Barton 
Anderson 1961). 

That such a system of relationships was 
to exist in a single case is hardly conclusive e zi 
dence that it exists in all such cases. Comparat va 
study of several organizations can begin to provic™ 
checks against alternative possibilities and to spe 
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rehabilitation program disorder 


Figure 1 — Causal relationships in prison disorder 


Source: Adapted from Barton & Anderson 1961. 


cify conditions under which the relationships hold. 
Thus, Guest (1962) compared two studies of suc- 
cession of a top manager and proposed no fewer 
than seven variables in an effort to explain why 
the two men adopted opposite policies. 
Multivariate analysis of individuals. A survey 
Of individual attitudes and behavior within a single 
organization can analyze the correlates of job satis- 
faction, productivity, mobility aspirations, or any 
other organizationally related attitudes and behav- 
iors, in exactly the manner of ordinary public- 
Opinion studies, except that detailed information 
On organizational status and activities can be added 
to the usual limited background data. For instance, 
Stouffer and his colleagues (1949) analyzed the 
elation of rank, combat experience, length of serv- 
ice, and similar organizational-status variables to 
Morale, mobility aspirations, etc., for the army as 
@whole and for various components. 
of ltivariate analysis of linked sets. Analysis 
4 characteristics of pairs or sets of role partners 
i possible by relational samples. This is 
ae pee the role set corresponds to an organ- 
+ i subunit—as in studies of the correlation of 
“tive isor behavior with worker morale or produc- 
T (see, for instance, Kahn & Katz 1953). In 
a poy (1955) of 40 air crews, behavior of 
e mander was described for three variables 
crew's eo perceptions of his crewmen; each 
Biur a prescriptions for their commander were 
en, for the same three variables; and a meas- 
crew e conformity of commander behavior to 
Prescriptions was derived. His conformity 
en be correlated with other attributes of 


I$ Crew. A 
: » Such as its cohesi of con- 
sensus, sion and degree of 


Where the role set does not correspond to an 
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organizational subunit, “snowball sampling” can 
be used. For example, Kahn and his associates 
(Organizational Stress . . . 1964) took 53 managers 
as “focal persons” and interviewed two superiors, 
two subordinates, and three pairs of each. Tensions 
of the focal person could then be related to char- 
acteristics of his role set. The variety of measures 
that can be created with such samples is shown by 
a study by Stogdill and his colleagues (1956), who 
surveyed 47 role sets of a superior and one or two 
subordinates in a research agency, measuring role 
expectations and perceived role behavior for self 
and other for each of 45 role elements. From these 
basic data they derived measures of each pair's 
consensus on role expectations, conformity of be- 
havior to own and partner's expectations, agree- 
ment on perceptions of behavior, and differentia- 
tions or division of labor between the two statuses, 
both perceived and expected. 

Multivariate analysis of organizations. When- 
ever we have data on a large number of organiza- 
tions or subunits within organizations, it is possible 
to apply the same methods of analysis that are used 
in survey analysis of individual data. The organiza- 
tional data may derive from a large set of qualita- 
tive studies containing comparable data, from sur- 
veys that have sufficient samples within a large 
number of organizations to provide aggregate data, 
or from institutional data derived from records, 
“data banks,” and surveys of informants by means 
of institutional questionnaires, Thus Udy (1959) 
classified 82 production organizations, using data 
from anthropological monographs compiled in the 
Human Relations Area Files. He found a high cor- 
relation between the degree of complexity of their 
technology and the number of levels of authority 
(see Table 2). 

Blau and his colleagues (1966) studied a large 
sample of public bureaucracies with an institu- 
tional questionnaire which measured size, division 
of labor, professionalization, and centralization of 
authority. Quite complex conditional relationships 
were found; for example, in organizations with few 
professionals, division of labor was positively re- 
lated to centralization of authority, while in highly 
professionalized organizations the relationship was 
the reverse. 


Table 2 — Authority and technological complexity in 82 
organizations 


INDEX OF TECHNOLOGICAL COMPLEXITY 
Number of levels 


of authority High Low 
3 or more 28 3 
2 or less 5 46 


Source: Udy 1959, p. 584. 
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Very large samples of organizations can be 
studied by using published directories, data banks 
of institutional information, or questionnaires to 
informants in each unit. Thus, Faunce (1962) 
studied 753 union locals by passing out a question- 
naire at the U.A.W. convention, and Douglass 
(1926) studied 1,000 city churches by using in- 
formant questionnaires. 

Contextual analysis of individuals. When we 
have drawn samples of individuals within several 
or many organizations, contextual analysis becomes 
possible, The simplest form of this is an examina- 
tion of the relationship between individual attitudes 
or behavior and the attributes of the organization 
of which the individuals in question are members. 
Thus, students report more cheating at colleges that 
are large, have many students per faculty member, 
have most students living off campus, have easy 
admissions policies, and are coeducational (Bowers 
1964). Moreover, it can be shown by means of 
three-variable “contextual tables” that the relation- 
ship between two individual attributes is different 
in different organizational contexts, and thus that 
there is an organizational factor influencing the 
processes of individual behavior. For instance, 
Lipset and his colleagues, in a study of a large local 
in the International Typographical Union, surveyed 
two strata: chapel chairmen and rank-and-file 
members. Over-all, the chairmen were more inter- 
ested, knowledgeable, and active in union politics 
than the rank and file were. However, when union 
members were separated according to size of the 
shop, it was found that this difference between 
chairmen and rank and file appeared only in the 
larger shops (Lipset et al. 1956, pp. 176-182). 
Similar effects were obtained in a study of three 
prisons by Berk (1966). 

Contextual analysis of linked sets. Relationships 
between characteristics of pairs of role partners 
or between those of whole role sets may be modi- 
fied by their organizational context. To study this 
requires comparing relational surveys in a number 
of organizations—for example, examining the rela- 
tion of informal contact to consensus between role 
partners in organizations of different degrees of 
bureaucratization. 

Contextual analysis of organizations. The larger 
setting—type of community, industry, or society— 
may affect the characteristics of organizations or 
the interrelations among them ( Udy 1965). Cross- 
national comparisons show, for instance, that the 
ratio of supervisors to workers may be 1 to 15 in 
an American steel plant and 1 to 50 in a similar 
German steel plant (Harbison et al. 1955; see 
also Evan 1963). Presumably the relationship be- 


tween plant size and the supervisor—worker ratio 
is also different in each case. A complex con- 
textual effect is exemplified by the finding that 
the relationship between secular orientation and 
effectiveness is positive for YMCAs at nondenomi- 
national colleges, but negative for those at de- 
nominational colleges (Lucci 1960). 

Time-series analysis. When information is avail- 
able on several variables within one organiza- 
tional unit for many points in time, it is possible 
to search for causal relationships by examining the 
sequence of changes in these variables. Brown 
(1956) examined union records covering a period 
of 50 years. Indicators of intraunion conflict— 
number of resolutions for constitutional change, 
challenges to convention delegates, failure of offi- 
cers to be re-elected, jurisdictional disputes between 
locals—were found to be inversely related to size 
year by year during a cycle of growth, decline, and 
new growth. Tsouderos (1955) used time series 
of income, expenditure, capital, membership, and 
number of administrative employees to study the 
process of growth, bureaucratization, and decline 
in ten voluntary organizations. His finding was that 
loss of members after a period of growth led to 
increased administrative staff, which led to main- 
tenance of income and activities but further loss 
of members and to eventual decline. 

Time series could also be provided by survey data 
on several organizational strata repeated over many 
periods. Some large corporations collect data of 
this kind as a “morale barometer,” but it does not 
seem to be systematically analyzed in the way that 
public-opinion time series have been. 

Panel analysis of individuals. A design that ob- 
tains information on a sample of organization 
members at two or more points in time permits @ 
much clearer isolation of causal relationships 
among individual characteristics than does a one- 
time survey. For instance, Stouffer and his associ- 
ates found that higher-ranking enlisted men were 
more often in good spirits, accepted the soldier role, 
were more satisfied with their army job, and 
thought the army was well run. To test whether 
these conformist attitudes were a cause of promo- 
tion or only a result of being promoted, samples of 
privates were surveyed when newly recruited and 
subsequently followed up after several months dur- 
ing which some had been promoted. Those with 
more conformist attitudes early in their army C&- 
reers were more likely to be promoted later on. The 
difference held good when various background fac- 
tors related to both promotion and attitudes were 
held constant (Stouffer et al. 1949, vol. 1, PP: 
147-154). 
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Another study dealt with workers who became 
foremen or shop stewards, repeating on them and 
on matched control groups a set of attitude ques- 
tions given a year previously to all workers in the 
plant. A third wave was done two years later, when 
some of the foremen had been demoted because of 
layoffs and some of the stewards had been replaced 
in office. Many attitudes had changed considerably 
in response to changes in status (Lieberman 1956). 

A panel study that included many organizations 
would permit comparison of social processes in 
different settings. Such studies do not appear to 
exist. (See, however, Miller 1958 for an example 
of a “pseudo-pane]” study which compares students 
in different years of college for different types of 
institutions, suggesting major differences in the so- 
cialization process. ) 

Panel analysis of linked sets. If, as is often 
said, organizations are systems of interacting parts, 
the best design for studying them is a relational 
panel, since this would permit analysis of changes 
among related individuals and groups over time. 
But studies that make use of such panels are re- 
markably rare. We have mentioned the qualitative 
analysis by Stanton and Schwartz (1954) of pa- 
tient disturbance and staff disagreement in a men- 
tal hospital. One quantitative panel study of staff 
and patients found no such relationship (Wallace 
& Rashkis 1959), but another found not only that 
Fete disagreement was associated with subsequent 
Patient disturbance but also that patient disturb- 
ance led to staff disagreement (Rashkis & Wallace 
1959), 

Walter Wallace (1964; 1965) obtained a com- 
re sociogram of contacts among students at one 
se He also obtained panel attitude data from 
“ae io at entry, at midsemester, and at the end 
sie rst semester, and attitude data on all other 
‘Bike S at midsemester. He constructed measures 
“inten dominant attitude of each freshman’s 
ae sonal environment,” which he related to 
ier in freshman attitudes. Segments of the 

Personal environment, classified by sex and 


coll 7 
ege class, were shown to have differential 
effects, 


en panel analysis would be particularly 
since it studying role relationships as a system, 
mene would permit examination of such phe- 
i the effects on the whole network of 
nets tol ns and behavior of a change in one part- 
appear E expectations. But such studies do not 
4 0 exist. 

nea analysis of organizations. If standardized 
over td gathered for a sample of organizations 

me, the determinants of organizational 


change could be analyzed by the same methods 
that are used in analyzing panels of individuals. 
The basic data might be from any of the sources 
we have discussed—qualitative studies, surveys, or 
institutional records—provided they measure the 
same characteristics of a large number of organ- 
izations at several comparable points in time. A 
near example of this type of analysis is the finding 
by Stouffer and his colleagues that companies 
which before D-day had the highest willingness for 
combat subsequently had the lowest rates of non- 
battle casualties. The relationship was much higher 
for veteran regiments than for nonveteran regi- 
ments (Stouffer et al. 1949, vol. 2, chapter 1). 

A comparable approach was taken by Lipset and 
his colleagues, who analyzed time-series data on 
party voting in the printers’ union, broken down by 
large versus small locals. They were able to show 
that a rising opposition normally gains strength 
first in the large locals, while the small ones sup- 
port the party in power; when the party supported 
by the large locals wins, these locals are again the 
first to go into opposition (Lipset et al. 1956, 
pp. 373-382). But, once again, such studies are 
hardly ever undertaken. 

For purposes of developing theories of organiza- 
tional processes at the microlevel, relational panel 
studies obtaining data on role expectations and 
behavior would appear to be particularly appro- 
priate. They would provide factual data on which 
to base development of simulation models and 
mathematical models, just as panel studies of vot- 
ing behavior (including data on the perceived 
norms and behaviors of friends and associates and 
of various social groups) did for the development 
of simulation models of electoral processes [see 
SIMULATION, article on POLITICAL PROCESSES; see 
also VOTING]. 

For testing theories of organizational change, 
panel data on organizational characteristics for 
large samples of organizations would be highly de- 
sirable, We now have a good many examples of 
studies of large samples of organizations, but be- 
cause they deal with the organizations at only one 
point in time, they are severely limited in establish- 
ing causal relations. 

ALLEN H. BARTON 


[Directly related are the entries EXPERIMENTAL DESIGN; 
FIELD WORK; OBSERVATION; PANEL STUDIES. Other 
relevant material may be found in ADMINISTRATION, 
article on ADMINISTRATIVE BEHAVIOR; CouNTED 
DATA; INDUSTRIAL RELATIONS, article on THE SO- 
CIOLOGY OF WORK; INTERVIEWING, article on SOCIAL 
RESEARCH; MULTIVARIATE ANALYSIS; and in the 
biographies of Mayo; STOUFFER.] 
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ORTEGA Y GASSET, JOSÉ 


José Ortega y Gasset (1883-1955), Spanish 
writer and philosopher, was born and died in 
Madrid, where both sides of his family were closely 
connected with the worlds of politics and journal- 
ism. Ortega attended various Jesuit schools and 
studied at the University of Madrid, where he re- 
ceived his PH.D. in 1904. His postgraduate work was 
done in Leipzig, Berlin, and Marburg, during which 
time he was greatly influenced by the ideas of Georg 
Simmel, Hermann Cohen, and Paul Gerhard Natorp. 
Between the years 1910 and 1936, Ortega was pro- 
fessor of metaphysics at the University of Madrid. 

Ortega began writing for Spanish and South 
American newspapers and journals in 1902. In 
1923 he founded the Revista de occidente, which 
he directed until its activities were interrupted in 
1936. As a liberal interested in social problems, 
Ortega opposed the dictatorship of General Primo 
de Rivera, which lasted from 1923 to 1930, and in 
1931 he founded, together with G. Marañón and 
Ramón Pérez de Ayala, the Agrupación al Servicio 
de la República, a political group. Soon afterward, 
he was elected a deputy in the Constituent Assem- 
bly, a position he held from 1931 to 1933, At the 
outbreak of the civil war in 1936, Ortega, unable to 
support either side in the conflict, chose instead to 
live abroad—in France, Holland, Argentina, and 
Portugal. Upon his return to Spain in 1945 he kept 
aloof from both the government and the University 
of Madrid. In 1948, with Julián Marías, Ortega 
founded the Instituto de Humanidades in Madrid. 

For forty years Ortega was one of Spain’s lead- 
ing intellectual figures and the center of a broad 
intellectual movement that has been called the 
“school of Madrid.” As a writer, lecturer, educator, 
political guide, philosopher, and creative sociologist, 
he made a deep mark on twentieth-century Spanish 
thought and literature. 

Although influenced at first by Neo-Kantianism 
and phenomenology, Ortega soon criticized these 
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doctrines and evolved a metaphysics according to 
vital reason that went beyond philosophical ideal- 
ism without slipping into realism. This metaphysics 
states that the ultimate or radical reality from which 
every other reality draws its roots cannot be re- 
duced to any idea or theory but is “my life” in the 
biographical, not the biological, sense. To Ortega, 
“Life is what we do and what happens to us.” 
“To live is to deal with the world, aim at it, act in 
it, be occupied with it.” “I am myself and my cir- 
cumstance, and if I do not save my circumstance, 
I do not save myself (Meditations on Quixote 
1914). Things and the ego are only abstract ele- 
ments in the primary reality—my life—which con- 
sists in activity, My life is given to me, not made 
for me: I have to make it with things. The ego is a 
vital project and the circumstance is a repertory of 
possibilities that require choice and the justification 
of choice, In order to live, I must imagine, think, 
and reason. Reason is a condition of human life, a 
requisite for living, and it is life itself that makes 
it possible to understand whatever has a function 
in life, Insofar as life apprehends reality it is reason 
(vital reason); in its concrete form, it is historical 
reason, since individual life includes its own past 
and all of history, Ortega had to re-elaborate the 
categories and forms of philosophical thought in 
order to arrive at a form of circumstantial thought 
according to which each individual life (as well as 
each society) is a point of view on the universe— 
a perspective that is an ingredient in, and an or- 
ganizing principle of, reality. 

In a strict sense, human life is individual; when 
several lives interrelate, the interindividual is 
formed. The collective, or social, individual (no 
longer an inter-individual) is impersonal, unspon- 
tancous, and without responsibility. 

Society consists of those usages or customs to 
which the entire “people” is subject. Customs are 
what is done, thought, believed, and said. By means 
of customs, society exerts pressure on individuals, 
regulates conduct, and automates a large part of 
life. At the same time, customs transmit the heri- 
tage of the past and create progress and history. 
Customs put man in a position to discover what is 
properly his and make it possible for him to deal 
with the unknown. Language, beliefs, manners, 
and laws are diverse forms of customs, By far the 
strongest are law and the state. Authority is nec- 
essary for society to exist, since man’s social im- 
Pulse fights with his drive toward disassociation and 
insociability. Authority's goal, however, is to permit 
each individual freely to live his personal and irre- 
Placeable life. 

Society is composed of the masses and the minor- 


ity—not in the sense of social classes, but of cate 
gories of men, or social functions. The masses con. 
sist of an unqualified majority and a minority 
composed of qualified men with leadership ability 
(who may, in most aspects and activities of thelr 
lives, be part of the unqualified masses), The revolt 
of the masses is not a positive attempt to gain 
access to superior forms of life, but the pathological 
striving of the unqualified to act as if they were 
qualified. Unlike the man of responsibility and 
effort, the mass-minded man demands nothing of 
himself at the same time that he attempts to judge 
and evaluate everything around him. 

Historical variation takes place within each sock 
ety according to the rhythm of 15-year generations 
which create, even though it may be small, a total 
change. Those who belong to a generation are the 
men born within a “region of dates,” who have to 
meet the same problems, in the same world, at the 
same level, At cach moment there coexist at least 
three active generations, partially overlapping and 
producing historical change. 

The enormous wealth of themes in Ortega’s 
thought can only be indicated In this extreme abbre- 
viation of his theories of human, individual, bi» 
torical, and social life. 

JuuiAn Manias 


[For the historical context of Ortega’s work, see Hie 
TORY, article on THE PHILOSOPHY OP HINTORY, ond 
the biographies of Buncxnannr, Din THEY; SIMMEL 
for discussion of the subsequent development of hit 
ideas, see Curativrry, article on social AsPReT® 
GENERATIONS; Mass socirry.| 
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OSSOWSKI, STANISŁAW 


+ Stanistaw Ossowski (1897-1963), Polish soci- 
—_ a scholar of wide interests; his writ- 
studies of the theory of soclal struc- 

ture, the methodology of the social sciences, social 
. aesthetics, and the sociology of art. 

i Was also active in the promotion of sociology 
à profession in Poland, organizing the Polish 
Kuh al Association after World War n; he was 
‘under of the International As- 
Wina, and served as its vice president from 1959 


amns works are marked by the combination 
ë broad historical and cultural perspective with 
— conceptual apparatus and expository 
tah po 2d he taught his students to maintain 

perspective without any sacrifice of 
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rigorous inductive analysis, He sought to tackle 
the focal social issues of his day, striving for un- 
ambiguous results that were firmly grounded em- 
pirically. He was deeply convinced that the social 
scientist must stick to his scientific opinions re- 
gardless of the pressures exerted on him by so- 
ciety, and he communicated this conviction to his 
students. As he put it, “The scientist is a man for 
whom disobedience of thought is a professional 
obligation, His social function is to question. In 
this respect he must obey neither the synod, nor 
the committee, nor the cabinet, nor Caesar, nor 
God” (1964a, p. 22). 

Ossowski was born in Lipno, the son of a phy- 
sician. He studied philosophy at the University of 
Warsaw and in Paris and Rome. The Lvov-Warsaw 
school of philosophy was principally interested in 
the logical analysis of the language of science, pre- 
ceding the Vienna Circle in this respect by a few 
years. The influence of the Polish school, above 
all the influence of Lukasiewicz and Kotarbiński, 
shaped Ossowski's interest in methodology and his 
conviction that a precise formulation of the prob- 
lems and the conceptual apparatus of the social 
sciences is indispensable to successful sociological 
study, His doctoral thesis (1926), an outgrowth 
of his combined interest in the logic of concepts 
and the theory of culture, was an analysis of the 
concept of sign and discussed the twofold function 
of culture elements; that of expressing and that 
of denoting. 

His next major work was a book, U podstaw 
estetyki (1933; “On the Foundations of Aesthetics”), 
written while he was working as a secondary school 
teacher, The book includes a review of past work 
on aesthetics and analyzes in precise terms the 

ive roles of the beauty of nature and the 
beauty of man-made objects in man's spiritual life 
and in human culture. On the basis of this book, 
Ossowski was awarded the venia legendi by the 
University of Warsaw in 1933. 

During the ten-year period that Ossowski worked 
on aesthetics, he was developing the interest in 
sociology that was gradually to become dominant. 
In the 1930s his concer with social issues and 
with the problems of socialism grew, and he be- 
came a member of the socialist-minded intelli- 
gentsia of pre-World War n Warsaw. Both his 
sociological and his social Interests were strength- 
ened by a period of study that he and his sociologist 
wife, Maria, spent in England, from 1933 to 1935. 
While there, Ossowski attended Malinowski's semi- 
nar on cultural anthropology and came into con- 
tact with the political and cultural problems of 
non-European peoples. This led him to try to avoid 
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European ethnocentrism in posing and solving so- 
ciological problems and to search for syntheses 
that would be valid for broad cultural areas. 

His contact with cultural anthropology also in- 
tensified his interest in the phenomenon of racial- 
ism, and in the late 1930s he delivered lectures 
on the social functions of racialist theories and 
myths in contemporary societies (1938). In his 
preface to the second edition, Ossowski wrote, 
“Today, racialist ideology seems an anachronism. 
. .. But racialism was an anachronism even before 
the war. That war demonstrated fully how vital 
such anachronisms can be when they encounter 
favourable conditions, and how insufficiently mod- 
ern civilization safeguards man against the possi- 
bility of mass folly” ([1938] 1948, pp. 5-6). It is 
hardly surprising to find also that Ossowski took 
an active part in the campaign against anti-Semi- 
tism in Polish academic life before 1939. 

During the war Ossowski was active in the 
Polish resistance. At the risk of his life, he lectured 
in the then-underground University of Warsaw. 
He also continued to write, but most of the works 
he wrote during this period were burned in 1944. 
The only work that survives is a small but im- 
portant book (1943), first published clandestinely, 
that outlines the ideal socialist democracy: It 
proposes that a planned economy and individual 
freedom be combined,. permitting social equality 
without the loss of cultural diversity. The new 
democratic culture would be marked by the co- 
existence of universalization and differentiation 
processes on a large scale. 

After the war Ossowski lectured at the University 
of Lodz and then was appointed to the chair of 
sociology at Warsaw. He held the chair until 1952 
when, with increasing Stalinization and the sys- 
tematic elimination of centers of independent re- 
search, the chair was abolished. Although he was 
thus deprived of contact with students, Ossowski 
continued his research with a group of his closest 
followers. During this period he wrote Class Struc- 
ture in the Social Consciousness (1957), analyzing 
the various ways that social stratification has been 
handled in different social theories and ideologies. 
He paid particular attention to differences in the 
conceptual apparatus of the theories of social 
structure in Marxist and non-Marxist sociology. 
At the International Sociological Congress in Am- 
sterdam in 1956, he explained that his chief con- 
cern in writing the book was to develop conceptual 
categories that would enable sociologists to com- 
pare changes in social structure in socialist and 
capitalist countries (1956). 

With the political changes that took place in 


Poland in 1956, Ossowski was restored to the chair 
of sociology at Warsaw. Thereafter he was one 
of those principally responsible for the rebirth of 
sociology in Poland. In 1957 his initiative led to 
the founding of the Polish Sociological Association; 
he was elected its president and remained in that 
office until his death. He devoted most of his en- 
ergy to teaching, but he also wrote a number of 
methodological essays, later collected in a single 
volume (1962a). He concentrated on the necessity 
of preserving humanistic perspectives and psycho- 
logical intuitions at a time when the theoretical 
structure of the social sciences was gaining in pre- 
cision and when modern standardized methods of 
empirical research were becoming increasingly 
effective. 

In the last years of his life Ossowski was pre- 
occupied with new problems, and he was hoping 
to write yet another book, on the dynamism of 
“higher values” in social processes. His work on 
these problems was interrupted by his death in 
1963. 

In one of his articles, written in Nazi-occupied 
Warsaw, Ossowski wrote: “Life in itself is not so 
important, and the transition to a state of inorganic 
matter is not so important either. Life is important 
as an opportunity for experience and for doing 
things which make life worth living” (1964a, 
p. 27). These words are a key to the force and 
charm of Ossowski’s personality. 


STEFAN Nowak 


[For the historical context of Ossowski’s work, see 
Marxist socioLocy and the biography of Matr 
NOWSKI.] 
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OSTROGORSKII, MOISEI IA. 


Moisei Ostrogorskii (1854-1919), also trans- 
literated as Ostrogorski or Ostrogorsky, was a 
Russian political scientist who studied the Ameri- 
‘an and British party systems around the turn of 
the century, He is generally regarded as the first 
major modern student of comparative party or- 
Sanization, and much of what has since been done 
Mm the United States and Britain, as well as in other 
‘ountries, stems from his work. In large measure 
n Weber’s and Robert Michels’ analyses of 
Patties followed directly from Ostrogorskii’s writ- 
yes (Weber [1906-1924] 1946, esp. pp. 104-111; 
chels 1911; Roth 1963, pp. 252 ff.). More re- 
cently, the works of such scholars as David Butler, 
ace Catlin, Maurice Duverger, Robert McKenzie, 
istin Ranney, and W. G. Runciman have attested 
N continuing significance of Ostrogorskii’s 
stt As Butler has put it, Ostrogorskii was the 
Seri to point to “the possibility of separating de- 
Ption from analysis” (1958, p. 44): 
Wag trogorskii was born in Grodno in Russia. He 
educated in law at St. Petersburg and worked 
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for some years in the ministry of justice. Subse- 
quently, while in his late twenties, he went to Paris, 
where he studied at the Ecole Libre des Sciences 
Politiques under Emile Boutmy. His work there re- 
sulted in a volume on the rights of women in public 
law (1892), which won a prize from the Paris 
faculty of law in 1892. Then he turned to the study 
of parties in the United States and Britain, spend- 
ing many years in both countries during the 1880s 
and 1890s and publishing a number of articles on 
American parties in French and American jour- 
nals. His two-volume treatise on the subject, De- 
mocracy and the Organization of Political Parties, 
was published in English in 1902 but, curiously, 
did not appear in French, the language in which 
it was written, until a year later (see Macmahon 
1933). 

Although Ostrogorskii lived for 17 years after 
the publication of these volumes, he did not follow 
them up with any other major scholarly works. He 
returned to Russia after the revolution of 1905 
and was elected to the first Duma in 1906 as a 
representative of the liberal Constitutional Demo- 
cratic party (Cadets). He played an important role 
in the party caucus during the first parliament but 
withdrew from active politics when it was dis- 
solved. He returned briefly to the United States to 
revise and condense his volume on American poli- 
tics. That version appeared in 1910. Just after 
World War 1, he published some articles in Russia 
on British constitutional history. His role in, or 
reactions to, the Revolution of 1917 has not been 
recorded. 

Ostrogorskii made several distinct contributions 
to political analysis. First and most important is 
the main theme of his two-volume work, the anal- 
ysis of the general traits of democratic mass par- 
ties. Second, his work represents the first important 
attempt at systematic comparative analysis of po- 
litical systems. He sought the causes of political- 
party behavior in the elements inherent in a 
democratic party with universal suffrage. The 
American party system was a crucial case for 
the political scientist to study in depth, precisely 
because the United States was the first mass- 
suffrage democracy: the hypotheses suggested by 
such a case study could be tested in various other 
countries then undergoing democratization. In the 
course of his analysis, Ostrogorskii attempted to 
explain the various structural attributes that dif- 
ferentiated the two major English-speaking soci- 
eties from each other. Third, in connection with 
his research he presented a detailed account of 
the party systems in the two countries; although 
modified by subsequent research, this account re- 


348 OSTROGORSKII, MOISEI IA. 


mains remarkably authoritative. Finally, his de- 
tailed reportage on parties and elections contains 
a large number of propositions concerning the de- 
terminants of voting behavior and public-opinion 
formation that are relevant to recent voting re- 
search. It is impressive to see how many of Ostro- 
gorskii’s generalizations about the effect of a 
campaign on the participation and partisan choices 
of the electorate have been verified by public- 
opinion surveys of samples of electorates (Lipset 
1964, pp. xxxix—xliii). 

Structure and function of parties. The key 
aspect of modern mass-suffrage parties, from 
which all else follows, is their dual nature: al- 
though extraconstitutional and presumably created, 
in Edmund Burke's words, to promote “the national 
interest upon some particular principle” on which 
the members agree, parties necessarily form perma- 
nent organizations staffed by professional politi- 
cians. The need to maintain the party apparatus 
inevitably leads parties to modify both the prin- 
ciples on which they are based and the activities 
that promote these principles, in such a way as to 
increase financial and electoral support for the 
organization. From being a means to an end, par- 
ties—that is, the perceived interests of the party 
elite—become ends in themselves. Starting as the 
organized spokesmen of a section of the electorate, 
they seek to transform that section into unwaver- 
ing supporters, since mass-suffrage parties clearly 
could not exist if they had to re-form their ranks 
at every election. Since access to political office is 
the basic raison d'être for a political party, parties 
will tend to subordinate any worthy ends that con- 
flict with gaining or retaining power. 

Much of Ostrogorskii’s analysis of American 
politics consists of a detailed specification of the 
behavior of politicians and organizations. He noted 
that at the heart of each political machine are a 
number of little machines which work through a 
system of mutual obligations. Since party unity is 
a precondition of electoral strength, those who 
want to participate in the party organization must 
exhibit strong loyalties, be “regular.” In an analy- 
sis that predated Michels’ comparable one (see 
Michels 1911), Ostrogorskii suggested that a party 
machine tends to be bureaucratized much like an 
army: as an organization perennially engaged in 
combat with an opposite number, it emphasizes 
obedience and chain-of-command logic. 

Ostrogorskii showed how structural factors fa- 
cilitate machine domination. In a mass democracy 
individuals, no matter how prominent, cannot fight 
those who control a party machine, without joining 
or forming another organization. The sheer prob- 


lem of numbers makes any form of politics utopian 
unless it is rooted in organization. Hence, an effort 
to revolt against a machine “must have recourse 
to the men of the Machine itself, must ‘fight the 
devil with fire, by allying itself with the rival 
Machine or even with a section of the Machine of 
the predominant party which it sought to over- 
throw” (1902, vol. 2, p. 439), This means that the 
success of reformers almost invariably results in 
the triumph of yet another machine, not in a 
cleansing or democratizing of politics. 

Among the factors which strengthen machine 
or bureaucratic domination is the indifference of 
the masses to the interests of collectivities, The 
typical citizen of an egalitarian and competitive 
society (which the United States already was and 
Britain was becoming) is motivated to press his 
personal concerns rather than collective interests. 
This indifference on the part of the electorate in- 
creases the need for parties effectively organized 
to win votes. Since such organizations serve pri- 
marily to mobilize votes, they necessarily minimize 
ideological differences. To win elections, machines 
seek to offend as few and to please as many as 
possible. [See POLITICAL MACHINES. ] 

In his analysis of the machine, Ostrogorskii em- 
ployed what has come to be known in sociology 
as the functional approach. He asked what the 
political needs are of different groups in a society 
and how they are met. The machine exists because 
it fulfills certain needs; it is clearly not prescribed 
by the formal political structure (see Merton [1949] 
1957, pp. 72-83). The fact that party organization 
assumed rather similar forms in the two leading 
mass democracies suggests that a strong party ap- 
paratus is inherent in, and required by, a system 
of universal suffrage in a complex society. How- 
ever, the decentralized government of the United 
States gives greater scope to party machines than 
the English system. 4 

The machine also satisfies the need of private 
interests to deal with a small, stable group of 
leaders. For the masses the party machine acts 48 
a go-between; it provides a more personal way of 
dealing with the impersonal bureaucratic structure 
of government. i 

Comparative party systems. Although Ostrogor 
skii stressed the similarity of the party systems © 
the United States and Britain, he was also con 
cerned with the differences between them. Amer 
can machines were much more corrupt and boss- 
ridden than the British; American politicians came 
from lower socioeconomic strata; British parties, 
unlike American ones, were disciplined at the na 
tional level. Seeking to explain these variations, 


he suggested a number of hypotheses about the 
two societies. 

In analyzing the reasons underlying English 
middle-class participation in politics at the turn 
of the century, he repeatedly noted the importance 
of deference. The party associations depended on 
amateurs or zealots, but the presidents of the local 
associations were always men of great prestige in 
the local community who lent respectability and 
status to political activity. “The hierarchical spirit, 
the class spirit, still lives in English society, the 
English people has not yet ceased to be what 
Bagehot called ‘a deferential nation,’ ‘politically 
deferential ” (1902, vol. 1, p. 614). In the United 
States, or: the other hand, the absence of deference, 
“the leveling spirit,” militated against the formation 
of a political leadership stratum drawn from the 
“better element.” The populist anti-elitist values 
that emerged as an outgrowth of the Revolution, 
the absence of historical continuity with an earlier 
feudal-monarchical system, and the social condi- 
tions of the frontier all served to make elites the 
subject of ridicule in American political combat. 
As Ostrogorskii observed, the participation and 
influence of the intellectuals and upper strata that 
existed in the early days of the republic had dis- 
appeared with universal suffrage. The absence of 
a traditional aristocracy with its emphasis on 
noblesse oblige meant that the American upper 
class felt less pressure than the English to partici- 
Pate in politics, The American social elite was 
willing to leave the work of politics to men from 
relatively lowly backgrounds. Ostrogorskii noted 
that the word “politicians,” which originally de- 
noted simply persons engaged in public affairs, be- 
came synonymous, as early as the 1820s, with self- 
Interested men seeking to gain personally from 
Political participation and power. 
cine the most significant difference between 
ae American and the British party systems is the 
age of parliamentary parties in England, 
Se precludes conflict between a parliamentary 
ithe and the executive, as contrasted with the 
Oh or party discipline in the United States, 
Talal allows for frequent conflict between the 

sident and Congress, sometimes between the 
2S sete and majorities formed in large part 

z aes of his own party. This difference 
Otis vah been explained, both before and after 
Ea as stemming from the varying con- 
eae structures of the two countries [see 
EHA GOVERNMENT and PRESIDENTIAL 
reject ee Ostrogorskii, however, tended to 
iffere effort to account in legal terms for the 

nces between the two types of government. 
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Following his general sociological approach, he 
looked for the source of variation in other, more 
basic elements of the social structure of the two 
nations. 

English society contained a less complex set of 
sources of party diversity than did American so- 
ciety, thus facilitating greater party unity on 
issues. The Conservative party, traditionally based 
on the aristocracy, the Church of England hier- 
archy, and their deferential followers, was strong- 
est among the rural population but also important 
among urban workers. These groups were linked 
to the Tory party less by their concern with par- 
ticular issues of self-interest than by their involve- 
ment in social relations that maintained traditional 
social values. The Liberal party drew its support 
from the new bourgeoisie, urban entrepreneurs, 
adherents of the dissenting churches, and the ma- 
jority of the urban working classes. (The Labour 
party, in turn, was to take over much of the Liberal 
support.) Thus, the groups supporting the two 
parties tended to represent two quite separate 
strands of opinion. 

The American parties divided along more varied 
class, ethnic, religious, and sectional lines. The 
Democrats tended to secure support from the 
less well-to-do, particularly in urban areas, from 
those of recent non-Anglo-Saxon, especially Roman 
Catholic, immigrant stock, and from the South; 
Republican strength reflected the opposite charac- 
teristics plus the vote of the Negroes and western 
farmers, Regional and urban-rural conflicts within 
given states added further complications. To ful- 
fill a party’s prime purpose of winning elections 
locally, any effort at ideological or national policy 
discipline had to be abandoned. 

‘Another fact which affected the character of the 
party system was the greater reliance of the British 
parties on amateurs. Agreement with the political 
views espoused by their party was an important 
element in uniting the amateurs who led the party 
organization. Members of Parliament, therefore, 
were under pressure to be loyal supporters of their 
party in Parliament if they were to maintain a 
strong party organization in their constituencies; 
a member of Parliament at odds with his national 
party would necessarily lose backing in his con- 
stituency. 

In the United States, on the other hand, those 
who did the grass-roots work in the parties did not 
care as much what policies officeholders supported, 
since this was not relevant to electoral victory or to 
securing spoils. Since the organized interests and 
significant sections of the electorate in different 
parts of the country differed greatly in what they 
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wanted from government, legislators might either 
be left free by the professional leaders of the ma- 
chine to vote as they saw fit, or they might be 
urged to vote for some specific local interest in 
opposition to the policy favored by the president 
as leader of his party or by the dominant opinion 
in the Congressional party. There was no national 
machine in the United States, and the local parties 
were primarily concerned with winning elections 
in their area, even if they had to support policies 
elsewhere opposed by their party. Hence, although 
the varying constitutional arrangements in Eng- 
land and the United States may have played a part 
in affecting the patterns of discipline at the level 
of the national party, Ostrogorskii argued that the 
differences in social structure were more important. 

Assessment. Much has happened in the politics 
of Britain and the United States since Ostrogorskii’s 
two volumes were published. While many of his 
insights about organization remain valid, some of 
his predictions have proved to be in error. To take 
only one example, his expectation that the con- 
siderable growth of the central government in the 
United States would be accompanied by a strength- 
ening of the position of the machines has not been 
fulfilled, However, Ostrogorskii’s most significant 
generalization about the workings of suffrage de- 
mocracy—that party organization is by nature self- 
aggrandizing—has not been challenged by events. 
As he noted in much detail, party organizations 
are not democratic reflections of popular will but 
powerful instruments for dominating the electorate 
and for imposing opinions, officials, and policies 
on the public. As he, and later Michels, noted, 
party functionaries and elected officeholders have 
interests of their own and use their positions to 
further them. 

As a reformer, Ostrogorskii recommended re- 
placing general parties by single-issue ones, but as 
he himself documented as a reporter and historian, 
the real solution lies in opposing one set of party 
bureaucrats with the organized power of other 
groups—an opposing party, organized factions 
within a party, other mass-based associations such 
as unions, farm groups, veterans’ leagues, and the 
like. It is evident from reading Ostrogorskii that 
there have been many situations in which power 
has restrained power, in which opposition groups 
have exposed or prevented malfeasance, or in 
which different groups have presented alternative 
policies to those fostered by incumbents. The struc- 
ture of mass organizations may inevitably produce 
a propensity to self-aggrandizement in the leaders, 
but as long as clear differences exist among or- 
ganizations, democracy also exists. The voters, or 


any given defined section of them, do have access 
to government decisions insofar as politicians 
worry about their reactions. 

The debate as to whether suffrage regimes char- 
acterized by competition between organized parties 
and heavily influenced by “interest groups” are 
democratic or not is, in some part, a matter of 
definition. By defining democracy as the pure, 
unmediated rule of the people, both Ostrogorskii 
and Michels proved to their own satisfaction that 
such systems are not democratic. As Ranney has 
pointed out, “Democracy, for Ostrogorskii, was in- 
conceivable save as a society of isolated individ- 
uals, all prepared to devote much of their time and 
energy to rational discussion of public affairs, all 
eager to form an association with others who hold 
identical views on a given issue, and all ready to 
dissolve the association as soon as the issue is 
settled” ({1954] 1962, p. 130). Ostrogorskii’s solu- 
tion to the problem of party domination is as un- 
realistic as his analysis of the operation of parties 
in his day was accurate. 

The naiveté of Ostrogorskii’s solutions to the 
problems of political machines and oligarchy 
should not obscure his enormous contributions to 
political analysis. In large part his success in pre- 
senting sophisticated hypotheses about organiza- 
tional behavior derived from the fact that he 
approached the study of parties in a comparative 
context. He did not ask, What are American pat 
ties like? or How does the English political system 
work? but rather, What are the general attributes 
of political parties under conditions of universal 
suffrage? He was, in other words, seeking to 
formulate a systematic theory of party organiza- 
tion, and it was this effort at systematic theory 
that so attracted Max Weber to his work. As po- 
litical science joins the other social sciences in ie 
lating substantive concerns to the formation a 
general theory, we may expect increased interes 
in Ostrogorskii. Seymour M. LIPSET 
nt of OS 
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Robert Owen (1771-1858), British socialist, 
was born at Newtown, Montgomeryshire, Wales. He 
began to influence social thought in 1799, when 
he acquired at New Lanark, Scotland, the cotton 
mills which he made famous. His mills became 
a showplace of enlightened management, and 
Owen's reputation as a philanthropist spread 
throughout Europe. 

When he entered upon the “government of New 
Lanark,” as he called it, Owen’s object was not to 
be a “mere manager of cotton mills, but to intro- 
duce principles in the conduct of the people.” These 
principles had to do with character formation. 
Owen rejected the competitive business system 
through which he had made his money and urged 
the merits of a cooperative system in which “one 
man’s gain” would not be “another man’s loss” 
([1813-1821] 1927, p. 124). In a cooperative com- 
munity, he believed, a healthy and happy environ- 
ment would shape individual character along the 
right social lines. Factories were nurseries of bad 
habits which only social controls, particularly edu- 
cation, could eliminate. It was necessary, more- 
over, to educate not only children but also adults. 

Having expressed these views even before the 
end of the Napoleonic Wars in 1815, Owen was all 
the more convinced of the urgent need to apply 
them when the years following the wars brought 
economic discontent and distress to Britain. From 
his original interest in the effects of industrial 
working conditions on character he moved nat- 
urally to a concern with the state of the unem- 
ployed. He advocated “villages of co-operation” 
where work would be carried on collectively; these 
he considered not only a necessary remedy for un- 
employment but also a contribution to “social re- 
generation,” a remodeling of society on cooperative 
lines with no reliance on the profit motive. 

When Owen failed to convince the rich and influ- 
ential of the wisdom of his plans and the practica- 
bility of his vision of a new order, he turned to 
other sectors of society, particularly the middle 
and working classes. He broke sharply with the 
churches in 1817, and thereafter his cooperative 
philosophy became markedly anticlerical in tone. 
He also began to formulate more definitely commu- 
nitarian ideals and, finding British opinion reluc- 
tant to support him, left Britain for the United 
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States in 1824, There he set up a system of com- 
munity living at New Harmony, Indiana, an 
attempt to realize “the new moral world.” The 
community fared ill from the first (Maclure 1820— 
1833) and soon swallowed up most of Owen's 
fortune. 

Owen returned to Britain in 1829 and again was 
forced to plot a new course. He discovered that 
many of his ideas on labor as a source and standard 
of value and on cooperative production had been 
accepted by working-class groups of Owenites, and 
he was increasingly drawn into British working- 
class politics. This phase of his experience reached 
its high point when in 1833 the Grand National 
Consolidated Trades Union was conceived. Although 
the Trades Union had over half a million members 
early in 1834, it disintegrated within a few months. 

Owen's greatest dream was even further from 
reality: it was a “Grand National Moral Union of 
the Productive Classes,” a pyramid of producers’ 
power with trade-union lodges at its base and a 
national labor exchange at its apex. Society would 
be quickly transformed—indeed, at a single blow— 
by the operation of the union. Like his previous 
schemes, this one also failed. Yet, still believing in 
the possibility of social redemption, Owen turned 
to the development of a secular religion and in 
1839 to the development of a new community 
experiment at Harmony Hall (Queenwood), Hamp- 
shire. 

As working-class cooperation developed in the 
1840s under the leadership of the Rochdale Pio- 
neers, it depended much on Owenite ideals, but 
soon thereafter became more practical in its em- 
phasis. Owen was frequently dismissed as a 
“Utopian socialist”; it was even denied that he was 


a socialist at all. In fact, however, he had done 
much to develop a constructive critique of indus- 
trialism, to fashion the socialist vocabulary (Bestor 
1948), and to stimulate working-class action by 
offering a new vision of society as it might be, 
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[See also ECONOMIC THOUGHT, article on SOCIALIST 
THOUGHT; COOPERATION; COOPERATIVES; UTOPIAN- 
IsM.] 
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PACIFISM 


Pacifism has had both a broad and a narrow 
connotation. In the former sense, it has often been 
used—particularly in continental Europe just be- 
fore and after World War 1—to designate those 
outlooks centering their attention on the need for 
international reconciliation and peacemaking ma- 
chinery, Thus many tendencies in thought were 
pacifist, the word being employed to describe a 
whole spectrum of ideas and activities that might 
conceivably be peacemaking in character. From 
this point of view, Jean Jaurès, the French socialist 
leader, and men like Woodrow Wilson were paci- 
fists; and organizations like the Carnegie Endow- 
ment for International Peace were pacifistic. 

While this broad usage of the term is still to be 
found, pacifism has generally come to be connected 
with all those ideas, attitudes, and movements 
which repudiate the use of violence, particularly 
cy under any circumstances. The pacifist in this 

se will argue that war, whether defensive or 
e is always ethically illegitimate and, in 
E run, inefficacious for the attainment of 
ee goals, Often, this conception also em- 
a ae notion that forms of nonviolent power 
een iscovered and ought to be used to counter 
P ee and internal exploitation and 
aE arious forms of nonviolent resistance or 
hee in coercion are thus associated with paci- 
SHa n we the viewpoint of political theory, it 
conse gned with those doctrines which stress 

nt rather than force as the basis for political 


authority, 
History 


sca pacifism, in the narrow sense, is a 
™ concept, the general outlook that it desig- 


nates has a long history. In the seventh and sixth 
centuries B.C., Jeremiah developed certain attitudes 
toward Babylonian aggression that can legitimately 
be described as pacifist. A little later, in China, 
Lao-tze advocated positions that were very close to 
a kind of anarchist pacifism. In Greek thought, cer- 
tain of the Cynics and Stoics came close to a paci- 
fist outlook; and some of the Essenes in the Hebrew 
tradition apparently repudiated the utilization of 
war and all violence, whether personal or group. 
The spirit of the Gospels, so some modern schol- 
ars, notably MacGregor ( 1936), have argued, can- 
not be reconciled with violence, whether personal or 
political. Evil is to be overcome only by good. True, 
the New Testament does not explicitly exclude war- 
making for the believer; but it does lay down prin- 
ciples that clearly point in this direction. 
Although the attitude of the early Christians is 
still a matter of controversy, many of the best 
modern authorities, for example, the Scottish schol- 
ar C. J. Cadoux (1940), contend that for at least 
the first century few, if any, Christians would enter 
the Roman army. In part, at least, their refusal was 
based on a religious repudiation of the violent acts 
that soldiers had to perform, whether in battle or 
as executioners. As late as A.D. 295, Maximilianus 
underwent martyrdom primarily because, as a 
Christian, he said he could not be a soldier. By the 
early part of the fourth century, however, the 
strong pacifist strain in Christian thought began to 
die out. Thus the Council of Arles, A.D. 314, de- 
creed that those Christians who gave up their arms 
in time of peace should be excommunicated, Dur- 
ing the Middle Ages, according to orthodox doc- 
trine, laymen could participate in “just” wars. Both 
regular and secular clergy, however, were supposed 
to refrain from the shedding of blood. This dual- 
istic ethic was challenged by certain heretics who 
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sought to revive the attitudes of early Christian 
apologists. 

The beginning of modern pacifism may be dis- 
cerned in a number of the early Anabaptists and 
in religious groups (like the Mennonites) having 
similar theological beliefs. During the sixteenth- 
century religious struggle in France, the Roman 
Catholic Etienne de la Boétie argued strongly for 
tactics of nonviolent resistance against tyranny. 
Since all rule depends on willingness of men to 
obey, he maintained, withdrawal of obedience could 
lead to the collapse of any tyranny. In the seven- 
teenth century, the British Quakers sought to im- 
plement pacifism in politics by establishing a dis- 
armed commonwealth in Pennsylvania; and for 
about two generations that colony, unlike the 
others, was free of Indian warfare. 

Eighteenth-century and early nineteenth-century 
pacifism was affected by the decline in the religious 
basis of social and political doctrine. Rationalistic 
and utilitarian near-pacifism came to the forefront 
in the writings of the early anarchist William God- 
win. Godwin stressed the irrational and immoral 
character of the appeal to force, even for the ends 
of justice. Revolutionary violence, he held, was a 
contradiction in terms. Similar views were reflected 
in the political poetry of Shelley, who in the Masque 
of Anarchy urged British workers to revolt against 
exploitation through a “folded arms” strike. After 
Shelley, the anarchist pacifist strain was reflected 
in men like Thoreau, Benjamin Tucker, Tolstoy 
(1908), and Barthélemy de Ligt (1934). Tol- 
stoy, however, grounded his pacifism primarily 
on an anarchist interpretation of the Christian Gos- 
pels. Writing with great literary power, he had a 
considerable impact on early twentieth-century 
thought. [See ANARCHISM.] 

Perhaps the greatest pacifist figure of the twen- 
tieth century has been Mohandas K. Gandhi, who 
combined in his views and practices both religious 
and utilitarian notes. Religiously, he was inspired 
by certain Hindu scriptures (for example, the 
Upanishads and the Bhagavad-Gita), the New 
Testament, and the writings of Tolstoy. His experi- 
ences as a lawyer and man of affairs seemed to 
suggest that sheer retaliation in group conflict does 
not in the long run accomplish desirable objectives. 
On the other hand, even adamant opponents will 
often respond favorably to acts animated by love 
and in which the actors willingly undergo suffering 
for a cause. Notions of this kind became the basis 
for what Gandhi called the doctrine of Satyagraha 
(“truth power” or “firmness in truth”). Gandhi in- 
sisted that Satyagraha was applicable in political 
as well as in personal relations; and he sought to 


demonstrate this proposition in his leadership, 
1919-1947, of the nonviolent resistance to British 
rule in India. His theory and example were also 
important in such movements as the struggle of the 
American Negro for racial integration, the South 
African natives’ battle for equality, and the achieve- 
ment of political independence by Ghana in 1957, 
[See INDIAN POLITICAL THOUGHT. ] 

The impact of World War 1 on the development 
of twentieth-century pacifism was considerable, 
Utter disillusionment with war as an institution led 
to such phenomena as the “Oxford oath” in which, 
during the 1920s and 1930s, thousands of young 
Britons pledged themselves never to fight again 
“for king and country.” The Peace Pledge Union 
sought to provide a register of all those who sub- 
scribed to the promise. Elsewhere, and particularly 
in the United States, there were similar tendencies, 
Pacifist attitudes were encouraged by public ex- 
posés of the role of munitions manufacturers in 
encouraging conflict and also by a widespread be- 
lief that economies based on private profit tended 
to make for war and violence. 

Political experience just before and during World 
War 11 led many pacifists to re-evaluate their views 
and some to change them. However, for not a few 
the period between 1939 and 1945 was primarily 
a challenge that demanded from them greater in- 
tellectual clarity and more nearly adequate pro 
posals for pacifist alternatives. Throughout the wat, 
Gandhi's personal view was that India should oppose 
any Japanese invasion only by nonviolent resist- 
ance; he also thought that Hitler and the Na 
tional Socialists could, and ought to be, counter- 
acted in the same way. For pacifists of Gandhi's 
type, World War 11 in its methods and results had 
the effect of strengthening their convictions. For 
leaders like Martin Niemoeller, who had been un- 
sympathetic before, war experiences helped creaté 
a postwar pacifist outlook. 

After World War n, potentialities for widespread 
destruction through nuclear arms led some to be- 
come nuclear pacifists. Certain scientists and engi- 
neers in Germany, the Soviet Union, Great Britain, 
the United States, and elsewhere, disturbed by the 
possible social consequences of some types of Tte- 
search, pledged themselves to refuse to labor in 
those areas. 

Pacifists have also been active in the effort 1 
gain legal recognition for conscientious objector 
to war and in the struggle against military CON: 
scription. Between World War 1 and World War 1h 
recognition was broadened in Great Britain and the 
United States. After World War 1, the constitution 
of the German Federal Republic recognized pi 
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right of objection; and in 1962, after a long strug- 
gle, some legal status was accorded to French 
objectors. 

While pacifism remained the ideology of a mi- 
nority in the 1950s and 1960s, the ubiquity of 
violence and the radical increase in the destructive 
power of modern weapons tended to dramatize its 
propositions by sheer contrast. In a world of monu- 
mental violence, its doctrine of nonviolent power 
took.on new meaning and added significance. 


Foundations and emphases 


Aside from the fact that all pacifists eschew vio- 
lence for any purpose, pacifism has no uniform or 
authoritative theory or ideology. To illustrate the 
range of pacifist views, a rough classification of 
schools of thought may be useful: 

(1) The scriptual text school bases its beliefs 
on certain texts of the New Testament, and since it 
believes that the Bible is divinely inspired, these 
texts become authoritative. Groups like the Men- 
nonites have tended to espouse this position. 

(2) Another viewpoint bases its attitude on 
what it regards as the spirit of Biblical or other 
scriptual ethics. Jewish pacifists cite the spirit of 
Hebrew prophecy, while those in the Christian tra- 
dition root their beliefs in their interpretation of 
the New Testament. For sects like the Religious 
Society of Friends, all religious tradition is subject 
to constant reappraisal through the Inner Light 
Possessed by each individual. Those of the Hindu 
Pacifist tradition claim support in the spirit of the 
Bhagavad-Gita and the Upanishads. 

(3) The just war school is largely, although not 
exclusively, confined to Roman Catholics. Using as 
its criterion the traditional Catholic definition of 
the “just war,” it concludes that no modern war 
can possibly satisfy the standards laid down by 
that definition—declaration by a public authority, 
Conduct by methods that discriminate between 
combatants and noncombatants, promise of pro- 
oe a better situation after the war than before 
in, Hence a pacifist position is not only justi- 
aa but required. Many “just war” pacifists cite 

ertain encyclicals and other messages of Pope 
1e Xu and Pope John xxm to buttress their 
Position, 
To The utilitarian school, in its several ver- 
har maintains that war and violence, on the 
ten, and in the long run, work against the imple- 
a aon of desirable social and political values, 
KA €ver the short-run effects may appear to be 
ie ae the proclaimed objectives of the 
be a e. Many anarchist pacifists are utilitarians, 
Most socialist pacifists. Often the views of 
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those who state their positions in traditional re- 
ligious language have a large measure of the utili- 
tarian in them: Gandhi is a good example. In the 
twentieth century the percentage of utilitarian pac- 
ifists has probably increased considerably. 

But no classification can do justice to the many 
elements that may constitute the groundwork for 
any individual pacifist outlook. 

An important distinction running through all 
pacifist thought is that between the personal and 
the political emphasis. Many think of the pacifist 
ethic as primarily relevant to personal relations or 
applicable to individual attitudes to the state (re- 
fusal of direct war service, for example) but not 
particularly relevant for group or state relations. 
Thus, some feel that they have a peculiar “voca- 
tion” for pacifism but do not think that the impera- 
tive is a universal one. Others think of political 
relations as inevitably bound up with violence and 
therefore largely “unredeemable.” 

By contrast, political pacifists, whatever the foun- 
dations for their beliefs (religious or utilitarian), 
tend to think not merely in personal terms but also 
in terms of nonviolence for group relations. Hence, 
political pacifism is concerned about such issues 
as principles of group conciliation, nonviolent co- 
ercion, the nature of a pacifist society, and pacifist 
methods for resisting military invasion. Generally 
speaking, the political pacifist emphasis has be- 
come central in modern discussion, and the re- 
mainder of this article will refer primarily to it. 


Implications and problems 

Pacifist thought has been much affected by de- 
velopment of the sociology of conflict, penology and 
criminology, and psychological approaches to the 
study of human behavior. C. M. Case’s researches 
into the phenomena of nonviolent coercion (1923) 
helped direct attention to alternatives to violent 
power; and in the 1930s Richard Gregg (1930) 
commented on such questions as nonviolent treat- 
ment of criminals and the mentally ill, as well as 
on the phenomena of nonviolent action in general. 
Both Case and Gregg envisioned the possibilities 
of a largely nonviolent police system. Pitirim Soro- 
kin (1954) has surveyed exemplifications of what 
he calls “altruistic love” and has appeared to con- 
firm the judgment of pacifists that violent conduct, 
whether personal or political, is not only immoral 
but also “impractical.” 

Studies such as those of Case, Gregg, and Soro- 
kin served as a challenge to examine sociological 
and political particulars in the pacifist case. They 
also called attention to certain implications and 
problems for application of the doctrine. 
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One such problem is to define more clearly the 
circumstances under which physical force might 
be legitimate; for while some pacifists repudiate 
the utilization of physical force altogether, many 
recognize a distinction between force (applied dis- 
criminately and with no irremediable injury) and 
violence (forms of force, or contexts within which 
force is used, that result in indiscriminate and irre- 
mediably injurious action). 

Another question involves the attempt to under- 
stand psychological and social factors involved in 
conflict resolution, Galtung (1959), in a sociolog- 
ical interpretation, contends that for the pacifist 
the doctrine involves two fundamental norms: (a) 
act so that a solution acceptable to all parties can 
be attained; and (b) act so that the short-run and 
long-run application of violence will be reduced. 
As an instance of (b), the pacifist eschews direct or 
indirect use of violence. To implement norms of 
this kind obviously requires understanding based 
not only on enlarged academic knowledge but also 
on broadened experience. 

Pacifists, guided by thinkers like John Dewey 
and Aldous Huxley, tend to stress the interpenetra- 
tion of means and ends in politics. Specifically, po- 
litical pacifism emphasizes the possibilities of using 
nonviolent power against both domestic and foreign 
tyranny and exploitation. Nonretaliatory, nonvio- 
lent resistance, it is maintained, can either change 
the attitude of a tyrant or an invader or deprive 
him of his instruments and resources (and there- 
fore of his power). 

Although the systematic analysis of nonviolent 
resistance as a technique is relatively recent, util- 
ization of such methods as the strike, boycott, and 
civil disobedience is very old. In ancient Rome the 
boycotts of the plebs resulted in revolutionary 
changes in Roman life. Indian history reveals 
numerous instances of noncooperation with rulers 
or even complete withdrawal of protesting groups 
from the territory of the prince (thus depriving him 
of an economic base for his government). Modern 
examples include the experience of colonial Penn- 
sylvania with nonviolence; the Hungarian struggle 
for autonomy under the leadership of Francis 
Deak, which led to the dual monarchy in 1867; 
Norwegian nonviolent resistance against the Ger- 
mans in World War 11; eastern European demon- 
strations against communist control (in some 
cases Russian soldiers refused to fire on their un- 
armed opponents, even when ordered to do so); 
and the American Negro struggle for equality and 
integration through sit-ins and other forms of 
nonviolent direct action. 

The use of nonviolent resistance is subject to a 


number of hazards, as Case and others have point- 
ed out. One is that in the atmosphere of tension 
and conflict, violence might break out. Another is 
that the hardships entailed by nonviolent, non- 
retaliatory action could lead to discouragement and 
abandonment of the campaign. Modern pacifist 
thinkers, aware of these difficulties, have encour- 
aged more thorough analysis of nonviolent resist- 
ance movements with a view to discovering under 
what circumstances they are most likely to be effec- 
tive. All stress the need for careful planning, 
organization, and discipline. 

The pacifist outlook strongly supports develop- 
ment of world organization and law but, in general, 
distrusts those views which would conceive of 
peace as being imposed by force. States cannot be 
coerced by violent power, it is argued, without risk 
of war; and police action for implementation of 
law must confine itself to discriminate and non- 
violent coercion of individuals. Even in a world 
state, moreover, nonviolent resistance would still 
need to be employed on occasion; for imbalances 
of power, exploitation, and tyranny would still be 
more than mere possibilities. Some thinkers call 
for greater imagination in sketching out the insti- 
tutions of a pacifist world order; and most would 
argue that it cannot simply duplicate the national 
state, which has been molded to so large a degree 
by domestic and international violence. x 

In seeking to cast light on the political implica- 
tions of their doctrine, pacifists have frequently 
called for more research. While recognizing that 
general academic, political, and social research 
might prove helpful, they have remained unsatis- 
fied by it. In World War u, therefore, they organ- 
ized the Pacifist Research Bureau in Philadelphia, 
which published studies on such themes as the 
political theories of pacifism and the problem of 
coercion of states. More recently, at the University 
of Michigan, the Center for the Study of Conflict 
Resolution has engaged in similar types of te 
search. The Institute for Peace Research at the 
University of Oslo has undertaken some parallel 
studies. Pacifist research activity in India, and in 
the United States at Harvard University under the 
direction of Pitirim A. Sorokin, might also be 
mentioned, 


Evaluation 

Pacifism has been subjected to criticism on sev- 
eral different grounds. Some contend that in the 
world as it is, it is an impossible and perhaps even 
an immoral principle, So long as evil men exist an“ 
power politics flourishes, it is maintained, right 
eousness may have to be vindicated through US 


of violence, including war. About the time of World 
War n, John Lewis (1940) developed a criticism 
from the Marxist point of view. And after World 
War 1, it was frequently contended that pacifist 
influence had led to British military unprepared- 
ness in the 1930s and hence to the impossibility 
of deterring Hitler from war. Critics like Reinhold 
Niebuhr (1940) stress the reality of power in 
human affairs and accuse pacifists of blindness to 
it. According to such critics, justice always requires 
power. 

Countercriticisms include the following argu- 
ments: While the full implementation of pacifist 
goals is indeed difficult, the statement of the ideal 
is itself a factor in helping to attain the goal. Non- 
violent treatment of the mentally ill is now no 
longer deemed to be impossible, although severe 
restraint used to be regarded as indispensable. 
Modern war must always defeat the ends of justice 
because of its utterly indiscriminate character. 
Pacifists by no means ignore power but rather seek 
to develop nonviolent forms of it. Britain, more- 
over, did not follow a pacifist policy before World 
War 11 but rather one based on indifference, partial 
financial assistance to Hitler, preservation of the 
status quo at almost any cost, and insensitivity to 
the economic basis for acceptance of National So- 
cialism by Germany. Finally, apologists claim that 
tyrannies arise not because men have accepted 
pacifism—with its stress on economic justice, ab- 
sence of a psychology of subordination, repudia- 
tion of violence, and planning for nonviolent resist- 
ance—but rather because they have rejected it. 


MULFORD Q. SIBLEY 


[See also CIVIL DISOBEDIENCE; MILITARISM; PEACE.] 
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PAIN 


The relief of pain and suffering has been a con- 
tinuing human endeavor since the dawn of record- 
ed history (see Keele 1957). Yet despite centuries 
of observation and study, we are only beginning 
to achieve an understanding of the subtleties and 
complexities of pain. Even though pharmacologists 
have provided effective “painkillers,” we know little 
about where and how these drugs act. Surgical pro- 
cedures that are usually effective in relieving pain 
can sometimes produce dismal failures, often 
enough to convince us that we are far from under- 
standing the neurological mechanisms that sub- 
serve pain perception. 

Part of the difficulty of understanding pain 
mechanisms lies in the divergent empirical ap- 
proaches to the problem. Sensory physiologists, 
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anatomists, and psychologists (see Ruch & Fulton 
1960, pp. 300-368) have studied pain as a sensory 
phenomenon and have tended to neglect its moti- 
vating aspects, Learning theorists (Miller 1951; 
Bindra 1959) have dealt with pain primarily as a 
drive producer and negative reinforcing agent but 
have generally ignored the other facets of the prob- 
lem. Finally, medical clinicians, such as anes- 
thetists (Beecher 1959) and surgeons (White & 
Sweet 1955), have regarded pain as indicative of 
tissue pathology that has to be treated and abol- 
ished and have often had to postulate hypothetical 
neural mechanisms to account for the complex 
phenomena they observe, These three approaches 
are so different that it is not surprising that “pain” 
has never been satisfactorily defined (Beecher 
1959). 

The major obstacle to understanding pain, how- 
ever, has been the perpetuation of a number of 
theories that have had a powerful influence on the 
field. The persistence of these theories has resulted 
in heated controversies that have endured since the 
beginning of this century. Consequently, one of the 
most difficult tasks in this field is to separate fact 
from theory. For this reason the psychological and 
clinical phenomena of pain, which must be ac- 
counted for by any satisfactory theory, will be 
described before physiological theories and experi- 
ments are discussed. 


Psychological aspects of pain perception 

The obvious biological significance of pain has 
led to the general belief that it must always occur 
after injury and that the intensity of pain perceived 
is proportional to the amount and extent of the 
damage. The positive aspect of pain is universally 
recognized: it warns us that something biologically 
harmful is happening. Reports (Sternbach 1963) 
of people who are born without the ability to feel 
pain provide convincing testimony to its value. 
Such a person sustains extensive burns and bruises 
during childhood and learns only with difficulty to 
avoid inflicting severe wounds on himself, Never- 
theless, there is convincing evidence that pain, in 
higher species at least, is not simply a function of 
the amount of bodily damage alone. Rather, the 
amount and quality of pain perceived are also de- 
termined by past experience and attention, by the 
ability to understand the cause of the pain and 
to grasp its consequences. This fact, supported by 
a large body of evidence, presents a challenge to 
pain theorists. 

Cultural factors. Cultural values are known to 
play an essential role in the way a person perceives 
and responds to pain. In Western culture, for ex- 


ample, childbirth is considered by many to bi 
of the worst pains a human being can experi 
Yet the practice of couvade (Kroeber [1923] 19: 
pp. 542-543) in cultures throughout the world 
dicates the extent to which culture contributes { 
the intensity of pain. In some of these cultures ; 
woman who is going to have a baby continues t 
work until the child is about to be born, Her hi 
band then gets into bed and groans as though 
were in great pain while she bears the child. In 
more extreme cases, the husband stays in bed witl 
the baby to recover from the terrible ordeal and 
the mother returns almost immediately to 
household work. Dick-Read (1944) has stresi 
the great extent to which culturally determi 
fear enhances the amount of pain felt during lal 
and birth and points out how difficult it is to 
such fear. 

Role of anxiety. The effect of anxiety 
intensity of perceived pain is further demonstra 
by studies on the effectiveness of placebos. Bet 
found that severe pains (such as postsurgical 
can be relieved in about 35 per cent of pati 
by giving them a placebo, such as sugar or sa 
solution, in place of morphine or other anal 
drugs. As Beecher has pointed out, only about 
per cent of patients experiencing severe pain 
satisfactorily relieved even when given large doses 
of morphine; the placebo effect thus accounts: 
about 50 per cent of the drug effectiveness (1959 
p. 169). This in no way implies that people 
are helped by a placebo do not have real pain; 
one will deny the reality of postsurgical 
Rather it illustrates the powerful contribution 
anxiety to pain perception, since the physician mayti 
often relieve pain significantly by prescribing place 
bos to lower the patient’s anxiety as well 
treating the wounded areas of the body. Similarly 
experiments by Hall and Stride (1954) have sho 
that the anticipation of pain raises the level ‘ 
anxiety and, consequently, the intensity of pet 
ceived pain. Experiments by Hill and his associi 
(1952a; 1952b) have shown that a given le 
of electric shock or burning heat is perceived 
significantly more painful when anxiety is exper” 
imentally induced than it is after anxiety has be& 
dispelled. These studies also show that morphine i 
diminishes pain if the anxiety level is high but na 
no demonstrable effect under conditions of E 
anxiety. <I 

Role of attention, Attention to stimulation also 
contributes to pain intensity. It is frequently 29 
that contestants in a fight or in the heat of & 
can receive severe wounds without being a" OAN 
that they have been hurt. Indeed, almost any S 


tion that attracts a sufficient degree of excited, pro- 
longed attention may provide the conditions for 
other stimulation to go by unnoticed, including 
wounds that would cause considerable suffering 
under normal circumstances. Hypnosis, a trance 
state in which attention is focused intensely on a 
person or an object, is perhaps the best-known con- 
dition in which people can be cut or burned with- 
out their reporting any perception of the event 
(Barber 1959). Failure in attention may also ac- 
count for the fact that dogs raised in sensory isola- 
tion in specially constructed cages from infancy 
to maturity (Melzack & Scott 1957) show a fre- 
quent failure to respond to normally painful stim- 
ulation, such as a flaming match or pinprick, after 
they are released from their cages. Since these dogs 
exhibit a remarkably high level of excitement, it 
is reasonable to suppose (Melzack & Burns 1963) 
that they fail to attend selectively to these noxious 
stimuli when they are presented in an unfamiliar 
environment in which all stimuli are equally atten- 
tion-demanding. [See ATTENTION and HYPNOSIS.] 

Role of meaning. Finally, there is striking evi- 
dence to show that the meaning associated with a 
pain-producing situation is extremely important in 
determining the degree and quality of pain that 
are perceived, Beecher observed soldiers who were 
severely wounded in battle and found that only 
One out of three claimed that he had enough pain 
to require morphine. Most of the soldiers denied 
having pain from their wounds or had so little that 
they did not want medication to relieve it. In con- 
trast, four out of five hospitalized civilians who had 
Surgical incisions matching the wounds received 
by the soldiers claimed that they were in severe 
pain and demanded a morphine injection. Beecher 
concluded that in the wounded soldier, the response 
yy. injury was relief, thankfulness for his escape 
alive from the battlefield, even euphoria (his wound 
Was a good thing); to the civilian, his major sur- 
gery, even though essential, was a depressing, ca- 
lamitous event; there is no simple direct relation- 
ship between the wound per se and the pain 
experienced (1959, p. 165). 

The importance of the meaning associated with 
a pain-producing situation is made especially clear 
io conditioning experiments by Pavlov (1923). He 
ound that if electric shocks administered to a dogs 
in are followed consistently by the presentation 
ae they eventually fail to elicit signs of pain 
ema an entirely different response: the dog 
dah ie Wags its tail, and turns toward the food 
DE K pores (1950) has carried these experi- 
ies further. After cats had been taught to 

Pond to electric shock as a signal for feeding, 
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they were trained to administer the shock to them- 
selves by walking up to a switch and closing it. 


Medical aspects of pain 

Theories of pain are satisfactory only to the ex- 
tent that they are able to account for all of the 
relevant phenomena. There are many forms of 
pathological or clinical pain that present bizarre 
features that are difficult to explain; yet they must 
fit into the framework of pain theory, Two pain 
syndromes, phantom-limb pain and causalgia, have 
been studied in detail and represent the most ter- 
rible of human pain experiences. 

Phantom-limb pain. The presence of a painless 
phantom limb is reported by the majority of ampu- 
tees almost immediately after amputation. About 30 
per cent, however, have the misfortune to develop 
pains in the phantom limb, and in about 5 per cent 
the pain is severe. These pains may be occasional 
or continuous, but they are felt in definite parts 
of the phantom limb (Livingston 1943; Feinstein 
et al. 1954). The pain tends to decrease and even- 
tually disappear in most amputees. There are a few, 
however, in whom the pain increases in severity 
over the years and may even spread to other re- 
gions of the body, so that merely touching these 
new “trigger zones” will provoke spasms of severe 
pain in the phantom limb (Cronholm 1951). Un- 
fortunately, the conventional surgical procedures 
for controlling pain usually fail to bring permanent 
relief; thus, these patients may undergo a series of 
such operations without any decrease in the sever- 
ity of the pain (Livingston 1943). Phenomena such 
as these defy explanation in terms of our present 
physiological knowledge. Attempts have been made 
to label these unfortunate people as “neurotic” 
(see Kolb 1954), but there is convincing evidence 
that argues against such an explanation for all 
cases (Livingston 1943). 

There are a number of features of phantom- 
limb pain that provide clues toward understanding 
the mechanisms underlying it. 

Peripheral factors. It is known (Livingston 
1943) that the neuromas ( small nodules of regen- 
erating nerve tissue) in the stumps of amputated 
patients contribute to phantom-limb pain, since 
pressure on them can trigger bouts of unbearable 
pain. Yet excision of neuromas or reamputation at 
a higher level usually fails to relieve pain for more 
than a few weeks or months (Livingston 1943; 
Cronholm 1951). Indeed there is almost unanimity 
of opinion that peripheral operations are likely to 
fail and that other procedures should be sought. 

Role of sympathetic nervous system. The sym- 
pathetic nervous system also plays an important 
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role, because cutting or temporarily anesthetizing 
the sympathetic ganglia entering the spinal cord 
(Livingston 1943) is capable of dramatically re- 
moving the pain for variable periods of time. Yet 
it is clearly not the sole cause of phantom-limb 
pain because pain often returns after the sym- 
pathetic ganglia are surgically removed. The con- 
tribution that the autonomic nervous system as a 
whole makes to phantom-limb pain is clear, more- 
over, from observations (Henderson & Smyth 
1948) that pain is triggered in many patients at 
the start of urination or defecation. Similar sudden 
increases of pain may be triggered by sexual ex- 
citement and orgasm (Kolb 1954). 

Emotional factors. Phantom-limb pain is greatly 
enhanced by emotional factors. Seeing a disturb- 
ing movie (Kolb 1954), having an argument with 
wife or husband (Livingston 1943), and other 
emotionally disturbing situations are capable of 
initiating or increasing the intensity of phantom- 
limb pain, 

Role of sensory input. Either increasing or de- 
creasing the sensory input from the stump or re- 
lated areas is capable of providing relief from 
phantom-limb pain. Feinstein, Luce, and Langton 
(1954) have demonstrated that injection of the 
vertebral tissues of amputees with 6 per cent salt 
solution produces severe pain at the site of injec- 
tion, which then radiates into the phantom limb. 
After this initial onset of pain, there is usually a 
decrease of the phantom-limb pain. Occasionally 
the pain vanishes completely following a single in- 
jection. Similarly dramatic results may occur after 
an injection of anesthetic procaine into the verte- 
bral tissues in the attempt to decrease sensory in- 
put from these regions, Comparable findings are 
reported when stimulation is increased or de- 
creased at the peripheral level. Injection of the 
tender neuromas of the stump with procaine solu- 
tion often brings about sudden and dramatic relief 
for variable periods of time. On the other hand, 
stimulation of the stump, by massage or by hitting 
it with a small rubber mallet, often produces the 
only possible relief from phantom-limb pain in a 
large number of patients (Russell & Spalding 1950). 

Spread of pain and trigger sites. Finally, there 
is the spread of pain and of trigger sites beyond 
the segments directly involved in the limb. Thus 
Cronholm (1951) found that touching the small 
of the back or the forehead may induce spasms 
of pain in a phantom leg. These trigger zones 
spread in unusual, seemingly random patterns and 
are not related in any apparent way to the seg- 
mental distribution of the somatic afferent nerves. 

Causalgia. Causalgia is a severe, unremitting, 
burning pain that occurs in about 2 per cent of 


people who have sustained a peripheral-nerve in. 
jury. The pain is felt in the affected limb but may 
spread to other parts of the body. It exhibits many 
of the features of phantom-limb pain as well as 
other even more bizarre characteristics. Surgical 
procedures have only limited success in the treat- 
ment of causalgic pain. Section of the peripheral 
nerve at a higher level, amputation of the limb, 
and cutting the dorsal sensory roots that enter the 
spinal cord have all produced as many failures as 
successes, Indeed, operations have been performed 
for causalgic pain at nearly every possible site in 
the pathway from the peripheral receptors to the 
sensory cortex, and at every level the story is the 
same: some encouraging results but a dishearten- 
ing tendency for the pain to return (see Livingston 
1943). 

Nonspecific triggering stimuli. A further re- 
markable feature of causalgia is that a variety of 
stimuli that can hardly be called “adequate pain 
stimuli” can produce increases in pain. Sudden 
noises, the sound of airplanes, the scraping of a 
shoe on the floor, emotional disturbances, almost 
any stimulus that elicits a startle response, touch- 
ing the damaged leg or arm or even blowing lightly 
on it are all capable of making the pain worse 
(Livingston 1943). 

Sympathetic nervous system in causalgia, The 
involvement of the sympathetic nervous system in 
causalgia, as in phantom-limb pain, is obvious. The 
skin becomes dry and cool, and sweat may drip 
from a single finger (Livingston 1943). Moreover, 
injection of anesthetic procaine into the sympa 
thetic ganglia may dramatically abolish the pain 
for variable periods of time. But the fact that pag 
may return after surgical removal of sympathetic 
ganglia (Livingston 1943) indicates that sympa- 
thetic-nervous-system activity is not the primary 
cause of the pain. 

Role of sensory input in causalgia. Similarly, 
an abnormal sensory input from the site of the 
nerve lesion is clearly implicated as an important 
cause of causalgic pain. Procaine blocks proximal 
or distal to the lesion may abolish pain for hours 
or days, and on rare occasions it never returns. 
Livingston (1943) reports, moreover, that the pain 
can be abolished if the patient is trained to tol- 
erate sensory stimulation of the affected limb and 
is encouraged to use it normally. But the frequent 
failure of peripheral-nerve surgery to abolish pain 
indicates that more is involved than simply 4? 
irritating peripheral lesion. 


Psychophysiology of pain 


The psychological and clinical phenomena a 
pain that have been described above must be take 


into account in any satisfactory theory of pain. 
Since physiological evidence on the sensory mech- 
anisms of pain is intimately bound up with the 
theories in vogue at the time, it is necessary to 
consider the physiology in terms of theoretical 
orientation. 

Orthodox specificity theory. The orthodox the- 

ory of pain, still the most widely held, was first 
proposed by Max von Frey in 1895 (see Melzack & 
Wall 1962) and was subsequently extended in a 
vast literature on pain mechanisms (see Bishop 
1946; White & Sweet 1955). Von Frey's theory, 
also known as specificity theory, proposed that 
there are specific pain receptors (the free nerve 
endings) which, when stimulated, give rise to pain 
and only to pain. Following this idea, physiologists 
proposed that pain is carried by peripheral-nerve 
fibers of particular diameter (the A delta and C 
fibers), a distinct spinal-cord system (the spino- 
thalamic tract), and a particular projection area 
in the thalamus, which is presumed to be the seat 
of pain sensation. Specificity theory has been the 
subject of heated debate and controversy since it 
was first proposed, and an attempt has recently 
been made (Melzack & Wall 1962) to analyze the 
features of this theory that make it both attractive 
and repugnant. 
k Von Frey's specificity theory has three underly- 
ing assumptions. The first is physiological: the 
theory assumes that each receptor in the skin has 
a specific irritability, that is, a lowest threshold 
for some particular stimulus energy. There is con- 
vincing evidence to indicate that this assumption 
is valid, and it has been restated by Sherrington 
(1906) as the law of the adequate stimulus. The 
second assumption concerns the morphological 
receptor that is associated with pain experience. It 
ìs now certain (Weddell 1955) that the free nerve 
endings transmit information not only about pain 
but also about warmth, cold, touch, itch, tickle, 
and the myriad other experiences that derive from 
cutaneous stimulation. The third assumption of 
Specificity theory is psychological: it assumes a 
one-to-one relation between skin receptor and psy- 
chological experience. 

Inadequacy of specificity theory. It is the as- 
opion of a one-to-one relation between skin 
; ceptor and psychological experience that has led 
‘0 attempts at outright rejection of von Frey's 
te The theory implies a direct transmission 
Aes in which there is an invariant, one-to-one 
teks ionship between stimulus intensity, peripheral 
Eon central-nervous-system pathway, and in- 
ae of pain perceived. Almost all of the psy- 
i Sical and clinical phenomena described above 

gue against this simple one-to-one relationship. 
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The fact that a light puff of air on the skin, emo- 
tional disturbance, or arousal of the autonomic 
nervous system can elicit bouts of excruciating 
causalgic and phantom-limb pain indicates that 
there is more to pain mechanisms than a straight- 
through system from specific peripheral receptors 
to a pain center in the brain. 

Pattern theory. Alternative theories have been 
proposed to replace specificity theory. Their history 
dates back to the time of von Frey’s theory, and 
each is characterized by complex physiological 
mechanisms that are postulated to account for the 
complex psychological and clinical phenomena of 
pain. Collectively, these alternative theories may 
be brought together under a single conceptual 
name: pattern theory. It proposes essentially that 
information at the skin is coded in the form of 
nerve-impulse patterns, which provide the basis of 
our sensory perceptions. These patterns, moreover, 
can undergo modification during their transmis- 
sion centrally, that is, the quality and intensity of 
pain can be modulated by events in the central 
nervous system, such as memories, emotions, and 
attention. The most recent formulation of a pattern 
theory for cutaneous perceptions (Melzack & Wall 
1962) proposes that skin receptors have specialized 
physiological properties for the transmission of par- 
ticular kinds and ranges of stimuli into patterns of 
nerve impulses, rather than modality-specific in- 
formation, and that every discriminably different 
somesthetic perception is produced by a unique 
pattern of nerve impulses. 

The concept of patterning of nerve impulses, 
together with three recently discovered features of 
the skin sensory system, provides the basis for a 
new theory of pain (Melzack & Wall 1965). First, 
there is now abundant physiological and anatomi- 
cal evidence of efferent fiber systems that run from 
the brain down to the afferent pathways and are 
capable of modifying or inhibiting the afferent 
pattern in the course of its transmission centrally 
(see Livingstone 1959). Second, the dorsal-column 
and dorsolateral systems of the spinal cord have 
properties (see Melzack & Wall 1965) indicating 
that their function may well be that of arousing 
the central processes subserving memories of prior 
experience, attention, and so forth, which are 
then able to act downward on the afferent impulse 
patterns. 

The third line of evidence derives from the re- 
cent work of Wall (1962), which shows that a 
sensory input arriving at the spinal cord has two 
effects. First, it transmits information from the 
peripheral nerve to spinal-cord cells whose fibers 
go to the brain, and, second, it influences the prop- 
erties of the substantia gelatinosa, a diffusely in- 


362 PAIN 


terconnected band of tissue lying throughout the 
length of the spinal cord in the dorsal horn. Men- 
dell and Wall (1964) have shown that the sub- 
stantia gelatinosa can both inhibit and facilitate 
the transmission of the coded sensory information 
from peripheral fiber to central cell. They point 
out that there is a continuous tonic input from 
the periphery to the substantia gelatinosa, so that 
continual inhibitory control is exerted over the 
transmission of nerve impulses across the synapses 
from peripheral fibers to central cells. 

This tonic inhibition can be increased or de- 
creased by the size of the fiber stimulated. Thus 
the largest A fibers increase the tonic inhibitory 
effect of the substantia gelatinosa, while the small- 
diameter C fibers decrease the inhibitory influence, 
that is, actually facilitate the transmission of in- 
formation in such a way that there is a greater 
likelihood of all inputs, from the peripheral and 
autonomic nervous systems, as well as from the 
brain, summating and thereby producing the char- 
acteristic pattern of high-frequency bursts of im- 
pulses that signals pain. The substantia gelatinosa, 
moreover, is a functionally continuous unit, so 
that different parts of the body are connected in 
a way that permits the spread of trigger zones ob- 
served in phantom-limb and causalgic pain. 

Gate control theory. These three features of 
the skin sensory system provide the basis for a gate 
control theory of pain (Melzack & Wall 1965). 
The theory proposes that (1) the substantia gela- 
tinosa functions as a gate control system that 
modulates the amount of input transmitted from 
the peripheral fibers to the dorsal horn transmis- 
sion (T) cells; (2) the dorsal column and dorso- 
lateral systems of the spinal cord act as a central 
control trigger, which activates selective brain 
processes that influence the modulating properties 
of the gate control system; and (3) the T cells 
activate neural mechanisms that constitute the 
action system responsible for both response and 
perception. 

Figure 1 provides a schematic diagram of the 
gate control theory, showing the large-diameter and 
small-diameter peripheral fibers and their projec- 
tions to the substantia gelatinosa ( SG) and T cells 
in the dorsal horn. The inhibitory effect exerted 
by the substantia gelatinosa on the afferent fiber 
terminals is shown to be increased by activity in 
the large fibers and decreased by activity in the 
small fibers. The central control trigger is repre- 
sented by the heavy line running from the large- 
fiber system to the central control mechanisms: 
these mechanisms, in turn, project back to the gate 
control system. The T cells project to the entry cells 


Central 
control 


Gate 
contro] 
system 


Figure 1 — Schematic diagram of the gate control theory 


Source: From Melzack & Wall 1965, Copyright © 1965 by the 

American Association for the Advancement of Science, 

of the action system, Excitation is represented 
by +; inhibition by —. 

The theory proposes that pain phenomena are 
determined by interactions among these three sys- 
tems. For example, a marked loss of the large 
peripheral-nerve fibers, which may occur after 
traumatic peripheral-nerve lesions or in some of 
the neuropathies (Greenfield 1958), such as post- 
herpetic neuralgia (Noordenbos 1959), would de- 
crease the normal presynaptic inhibition of the 
input by the gate control system. Thus, the input 
arriving over the remaining large and small fibers 
is transmitted through the unchecked, open gate iets 
duced by the small-fiber input. This, together with 
the Opportunity for summation of inputs into the 
substantia gelatinosa from other parts of the body 
and from the brain, provides the basis for the trig- 
gering of pain by a variety of stimuli that are 
normally not noxious. 


Affect and motivation 


Pain has generally been considered primarily a 
Sensory experience somewhat similar to sight or 
hearing. In one important respect, however; pee 
differs from vision and hearing: it has a unique 
distinctly unpleasant quality that wells up in con- 
sciousness and obliterates anything we may Bo 
been thinking or doing at the time. It become 
overwhelming and demands immediate anea 
Besides its sensory component, then, pain also s 
a strong emotional quale that drives (or moa 
the organism into doing something about it. oe 
ing responses are such as to stop the pain quickly 
by whatever course of action is possible. £ the 

Introspectionist psychologists at the turn 0 Hi 
century made a sharp distinction between 
sensory and the affective dimensions of the P 


experience. Titchener (1909) was convinced that 
there is a continuum of feeling in conscious ex- 
perience, distinctly different from sensation, that 
ranges through all degrees of pleasantness and 
unpleasantness. These two dimensions, the sensory 
quality and the affective quale, are brought clearly 
into focus by clinical studies on prefrontal lobot- 
omy (Freeman & Watts 1942), a neurosurgical 
operation for intense pain in which the connections 
between the prefrontal lobes and the rest of the 
brain are severed. Typically, these patients report 
after the operation that they still have pain but 
that it no longer bothers them; they simply no 
longer care about the pain and often forget it is 
there, It is certain that the operation does not stop 
pain perception entirely, since the sensory com- 
ponent is still present. The predominant effect of 
the operation seems to be on the affective coloring 
of the total pain experience; the terribly unpleas- 
ant quality of the pain has been abolished. 

Brain areas involved. Recent experiments sug- 
gest that there are portions of the brain that are 
particularly concerned with the motivating aspects 
of behavior. Miller (1957) has recently found sub- 
cortical areas in the brain that produce vigorous 
escape reactions, cries, and other emotional be- 
havior characteristic of pain perception when they 
are stimulated electrically. It seems possible that 
the activities in these areas provide the neural 
substrate for the affective, “driving” component of 
pain perception. [See NERVOUS SYSTEM, article on 
BRAIN STIMULATION.] 


Toward a definition of pain 


5 In recent years the evidence on pain has moved 
in the direction of recognizing the plasticity and 
modifiability of events occurring in the central 
nervous system. In the lower part of the brain at 
least, the patterns of nerve impulses evoked by 
noxious stimulation travel over multiple pathways 
going to widespread regions of the brain and not 
along a single path going into a “pain center.” The 
Psychological evidence lends strong support to the 
consideration of pain as a perception determined 
by the unique past history of the individual, by 
the meaning the stimulus has to him, by his “state 
of mind” at the moment, as well as by the sensory 
nerve patterns evoked by a physical cause. In this 
Way, pain becomes a function of the whole indi- 
vidual, including even his thoughts and hopes for 

e future, 

Pain, then, refers to a category of complex ex- 
E not to a kind of stimulation. Clearly, 
e te are many varieties and qualities of experi- 
nce that are simply categorized under the broad 
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heading of pain because they defy more subtle 
verbal description. There are the pains of a scalded 
hand, a stomach ulcer, a sprained ankle; there are 
headaches and toothaches. But there is also the 
heartache of the scorned lover, the pain of losing 
a dear friend. It may be argued that the pain of 
bereavement is very different from the pain that 
follows surgery. But the pain of a coronary occlu- 
sion is just as uniquely different from the pain of 
a scalded hand. 

Why is it so difficult to achieve a satisfactory 
definition of pain? The answer appears to be that 
pain is not a single, specific experience that can 
be analyzed and manipulated. It may be agreed 
that pain, like vision and hearing, is a complex 
perceptual experience. But the numerous, diverse 
causes of pain prevent the specification of a par- 
ticular kind of environmental energy as the specific 
stimulus for pain, in the way that light can be 
specified as the adequate stimulus for vision, and 
air pressure waves for hearing. Pain is a category 
of experiences, signifying a multitude of different 
unique events having different causes and char- 
acterized by different qualities varying along a 
number of sensory and affective dimensions. 


RONALD MELZACK 


[Directly related are the entries NERVOUS SYSTEM; 
SENSES; SKIN SENSES AND KINESTHESIS. Other rele- 
vant material may be found in DRUGS; HYSTERIA; 
PsycHoLoGY, article on PHYSIOLOGICAL PSYCHOL- 
OGY; PSYCHOSOMATIC ILLNESS.] 
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PAINE, THOMAS 


Thomas Paine (1737-1809) was by all odds 
the most influential revolutionary pamphleteer of 
the Anglo-American world of the late eighteenth 
century, a figure of international importance. His 
pamphlet Common Sense, published in 1776, 
13 months after he arrived in Philadelphia from 
his native England, crystallized the sentiment that 
led to the Declaration of Independence. His series 
of papers, The American Crisis, stiffened patriot 
resistance during the war, in “the times that ty 
men’s souls.” The Rights of Man, published in 1791 
and 1792, was written after his return to England 
in 1787 in response to Edmund Burke's attack on 
the French Revolution. This pamphlet stimulated 
the radical reform movement in England and the 
Jeffersonian Republican party in the United States 
and led to the prosecution of Paine in England for 
sedition and to his election to the French National 
Assembly. In France from 1792 to 1802, his role 
in the revolution was minimal; allied to the Giron- 
dins, he was imprisoned by the Jacobins doas 
1794, barely escaping execution. His chief wor 
in France, however, The Age of Reason, which ap- 
peared in 1794 and 1795, was probably the most 
popular deist attack on revealed religion ever 
written. A 

Paine’s mind was “speculative but unscholarlyt 
(Clark [1944] 1961, p. xxix), and he was almos 
completely self-educated. His thought was to i 
great degree shaped by the experiences of his firs 
37 years in England, where as a poor staymaket 
(his father’s craft) and excise collector he devel: 


oped an impassioned hatred of a “system of gov- 
ernment” he held responsible for “the mass of 
wretchedness” in society (Complete Writings, 
vol, 1, pp- 404-405). The Quakerism of his father 
contributed to his humanitarianism and noncon- 
formism, Newtonian science, which he learned 
from popular lectures in London, gave him con- 
fidence in natural law and rationalism as well as 
a lifelong interest in scientific improvement. (His 
most notable scientific contribution was the per- 
fection of an iron bridge. ) Thereafter he absorbed 
the ideas of the Enlightenment that were preva- 
lent among his acquaintances in the advanced 
political groups of his day: in the United States 
the Philadelphia circle of Franklin, Jefferson, and 
Rush; in England the reform circle of Tooke, 
Godwin, and Priestley; in France the moderate re- 
publican circle of Condorcet and Brissot. 

Paine is generally classified neither as a political 
theorist nor as an original thinker but as a popu- 
larizer who brought the ideas of the Enlightenment 
to the ordinary man in a lucid and trenchant style. 
His originality lay in pushing the critique of es- 
tablished unnatural institutions and ideas beyond 
the immediate consensus of enlightened opinion 
and in a radical direction. Thus, Common Sense 
rejected not only King George 1I and any colonial 
tie to England but all monarchy, arguing for a 
broadly based republican form of government. The 
Rights of Man, in defending the French Revolu- 
tion, advocated a republic for England, irreverently 
rejected the constitutionalism that was the basis 
of the Whig rationale of reform, and exposed the 
government as a class instrument through which 
the rich profited from war and the poor were ex- 
Ploited by taxes. In the famous fifth chapter of 
Book 2 of The Rights of Man and in Agrarian 
Justice, published in 1797, Paine proposed a pro- 
gram—poor relief, public education, old-age pen- 
sions, unemployment projects, etc., all to be fi- 
nanced by progressive inheritance and income 
taxes—that anticipated the welfare state. The Age 
of Reason, with its book-by-book exposé of the in- 
consistencies of the Old Testament and the New 
Testament, indicting the obscurantism of Christi- 
anity as “an engine of power” which “serves the 
purpose of despotism” (Complete Writings. vol, 1, 
P. 600), transformed deism from an aristocratic to 
a democratic creed. 

; Paine returned to the United States in 1802, and 
fa his last years was unjustifiably denounced as a 
2 ee and an atheist, which opprobrium long 
i aracterized orthodox American political and re- 
Jea opinion of him. His ideas had a brief re- 

in American labor and reform movements, 
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but his fame was kept alive primarily by agnostics 
and freethinkers. In England, by contrast, The 
Rights of Man became “a foundation text of the 
English working class movement” until about 1880 
(Thompson [1963] 1964, pp. 31, 90). From the 
point of view of conservatives, Paine, because of 
his optimistic view of human nature and his failure 
to grapple sufficiently with the kinds of political 
institutions necessary to guarantee the rights of 
man, epitomizes the “naiveté” of the Enlighten- 
ment. He has had a continuing appeal to anarchists 
for his antistatist tendencies and to liberals and 
socialists for his radical egalitarianism. 


ALFRED YOUNG 


[For the historical context of Paine’s work, see the 
biographies of BURKE; CONDORCET; JEFFERSON; 
RusH; TooKE.] 
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PAN MOVEMENTS 


A pan movement is dedicated to the unification 
of a geographic area, linguistic group, nation, race, 
or religion. The term pan is so broad that it can 
be, and has been, used to designate a vast variety 
of disparate phenomena. Thus, in Italian, pan- 
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cristianesimo is used for Christian ecumenicism. 
Pan-Europa was a utopian plan of European fed- 
eration. Pan-Americanism, an attempt to create 
hemispheric solidarity, was a government-spon- 
sored enterprise. Pan-Germanism, Pan-Slavism, 
and Pan-Turkism were militant nationalist move- 
ments of a racist and imperialist type. Pan-Islam- 
ism was a reaction against European colonialism. 
The environments out of which they arose, their 
aims, forms of organization, methods, and ideolo- 
gies have been so diverse as to make valid gen- 
eralizations about the pan movements almost im- 
possible, 

Attempts at unification along religious, geo- 
graphic, or other lines had been made and the 
desire for unity of groups of people had existed 
long before the term pan, used in this particular 
sense, became current. The memories of the Ro- 
man Empire persisted for many centuries after its 
fall. The ideal of a Christian commonwealth sur- 
vived the Middle Ages. The Arabs continued to 
dream of Dar ul-Islam, the home of all Muslims. 
However, modern pan movements did not originate 
in the distant past. Rather, they were the result 
of the impact of romantic nationalism upon the 
divided peoples of Europe, Asia, and, to a lesser 
extent, America and Africa. 

Pan-Americanism. Pan-Americanism cannot be 
properly considered a movement. Originally, it 
was a policy promoted by Latin American states- 
men anxious to preserve the independence of their 
new nations; it was later taken up by the United 
States. As early as 1826 Simón Bolivar convened a 
Pan-American conference in Panama. The United 
States did not participate; its delegation arrived 
after the conference was over. Since 1890, how- 
ever, the United States has been the chief sponsor 
of Pan-American conferences, which have had a 
strictly official character as gatherings of represent- 
atives of all American states except Canada. Ques- 
tions of law, arbitration and conciliation, economic 
and cultural exchanges, public health, and tech- 
nical assistance have filled the agendas of these 
conferences. 

The conference of 1890 founded the Pan Ameri- 
can Union, with headquarters in Washington, D.C. 
The Union’s governing body was composed of the 
U.S. secretary of state and the representatives of 
all Latin American states. The Union promoted 
the sentiment of hemispheric solidarity and good 
will, disseminated commercial and tourist informa- 
tion, encouraged cultural exchange, and main- 
tained a library. Pan-Americanism never had an 
ideology. To some it seemed a screen for U.S, im- 


perialism, while others saw in it an expression of 
international understanding based on common 
hemispheric interests. 

Pan-Slavism. In marked contrast to Pan-Ameri- 
canism, Pan-Slavism was originally a nongovern- 
mental, if not antigovernmental, movement, The 
awakening of the Slavs in the Austrian and Otto- 
man empires produced a feeling of unity that led 
to the idealization of the common distant past of 
the Czechs, Slovaks, Serbs, Croats, Bulgars, and 
others. German—Hungarian and Turkish domina- 
tion nurtured the conviction that the liberty and 
even the survival of the Slavs depended upon their 
union. 

The Slovak poet Jan Kollar, 1793-1852, under 
Herder’s influence, attributed to ancient Slavs 
idyllic qualities of purity, peacefulness, humani- 
tarianism, and piety. Czech, Serbian, Croatian, 
Polish, and Ukrainian intellectuals responded with 
enthusiasm. There appeared a profusion of poems, 
historical studies, and grammars of the various 
Slavic languages. The Ukrainian poet Taras Shev- 
chenko, 1814-1861, and the Polish poet Adam 
Mickiewicz, 1798-1855, ardently advocated the 
unity of all Slavs. 

Under the impetus of the revolutions of 1848 
and in response to the menace of German nation- 
alism, Austrian Slavs shifted from purely literary 
and historical preoccupations to political activity. 
A Pan-Slav congress was convened in Prague in 
June 1848. The divergent, and often irreconcilable, 
interests of the various Slavic peoples manifested 
themselves at once, Austrian Slavs tended to be 
loyal to the Hapsburg multinational state, to fear 
both the Germans and the Russians, and to have 
but little concern for the liberation of Ottoman 
Slavs. The Poles saw their principal enemy in 
Russia and sought support against the tsar, while 
the Serbs and the Bulgars looked to St. Petersburg 
for help in overthrowing the Turkish yoke. The 
congress ended in complete disagreement. Pan- 
Slavism remained a_philosophical-literary phe- 
nomenon of minor importance until it entered its 
Russian phase in the second half of the nineteenth 
century, 

Russian Pan-Slavism was a lineal descendani 
of Slavophilism. However, where Slavophilism hai 
been theological, philosophical, and literary, P an; 
Slavism was militantly political. The Ukrainian 
historian Nikolai I. Kostomarov, 1817-1885, mej 
flecting the interests of a minority nagonki 
advocated a federal Slavic monarchy under a 
Russian tsar, but the Russian publicist Mikhail i 
Pogodin, 1800-1875, as early as 1838 glorifie 


Russia as the rising giant destined to unify the 
Slavs. To him, as to the next generation of Pan- 
Slavists, the future of Slavdom could be guaran- 
teed only by the tsar. 

Defeat in the Crimean War increased anti- 
Western sentiments among the more conservative 
Russians. Hostility to Europe and faith in the 
superiority of Russia were expressed in pseudo- 
scientific terms in the writings of the biologist 
Nikolai Ia. Danilevskii, 1822-1885. Danilevskii’s 
cyclic theory of historical development purported 
to show that the West was in decline and that the 
future belonged to the new, higher cultural-his- 
torical type, the Slavs. Like Pogodin, Danilevskii 
claimed for Russia the leading role in the unifica- 
tion of the Slavs. 

Hatred of the West, the Poles, the Catholics, the 
Jews, the socialists, and the bourgeoisie was char- 
acteristic of Fédor M. Dostoevski, 1821-1881, who, 
like most of the later Pan-Slavists, was essentially 
a Great Russian chauvinist. 

Russian Pan-Slavism was not a mass movement 
and had no centralized organization. Committees 
had come into existence in Odessa, Moscow, St. 
Petersburg, and elsewhere. These were originally 
benevolent societies to provide aid to Slavic refu- 
gees from the Ottoman Empire as well as to Slavic 
students in Russian schools. During the war be- 
tween Serbia and Turkey in 1876, committees in 
Russia recruited volunteers, collected money, and 
spread anti-Turkish propaganda. 

In its early stages Pan-Slavism was a move- 
ment of nationalist intellectuals, who were joined 
by some members of the bureaucracy and a few 
Wealthy merchants. However, the ideologists grad- 
ually gave way to politicians and generals, among 
them Mikhail G. Chernyaiev, 1828-1898, one of 
the conquerors of Turkestan and commander of 
the Serbian army in 1876; Nikolai P. Ignatiev, 
1832-1908, ambassador to Turkey and later min- 
ister of the interior; Mikhail D. Skobelev, 1843- 
1882, the Russian general infamous for the massa- 
cre of the Turkomans at Geok-Tepe. After 1878, 
Pan-Slavism rapidly lost much of its modest follow- 
ing and most of its appeal. It was absorbed into the 
reactionary ideology of Russian imperialism pro- 
mulgated by Alexander mı and Nicholas 11. 
be War 1 brought about the liberation and 
oo of all Slavic peoples except the 
eed and the Belorussians. It also demon- 
ie the unreality of Pan-Slavism and the 
an of nationalism. For Poland, Czechoslo- 

a, Yugoslavia, and Bulgaria, Pan-Slavism had 
come utterly useless. 
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During World War m the Soviet government 
attempted to resurrect Pan-Slavism and to use it 
as a weapon against Germany. A committee was 
hastily formed in Moscow in August 1941, two 
months after Russia had been invaded. While 
nineteenth-century Pan-Slavism, even in its Rus- 
sian form, had been an independent movement, 
often at odds with the more cautious government 
policies, Stalin’s Pan-Slavism was the product of 
the state propaganda machine. The Moscow com- 
mittee published a magazine and organized con- 
gresses in Russia, Britain, and the United States 
with the purpose of mobilizing public opinion 
among Slavs in favor of the Soviet Union. After 
the war the committee was briefly used by the 
Russians to consolidate their position in eastern 
Europe. Yugoslavia’s withdrawal from the Soviet 
camp broke up the already moribund Pan-Slav 
organizations. Their value to the Soviets had proved 
negligible. Stalin now preferred to stress Russian 
nationalism rather than Pan-Slavism, which had no 
appeal at home and, after the defeat of Germany, 
very little appeal abroad. 

Pre-Soviet Pan-Slavism had two distinct stages: 
the nonpolitical, cultural, democratic one, in which 
Czech and Slovak intellectuals played the dominant 
part; and the conservative, nationalist, political 
one, led by an unusual combination of intellectuals, 
bureaucrats, and the military. As a political move- 
ment Pan-Slavism was a failure. It never developed 
a unified organization, nor was it able to create a 
mass movement anywhere. Its most significant 
effects were in the realm of culture. Pan-Slavism 
stimulated scholarly interest in Slavic antiquity, 
spurred linguistic and archeological studies, en- 
couraged the collection of folklore, and gave the 
Czechs, Slovaks, Serbs, Croats, and Bulgars a sense 
of worth and importance through membership in 
a vast and glorious community. Pan-Slavism meant 
but little to the Poles and the Russians. The latter 
used its phraseology but transformed the . move- 
ment into an instrument of extreme nationalism 
and imperialism. 

Pan-Germanism. Pan-Germanism was a politi- 
cal movement similar to the second stage of Pan- 
Slavism. Unlike the latter, however, it confined 
its activities to one nationality—the German—and 
was very well organized. The Pan-German League, 
the heart and soul of the Pan-German movement, 
was founded by a small number of extreme na- 
tionalists interested in overseas expansion. Its first 
leader was Carl Peters, 1856-1918, who had 
achieved fame by acquiring several African terri- 
tories for Germany. Lust for colonies, chauvinism, 
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militarism, and an almost hysterical Anglophobia 
were the chief characteristics of the organization, 
which from 1891 to 1893 bore the name Allge- 
meiner Deutscher Verband and from 1893 on, the 
name Alldeutscher Verband. 

Under the leadership of Ernst Hasse, 1846— 
1908, the league became a vigorous and active 
organization. Membership was open to every Ger- 
man regardless of citizenship. Annual dues were 
low, as was the subscription price of the league's 
organ, the Alldeutsche Blätter. The league was 
made up of local branches and district organiza- 
tions subordinate to an executive council, which 
determined policy, amended the league's constitu- 
tion, chartered the local branches, and elected the 
executive and business-managing committees. In 
fact, the president of the league, who also headed 
the business-managing committee, concentrated 
authority in his own hands. 

The Pan-German League did not become a 
mass organization. Its membership fluctuated with 
changes in the international situation, reaching 
its peak during the Boer War (with 21,924 mem- 
bers in 1901) and dropping slightly thereafter 
(to 17,000 members in 1912). The league's organ 
constantly complained of the indifference of the 
German public. The membership was drawn largely 
from the middle classes, with teachers, business- 
men, and physicians heavily represented. Indus- 
trial workers and artisans accounted for about 15 
per cent of the membership in 1901. In 1914, 24 
per cent of the officers of local branches were 
teachers, 31 per cent businessmen, 12 per cent 
government officials, 8 per cent physicians, 9 per 
cent technicians, 5 per cent lawyers, 3 per cent 
clergymen, 2 per cent writers and editors. Signifi- 
cantly, only 1 per cent were nobles and only 1 per 
cent were military men. In the same year almost 
10 per cent of the officers of local branches held 
PH.D.S. 

Since Pan-Germanism had official spokesmen 
and a central organization, its purposes were for- 
mally stated on many occasions. In its Handbuch 
(quoted in Wertheimer 1924, p. 95) the league 
proclaimed that it “strives to quicken the national 
sentiment of all Germans and in particular to 
awaken and foster the sense of racial and cultural 
kinship of all sections of the German people.” The 
league also proposed to strive for the preservation 
of the German people, to combat “all forces which 
check the German national development, to further 
German interests throughout the world and par- 
ticularly the colonial movement.” 

The Germans as the Herrenvolk were entitled 
to consideration and respect. It was the task of the 


league to arouse their patriotism through agitation 
and propaganda and to indoctrinate the masses in 
“hundred-per-cent Germanism,” in hatred of Poles, 
Jews, or any foreigners, in militarism, and in the ` 
need for expansion. At its 1898 convention the 
league proclaimed 26 aims, among them reorgan- 
ization of the navy; acquisition of coaling stations 
in the Red Sea, the West Indies, and Singapore; 
increased financial assistance to German schools 
abroad; employment of only German labor in im- 
perial and state domains; prohibition of immigra- 
tion of “less worthy elements”; prohibition of the 
use of foreign languages in meetings and clubs; 
and purification of the German language through 
the substitution of Germanic derivatives for bor- 
rowed words (Landshauptmann for Gouverneur, 
Befehlshaber for Kommandant). 

The league carried on its activities through meet- 
ings, lectures, newspaper publicity, the dissemina- 
tion of the Alldeutsche Blätter, the publication of 
occasional books and pamphlets, and participation 
in politics. It is difficult, if not impossible, to 
measure the impact of the league’s activity on the 
German people. The circulation of the league's 
literature was small, as was the number of people 
directly exposed to its propaganda at public meet- 
ings. Most of the principles and attitudes of Pan- 
Germanism had been part of the intellectual and 
emotional baggage of a large segment of the popu- 
lation before the league was formed, and they 
would continue to exist, finding embodiment in 
other supernationalistic organizations, 

In politics the league did not function as a party. 
Its members belonged to the National Liberal 
party, the Conservative party, the Reichspartei, 
the Wirtschaftliche Vereinigung, the Deutsche So- 
ziale Partei, and other extreme right-wing, often 
anti-Semitic, parties. In 1898, of the 397 members 
of the Reichstag, 15 belonged to the Pan-German 
League; in 1907, 34; in 1912, 15. 

Although most of its views were shared by vast 
numbers of people, the league was unable to com- 
pete with other organizations that advocated parts 
of its program, while not subscribing to all of it 
The league's ideology was too radical and t00 
brutal. Only a few thousand were capable of swal- 
lowing the unsavory mixture of colonialism, mili- 
tarism, navalism, anti-Semitism, Anglophobia, 
anti-Slavism, and antisocialism. Yet the league 
undoubtedly helped to bring all these ideas into 2 
sort of system and bequeathed them to later eX 
tremists, among them Hitler. d 

Pan-Islamism. The failing fortunes of Muslim 
states, and their inability to withstand the pres- 
sure of European imperialism gave rise to the Pan- 


Islamic movement. Its founder and principal ide- 
ologist, Jamal al-Din al-Afghani, 1838-1897, was 
probably born in Iran and lived in Afghanistan, 

- India, Egypt, Iran, and Turkey. Together with an 
Egyptian disciple, Muhammad *Abduh, 1849-1905, 
he briefly published a weekly newspaper that called 
for the union of Muslim peoples and states against 
Western aggression and domination. Jamal al-Din’s 
influence quickly spread throughout the Middle 
East and among the Muslims of Russia. He spent 
some time at the court of the shah of Persia, 
Nasir al-Din, but the shah discovered that Jamal 
al-Din’s propaganda had revolutionary implications 
and exiled him. However, Pan-Islamists were active 
both in the agitation directed against an English 
tobacco monopoly in Persia in 1890 and in the re- 
sulting revolutionary movement, Nasir al-Din was 
assassinated in 1896 by a member of Jamal al- 
Din’s circle. 

Pan-Islamism never developed a strong organ- 
ization, It was rather a projection of Jamal al-Din’s 
ideas and personality. Like any traditional murshid 
(leader of a sect or religious brotherhood), he 
relied chiefly on his personal magnetism and con- 
siderable powers of persuasion. Being quite un- 
scrupulous as to means, he hoped to utilize the 
caliphate for Muslim unification and attempted to 
win the support of the reactionary and barbarous 
Ottoman sultan, ‘Abd al-Hamid 1m, who, in turn, 
saw in Pan-Islamism a means of strengthening his 
own hold over the vast but tottering Ottoman Em- 
pire. Although the brief collaboration between the 
radical Jamal al-Din and the traditionalist ‘Abd 
al-Hamid bore no fruit, Pan-Islamic ideas pene- 
trated the minds of a number of younger bureau- 
crats and soldiers in Constantinople. 

Pan-Islamism was utopian. It advocated the 
union of disparate elements and preached a su- 
Pranational doctrine at the exact moment that 
Western nationalism was beginning to stir up the 
Peoples of the Middle East; it appealed to diverse 
religious communities, which could not unite with- 
out abandoning Islam as they understood it and 
at paradoxically, destroying the basis of Pan- 
jive oa itself, Jamal al-Din did not bring about 
te Teconciliation of the Sunnites and the Shiites. 
ae to sense the growth of nationalism among 

ersians, the Turks, and the Arabs. For all his 
sed modernism he was not a pioneer, His 
feat although frequently mouthed, were not a 
sat Roe for action, and his clerical following was 
‘Pas t to lead a political movement. Pan-Islamism 

i aan by the Turks for anti-Russian and anti- 
ae purposes during World War 1. However, a 

Majority of Russian and Indian Muslims failed 
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to respond to propaganda from Constantinople. In 
Turkey itself, Pan-Turkism had much greater ap- 
peal to the intellectuals. After 1918 Pan-Islamism 
faded away. 

Pan-Turkism. Pan-Turkism was born in the 
last quarter of the nineteenth century among the 
Tatars of the Crimea and the Volga; influenced by 
Pan-Islamism, it was a reaction against the en- 
croachments of a virulent Russian nationalism. It 
was Pan-Slavism, however, that provided Pan- 
Turkism with an intellectual model. While study- 
ing in Moscow, Ismail Bey Gasprinski, 1851-1914, 
a Crimean Tatar, absorbed many Pan-Slavist views, 
such as the primacy of race and language in creat- 
ing a community; he combined them with certain 
elements of Pan-Islamism to produce an ideology 
that suited the needs of a nascent Tatar intelli- 
gentsia, Although theoretically he envisaged the 
union of all Muslim peoples, his activity was di- 
rected at Russia’s Turkic masses, whom he wanted 
to provide with a common language and a modern 
education. 

The younger generation of Tatar intellectuals 
further secularized Pan-Turkism, breaking with 
Pan-Islamism and emphasizing Turkic nationalism 
and racism, even at the expense of other Muslim 
peoples and states. One of the chief theorists of 
Pan-Turkism, the Russian Tatar Yusuf Akçura, 
1876-1935, was exposed during his student days 
in Paris to the thought of Joseph Renan, Auguste 
Barrés, Joseph Gobineau, Houston Chamberlain, 
Hegel, and the social Darwinists. To him Islam 
was a thing of the past. Only language and race 
could provide the basis and the inspiration for the 
political unity of Ottoman and Russian Turks. 

During and after the Russian revolution of 1905, 
Pan-Turkism gained considerable influence among 
the Turkic peoples of the empire, although it never 
achieved exclusive control or undisputed leader- 
ship. The differences of sect, language, history, 
circumstance, and aspirations between the Tatars, 
Kazaks, Uzbeks, Turkomans, and Azerbaijani were 
far too great to permit the effective adoption of a 
uniform ideology or the predominance of a single 
organization. 

In the years 1905-1908, Pan-Turkism expressed 
itself through its press and through several Muslim 
congresses, which served it as sounding boards. 
The virtual restoration of Russian absolutism after 
1908 drove the Pan-Turkist leaders out of the 
country. They found refuge in Constantinople, 
where they influenced a number of Young Turks, 
who had recently come to power. The Constanti- 
nople circle evolved an elaborate myth of Turkic 
racial superiority. Ahmad Bey Agaev, 1865-1939, 
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an Azerbaijanian journalist, repudiated Islam and 
the Ottoman past and sang the glories of Attila and 
Genghis Khan. Ziya Gökalp, 1875-1924, himself a 
Kurd, found the original home of the Turkic race 
in the legendary Taran, a mythological land of 
Persian epic poetry. Pan-Turkist appetites grew 
steadily. On the eve of World War 1 Pan-Turkists 
talked of an empire stretching from the Mediter- 
ranean to the Pacific. 

With the outbreak of the war Pan-Turkist agita- 
tion reached its high point. The introduction of the 
military draft among the previously exempt Mus- 
lims of central Asia provoked rebellions among the 
Kazaks, the Kirghiz, and the Uzbeks. Pan-Turkist 
propaganda attempted to use these for its own pur- 
poses but met with little success. Only a complete 
breakdown of the Russian Empire could make pos- 
sible the realization of the Pan-Turkist dreams. 
However, the collapse of Russia was followed 
shortly by the collapse of Turkey. From then on, 
Turkish patriots were much more interested in 
Saving their own country than in engaging in 
adventures abroad. The new Turkey of Kemal 
Atatiirk tolerated Pan-Turkist intellectuals but did 
not follow their lead. Without Turkish support the 
movement lost its importance, while in Russia it 
was effectively eradicated by the Soviets. However, 
Pan-Turkism contributed certain ideas to modern 
Turkish nationalism. The attempt to “purify” the 
Turkish language of Arabic and Persian words, the 
glorification of the supposedly great Turko—Mon- 
golian civilization in central Asia, xenophobia, the 
neglect of the Ottoman cultural heritage, all were 
the dubious inheritance Pan-Turkism left to modern 
Turkey. 

Other pan movements. The modern world has 
seen several other pan movements, of which Pan- 
Arabism, Pan-Iranism, and Pan-Africanism are 
worth mentioning. 

Pan-Arabism. Like Pan-Turkism, Pan-Arabism 
was closely related to Pan-Islamism, Influenced both 
by Jamal al-Din al-Afghani and by Western nation- 
alism, Egyptian intellectuals such as Mustafa 
Kamil, 1874-1908, and Saad Zaghlal, 1860?-1927, 
preached the resurgence of the Arab nation, While 
Turkish and Persian national sentiment could exist 
independently, Arab nationalism was impossible 
without Islam, since Arabism itself was defined in 
terms of classical Arabic and Muslim faith, Like 
Pan-Islamism, Pan-Arabism failed to become a 
popular movement. Its potential impact was broken 
by the diversity of the Arab world, the rapid growth 
of nationalism in Egypt and Syria, and the dispar- 
ity in the cultural, economic, and political levels 
between the various Arab States (for instance, 
Lebanon as compared to Yemen, or Tunisia as 


compared to Saudi Arabia). The Pan-Arab idea and 
the awareness among the intellectuals of the great 
potential of Arab unity have often been exploited 
for national ends. The Arab League, which pur- 
ports to express on the governmental level the real- 
ity of Pan-Arabism, has not had much influence 
and seems to owe its continued existence to the 
irritation provided by the state of Israel. The for- 
mation of the United Arab Republic in 1958 and 
its rapid dissolution demonstrated the weakness of 
Pan-Arabism in relation to Egyptian and Syrian 
nationalism. 

Pan-Iranism. Pan-Iranism was a belated echo 
of Nazi wartime propaganda, an ephemeral inven- 
tion of a few uninfluential individuals who pro- 
claimed themselves a political party in 1946. It 
needs to be mentioned only because the term was 
used in the Soviet Union during the last years of 
Stalin's rule whenever the Soviet government con- 
ducted campaigns of repression against Tajik in- 
tellectuals. The latter's passive resistance to Russi- 
fication was frequently attacked as a manifestation 
of Pan-Iranism, just as the national tendencies 
of the Uzbeks and Azerbaijani were labeled Pan- 
Turkist, 

Pan-Africanism. Pan-Africanism also fails to 
qualify as a movement. It is rather an affirmation 
of a unity that does not exist. Pan-Africanism is 
used for political purposes by the governments of 
the various African states, but, as yet, it exercises 
no appreciable influence on their policies. 


F. KAZEMZADEH 


[See also COLONIALISM; IMPERIALISM; INTERNATIONAL 
RELATIONS, article on IDEOLOGICAL ASPECTS; NA- 
TIONALISM; SOCIAL MOVEMENTS. Other relevant ma- 
terial may be found in IDEOLOGY; INTERNATIONAL 
INTEGRATION. ] 
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PANEL STUDIES 


The potentials of panel analysis were first de- 
yeloped at Columbia University under the aegis of 
Paul F. Lazarsfeld. The first major panel study 
carried out under the techniques pioneered by 
Lazarsfeld was a study of voter decision making 
during the 1940 presidential election campaign 
(Lazarsfeld et al. 1944). Since then there have 
been numerous panel studies in a variety of socio- 
logical, political, and economic areas. Panel studies 
have been made of other elections in the United 
States and other countries, of the socialization of 
medical students to professional norms, and of 
social climates in high schools. 

The idea behind panel analysis is deceptively 
simple. Instead of comparing aggregates over time, 
panel analysis compares repeated observations of 
individuals, Insofar as the study of social change 
is limited to net changes in a social aggregate, the 
absence of such net changes is often assumed to 
be indicative of social stability. But constancy in 
the aggregate may obscure considerable compensa- 
tory change among individuals. For example, the 
distribution of income in a society may show no 
net change over a ten-year period, but various 
processes of economic mobility may be at work. 
In short, panel analysis gives rise to the study of 
an aspect of social change that tends to be neg- 
lected in studies of aggregate trends. 

d Origins. The technique of using repeated inter- 
views with a constant sample of people was first 
attempted by Stuart Rice (1928) during the 1924 
Presidential election campaign. Rice’s panel con- 
sisted of students of sociology in the three upper 
classes at Dartmouth College. His purpose was to 
find out how preferences for candidates shifted 
during the campaign. Though his research design 
included few variables, Rice’s analysis was quite 
Modern in that he did differentiate between net 
change and gross change. Theodore Newcomb 
(1943), in his classic study of Bennington College 
raent; conducted repeated interviews for the 
our-year period 1935 to 1939. His main interest 
Was in learning what effect a liberal college en- 
vironment had upon the attitudes of girls coming 
tom well-to-do, conservative families. 
oe, both of these early efforts at panel analysis, 
a were interested in identifying stu- 
ihe mee had changed attitudes and in discerning 
Sih ve for the changes. Neither Rice nor New- 
nique k owever, developed an appropriate tech- 
stribed or handling repeated data. Newcomb de- 
shifet in detail the girls who showed a marked 
ing ae a conservative to a liberal outlook dur- 

eir stay at Bennington and, conversely, those 
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who were impervious to the liberalism of the fac- 
ulty. But although he collected a wealth of statis- 
tical information in his interviews with students, 
Newcomb tended to rely on qualitative analysis of 
the extreme cases of change. 

The first systematic statement on the technique 
of panel analysis was made by Lazarsfeld and 
Fiske (1938). These researchers reported their ex- 
periences and problems with the new panel tech- 
nique. 

Applications. Formally, panel analysis is a re- 
search technique for collecting and analyzing data 
through repeated interviews with a sample of indi- 
viduals in a natural, rather than a laboratory, set- 
ting. Before examining the characteristic modes of 
panel analysis, it is useful to compare panel analy- 
sis with other research techniques in social science 
that involve over-time data. 

Econometrics, for example, makes use of quan- 
titative time series data. Monthly, quarterly, or 
annual series may cover hundreds of time points, 
whereas panel studies seldom exceed six inter- 
views. In econometrics, the data typically involve 
a single geographic or political entity. Though 
data may be collected from cross-sectional samples 
of individuals, families, or firms, the interest is 
usually in the aggregate dynamics of a single eco- 
nomic system. Panel studies typically range from 
300 to 3,000 cases, and the interest is in the 
molecular changes of these hundreds or thousands 
of individuals. 

In experimental psychology, learning studies in- 
volve longitudinal data covering hundreds of ob- 
servations, but the focus is usually on the change 
in one criterion. The research tends to be highly 
experimental and theoretical, and involves the use 
of mathematical models. Panel analysis, on the 
other hand, is nonexperimental and descriptive. It 
emphasizes the interrelationships of many chang- 
ing variables and is statistical, though not highly 
mathematical. In such panel studies, a set of inter- 
locking variables may be investigated because they 
are thought to be theoretically fruitful, and there is 
considerable theoretical improvisation during the 
analysis of the data. 

In educational research, follow-up studies in- 
volve over-time data. The interest, however, is not 
in periodically reobserving the same variables but 
in correlating a set of predictors with a criterion, 
as when a battery of aptitude tests is correlated 
with subsequent vocational success. 

In social psychology, research design usually 
takes the form of controlled experiment involving 
pefore—after data obtained from subjects in a lab- 
oratory setting. The researchers decide which sub- 
jects will be exposed to what stimulus. In panel 
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studies, by contrast, the researchers have no con- 
trol over the individuals in the study, nor do 
they manipulate any stimuli. Experiments, unlike 
panels, are not intended to uncover new concepts 
or descriptively explore new terrain but, rather, to 
verify or nullify hypotheses dictated by psycho- 
logical theory and formulated before the start of 
the experiments. However, a type of panel study 
known as the “impact” panel does parallel con- 
trolled experiments; the similarities between these 
two techniques will be considered at greater length. 


The turnover table 


The starting point, or central concept, of panel 
analysis is the turnover table, showing a categorical 
variable at time 1 cross-tabulated with itself at 
time 2. For a variable with n possible response 
categories, a turnover table is an n x n table sum- 
marizing the responses of each individual in the 
panel on a particular item at two successive inter- 
views. In essence, such a table shows not only net 
change but also gross change. 

Since the turnover table is basic, let us first ex- 
amine the simplest case: a dichotomy (n = 2) ob- 
served at two time points. An example comes from 
a study by the Bureau of Labor Statistics, which 
in February 1963 carried out a nation-wide survey 
of men between the ages of 16 and 21 who were 
no longer enrolled in school. Two years later the 
bureau resurveyed them to analyze their early 
work experiences and problems. Table 1, which 
includes only those who were in the labor force at 
both times, shows the employment status of the 
youths at the two times. 


Table 1 — Employment status in February 1963 and 1965 


FEBRUARY 1965 


FEBRUARY 1963 Employed Unemployed 1963 totals 
Employed 76.6% 4.7% 81.3% 
(1,628,000) (101,000) (1,729,000) 
Unemployed 14.0% 47% 18.7% 
(298,000) (99,000) (397,000) 
1965 totals 90.6% 94% 100.0% 
(1,926,000) (200,000) (N = 2,126,000) 


Source: Perrella & Waldman 1966, table A, 


From the totals, we observe that the employ- 
ment rate for these youths rose from approximately 
81 per cent to almost 91 per cent. Although there 
was a 10 per cent net change, there were actually 
19 per cent whose employment status changed 
from working to being jobless, or the reverse. The 


Table 2 — Net change equals gross change 


FEBRUARY 1965 


FEBRUARY 1963 Employed Unemployed 1963 totals 
Employed 1,729,000 0 1,729,000 
Unemployed 197,000 200,000 397,000 

1965 totals 1,926,000 200,000 2,126,000 


Source: Perrella & Waldman 1966, table A. 


information that the rate rose from 81 to 91 per 
cent is not enough to predict how many individuals 
experienced a change in employment status. Table 
2 shows one possible situation—where net change 
and gross change are identical. Table 3 shows a 
contrasting possibility—where the gross change is 
almost three times as much as the net change. 
Thus, tables 1-3, with the same aggregate em- 
ployment trend, represent very different economic 
situations. 


Table 3 — Gross change equals almost three times net 


change 
FEBRUARY 1965 
FEBRUARY 1963 Employed Unemployed 1963 totals 
Employed 1,529,000 200,000 1,729,000 
Unemployed 397,000 0 397,000 
1965 totals 1,926,000 200,000 2,126,000 


Source: Perrella & Waldman 1966, table A. 


I will return to this example in more depth when 
I come to discuss qualifier analysis. Meanwhile, it 
is useful to examine more complex multicategor- 
ied turnover tables. 

The data in Table 4 come from a panel study 
of the 1948 presidential election campaign between 
President Truman and Governor Dewey ( Berelson 
et al. 1954). The study was carried out in Elmira, 
New York, which in 1948 was a predominantly 
Republican community. Before examining the in- 
terior cells, we first examine the trend, as shown 
in Table 5. 


Table 4 — Vote intention, August and October 1948 


ION st 
AUGUST OCTOBER VOTE INTENT! hy 
VOTE INTENTION. Republican Undecided Democrat a 
Republican 369 54 17 a 
Undecided 22 102 12 184 
Democrat 6 27 151 
60 
October totals 397 183 180 y 
p. 23. 


Source: Adapted from Berelson et al. 1954, 


Table 5 — Trend in vote intention, August to October 
1948 (per cent) 


VOTE INTENTION AUGUST OCTOBER NET CHANGE 
Republican 57.9 52.2 —5.7 
Undecided 17.9 24.1 +6.2 
Democrat 24.2 23.7 —0.5 


Source: Adapted from Berelson et al. 1954, p. 23. 


Thus, as the campaign progressed, instead of 
more would-be voters reaching a decision, both 
parties appear to have lost adherents, with a corre- 
sponding increase in the undecided category. This 
information could of course have been obtained 
from two polls of separate samples. When we ex- 
amine the interior cells of the turnover table, we 
see that there was more than a 6 per cent change. 
Dividing all of the entries in Table 4 by the grand 
total, we can obtain the distribution of changer 
types (see Table 6). 


Table 6 — Distribution of vote intention (per cent) 


OCTOBER VOTE INTENTION. 


AUGUST August 
VOTE INTENTION Republican Undecided Democrat totals 
Republican 48.5 72 2.2 57.9 
Undecided 29 13.4 16 17.9 
Democrat 0.8 3.5 19.9 24.2 

October totals 52.2 24.1 23.7 100 = 760 


Source: Adapted from Berelson et al. 1954, p. 23. 


The percentages on the diagonal running from 
Upper left to lower right represent those whose vote 
intention remained constant, the total being 81.8 
Per cent. The gross change is thus about three 
times the net change. The individual changes, 
largely compensatory, can be classified into three 
types: (1) crystallizers (4.5 per cent), or those 
who were undecided but reached a decision in 
October; (2) waverers (10.7 per cent), or those 
who were decided but became doubtful in October; 
(3) converters (3 per cent), or those who switched 


Ti 
able 7 — Changes in vote intention (per cent) 


AUGUST 

ina OCTOBER VOTE INTENTION Number 

: N Republican Undecided Democrat Totals* of cases 

ee 83.9 12.3 3.9 100 440 

pene 16.2 75.0 8.8 100136 
mocrat 33 147 82.1 ODM. 84: 


* Detail: 
N's May not add to totals because of rounding. 


Source: Adapted from Berelson et al. 1954, p. 23- 
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from Republican to Democrat or from Democrat 
to Republican. 

Turnover tables are also frequently examined by 
dividing the entries in each row by the row total; 
these percentages, which provide estimates of 
“transition probabilities,” enable us to examine 
change, controlling for initial position (see Table 
Te 

To predict what decisions will be made by those 
who are undecided in August, we might advance 
one of the following four theories. (1) In a predom- 
inantly Republican community, those who are un- 
decided might tend to lean toward the Democrats, 
since those leaning toward the Republicans would 
not hesitate to say so. (2) There are presumably 
equal forces pulling the undecided toward the Re- 
publican camp and toward the Democratic camp. 
Hence, those who do decide for whom to vote will 
split their votes 50-50 for Republicans and Demo- 
crats. (3) The undecided will vote in the same 
proportion as the rest of the community—2.4 Re- 
publicans to 1 Democrat. (4) Each undecided will 
be exposed, on the average, to 2.4 Republican 
stimuli to 1 Democratic stimulus and will vote for 
Republicans in most cases—or, at any rate, having 
no better criterion, most of the undecided will vote 
with the majority. Conjectures of this kind can best 
be tested by panel studies. 

In Table 7, the transition probabilities on the 
main diagonal show that the undecided group is 
the most volatile: 16.2 per cent switched to Repub- 
lican and 8.8 per cent switched to Democrat, dis- 
tributing their votes to Republican and Democrat in 
a ratio of 1.8 to 1. This is less than the August 
ratio of Republicans to Democrats ( 440/184 = 
2.4); it is also less than the October ratio (397/ 
180 = 2.2). 

Examination of Table 4 revealed that of 138 in- 
dividual changes, all but 23 involved either switch- 
ing from undecided to a party preference or from 
a party preference to undecided. Analysis of turn- 
over using the Republican—Undecided—Democrat 
trichotomy, as in Table 7, tends to focus on the 
voters who are least involved politically. As was 
shown by an earlier panel. study of the 1940 presi- 
dential campaign (Lazarsfeld et al. 1944), the so- 
called independent voter who listened to both 
candidates and judiciously weighed the soundness 
of their respective programs before deciding on his 
vote was largely a mythical being. For the most 
part, those who oscillated from either Republican 
or Democrat to indecision, or who did not decide 
whom they would vote for until late in the cam- 
paign, were the politically uninterested who often 
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Table 8 — Change of vote intention from August to 
October (per cent) 


VOTE INTENTION August totals 

VOTE INTENTION IN OCTOBER 1948 Per Number 

IN AUGUST 1948 RE RILE DEF er estes 
Strong Republican 75 13 10 2 1 100 241 
Moderate Republican 32 47 16 4 2 100 199 
Undecided 6 10 75 7,49 100 136 
Moderate Democrat 0 4 16 59 2 100 122 
Strong Democrat 1 0 13°15 71 100 62 
October totals 254 143 183 102 78 760 


(number of cases) 
* Details may not add to totals because of rounding. 
Source: Adapted from Berelson et al. 1954, p. 23. 


were persuaded to join one or another political 
camp for the flimsiest of reasons. Most of the 
changers were politically uninvolved and unin- 
formed. Their shifts were more easily explained by 
social determinants than by political considera- 
tions. As a result of these findings, some political 
scientists charged the authors with supplanting 
traditional concepts of the political process with 
a type of sociological determinism. Accordingly, 
when the 1940 election study was replicated in 
1948, the researchers sought a deeper analysis of 
the decision-making Processes of politically in- 
volved voters. By combining the variable of party 
preference with that of strength of party prefer- 
ence, a five-point vote intention scale was con- 
structed. The turnover table with this as the cri- 
terion shows much more of the dynamics of the 
campaign than the trichotomous turnover tables 


intraparty change and, hence, more study of the 
political issues. One further feature of the table is 


ate Republicans and the moderate Democrats are 
the least constant. They tend either to become 


the campaign progresses, there is an increase in 
the number of undecided, together with an increase 


Qualifiers 

Turnover tables, by showing all of the gross 
change with respect to some criterion, serve the 
function of raising questions about the different 
types of changers. What are their characteristics, 
the processes that impel them to change, and the 
psychological and social effects of different pat- 
terns of change? In order to answer such ques- 
tions, researchers have stratified turnover tables by 
other variables, which are sometimes called quali- 
fiers. In panel studies, analysis by means of 
qualifiers constitutes the great bulk of the work 
performed. 

Qualifiers can be classified according to whether 
they are changing or constant. Constant qualifiers 
can be further classified by precedence; those 
which occur prior to the first interview are called 
antecedent qualifiers, and those which occur be- 
tween interviews 1 and 2 are called intervening 
qualifiers, 

The constant qualifiers that precede interview 1 
are the conventional demographic characteristics, 
such as age, education, nationality, sex, and social 
class. Of course, in some instances such charac- 
teristics might change; whether or not they are 
categorized as constant depends on the interval 
between interviews in the particular study. An in- 
dividual can change his marital status or primary 
group attachments or socioeconomic status when 
the period between interviews is sufficiently long. 
The function of such qualifiers is to elaborate the 
original turnover table by showing the conditions 
under which there is more or less change, just as 
in survey analysis the researcher starts with the 
association between two variables and introduces 
other variables to find conditions under which the 
original relationship is heightened or diminished. 

Tables 1-3 above showed changes in employment 
status over a two-year period. In order to learn 
more about the effects of different characteristics 
on employability, the same researchers examinis 
the employment turnover by the educational leve! 


Table 9 — Employment status of high school graduates 
(1963) in 1963 and 1965 (per cent) 


1963 totals 
FEBRUARY 1965 Number 
FEBRUARY 1963 Employed Unemployed Percent of cases 
Employed 98.4 1.6 100 saa 
Unemployed 88.6 11.4 100 132,01 
1965 totals 1,065,000 30,000 1,095,000 


ble A- 
Source: Adapted from Perrella & Waldman 1966, tol 


Table 10 — Employment status of high school dropouts 
(1963) in 1963 and 1965 (per cent) 


1963 totals 
FEBRUARY 1965 Nonik 
FEBRUARY 1963 Employed Unemployed Per cent of cases 
Employed 88.8 11.2 100 766,000 
Unemployed 68.3 31.7 100 265,000 
1965 totals 861,000 170,000 1,031,000 


Source: Adapted from Perrella & Waldman 1966, table A, 


of the youths. The results are shown in tables 9 
and 10. 

The trend is the same for both groups. Dropouts 
increased from 74 per cent employed to 84 per 
cent, and high school graduates increased from 
88 per cent employed to 97 per cent. The transition 
probabilities reveal more of the situation. Among 
graduates who were employed at the time of the 
first survey, only 1.6 per cent were jobless at 
the time of the resurvey, but of those who were 
unemployed at the time of the first survey, 11.4 per 
cent were still unemployed at the time of the re- 
survey. Among the dropouts, of those employed at 
the time of the first survey, 11.2 per cent were 
jobless at the time of the resurvey, whereas of 
those dropouts who were unemployed at the time 
of the first survey, fully 31.7 per cent were jobless 
at the time of the resurvey. These findings strongly 
Suggest that there exists a hard-core group of job- 
less youth. It should also be noted that 26 per cent 
of the dropouts underwent some change in employ- 
ment, compared with 12 per cent of the graduates. 

To discover why some dropouts were employed 
at both times while some graduates were unem- 
Ployed at both times, we might qualify the employ- 
ment turnover table by race. There are income 
Statistics which show that there is a considerable 
8ap between the earnings of whites who did not 
graduate from high school and nonwhite high 
School graduates. In the course of a lifetime the 
eee nonwhite high school graduate can expect 

earn approximately $50,000 less than the white 
fropout, It would not be surprising to find that 
ace accounts for some of the anomalous cases. 
ae might study other qualifiers. For example, 
cee the graduates, did those who pursued a 

Y academic program do better or worse than 
ti A Who pursued a business education or voca- 
o kagem Were married youths more likely 
eN their jobs? They might be more diligent or 
ce A to work to pay for furniture or to 
ei ulate savings for a prospective family. And 
iy ee be a tendency among employers to 

Unmarried workers first when work slackens. 
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Undoubtedly, employment and marital status are 
related. In fact, more dropouts than graduates 
were unmarried at the time of the first survey. If 
we were interested in the social effects of employ- 
ment, we could use marital status as the criterion 
and examine turnover of marital status as qualified 
by employment. Were those who were employed 
and unmarried in 1963 more likely to become mar- 
ried by 1965 than those who were unemployed? 
And were those unemployed and married in 1963 
more likely than the employed to become separated 
or divorced? We could go further and analyze the 
joint change of employment and marital status, 
which, with dichotomous variables, would involve 
analysis of a sixteenfold table. 

Impact panels. Frequently the panel technique 
is used for evaluating the effects of information 
campaigns. A public health agency, for example, 
might be interested in learning the effects of an 
information campaign on the recognition and re- 
porting of cancer symptoms; the federal govern- 
ment might want to determine the effectiveness 
of a consumer education program among the poor; 
a state commission might want to determine the 
effectiveness of a campaign aimed at reducing dis- 
crimination against minority groups. These ques- 
tions and others like them can best be answered 
by a type of panel study called an impact panel, 
in which the qualifiers intervene between the first 
and second series of interviews and refer to ex- 
posure to some stimulus or succession of stimuli. 
The impact study thus represents one equivalent 
of the controlled experiment when observations 
must be made in a natural social setting and the 
individuals being studied cannot be randomly as- 
signed to experimental or control groups. [See 
EXPERIMENTAL DESIGN, article on QUASI-EXPERI- 
MENTAL DESIGN; see also EVALUATION RESEARCH.] 

One general observation should be made con- 
cerning the place of theory in experiments as com- 
pared with panel studies. In experiments virtually 
all of the theoretical thinking must be done in ad- 
vance of the field work. In panel studies, however, 
theorizing takes place at both ends of the research 
process. A small set of interlocking variables must 
be decided upon at the outset; these variables de- 
termine the boundaries of the analysis. Even the 
most empirically oriented researcher must have 
some implicit theory of the relative fruitfulness of 
different variables. After the data have been coded 
and put on punch cards or tape, theory serves an 
organizing function, since without some notion 
about which tabulations to run, it would be possible 
to keep a computer busy for months before ex- 
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hausting the astronomical number of possible cross 
tabulations. 

Some problems connected with impact studies 
may be worth pointing out. The experiment seeks 
to measure the effects of a stimulus when the indi- 
vidual is exposed to it. The panel, on the other 
hand, seeks to explain what happens under non- 
laboratory conditions; it aims to assess effects of 
exposure under conditions where audience self- 
selection is operative. Under experimental condi- 
tions a captive audience is exposed to some pre- 
pared stimulus, such as a lecture, and often shows 
significant before—after differences in information 
or attitude. Usually, the shift is among the least 
educated and least interested individuals. When 
the same program is transferred from the labora- 
tory to amass medium, a study of effects frequently 
reveals little or no change: those who are least 
educated, least interested, and least in agreement 
with the communication rarely expose themselves 
to the stimulus in the first place. The panel, unlike 
the experiment, deals with a noncaptive audience; 
hence, not only the effects of exposure but also the 
nature of the audience self-selection are studied 
[see COMMUNICATION, Mass, article on EFFECTS; 
PERSUASION]. 

Determination of exposure to mass media or to 
personal influence is difficult, for it generally in- 
volves the use of retrospective questions. This, of 
course, violates the central conception of panel 
analysis, because we cannot be sure to what extent 
selective recall is operative. For example, in the 
study of political behavior during a national elec- 
tion campaign, it would be important to assess the 
role of local political parties. What, for instance, 
is the effect of receiving campaign literature? What 
influence does a personal visit by a party worker 
have on the turnout and vote intentions of poten- 
tial voters? But if we ask respondents whether they 
received literature or were contacted by a party 
worker, how can we be sure that we are not dealing 
with memory biases—that is, the voters were con- 
tacted randomly by workers but those interested in 
politics tended to remember the contact? If re- 
sponses to the queries on being visited or on re- 
ceiving political literature are analyzed, we do find 
that the more interested respondents report greater 
personal and impersonal contact. 

To some extent this result may reflect selective 
contact. Party workers may have approached would- 
be voters who seemed more receptive to political 
discussion, or they may have selected names of 
contacts from previous voting lists or made their 
contacts in various other nonrandom ways. 

One safeguard that can be built into panel stud- 


ies to check against the bias of selective recall is 
Stratification. When effects of party contact are 
analyzed, for instance, it is necessary to control by 
level of political interest. Another safeguard is to 
try to measure exposure independently by finding 
out from party workers whom they contacted. 

Inherent in panel study design is the possibility 
of examining the differential processes through 
which the stimulus is related to its effects, Experi- 
ments ordinarily do not permit such detailed study, 
for the simple reason that the number of cases is 
usually not sufficient, although there are occasional 
exceptions. In panel studies, more refined analyses 
of the relations between the stimulus and its effects 
are carried out by determining the characteristics 
of those respondents who are most influenced by 
the stimulus—whether they are men or women, 
old or young, educated or uneducated, and so on. 
The stimulus may have had a marked effect in the 
desired direction on certain groups of respondents 
and no effect or a “boomerang” effect on other 
groups. 

Particularly important in this respect is the con- 
sideration of effects in terms of the respondents’ 
initial position on the criterion. In the impact panel 
study, unlike the experimental study, the respond- 
ents who have been exposed to the stimulus are 
quite likely to differ from unexposed respondents 
on the criterion prior to the exposure. If they do, 
it is necessary, in order to impute any effect to the 
stimulus, to control the criterion prior to the cam- 
paign, for there is every reason to expect that the 
effects of a stimulus will not be the same for re- 
spondents who are initially favorable, indifferent, 
or unfavorable on the criterion. By controlling for 
initial position on the criterion, we do not alto- 
gether approximate experimental matching, al- 
though we do reduce its urgency. By holding initial 
Position constant, the subgroups to be compared 
are made more homogeneous on everything except 
exposure to the stimulus. 

For other material bearing on impact panels and 
controlled experiments, see the discussions by Hov- 
land (1959) and Cohen (1964, chapter 9). 


Mutual effects . 

The most interesting part of panel analysis 1 
the analysis of mutually interacting variables. 
Some examples of two interacting variables are 
the following. (1) In the investigation of psycho- 
somatic disorders we find that worry and a 
produce hypertension or ulcers, and these aa 
ties in turn raise the level of anxiety. (2) sua 
have been carried out relating chronic illness wit 
poverty. Chronic illness drains a family’s resources, 


while low socioeconomic status reduces access to 
appropriate medical care, proper diet, and a con- 
genial working environment. (3) In studies of fam- 
ily life, we often find that problem children and 
lack of parental love tend to be associated. But 
which causes which? Does lack of parental love 
induce aberrant behavior, or does aberrant behav- 
ior corrode parental affection? It seems likely that 
each reinforces the other. (4) The social psycholo- 
gist may find that friendship and similarity in val- 
ues feed into each other. People tend to select 
friends among those with similar values, and val- 
ues tend to become convergent in the course of 
friendship. (5) The sociologist may observe that 
individuals who reveal greater conformity to com- 
pany goals tend to have higher rates of promotion. 
Correspondingly, promotion tends to reinforce con- 
formity with company goals. In each example, one 
cannot say which variable is the dependent variable 
and which the independent; each influences the 
other, The problem in research is to analyze their 
relative influence. Let us look at an example of the 
analysis of relative influence. 

Suppose there is to be a referendum in some 
community on the issue of fluoridating the water, 
and we are studying how people influence each 
other on public health matters. We twice ask a 
panel of married couples: “As things stand now, 
how do you intend to vote on this issue?” The 
resultant sixteenfold table (Table 11) shows the 
hypothetical joint distribution of vote intention at 
two interviews of 671 hypothetical couples. Assum- 
ing these figures were empirical, what could we 
infer from them about the relative influence of the 
Spouses? 

Consider, on the one hand, those couples who at 


Table 11 — Opinion of spouses at si 
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time 1 were in agreement but at time 2 were in 
disagreement. Altogether there were b+c+n+0= 
1+4+8+2= 15 such changes. In 12 of them, 
it was the husband who “defected,” so to speak, 
while in only 3 did the wife “defect.” Thus, as far 
as preserving agreement between the spouses, the 
husband was about four times more influential 
than the wife. 

Consider, on the other hand, the couples who 
were initially in disagreement but as a consequence 
of one spouse changing opinion were subsequently 
in agreement. Altogether there were e +i+h+1= 
25+5+10+25=65 such cases. To what extent 
did each spouse influence the other to change in 
accord with his own opinion? Of the 65 cases, 50 
agreements were generated as a result of the wife 
changing her opinion to accord with her husband's; 
only 15 agreements resulted from the husband 
changing to harmonize with his wife’s opinion. 
Thus, as far as generating agreement, the husband 
was more than three times as influential as the 
wife. From the four agreement-to-disagreement 
cells of the sixteenfold table and the four disagree- 
ment-to-agreement cells, an index of relative influ- 
ence might be devised. During the last twenty 
years, Lazarsfeld has proposed several indices of 
relative influence, all based on the single-change 
cells of the sixteenfold table. If the eight single- 
change cells are labeled as in Table 11, one of 
Lazarsfeld’s indices is the following: 


jp old ee aN pe a SO 
nw \erbp oT ifc nth 
The terms in the first pair of parentheses represent 


H’s influence on W; the terms in the second pair 
of parentheses represent W's influence on H. 


successive interviews on fluoridation issue* 


TIME 2 
HUSBAND Favor Oppose 
WIFE Favor Oppose Favor Oppose Time 1 totals 
HUSBAND WIFE 

a b c d 
Favor 300 1 4 3 308 

Favor e i g h 
Oppose 25 40 1 10 76 

TIME 1 i j k l 
Favor 5 2 45 25 77 

fo} m n o P 
et Oppose 3 8 2 197 210 
Time 2 totals 333 51 52 235 671 


Hypothetical date. 
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In the hypothetical husband—wife sixteenfold 
table (Table 11), 


25 2 5 8 
m= (ear) (3 = 
(in favor of husbands). 

A second method of measuring relative influence 
is based on the relative magnitude of the cross- 
lagged correlations: fu,w, versus Tw,n, (Campbell 
1963; Pelz & Andrews 1964). If Yule’s Q is used 
as a measure of correlation, the cross-lagged cor- 
relations in the husband—wife example are Quw, = 
93 > Qu,n, = 83. Since Qu,w, > Qw,m, the infer- 
ence would be that the husband is more influential. 
Pelz and Andrews (1964) also consider as a meas- 
ure the relative magnitude of the partial cross- 
lagged correlations: Taw, -w, VETSUS Tw,y,.4,. Gener- 
ally, the Lazarsfeld index, the simple cross-lagged 
correlations, and the partial cross-lagged correla- 
tions will agree as far as indicating which of two 
variables is more influential, [For a discussion of 
Yule’s Q, see STATISTICS, DESCRIPTIVE, article on 
ASSOCIATION. ] 


In conclusion, it should be stated that much of 
the potential value of panel analysis for sociologi- 
cal inquiry remains untapped. The analysis of 
mutual effects has been directed almost entirely at 
two-variable interactions. The burgeoning literature 
on causal analysis in recent years attests to the 
readiness of the field to move beyond this. More- 
over, panel methodology has developed around 
change of individuals, principally opinion change. 
Problems in which the unit of analysis is not the 
isolated individual, but individuals interlocked socio- 
metrically or in a formal organization, have rarely 
been the subject of panel research, 


BERNARD LEVENSON 


[See also COUNTED DATA; DIFFUSION, article on INTER- 
PERSONAL INFLUENCE; ERRORS, article on NONSAM- 
PLING ERRORS; REASON ANALYSIS; STATISTICS, DE- 
SCRIPTIVE, article on ASSOCIATION; SURVEY ANALYSIS: 
and the biography of RıcE.] 
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PANTALEONI, MAFFEO 


Maffeo Pantaleoni (1857-1924), Italian econ- 
omist, was born at Frascati, near Rome, the son of 
an English mother and an Italian father. He attend- 
ed the classical Gymnasium at Potsdam, near Berlin, 
where his father, a physician, resided for a period, 
and then took his doctorate in law at the University 
of Rome in 1881, He soon became passionately in- 
terested in economic and financial studies, which 
at that time were somewhat neglected in Italy, 
despite a traditional interest in these subjects that 
Went back to the studies of monetary theory in the 
sixteenth and seventeenth centuries. Pantaleoni 
received his license to teach finance in 1884. He 
taught at several Italian universities, and even- 
tually, from 1901 on, at the University of Rome, 
Succeeding Angelo Messedaglia. 

Pantaleoni attracted many disciples but never 
formed a school; indeed, he was actively opposed 
to the formation of schools, declaring in 1897 at 
Geneva that he knew of only two schools—“the 
School of those who know economics and the school 
of those who do not” ([1897] 1925, p. 158). This 
Stand was a salutary one in the Italy of his day. 
ae the economics faculties tended to take fierce- 
Y Partisan positions. He went too far, however, 
When in later writings he maintained that so far 
td the economist is concerned, the history of (eco- 
ee doctrines should not include those doc- 
a es that have been shown to be erroneous, but 
sae those that can be shown to be true or which 
ke a least still in dispute. Such a conception of 
ane istory of economic doctrines was not, of 
oe generally accepted; but its very rejection, 
tal cially Schumpeter’s critique, did stimulate in 

Y the study of the history of economic thought. 
A though Pantaleoni refused to be identified with 

Particular school of economics, his first contribu- 


(1947) 1957 Say It With Figures. 4th ed.,- 
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tions to pure economics were definitely based on 
the hedonistic premise that men are moved to act 
only to obtain the utmost satisfaction of their wants 
with the least individual sacrifice. He made use of 
this premise because of its appropriateness to his 
theory, declaring that it matters little whether or 
not theoretical abstractions exactly reflect complex 
concrete phenomena. 

While Pantaleoni asserted that pure economics 
and applied economics differ in their relationship to 
empirical reality, in fact many of his own contri- 
butions lie between these two alleged types of 
economics; this is true of his studies of price fluc- 
tuation, of “political” (discriminatory) prices, of 
industrial cartels, of the functions of banking, and 
also of his attempt to substitute for a general static- 
equilibrium analysis a dynamic economic theory 
based on value theory. 

Pantaleoni’s theory of value is a theory of the 
exchange of given components of wealth. These 
components are the objects of exchange, he af- 
firmed, insofar as there are differences in the de- 
gree of ultimate utility of each component. In 
considering the phenomenon of utility, Pantaleoni 
obviously started from the analyses of the theorists 
of marginal utility—in particular, those of Gossen 
and Jevons—but their analyses did not satisfy him, 
so he tried to combine them with the classical anal- 
yses of Ricardo and Ferrara. He thus gave a rather 
complete explanation of the value of goods that is 
similar to Marshall’s, but he refused to accept 
Marshall’s case studies on demand and supply 
functions. 

Pantaleoni’s study of the problems involved in 
determining the value of goods was in part influ- 
enced by the close attention that he gave to statis- 
tical data and empirical information. For many 
years, the standard study on the subject was his 
Dell'ammontare probabile della ricchezza privata 
in Italia (“The Probable Amount of Private Wealth 
in Italy”; 1884); the data he compiled played an 
important part in his rejection of the macroeco- 
nomic theses of the marginal utility theorists. An- 
other of his researches that is still noteworthy is 
his inquiry into the vicissitudes of the 1906 depres- 
sion (1925a); it gave rise to a heated controversy 
on cyclical fluctuations, in the course of which 
Pantaleoni eventually took the position that eco- 
nomic cycles have essentially exogenous causes. 

Pantaleoni’s work as a theoretical economist had 
a bearing on his analysis of problems of finance. 
Despite his frequent criticisms of Ricardo, he was 
often a Ricardian in finance, as for example in his 
studies of transfer and of the incidence of taxes 
and the public debt. However, Pantaleoni’s greatest 
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originality in the area of public finance lies in his 
efforts to rethink the work of the marginalists, 
whether he was analyzing the distribution of public 
expenditures and the problems involved in taxa- 
tion burdens or studying the system of “political 
prices.” (According to Pantaleoni, a political price 
is a price which can vary as a function of the par- 
ticular characteristics of a buyer or a seller; the 
political price par excellence is the tax, because, 
according to the progressive principle, rich men 
can be taxed more heavily than poor, even though 
all obtain the same services from the government. 
A political price is not subject to the indifference 
law of prices [1911]. ) 

Pantaleoni took an active part in political life. 
In 1901 he entered the Chamber of Deputies as a 
member of the Radical party; he soon left the 
Chamber and the party. Later, especially during 
and after World War 1, he devoted himself seri- 
ously to political journalism. His articles, a large 
number of which are collected in four volumes pub- 
lished between 1917 and 1922, were written in a 
keen and effective style, marked by lucidity and 
polemical power (see 1917a; 1917b; 1918; 1922). 
During this period he participated directly in 
Gabriele D’Annunzio’s venture in Fiume. In 1923 
he entered the Senate. He often showed himself 
favorable to fascism, although he did not refrain 
from criticism of some of the views and decisions 
of the fascist regime. 

Luict FREY 


[For the historical context of Pantaleoni’s work, see the 
biographies of Gossen; JEVoNs; MARSHALL; Ricar- 
DO; for discussion of the subsequent development of 
his ideas, see Uritiry.] 
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PARANOID REACTIONS 


The term paranoia is one of the oldest in the 
history of psychiatry. It appears in the writings of 
Hippocrates, as well as of Plato and Aristotle, and 
was apparently used by the ancient Greeks and 
Romans to refer to a variety of mental disorders, 
more or less as the term insanity is used today. 
The term fell into disuse after about the second 
century and does not reappear prominently in the 
psychiatric literature until the latter part of the 
eighteenth century, when, again, it was used with 
a variety of meanings. The much more specific 
sense in which paranoja is used today was formu- 
lated explicitly in 1893 by Kraepelin, who reserved 
the term for a syndrome characterized by insidi- 
ously developing fixed, systematized delusions, U 
accompanied by hallucinations or by general per- 
sonality deterioration. 


Diagnosis 

The essential feature in the diagnosis of Pa 
noid reactions is the presence of delusions— Er- 
roneous judgments not subject to correction T 
experience,” Kraepelin called them (Waelder 19 A 
P. 167). In addition, symptoms which figure per 
nently in other disorders, such as hallucinations, 


deterioration, and inappropriate . affect, must be 
absent or else the diagnosis (e.g., paranoid schizo- 
phrenia, depression with paranoid tendencies) will 
reflect the more complex symptom picture. 

Depending mostly upon the chronicity, rigidity, 
and degree of systematization of the delusional 
structure, it is current practice to distinguish two 
diagnostic categories, paranoia and paranoid state, 
together constituting the superordinate category 
paranoid reactions. ( Paranoid schizophrenia is not 
included in this classification. ) 

Paranoia is a psychotic disorder that occurs 
relatively rarely. The delusional structure develops 
slowly, over a prolonged period, and reaches an 
extreme degree of organization and rigidity. Delu- 
sions are highly systematized and intricately elab- 
orated, Details of the delusional system are often 
worked out with considerable respect for logic, and 
the patient may provide a persuasive account of 
his plight if only the initial premise is conceded. 
The delusions resist attempts at refutation and 
alteration and may persist for years. Outside the 
arena of the delusional system the personality re- 
mains relatively intact. There are no hallucina- 
tions, there is no general personality disorganiza- 
tion or deterioration, and intellectual functioning 
is well preserved. The psychotic features seem, 
superficially at least, to be isolated from the rest 
of the personality, and many patients have been 
able to continue in business or professional ac- 
tivities with unimpaired efficiency. Paranoia is con- 
sidered to be relatively unresponsive to therapy. 

Paranoid state is a less severe psychosis than 
Paranoia. It occurs more commonly and has a 
more favorable prognosis. The delusional system 
lacks the close-knit logic and elaborate systemati- 
zation seen in paranoia. Delusions are briefer and 
vaguer; they are more in flux and less resistant to 
change. On the other hand, they are not so bizarre 
or fragmented as those found, for instance, in 
Schizophrenia, The delusions in paranoid states 
often develop more suddenly than they do in para- 
noia, perhaps in response to a specific situational 
Stress. They are usually of shorter duration and 
may even clear up spontaneously with the passage 
of an emotional crisis. Sometimes, though, the de- 
lusions become chronic. Again, as in paranoia, 
there is no general personality or intellectual de- 
terioration. 
oe article, a distinction will be made be- 
on and paranoid state where data are 
mie: e for the two conditions separately. For the 

part, the two forms of the disorder will be 


di 5 $ 
Scussed collectively as paranoid reactions or para- 
noid disorders. 


PARANOID REACTIONS 381 


Symptomatology. The content of paranoid de- 
lusions varies widely and it is not uncommon to 
find several types of delusions in a single patient. 
By far the most frequent are delusions of persecu- 
tion. The patient may believe himself to be pur- 
sued and plotted against by a person or persons 
who intend him a variety of harms. He may be 
concerned that his food has been poisoned or his 
mail tampered with. In carrying out their evil de- 
signs his adversaries may enlist the aid of numer- 
ous technological devices, which, incidentally, tend 
to keep pace with scientific progress: what used 
to be blamed on telescopes and magnets is cur- 
rently attributed to television and radar. Political 
ideologies, too, may be involved in the delusional 
system, again often reflecting contemporary polari- 
ties; in the United States, persecutors are fre- 
quently held to be communist agents. 

Delusions of grandeur are much less common 
than delusions of persecution and usually occur in 
the more severe forms of the illness. They are more 
typical of paranoia than of paranoid states and are 
characteristically stable, persistent, and well or- 
ganized, The patient may consider himself a per- 
son of great importance, noble birth, unique and 
superior endowment, or divine appointment. He 
may attempt (sometimes successfully) to obtain 
a patent on a most remarkable invention, or he 
may have discovered an elusive cure for a crippling 
disease, Religion, science, and politics provide the 
principal sources of subject matter for delusions 
of grandeur. 

Other types of delusions have been described in 
paranoid patients, most notably erotic delusions 
and delusions of jealousy. At one time, it was cus- 
tomary to classify paranoid reactions in terms of 
the type of delusion manifested. This practice has 
proved ineffective because of the great diversity of 
delusions and the many ways in which they can 
combine in a single patient. Delusions also occur 
as secondary symptoms in a wide variety of psy- 
chotic disorders, and a differential diagnosis is not 
always easy to achieve with confidence. They ap- 
pear frequently in schizophrenia and manic—de- 
pressive states and in such organic illnesses as 
general paresis, senile dementia, and alcoholic psy- 
chosis. 

In addition to the prominence of delusions, the 
symptom picture in paranoid reactions is rounded 
out by a number of other traits which may be 
viewed as reinforcing and maintaining the general 
delusional structure. The patient displays exag- 
gerated self-reference, perceiving himself as the 
object of what goes on around him. He assigns 
special meanings, consistent with his delusions, 
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to the remarks and gestures of others, or to items 
in newspapers and on the radio, and interprets 
them as directed toward him personally. Past in- 
cidents, too, may be reinterpreted in the light of 
present beliefs (“retrospective falsification”). There 
is, however, a considerable degree of selectivity in 
the patient’s response to his environment. He is 
reaction-sensitive (Cameron 1959a, p. 513), sin- 
gling out those events, no matter how trivial, 
which fit into his system of preconceived ideas, 
while ignoring all disconfirming evidence. He is 
especially sensitive to attitudes and tendencies of 
others which coincide with his own impulses and 
conflicts. His perception of his environment re- 
flects the fact that the paranoid person is almost 
invariably an intensely hostile person. He is char- 
acteristically suspicious and vigilant, living in a 
world teeming with potential dangers and implied 
threats. His delusions may lead the patient to out- 
bursts of verbal hostilities and accusations and, 
much less frequently, to direct physical attacks 
on others. 


Distribution 


Incidence. Paranoid reactions are relatively rare 
in the mental hospital population, currently con- 
stituting about 0.5 per cent of first admissions and 
less than 1.5 per cent of resident patients in mental 
hospitals in the United States. Most likely, these 
figures greatly underestimate the incidence of the 
illness in the population at large; many patients 
are able to control the more blatant, socially dis- 
ruptive manifestations of their delusions and are 
never hospitalized; others, especially where the 
disorder is less severe, are tolerated at home and 
at work as eccentrics, Also, many psychiatrists tend 
to avoid diagnoses of paranoia and paranoid state, 
preferring, where possible, another category, such 
as paranoid schizophrenia. 

Prevalence. Paranoid reactions are chiefly dis- 
orders of middle age. The majority of first admis- 
sions to mental hospitals occur in the 35-50-year 
age range, though, especially in cases of paranoia, 
the illness may have been a number of years in 
developing. A large percentage of patients hospi- 
talized with paranoid reactions have never married, 
and many are divorced or separated. Little is 
known about the sex distribution of paranoid States, 
and the opinion of most earlier writers that para- 
noia is more common in men is not supported by 
more recent statistical evidence indicating that the 
disorder occurs almost twice as frequently in 
women (Tyhurst 1957, p. 47). Again, there are no 
conclusive data regarding intelligence and educa- 
tional level in these disorders; it is generally be- 
lieved, however, that the paranoid patient is more 


intelligent and has had more formal education 
than the average hospitalized patient. 

Social isolation. Paranoid reactions appear to 
be more prevalent among groups who are to some 
extent isolated from the larger societal setting, 
Relatively higher rates of the disorder have been 
reported for displaced persons and refugees and for 
migratory and minority groups. Paranoid reactions 
are more common, too, in “lingually isolated per- 
sons,” whose language differences interfere with 
their effective communication (Tyhurst 1957, p. 61). 


Psychodynamics 

Freud’s explanation. Alongside Kraepelin’s con- 
tributions of classification and description must be 
placed Freud’s contributions of interpretation and 
reconstruction. In 1911 Freud published the first 
and still most influential attempt to work out in 
detail the interplay of psychological forces leading 
to the development of paranoid symptomatology. 
Freud based his thinking mostly on the autobio- 
graphical account of a psychosis with prominent 
delusions written by Daniel Paul Schreber, a Ger- 
man jurist. Freud concluded: “what lies at the 
core of the conflict in cases of paranoia among 
males is a homosexual wishful phantasy of loving 
aman...” (Freud [1911] 1958, p. 62). According 
to this view, the paranoid individual has been 
fixated at the psychosexual stage of primary nar 
cissism but has been able during his prepsychotic 
lifetime to keep repressed the homoerotic impulses 
characteristic of this stage. When, perhaps because 
of a series of frustrations or because of overstimu- 
lation, the repressive process fails and permits 4 
reactivation of homosexual impulses (“return of 
the repressed”), a regression occurs to the stage of 
primary narcissism. Seeking desperately to repait 
the breach in his defensive structure, the patient 
draws excessively upon the defense mechanisms 
characteristic of this stage, most notably denial 
and projection. Within this framework, Freud dem- 
onstrated how the various forms of delusions may 
be understood as contradictions of the proposition 
“I (a man) love him (a man).” In delusions of 
Persecution, for example, this proposition is con” 
verted, by denial and reaction formation, into 
do not love him—I hate him.” By the mechanism 
of projection, the second proposition is eke 
transformed into “He hates (persecutes) me, Whi 
will justify me in hating him.” “Observation leaves 
room for no doubt,” Freud wrote, “that the pers¢ 
cutor is some one who was once loved” ([191 E 
1958, p. 63). In a later paper, Freud (1915) gie 
onstrated that homosexuality played a significa 
tole in the development of paranoia in a fem ts 
patient, too, In essence, then, the paranoid patien 


delusions represent his way of coming to terms 
with unacceptable homosexual feelings [see DE- 
FENSE MECHANISMS]. 

In addition, Freud emphasized that delusions 
also serve a restitutive function. They represent an 
attempt at reconstruction, at re-establishing, albeit 
in an unrealistic manner, the object relations re- 
linquished in the initial withdrawal and regression. 

Evaluation of Freud’s explanation. Experimen- 
tal and clinical attempts to evaluate Freud’s work 
have, for the most part, focused on comparing 
paranoid and nonparanoid patients on indexes of 
homosexuality. In a typical experimental approach 
to the problem, Zamansky (1958) gave male men- 
tal hospital patients an opportunity to examine 
paired pictures of men and women; he found that, 
on the average, paranoiacs and paranoid schizo- 
phrenics looked longer at pictures of men, while 
nonparanoid schizophrenics (and, in another study, 
a normal control group) looked longer at pictures 
of women. Not all the delusional patients pre- 
ferred the male pictures, however. Clinical studies 
(e.g., Klein & Horwitz 1949), too, have typically 
uncovered evidence of homosexuality in some, but 
by no means all, paranoid patients. Other workers 
have reread Schreber’s autobiographical material 
and have disagreed with Freud about the source of 
Schreber’s difficulties. For example, MacAlpine and 
Hunter (who published the first English transla- 
tion of Schreber’s memoirs) attribute Schreber’s 
psychosis to a reactivation, not of unconscious ho- 
mosexual tendencies, but of archaic pregenital fan- 
tasies of procreation and self-impregnation (Schre- 
ber [1903] 1955, pp. 369-411). They suggest, 
furthermore, that Schreber’s regression was to an 
earlier stage, “absolute ambisexuality,” than Freud 
postulated. 

Two issues must be considered in examining 
Freud's explanation of paranoid disorders. There 
is first the question of whether a homosexual con- 
flict is invariably a part of the patient's psycho- 
dynamic make-up. The failure to uncover a homo- 
Sexual concern in some paranoid patients suggests 
that conflicts other than homosexual ones may 
also serve as a focus for the illness. There is fur- 
ther the question of whether homosexuality plays 
Precisely the role specified by Freud in the develop- 
ment of the psychosis—whether it is the primary 
etiological agent or merely a link (perhaps itself 
serving a defensive function) in a chain of psycho- 
dynamic factors leading ultimately to the delu- 
sional structure. Interpretations favoring this latter 
view are reviewed in the following section. 
wie enn the importance assigned to homosexual 
aig in paranoid disorders, the broad outlines 

the psychodynamic picture sketched by Freud 
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have never been seriously challenged. The illumi- 
nation of such concepts as regression, restitution, 
projection, repression, and return of the repressed 
remains a hallmark in the history of psychiatry. 

Some later psychoanalytic contributions. Later 
theorists have differed most sharply with Freud in 
their emphasis on the primacy of hostility in the 
development of delusions. The frequent presence 
of a homoerotic conflict is not denied, but it is 
assigned a position of much less central impor- 
tance. For some theorists, homosexual stirrings 
serve merely to emphasize and reinforce gener- 
alized feelings of inferiority and inadequacy. For 
others, the relationship posited by Freud between 
homosexuality and hostility is turned around, and 
homosexuality is seen as a defensive maneuver 
against more basic destructive wishes. Knight 
(1940), a spokesman for this latter notion, be- 
lieves that the homosexual wish of the paranoid 
patient is in actuality an intense and desperate 
attempt to neutralize and erotize a tremendous un- 
conscious hate, The powerful need to keep the 
homosexual urges from awareness is based not on 
cultural pressures, which prohibit their expression, 
but on the fact that the least approach to the love 
object arouses intense anxieties that both the ob- 
ject and the patient will be destroyed by the hostil- 
ity in the patient and the consequent hostility 
aroused in the object. 

The British psychoanalyst Melanie Klein (1948) 
also singles out aggression as the central problem 
in paranoid disorders. In addition, she stipulates 
that the related fixation is at a much earlier age 
than Freud believed: at the phase of “maximal 
sadism” occurring during the first year of life. The 
infant in this phase is said to be dominated by 
intense primitive feelings of hostility, and it is the 
projection of these feelings that leads the infant 
(and the adult psychotic who regresses to this 
phase) to attribute hostile designs to the people 
around him. Again, homosexuality, where it oc- 
curs, is considered to serve a secondary, defensive 
function against the hostile and destructive fan- 
tasies. 

Prepsychotic personality and breakdown. Para- 
noid symptomatology may be most profitably re- 
garded as an exaggeration and intensification of 
personality patterns characteristic of the individual 
during much of his life. As children and throughout 
their prepsychotic adult years, paranoid patients 
are mostly described as having been suspicious, 
secretive, hostile, resentful, and seclusive. A crav- 
ing for praise and recognition, coupled with a 
hypersensitivity to criticism, signifies the presence 
of profound feelings of inadequacy and insignifi- 
cance (Schwartz 1963), though not necessarily at 
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a conscious level. Undoubtedly for some, a vulner- 
able self-esteem is further threatened by the border- 
line awareness of homosexual interests or by the 
guilt and anxiety that accompany destructive 
fantasies. The refusal to admit their own short- 
comings and the tendency to blame others for their 
own difficulties are obvious antecedents of the pa- 
tients’ later more unrealistic use of the mechanisms 
of denial and projection, suggesting that repres- 
sion alone had never been a wholly successful de- 
fense. Other traits which frequently characterize 
the lifelong behavior of the paranoid individual— 
aloofness, airs of superiority and self-importance, 
arrogance, pride, disdain—may be regarded as re- 
lated maneuvers to maintain self-esteem. For many 
persons such an array of defensive traits provides 
a successful adaptation to potentially threatening 
impulses and fantasies—successful at least in that 
the individual is not pressed to the development of 
a full-blown delusional system. 

For the paranoid patient, however, something 
goes wrong. He encounters an experience or, more 
likely, a series of experiences which overburden his 
defensive structure, and anxiety-laden impulses 
threaten to break through to consciousness. Per- 
haps doubts about masculinity are fanned by an 
unsuccessful love affair or, as Freud suggested in 
Schreber’s case, by the failure to have a son; per- 
haps excessive hostility is provoked by rivalry and 
competition in business or by a social rebuff. In 
any event the patient presses into use the well- 
practiced, overlearned, often-reinforced modes of 
reaction at which he has become expert; only now 
they are applied excessively and with greater 
intensity and rigidity. He becomes more suspicious 
and vigilant, desperately probing the motives and 
acts of others for an explanation of the unfamiliar, 
disturbing impulses that are at the outposts of 
awareness. Not uncommonly, his hostility and sus- 
piciousness evoke rebuffs from family and business 
associates, and a vicious circle between the pa- 
tient’s perceptions and the behavior of others is 
set into motion. By an excessive application of 
such mechanisms as denial, reaction formation, 
rationalization, and projection, ego-alien impulses 
are disowned and attributed to sources outside the 
self. Ultimately, it becomes necessary to construct 
a new version of reality, and the patient’s misinter- 
pretations are woven together into a full-blown 
delusional system. Self-esteem is once again safe, 
but the price is insanity. 

Cameron’s notion (1959b) of the paranoid 
pseudo community is a useful one in this context, 
The development of normal, socially organized be- 
havior, Cameron points out, depends heavily on 


the maintenance of effective channels of commu- 
nication with others. Through a constant inter- 
change of ideas, the normal person acquires con- 
siderable skill in role taking, in shifting his social 
perspective so that he can see things from other 
points of view than his own. As a result, he comes 
to perceive himself and other people relatively 
objectively, since he can check and modify errone- 
ous ideas before they develop to any considerable 
extent, In the paranoid person, Cameron believes, 
these skills have been inadequately developed, in 
part because of a lifelong tendency to seclusiveness, 
A realistic appraisal of his beliefs about the atti- 
tudes and motives of others is limited by a chronic 
deficiency in role-taking skills and therefore in 
social-reality testing. In an emotional crisis this 
lack of corrective experience leaves the patient vul- 
nerable to his inner promptings and fantasies, and 
he is pushed toward an increasingly delusional in- 


the delusions are chiefly persecutory, the process 
may culminate in the conviction that the patient 
“is the focus of a community of persons who are 
united in a conspiracy of some kind against him” 
(Cameron 1959b, p. 53), a paranoid pseudo com- 
munity. This imaginary organization is made up of 
actual and imagined persons and usually resembles 
in form such undisputedly menacing groups as spy 
rings and the secret police. The paranoid pseudo 
community provides the patient with an explana- 
tion of his strange, altered world, and it represents 
as well a restitutive attempt to re-establish a rela- 
tionship with social reality. 


Prognosis and treatment 


The prognosis for paranoia is generally unfavor- 
able, especially if treatment is not undertaken until 
the illness has progressed beyond its early stages: 
Complete recovery is not the rule, though a num- 
ber of patients achieve a “social recovery” and do 
not require hospitalization, The outlook is much 
brighter for the more transitory paranoid states: 
spontaneous recovery is not uncommon, and the 
majority of cases respond partially or completely 
to treatment. 

The differential prognosis for paranoia and 
paranoid states may be directly related to the 
different history of the delusional structure in the 
two disorders. In paranoia, a complex, highly oF 
ganized delusional system has developed aon 
over a prolonged period of time, establishing itee 
firmly, after countless reinforcements, as an effec- 
tive defensive network against a variety of E 
ful, anxiety-producing situations. Little won E 
that it is so stubbornly resistant to change! In 


contrast, the delusions in paranoid states are much 
more recent and lack the organization and struc- 
ture to prove wholly effective in binding anxiety 
and in preserving self-esteem. One might say that 
these patients have less to lose in discarding their 
delusions. 

The psychotherapy of paranoid reactions, 
whether on an individual basis or in groups, re- 
quires that effective channels of communication 
be established between the patient and the thera- 
pist. It is first necessary to gain the patient's trust 
and confidence, a formidable task when confront- 
ing someone who believes himself to be surrounded 
by spies and assassins who are plotting his down- 
fall. The problem of establishing a productive rela- 
tionship is accentuated by the patient’s extreme 
sensitivity to even the slightest tendencies toward 
hostility in the therapist. Probably it is best if the 
therapist is a sincere, sympathetic, and accepting 
person who indicates that he is interested in under- 
standing the patient's ideas without necessarily 
subscribing to them. Reasoning and logical refuta- 
tion are generally ineffective and tend only to 
increase the patient's vigilance. Ultimately, the 
patient must experience a sufficient decrease in 
anxiety to be able to relax his defenses and re- 
examine his views of the world. 

Paranoid reactions are also treated with one or 
more of the somatic therapies. The recovery proc- 
ess, especially in paranoid states, may sometimes 
be accelerated by the use of insulin-shock and 
electroshock therapy. More recently, tranquilizing 
drugs have been widely used in treating paranoid 
reactions, Despite conflicting reports, a number of 
studies have indicated that such drugs as chlorpro- 
mazine and reserpine may be helpful in reducing 
or clearing up delusions, perhaps because they tend 
to lower the general level of anxiety. Again, the 
chances for recovery are much better if the patient 
is treated early in the illness, and the improvement 
is more likely to be maintained if somatic treat- 
ments are supplemented by psychotherapy [see 
MENTAL DISORDERS, TREATMENT OF, article on 
SOMATIC TREATMENT]. 

Haroxp S. ZAMANSKY 


[Other relevant material may be found in AGGRESSION: 
MENTAL DISORDERS; MENTAL DISORDERS, TREAT- 
MENT oF; NEUROSIS; PSYCHOANALYSIS: ScHIZzO- 
PHRENIA; and in the biographies of FREUD; KLEIN; 
KRAEPELIN.] 
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PARAPSYCHOLOGY 


Parapsychology, or psychical research, is the 
area of science which studies such topics as telep- 
athy and clairvoyance, precognition, psychokinesis, 
and survival after death. Its data seem to show that 
in some circumstances it is possible for some per- 
sons to make direct responses to events not present 
to their senses as well as to future events, and prob- 
ably also to influence directly the movement of ob- 
jects by “willing.” 

These abilities have no known physical corre- 
lates. The research from which the abilities are 
inferred must therefore be scrutinized with great 
care, If we accept the data, we will come to the 
radical conclusion that there are forces or relations 
which physics has not yet explored. We will also 
enlarge our concept of human potentiality (and 
presumably the potentiality of other animals) to 
include very broad contacts with other individuals 
and with the outer world, because information that 
apparently is not registering upon the normal sen- 
sory mechanisms is nevertheless influencing be- 
havior. 

Such concepts can be exciting. However, we all 
know that a high level of excitation does not always 
lead to adaptive behavior. Psychic research has 
probably excited more outraged protest than any 
scientific contention since Darwin’s, but it has also 
activated such constructive work as critical exami- 
nation of its data and a search for further informa- 
tion, Perhaps each person’s choice between flat 
a priori denial (or acceptance) and critical interest 
reflects the balance between his need for simple, 
immediate certainty and his tolerance for ambi- 
guity. 

Definitions 


Parapsychology has developed a distinctive vo- 
cabulary, in part because some of its research prob- 


lems are unique. For an example of one semantic 
difficulty, let us compare the paradigm of para- 
psychological research to the paradigm of research 
in verbal learning. 

To study verbal learning, the experimenter ordi- 
narily designates certain test material as the “stim. 
ulus,” presents this material to the subject, records 
the subject’s responses, and scores these responses 
by comparing them with the original material. In 
parapsychology the experimenter similarly desig- 
nates certain test material, does not present this 
material to the subject, records the subject's re- 
sponses or “calls,” and scores them by comparing 
them with the original material. If the scores are 
high, is it semantically correct to call the nonpre- 
sented, concealed material a “stimulus” for the sub- 
ject? The question is debatable. To bypass this 
issue, the neutral word “target” is substituted for 
“stimulus”; and when the subject's response corre- 
sponds to the target, this response is called a “hit.” 

Some term is needed to designate an extrachance 
number of hits when subjects call concealed, ran- 
dom targets. ESP, standing for extrasensory per- 
ception, is the term most commonly used, and it 
designates both the putative effect (as, “We must 
ask whether this is evidence for ESP”) and the pu- 
tative process which underlies the subject's behav- 
ior (as, “She was showing strong ESP ability”). 
Some find the term ESP objectionable because it 
prejudges that the psychological process is percep- 
tual. “Psi” or “psi gamma” is sometimes used in- 
stead of ESP. 

The three conventional subclasses of ESP are 
defined by their experimental arrangements. When 
the target is someone’s thoughts, the ESP is called 
telepathy. When the target is an object or (nonsub- 
jective) event, the ESP is called clairvoyance. When 
the target does not exist until after the subject calls 
it, the ESP is called precognition. There are also 
various combinations, subclasses, and ambiguous 
experimental arrangements. For example, an ob- 
ject which is known to someone could be either a 
clairvoyant or a telepathic target. Success at calling 
such a target is often designated GESP for (‘general 
ESP). An object known to no one at the time of the 
subject's call, but later known to someone, could 
be either a target for clairvoyance or a target for 
precognitive telepathy. 

A markedly different experimental arrange 
requires the subject to hope or to “will” that a 
moving object (like a die or a coin) will come i 
rest in a certain position. The hypothesis examine 
by this arrangement is that the subject can influ- 
ence the movement of the object without using any 
known physical means. The term most commonly 


ment 


used to identify this hypothesized event or ability 
is PK (“psychokinesis” ); “psi kappa” is occasionally 
used as a more neutral designation. 


History 

Two events are landmarks in the history of para- 
psychology; each marks the onset of a period of 
increased rigor in investigations. The first occurred 
in 1882, when a group composed mainly of schol- 
ars from Cambridge University founded in London 
the Society for Psychical Research (SPR). Its pur- 
pose, printed on the inside cover of its Journal, was 
“to examine without prejudice or prepossession and 
in a scientific spirit those faculties of man, real 
or supposed, which appear to be inexplicable on 
any generally recognized hypothesis.” 

The SPR followed three major lines of investiga- 
tion: collection and critical analysis of “sponta- 
neous cases” of telepathy, hauntings, and so on; 
studies of “sensitives” or mediums to find if they 
gave veridical information, and if so, what their 
modus operandi was; and experimentation. Critical 
standards were high, and men distinguished in 
other areas have been active in the society. Its past 
presidents include Henry Sidgwick, the earl of 
Balfour, William James, Sir William Crookes, Sir 
Oliver Lodge, Charles Richet, Andrew Lang, Henri 
Bergson, F. C. S. Schiller, Gilbert Murray, Lord 
Rayleigh, Hans Driesch, C. D. Broad, and Gardner 
Murphy. It publishes a Journal and Proceedings, 
and occasional books and pamphlets. Similar groups 
have been formed in such countries as Holland, 
Sweden, France, and Greece. In the United States, 
for example, the American Society for Psychical 
Research was founded in 1885, with Simon New- 
comb as its first president. 

What was to prove a second major event 0c- 
curred in 1927, when J. B. Rhine joined William 
McDougall at Duke University to work in this field. 
Rhine’s basic procedure required subjects to guess 
at the order of concealed, randomized cards; he 
then evaluated the hits statistically to find if they 
were extrachance. Rhine performed experiments, 
trained a cadre of research workers, published 
books and articles, and stimulated both scientific 
and popular interest. The parapsychology labora- 
tory which he headed (as of August 1965, the 
Foundation for Research on the Nature of Man) 
on the Journal of Parapsychology, where most of 
Bi laboratory's research was published, changed 
e; emphasis of research so that variants of the 
te technique became the method used by 
ae investigators, Rhine introduced a new vocab- 
erat including the terms parapsychology, extra- 

nsory perception (ESP), psychokinesis (PK), and 
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psi-missing (extrachance low scores). He also in- 
troduced the ESP cards: a deck of 25 cards marked 
with a circle, a cross, a square, a star, or a set of 
three parallel wavy lines. (For a summary of 
Rhine’s work, see Rhine & Pratt 1957.) 


Evidence for ESP 


Unsystematic evidence. Evidence on ESP breaks 
rather sharply into two categories. One is material 
that is unsystematically collected, as in a spontane- 
ous case where a mother hears her son's voice call- 
ing her at the time her son is in an accident, miles 
away. Investigation shows many such reports to 
be memory distortions or exaggerations but others 
to be veridical in that the event (e.g., the accident) 
happened as described and the “percipient” de- 
scribed it before information was available to him 
normally. Some descriptions include many unusual 
and accurate details. Any case may be dismissed 
as coincidence. An accumulation of many authen- 
ticated cases may or may not seem intuitively to 
be stronger than the single case (Tyrrell 1942). 

Reports of apparent telepathy during psycho- 
analysis are frequently published and represent a 
subclass of the qualitative material. Analysts usu- 
ally follow Freud (1922) in ascribing the telepathy 
to strong emotional needs or bonds, such as uncon- 
scious identification of percipient and “agent” and 
use of a repressed, archaic mode of communica- 
tion, or follow Jung in considering ESP a normal 
function. 

Laboratory evidence. The other category of evi- 
dence comes from controlled laboratory work in 
which evaluations are made through statistical 
analyses instead of intuition. Routine controls in- 
clude randomized targets; the absence of sensory 
cues to guide the subject; a permanent record of 
the targets that is made without knowledge of the 
subject’s calls; and a permanent record of the calls 
that is made without knowledge of the targets. Only 
two examples will be cited in detail: one of research 
with a pretested individual, the other of research 
with groups of unselected fifth-grade and sixth- 
grade children. 

A selected, pretested case. S. G. Soal, a math- 
ematician at the University of London, tested many 
preliminary subjects in card calling. None scored 
above the chance level, but at a colleague’s insistent 
suggestion Soal re-examined the records to see if 
the calls corresponded to the cards preceding or 
following the designated target. Two subjects’ calls 
showed this pattern. Soal called back those two for 
a formal experimental series, and both made ex- 
traordinarily high scores in this later work (Soal & 
Bateman 1954). We shall describe one: a house- 
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wife, Mrs. Stewart. In her 130 formal sessions, over 
a period of four years, she called the order of 
37,100 randomized cards. There was one chance 
in five of each call’s being right by accident, Mean 
chance expectation was 7,420 hits; her score was 
9,410. The probability of obtaining this score or 
one more extreme by chance is approximately one 
in 10%. 

For this or any experiment it is proper to inquire 
how these statistically impressive results compare 
with similar published experiments with null re- 
sults and other null data which presumably lie 
unpublished in file cases. Are the data still signifi- 
cant when corrected for selection? G. E. Hutchin- 
son, a Yale biologist, has worked out an answer. 
He computed that if Soal’s experiment had been 
performed once a minute for the number of min- 
utes in the history of the earth (estimated at about 
a thousand million years, or 1.57 x 10™ minutes ) 
and had yielded chance results every other time, 
the results of Soal’s work would still be highly sig- 
nificant when pooled with the others. 

Unequivocally, then, Mrs. Stewart's results were 
extrachance, But can they be explained away as 
due to sensory cues or other poor experimental 
conditions? We must next examine the conduct of 
the experiment. 

The targets which Mrs. Stewart tried to call 
were determined from random numbers, which by 
an elaborate but clearly stated procedure were 
translated into randomly ordered letters. Targets 
were exposed in one room; Mrs. Stewart wrote her 
calls in another. The room door was usually ajar, 
but invariably there was a screen between doorway 
and targets; an observer in Mrs. Stewarts room 
watched her, and another person in the other room 
made sure that agent and target remained behind 
the screen. Details of procedure varied somewhat 
over the years. Perhaps the most striking subseries 
was the 108th through 113th sessions, when Mrs. 
Stewart was vacationing near Antwerp. Targets 
and experimenter were in London. At a predeter- 
mined time, targets were exposed in London, and 
Mrs. Stewart made her calls (200 per session) in Bel- 
gium. Instead of the 240 hits that represent mean 
chance expectation, she made 345 hits, an average 
of 28.75 per cent correct. The probability of at 
least so many hits being a chance event is one in 
10". Since the average of the other 124 sessions 
was lower (22.92 per cent correct), the contrast 
implies that there had been no sensory leakage 
from room to room during the other sessions. 
Clearly, in this subseries no sensory cues were avail- 
able; its extrachance results were obtained from 
what seems an impeccably controlled procedure. 


Unselected subjects. We next turn to an ex- 
ample of work with subjects who had not been 
selected on the basis of previous testing. J. G. Van 
Busschbach (1956), an inspector of schools in 
Amsterdam, conducted a series of experiments on 
children in the fifth and sixth grades, Targets were 
randomized lists of five symbols or five colors or 
five words. Each class’s teacher sat at the back of 
the room behind a screen and looked at the targets; 
an observer watched the class and set the timing; 
the children wrote their guesses. Out of 20,190 
calls there were 174 more hits than mean chance 
expectation. The probability (p) of obtaining this 
score or one more extreme by chance alone is 5 in 
1,000. Van Busschbach subsequently replicated the 
procedure in Amsterdam and Utrecht, with sub- 
stantially the same results. He then journeyed to 
the laboratory at Duke University and conducted 
a third series in North Carolina, where he again 
found similar scores for the fifth-grade and sixth- 
grade children. The grand total for these grades 
was 83,320 calls with a deviation of 579, signifi- 
cantly above mean chance expectation (p < .001). 

Suggestive sidelights are cast on the data, indi- 
cating attitudinal factors that were crucial to the 
above-chance scores, by the consistently lower 
averages of seventh-grade and eighth-grade stu- 
dents (whose feelings toward their teachers are 
typically more detached than those of fifth graders 
and sixth graders); the lower scores when teachers 
were not adequately informed about the experiment 
and had less interest in it; and the higher scores 
on colors than on words. 

A great many other experiments, in which tar- 
gets are concealed from teachers or experimenters 
as well as from subjects, have yielded extrachance 
results under well-controlled conditions. The data 
seem to show that although ESP occurs, it usually 
Operates at a low level; information obtained from 
ESP is scanty and undependable. Van Busschbach’s 
results are typical: 5 out of 25 calls would on aver- 
age be correct by chance; and his obtained average 
was only 5.17. To examine an ability which is 80 
inefficient, the experimenter must use extreme oa 
in maintaining tightly controlled conditions an 
must also accumulate a large body of data. 


Variables associated with ESP 

Physical variables. There is no good evidence a 
any relation between physical variables and r 
The following negative findings are of speci 
interest. SP 

Radiation. Vasiliev (1962), in a report of E i 
research in several Soviet laboratories, wrote nr 
his subjects responded to telepathic suggestion: 


equally well under normal conditions and in a lead 
chamber. The chamber shielded them from electro- 
magnetic waves in the range between X rays and 
waves of one kilometer, and attenuated both longer 
and shorter waves. [See RADIATION.] 

Distance. Mrs. Stewart's results, no worse at 
two hundred miles than at a few feet from the 
target, are typical of what has been found in other 
comparisons, with distances varied up to about 
four thousand miles. 

Target size. A corollary of the absence of dis- 
tance effects (with terrestrial orders of magnitude) 
is that size of target should be unimportant; this 
was found by Pratt and Woodruff (1939), who 
tested the same subjects on targets which were 
large, medium, and very small. Scores were similar 
for all target sizes, and for all sizes the subjects 
made significantly higher scores on the first few 
calls than on later calls. 

Time. No clear relation to time has been found: 
when random precognition targets were selected 
either a short or a relatively long time after sub- 
jects made their calls, extrachance scores were sim- 
ilar (Anderson 1959a). 

Summary. It is of course impossible to prove 
a negative, but what evidence we have indicates 
that ESP success is independent of space-time and 
radiation variables. 

Physiological variables. Several types of phys- 
iological variables have been investigated. Some of 
the interesting but fragmentary findings deserve 
discussion. 

Age, sex, race. High ESP scores have been 
found in children, adolescents, and adults; in both 
sexes; and in different races. There have been no 
systematic comparisons, and comparative research 
Would be difficult to perform, since it would be 
necessary to control attitudes. The incidence of 
ESP ability in the population is unknown, but in- 
ferences may be drawn from the many experiments 
with unselected subjects that have given extra- 
chance results. Probably the best working hypoth- 
esis is that the incidence of ESP ability is like that 
of visual imagery. Some seems present in almost 
every normal person; it is markedly stronger in 
Some persons than others; and there are a few 
(with eidetic imagery, or the special “sensitives” ) 
Whose ability is so strong that it seems qualitatively 
different from the usual one. 

r Body changes. Research on body changes as- 
ieee with differences in ESP scores is scanty. 
ek is some evidence (Huby & Wilson 1961) that 
oat drugs like sodium amytal lower the 
aoe rate, and also that scores change when 

are differences in general body relaxation 
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(measured by galvanic skin response). A single 
experiment testing subjects under abnormal condi- 
tions of oxygen and temperature showed signifi- 
cantly lower ESP scores than with the same 
subjects under normal conditions (Woodruff 1943). 
The abnormal conditions were experienced as dis- 
agreeable, and the difference in ESP scores may 
reflect mood rather than physiological change. A 
single experiment demonstrated that patients hos- 
pitalized for cerebral concussion had significantly 
higher ESP scores than did accident patients with- 
out concussion (see Schmeidler & McConnell 
1958); but because interviews and a personality 
test showed the concussion patients with high 
scores to be more passively, uncritically acceptant 
of incoming impressions, it was tentatively con- 
cluded that the high scores might result from the 
mood induced by the concussion and hospitaliza- 
tion rather than from the injury itself. 

Body changes as ESP responses. In five care- 
fully conducted experiments with blind scoring, 
a finger plethysmograph was attached to the sub- 
ject to compare changes in his vasoconstriction 
when an agent in another room quietly examined 
neutral material (such as an unfamiliar name) or 
material which, if the subject had contemplated it, 
would probably have markedly changed his vaso- 
constriction level (such as the name of a close 
relative or of a political enemy). In four of the 
experiments there was significant correspondence 
between vasoconstriction and exposure to the crit- 
ical targets; in one there was not (see Dean 1966). 
One of these experiments also reports a significant 
correspondence between the target and certain 
electroencephalogram (EEG) changes; but there 
was none between the target and galvanic skin 
changes, and only a chance number of hits when 
the subject tried to call the target (Tart 1963). 

Several earlier, less rigorous experiments have 
also reported ESP changes using plethysmographic 
or EEG techniques. Additional unreplicated experi- 
ments describe significant ESP (or PK) changes in 
the movement of paramecia, in plant growth, and 
in the healing of artificially produced wounds in 
caged mice handled by a man who claimed to be a 
“healer” (as compared with the wounds of mice in 
cages handled by nonhealers ). These latter claims 
are so novel and striking that replications with very 
carefully controlled procedures are required before 
they can be absorbed into the body of parapsycho- 
logical findings. 

Summary. In short, several experiments report 
biological changes associated with ESP (or PK). 
The findings are provocative and seem to demand 
replication and further investigation. 
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Psychological variables. Only a few examples 
can be given here of the many well-conducted ex- 
periments showing significant relations between 
ESP scores and psychological variables. Findings 
Seem to converge on the generalization that extra- 
chance high ESP scores are likely to result from a 
mood of comfortable, relaxed, friendly interest, 
while extrachance low ESP scores (psi-missing ) 
are likely to result from a mood of negativism, dis- 
trust, resentment, or reserve. 

Three difficulties beset research on this topic. One 
is that moods are almost impossible to measure 
directly; they must be inferred. A second is that 
the operative “mood” for ESP seems to be a conse- 
quence of the attitude of the experimenter as well 
as of the subject. Traditionally, experimenters do 
not report their own feelings; thus, in much re- 
search some of the necessary information is lack- 
ing. The third is that there seem to be individual 
differences in optimal mood for psi-hitting: some 
subjects score higher when challenged, alert, on 
their mettle; others, when they feel accepted and 
approved. Experimental designs that are simple 
may be inadequate. 

Teacher—pupil relations, Anderson and White 
(1957) report a series of classroom experiments. 
Targets were lists of randomized ESP symbols, each 
list separately determined from a table of random 
numbers and stapled into an Opaque container with 
a response sheet on top. The classroom teachers 
who acted as experimenters never saw the targets. 
Teachers gave their own pupils instructions for 
taking the test, then distributed and later collected 
both the ESP material and a questionnaire assess- 
ing the pupil’s attitude toward the teacher. Teachers 
were not permitted to examine these questionnaires 
but were asked to state for each pupil whether he 
would be chosen as a member of an ideal class. In 
this series, because of the excellent introduction 
and indoctrination from Anderson and White, we 
can feel confident of the mood of the experimenter: 
teachers were Cooperative, interested, and willing 
participants, 

Anderson and White provide a summary of re- 
sults obtained from 18 teachers, 546 students, and 
65,275 ESP calls. Its most striking datum is the 
difference between ESP scores of students who 
liked their teachers and whom the teachers would 
include in an ideal class and scores of students who 
disliked their teachers and whom the teachers 
would exclude from an ideal class. The former had 
a mean of 5.31 (out of a possible 25); the latter 
had a mean of 4.54; and the difference between 
them was significant (p < .001). Later replications 


by these and other investigators were sometimes 
consistent and sometimes inconsistent with this 
pattern, and ad hoc interpretations are of course 
available for the inconsistent data. 

Intelligence and school grades. A secondary, 
related finding of the Anderson—White research 
came from the division of the same ESP scores 
according to the marks received in schoolwork 
(Anderson 1959b). Students with grades of A or B 
had higher mean ESP scores than students with 
grades of D or E, and the difference between groups 
was significant (p < .001). 

If we tentatively conclude that high classroom 
morale (inferred from questionnaire answers or 
from high grades) is associated with above-chance 
classroom ESP scores and that low morale is as- 
sociated with below-chance ESP scores, we have a 
ready explanation for otherwise anomalous data 
on ESP and intelligence, Repeated studies have 
shown a low positive correlation between intelli- 
gence-test scores and ESP scores when the ESP 
tests are given in a school or college. This pattern 
is superficially contradicted by the high ESP scores 
reported for a class of retarded children. But if we 
infer that morale in a retarded class may, with the 
right teacher, in fact be high, the results appear 
consistent with each other and with the Anderson- 
White data, 7 

Open-mindedness versus disbelief. Attitude 
toward the task at hand is one component of mo- 
rale; and if the task is considered impossible, mo- 
rale is likely to be low. This factor was examined in 
a prolonged series of experiments where subjects 
were categorized (without experimenters’ knowing 
their ESP scores) as either flatly rejecting all pos- 
sibility of paranormal success under the conditions 
of the experiment or as not rejecting it (Schmeid- 
ler & McConnell 1958), Results of the 448,575 ESP 
calls showed that the rejecting group had a mean 
score of 4.92, while the other had a mean of 5.12. 
The difference is significant (p < .001). e 
certain subseries were sometimes inconsistent ye: 
the general tendency, as were some replications by 
other investigators; but the majority of replications 
have reported the same tendency. ae 

Personality traits. Personality tests pear 
tered to ESP subjects by Nicol and Hume 
(1955) showed significant positive ear 
(P < .03) between ESP scores and happy-g0" vee 
disposition, emotional stability, calm mee 
low irritability level, and freedom from depress! di- 
or nervous tension. Most investigations have ue i 
cated similar relationships. On the basis ae 
ministering Rorschach cards to 1,004 subj 


(Schmeidler 1960), for example, significant inter- 
action was shown between social adjustment and 
open-mindedness—disbelief (p < .001). Higher ESP 
scores were found for those subjects whose Ror- 
schachs had many movement and color responses, 
ie., those who (in Rorschach terminology) had a 
rich inner life and emotional responsiveness. In 
addition, the data indicated a low level of ESP suc- 
cess for subjects who were overintellectual, ex- 
tremely reserved, or somewhat overimpulsive. This 
seems consistent with the general patterns found 
by Nicol and Humphrey, as does Shields’s finding 
(1962) of low mean ESP scores for the withdrawn 
children she tested and high mean ESP scores for 
the others. [See PROJECTIVE METHODS, article on 
THE RORSCHACH TEST.] 

Target and testing conditions. In many experi- 
ments the independent variable has been the 
nature of the target or such changes in test condi- 
tions as illumination versus darkness or knowledge 
of scores versus no information. Variables denoted 
objectively, “from the outside,” such as sex of ex- 
perimenter or target content, have until now given 
inconsistent results from one research project to 
the next, It may, however, be premature to draw 
a negative conclusion about the ineffectiveness of 
“objective” conditions; recently reported research 
from Czechoslovakia points in the opposite direc- 
tion. Ryzl (Ryzl & Pratt 1963) hypnotized a subject 
and found not only a high rate of ESP success 
(p<.001) but also (with Pratt, a careful, experi- 
enced American visitor, acting as coexperimenter ) 
that the subject’s correct calls came on certain of 
the sealed cards but not on others (p < .001). There 
have as yet been no attempts at replication with 
other subjects and other experimenters. 

s Where variables are described “from the inside,” 
in terms of the subject’s feelings about them, 
higher ESP scores are usually found to be associ- 
ated with lively, spontaneous interest and feelings 
of social warmth, Typical are the Pratt and Wood- 
tuff data (1939), in which highest ESP scores were 
obtained on the first calls of each new size of tar- 
get, while the experimenter’s friendly challenge 
was still ringing in the subject’s ears. Another ex- 
ample is Scherer’s results (1948) with his “marble 
machine,” where subjects try to call the next color 
in a randomized series of marbles. When Scherer 
Placed the machine in the meeting room of the 
Duke laboratory, instructed the staff and visitors 
to call only if they felt a strong hunch, and restrict- 
ed them to a maximum of two calls a day, ESP 
Scores were high. However, when Scherer required 
Tepeated calls in the same session, ESP scores were 
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significantly lower (p < .001). High ESP scores 
apparently were related to the mood evoked by that 
machine in that situation but did not relate to the 
machine itself. 

Where two types of targets or testing conditions 
that differ in preferability are used in a single 
experiment, subjects are likely to score above 
chance on one and below chance on the other. Rao 
(1965), summarizing his own research and that of 
others, cites more than a dozen experiments at a 
high level of significance in which this preferen- 
tial effect appears. He follows Rhine in suggesting 
that dislike of one condition “drains off the psi- 
missing” and focuses psi success on the other con- 
dition. As in other research, internal evidence 
indicates that preferences are determined by sub- 
tle motivational factors rather than by the objec- 
tive nature of the target. In North Carolina, for 
example, young women scored higher on familiar 
English words than on synonyms in Telugu, an 
Indian language unfamiliar to them; but young men 
scored higher on the exotic Telugu words than on 
the English ones. This seems consistent with the 
findings of masculinity—femininity studies in the 
southeastern United States and with other research 
on attitudes in this area, which indicate marked 
conservatism and a shrinking from the unfamiliar 
as typical of women but a more adventurous atti- 
tude as more typical of men. 

Summary. In general, it would seem that high 
ESP scores are associated with the kind of mood 
and personal interaction which make for high mo- 
rale and good productivity in other tasks, while low 
ESP scores are associated with negativism and 
withdrawal. 

Replicability of ESP experiments. There is no 
ESP experiment that has been replicated frequently 
and has shown similar results on all replications. 
This fact has received much attention. Interpreta- 
tions fall into three main categories. (1) Any 
apparently significant finding is an accident of mal- 
observation or of coincidence. (2) The subtle 
variables determining whether ESP ability will re- 
sult in psi-hitting or psi-missing have been inade- 
quately identified; thus, the key psychological con- 
ditions cannot yet be adequately controlled and 
results therefore vary from one project to the next. 
(3) The experimenter is an intrinsic part of the 
ESP situation; spontaneous, lively interest helps 
ESP success; no experimenter can maintain spon- 
taneous interest indefinitely; therefore, replications 
eventually fail for the same experimenter (and 


` would not be adequate as replications if a markedly 


different person substituted for him). 
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Psychokinesis (PK) 

Modern research in PK, initiated by Rhine, ex- 
amines whether a subject can influence how a 
moving object comes to rest. In a typical procedure 
subjects are required to release dice and to hope 
that a specified die face will turn up. Necessary 
precautions include a control for instrument bias: 
with dice, all six faces should be called equal num- 
bers of times; or, with the placement technique 
introduced by Cox, an equal number of calls should 
be made for the right and left sides of the surface 
on which the moving objects fall. It is also neces- 
sary to control for motor skill and manipulations: 
a typical procedure with dice is to use a mechan- 
ical release and to arrange that the dice fall through 
a series of baffles which make them bounce re- 
peatedly. 

As in ESP experiments, the pattern of success 
and failure (i.e., the difference between conditions) 
rather than the total number of successes is some- 
times specified as the dependent variable. In PK it 
is frequently found that when a subject knows he 
is to hope for a certain die face 24 times in suc- 
cession, he will have more success on the first six 
tries than on the last six (“decline effect”). 

Evidence for PK. Although a routine control in 
ESP research requires that data be recorded by 
someone who does not know the target (and of 
course that the target be recorded by someone who 
does not know the subject’s response to it), in most 
PK experiments the records are made by the ex- 
perimenter, who knows what the subject had hoped 
for. Repeated extrachance data from such experi- 
ments can be explained away as recording errors 
caused by experimenter bias; however, the accu- 
racy of these results can be defended when inde- 
pendent records are made by two individuals. Many 
such experiments report significant PK effects with 
p levels ranging to .00005. 

In four experiments, data were recorded by a 
person who did not know the targets (i.e., did not 
know which position of the die would be scored 
correct). Two of these studies used the ingenious 
procedure of randomizing and recording target 
order but concealing this order from the subject. 
The subject was instructed to hope while throwing 
the die that the uppermost face would correspond 
to the (unknown) targets and to record for each 
throw which face was up. Records were returned 
to the experimenter for scoring (see Fisk & West 
1958). Results were highly significant (p< .001) 


and perhaps can best be interpreted as evidence for ` 


PK. A strained alternative explanation is that the 
subject learned clairvoyantly which faces would be 


correct and made autistic scoring errors, One ex- 
periment with photographic recording showed 
chance results and another showed total successes 
near chance but a significant predicted decline 
effect, p = .002 (see McConnell et al. 1955). 

Relation to physical variables. No clear relation 
has been observed between PK success and distance 
or weight of moving objects. The most recent work 
on distance was an experiment by Fahler (1959) 
with nine subjects, each of whom made sixty throws 
(each throw with six cubes) for a left-right place- 
ment test, i.e., subjects were instructed in half the 
throws to hope that the cubes would roll to the left 
of the medial line of a table, and in the other half 
to hope that the cubes would roll to the right. Sub- 
ject and experimenter were in a room 27 yards 
from the cubes and the placement table; throws 
were made by electromagnetic release. Independent 
records of the cubes were made by Fahler and a 
coexperimenter, both of whom knew the target; 
discrepancies were checked by re-examination of 
the cubes. Results were significantly high (p < .001) 
and were substantially higher than in another series 
when the same subjects were in the room with the 
cubes and table. Various other experiments have 
compared light with heavy dice and a single die 
with many dice. Scores are usually lower with a 
single die, and experimenters usually conclude that 
conditions which interest the subject are likely to 
be associated with higher scores. 

Relation to psychological variables. Experi- 
menters often comment that excitement and chal- 
lenge are conducive to PK success and that bore- 
dom militates against success. A single exploratory 
experiment by Van de Castle (1958) suggested 
that “expansive” subjects scored higher than “com- 
pressive” ones (p = .02) and that when these pema 
subjects were separated by Rorschach scores into 
a spontaneous group and an inhibited group, the 
spontaneous subjects showed above-chance PK suc- 
cess and the inhibited ones scored below chance. 
These indications are consistent with the other 
experimenters’ casual observations. No major re- 
search has been performed on psychological var- 
iables in PK, 


Telepathy 


When a percipient tries to call randomized, pe 
recorded targets which an agent is trying to “send, 
extrachance scores may be interpreted as responses 
either to the agent's thoughts (telepathy) or to the 
physical targets (clairvoyance). Several Bie 
ments with successful results have examined “pure 
telepathy.” McMahan (1946), for example, used 4 
Private code that was never written or spoken to 


translate the digits of a random-number table into 
ESP symbols. Subjects called the symbols with re- 
sults significantly better than chance (p = .003). 
Data were checked by an associate to whom Mc- 
Mahan conveyed the code by allusion (e.g., “The 
number of us who went swimming last Saturday 
stands for ‘star’”). Even with this procedure, how- 
ever, the metaphysical criticism was made that 
clairvoyance could not be excluded; subjects might 
have been responding “clairvoyantly” to the experi- 
menter’s brain changes rather than “telepathically” 
to her thoughts. At this level it is clear that pure 
telepathy is not demonstrable; indeed, by the can- 
ons of operationism the distinction between telep- 
athy and clairvoyance is scientifically meaningless. 

At a less rigorous level, there is substantial evi- 
dence for telepathic interaction. Mrs. Stewart, for 
example, showed consistently high ESP scores with 
some agents and chance scores with others, al- 
though she had no normal knowledge of who the 
agent was. Stuart (see Rice & Townsend 1962) 
found significantly higher GESP scores in twins 
and in engaged couples than in control pairs; Rice 
and Townsend discovered significantly higher 
scores in married couples than in control pairs; 
Schmeidler (1961) found significantly higher 
scores in subjects who were predicted (from ex- 
amination of their paired Rorschachs) to be 
friendly if they knew each other than in subjects 
predicted to be hostile or reserved if they knew each 
other (these latter giving significantly low GESP 
scores), Such data fit well into the conceptual 
framework of telepathy; they support the hypoth- 
esis although they do not prove it. 

Current research on dreams, using quantitative 
methods, may soon provide a means of manipulat- 
ing the variables and testing the interpretations of 
telepathy or GESP, as illustrated by an ongoing 
Series of experiments by Ullman and his co-workers 
(1966). Targets in these experiments are repro- 
ductions of pictures; each reproduction is put into 
an opaque envelope by someone not otherwise con- 
nected with the experiment. The subject is prepared 
for sleep with apparatus for recording rapid eye 
movement (REM) and EEG. The agent then re- 
tires to a distant room, where he randomly selects 
from the pile of envelopes the one that will contain 
Ue tateet for the night. The agent opens the en- 

pe and studies the picture; he has no normal 
means of communicating with the experimenter or 
Subject until the next morning, after all data have 
ae recorded. The experimenter in the meantime 
aS es the subject when REM and EEG give evi- 
ae of dreaming, asks the subject to report his 

m, and makes a tape recording of all state- 
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ments. Further reports and associations are record- 
ed the following morning. A typical experiment 
consists of 12 such sessions, each with a different 
picture and a different subject. Judging is blind, 
and consists of both ranking and rating the corre- 
spondence between coded pictures and the coded 
record of the entire night’s dreams and the subject’s 
responses in the morning. Each experiment thus 
serves as its own control. 

Data have shown significantly higher scores with 
an agent who was so interested in the research that 
he continued with the project than with an agent 
who resigned as soon as the experiment was com- 
pleted; and they have demonstrated that an out- 
standingly successful subject, when called back for 
an experiment of seven nights’ dreaming, continued 
to provide significantly high correspondence. Inter- 
esting qualitative data, and indications that color 
reports may be particularly good indicators of telep- 
athy, provide leads which can be pursued further. 

Dean (1964) developed a clever technique using 
REM as the dependent variable in a telepathy or 
GESP experiment. When REM and EEG records 
indicated that the subject was beginning to dream, 
the experimenter looked either at a picture with a 
horizontal composition or at a blank picture (tar- 
get order had of course been randomized and pre- 
determined). Subjects’ eye movements were scored 
blind. Significantly more horizontal eye movements 
occurred when the experimenter looked at the hor- 
izontal pictures than when he looked at the blanks. 
This tidy procedure has not yet been replicated. 
[See DREAMS; NERVOUS SYSTEM, article on ELEC- 
TROENCEPHALOGRAPHY. | 

There are also some scraps of evidence about the 
dynamics of telepathy. Warcollier (1948) and 
others, in qualitative research with drawings, de- 
scribe “contamination”: several individuals tried 
to draw a target selected by a stranger but instead 
apparently responded to each other’s ideas. Soal 
similarly reports that when Mrs. Stewart was con- 
sciously trying to respond to an agent with whom 
she usually failed, the accuracy of her responses 
was at a chance level; at the same time, she showed 
extrachance correspondence to the simultaneous 
messages of a different agent with whom she usu- 
ally succeeded. Among Schmeidler’s subjects there 
was a low but significant negative correlation be- 
tween GESP scores and scores when the agent, 
ignorant of target content, hoped the percipient 
would fail. These data imply that the percipient is 
often unable to shield out telepathic messages. 
Similar indications come from spontaneous cases, 
as when leading men in a Dutch town tried to hold 
a séance where they would receive messages from 
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the dead. A schoolboy across the street yearned to 
be one of their group but instead occupied himself 
with an English poem from a school assignment. 
The planchette under the hands of the leading cit- 
izens wrote an excerpt from the boy’s poem (see 
Murphy 1961, pp. 190 f.). 


Clairvoyance 


Extrachance results when a subject calls targets 
known to no one may be considered evidence for 
clairvoyance; but if someone later knows the tar- 
gets, the results can be interpreted as (precog- 
nitive ) telepathy and, although it seems farfetched, 
any clairvoyance results can perhaps be interpreted 
as evidence of PK. One recent experiment which 
could preclude precognitive telepathy (Schmeidler 
1964) required a computer to select random digits 
as targets, then to record the subjects’ calls, to 
score the hits, and to print out the total hits but 
not the targets. With this procedure, no one knew 
what the targets were, and if the digits had been 
truly random, no one could ever have known what 
they were. The digits in fact were part of a pseudo- 
random series, and the possibility now exists that 
the series could be reconstituted from the computer 
tapes. If Schmeidler destroys the tapes and other 
records before the series is reconstituted, precog- 
nitive telepathy is ruled out, but Schmeidler will 
open herself to allegations of fraud (Price 1955; 
Hansel 1966). Subjects were college undergrad- 
uates. Hits were significantly above chance 
(p = .001) for those runs where the computer’s 
program was correct, In a second series where con- 
ditions were slightly altered, the total scores were 
insignificantly below chance, but subjects who had 
no confidence that any of their calls were correct 
scored significantly below chance (p = :003), and 
subjects who thought that a few of their calls were 
correct scored significantly higher (p = 002). Thus 
there is some indication that “pure clairvoyance” 
occurs (if we disregard the counterhypothesis that 
the computer’s choice of random digits was influ- 
enced by PK), but the results have not been repli- 
cated. 


Precognition 


Extrachance results when subjects call random 
targets not yet selected may be considered evidence 
for precognition; but an alternative explanation is 
that the selection of the targets is influenced by 
PK, We must therefore examine how precognition 
targets are chosen. Let us consider nine precogni- 
tion experiments publishéd in 1959-1964 (see 
Journal of Parapsychology). In two of them, tar- 
gets were selected by a computer programmed to 


select random (presumably pseudo-random) num- 
bers; in six, by the experimenter’s throwing dice or 
coins as the directive for entering the random-num- 
ber table which would designate targets; in one, 
by using Dow-Jones averages of a specified future 
date as the directive for entering the random-num- 
ber table, All experiments yielded significant re- 
sults (p ranged from .02 to .0001). The most com- 
mon procedure involved throwing ten-sided dice, 
performing complicated prespecified computations 
on the results of the throw, and using the outcome 
of the computations as the directive for page, col- 
umn, and row of entering the random number 
table. For PK to be effective here, it would have to 
have a pinpoint accuracy far greater than any PK 
research has indicated. Although in theory PK can- 
not be ruled out, the more conservative interpreta- 
tion of the results seems to be that they demon- 
strate precognition. 

Comparisons of psychological variables in pre- 
cognitive and in simultaneous ESP have been 
scanty. Experimental evidence gives some indica- 
tion that similar mood and attitudinal factors af- 
fect both (Schmeidler 1964). However, several 
independent compilations of spontaneous cases in- 
dicate that more precognitive impressions come in 
dreams than while awake, and more telepathic and 
clairvoyant impressions come while awake than in 
dreams. 


Survival after death 


In a recent symposium on survival research (Roll 
et al. 1965/1966) participants agreed that there 
is no adequate research design to test the hypoth-, 
esis of survival after death, but they proposed sev- 
eral research methods which should yield data that 
could tend to confirm or disconfirm the hypothesis. 
None of these novel methods has yet been tried. 
Information which bears on the survival hypoth- 
esis comes from five major lines of investigation. 
Brief summaries and examples follow. 

Spontaneous cases. Apparitions of the dead 
sometimes give information known to no living per 
son which, when checked, is found to be correct 
(e.g., Tyrrell [1942] 1953). Many cases have been 
carefully studied. An example is that of the Chaffin 
will. Chaffin, a farmer, died in 1921, and his will 
left the bulk of his property to the third of his a 
Sons. In 1925 one of the disinherited sons He 
that his father, wearing an old coat, pulled the coa! 
back and said, “You will find my will in my over 
coat pocket.” A search located the coat, and ae 
into its lining at the pocket was a slip of ae 
specifying a verse in the Bible. Folded into ‘ll 
family Bible at the designated page was a later gil 


verified as being in the father’s handwriting and 
legally valid. 

Although there seems no doubt that these events 
occurred, at least three interpretations of them are 
available: that the surviving spirit of the father 
gave a belated message to his son; that the son had 
repressed memories of the stitching of the will into 
the coat, which he dramatized in his dream; and 
that the information came through clairvoyance. 

Counterhypotheses of repressed memories or of 
ESP or PK can similarly be formulated to “explain 
away” any case which superficially looks as ifa 
surviving spirit had evidenced itself. It is sometimes 
stated that until we find limits for ESP and PK 
abilities, no data can preclude ESP or PK as an 
alternative explanation to messages from the dead. 

Mediumistic sittings. Mediums (i.e. “sensi- 
tives” who claim to receive messages from the 
dead) ordinarily hold “sittings” for a person who 
asks for messages, and it is impossible to estimate 
what feedback from the sitter guides the medium 
to give accurate information. “Proxy sittings” are 
a research technique in which the person who will 
receive the messages (the “absent sitter”) is in 
touch with an experimenter. The experimenter 
transmits minimal information about the absent 
sitter (e.g., only the given name) to still another 
person, the “proxy sitter.” Only the proxy sitter sees 
the medium, asks for a message for the absent 
sitter, and records the messages. In modern re- 
search, the proxy sitter holds successive sittings for 
several absent sitters. All messages are coded and 
are scored blind by all absent sitters. It is then easy 
to evaluate the accuracy of the messages statisti- 
cally, using the absent sitters’ scores of messages 
not intended for themselves as a control. 

Earlier work with proxy sitters did not employ 
such elaborate precautions, but some is still im- 
pressive, Thomas (1935), for example, received a 
letter from a stranger, asking that Thomas make 
Contact with the spirit of a grandson, Bobbie, who 
had recently died. Thomas acted as proxy sitter 
and held 11 sittings with a medium. Many mes- 
Sages described toys, activities, etc., which the 
gtandfather confirmed. It must be noted that after 
the first reply from the grandfather, the strict con- 
ditions of a true proxy sitting had been violated. 
ie feature of the case remains interesting, 
on sa gives information known to no {iving per- 
Debole reply to the grandfather’s question of why 
4 aan had died, there were frequent references to 
thats with pipes where Bobbie had taken poison 
the ones his death. The grandfather searched in 
von gion indicated by the messages (past a cross- 

and bridge, to the right, with cattle, etc.), 
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found a hill which fit the description, and inquired 
of Bobbie’s friends, who confirmed that Bobbie had 
played there. Two springs where water issued from 
pipes were on the hill. At the grandfather's request 
a sanitary inspector examined the piped water and 
found it so contaminated that it was likely to cause 
infection in a child who drank it. Thus, the infor- 
mation from the sittings was in part known by the 
grandfather (the absent sitter), in part unknown 
by him but known by Bobbie’s playmates, and in 
part unknown by anyone until after the inspector's 
examination. The results can be interpreted as 
telepathic and clairvoyant, or as coincidence. In 
these and earlier sittings, Thomas’ interpretation is 
that the surviving spirits of his dead father and 
sister purportedly spoke to him or to the medium, 
and drew accurate inferences from information 
conveyed to them by Bobbie’s spirit, then commu- 
nicated the inferences to him through the medium. 

Cross correspondences. Cross correspondences 
are considered by many to be the strongest evi- 
dence for survival. Shortly after the death of dis- 
tinguished early members of the SPR, messages 
purporting to come from them were written or 
spoken (in trance) by sensitives in India, the 
United States, and Great Britain. The messages 
were collated in England, and it was found that 
classical allusions or similar odd items in one often 
overlapped with items in another (hence the desig- 
nation “cross correspondences”). Put together, the 
fragments conveyed a more complete version of 
some obscure story or reference than did the 
messages from a single sensitive. Some messages 
indicated that two of the deceased scholars were 
collaborating in the intricate interweaving of the 
information (see Murphy 1961, chapter 7). Fraud 
seems ruled out as a counterhypothesis to survival, 
but coincidence, optimistic scoring, telepathy, and 
clairvoyance are possibilities. 

Modern methods of content analysis have never 
been applied to this mass of material, so it 
is impossible to evaluate whether the apparent 
correspondences were extrachance. [See CONTENT 
ANALYSIS.] 

Similarity of apparitions. An ingenious statistic 
to support the survival hypothesis was adduced by 
Hart (1956), a sociologist. Apparitions which con- 
vey veridical information that is new to the persons 
who see them are frequently reported, as in the 
Chaffin will case. Sometimes these are appari- 
tions of living people (who often, when later ques- 
tioned, say that they had not been thinking of the 
person who “saw” them); sometimes of the dead. 
Hart listed 165 cases which seemed “good” on the 
basis of five criteria: oral or written statement of 
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evidential details before confirmation; later con- 
firmation; investigation by a competent research 
worker; fully documented record; and short time 
between the event and its report. From these 165 
cases he made a frequency count of 22 charac- 
teristics of apparitions (luminosity, attempt to 
speak, etc.). Frequencies of these characteristics 
were correlated for apparitions of the living and 
the dead, and the correlation was significant. Hart 
takes this to imply that the living and the dead are 
similar, and therefore that the dead still live. The 
obvious counterhypothesis is that cultural or per- 
sonal similarities among those who see apparitions 
account for similarities among the apparitions that 
they see. 

Reincarnation. Evidence is being collected, no- 
tably by Stevenson (1966), which shows that a 
living person, usually a child, has information 
which would not normally be known to him but 
which was part of the intimate, daily knowledge 
of some dead person. The living person sometimes 
also has special abilities, tastes, habits, or even 
physical stigmata characteristic of the dead person. 
This type of material is cautiously interpreted as 
consistent with a hypothesis of reincarnation. 
Since the careful research is very recent, there has 
not yet been time for the legalistic, hypercautious 
sifting of evidence, the attack and rebuttal, that 
is characteristic of the authenticated spontaneous 
cases. All we can do now is to note that for the 
first time an able investigator is making a serious, 
continued effort to collect what evidence there is 
on reincarnation and that on the face of it the 
evidence looks good enough to deserve critical 
study rather than casual a priori dismissal. 

An earlier claim for reincarnation should per- 
haps be cited here, since it received wide publicity 
(see Ducasse 1960), An amateur hypnotist at- 
tempted hypnotic regression and suggested to his 
subject that she would remember details of an 
earlier life. She accepted the suggestions, and 
claimed to have lived as Bridey Murphy in Ireland 
two or three generations ago. She described many 
details of the Ireland of that period. The later dis- 
covery that as a child she had spent many hours 
in the company of an old Irish woman destroyed 
general interest in the case. Stevenson's more so- 
phisticated investigations examine all data care- 
fully for such contamination. 


“Modus operandi” of sensitives 

Many sensitives produce their messages in a 
normal state, but more commonly the messages 
come during dissociation, as in trance or automatic 
writing. Messages are typically attributed to two 


sources: “communicators,” i.e., spirits of the dead, 
who drop in briefly, as it were; and “controls,” 
usually Indian princesses, Arabian physicians, ete, 
who visit frequently and often act as masters of 
ceremonies, introducing or transmitting messages 
from the communicators. Communicators are fre- 
quently identifiable from factual details or speech 
characteristics. No controls have ever been identi- 
fied, and some research indicates that they repre- 
sent secondary personalities of the sensitives, Mes- 
sages not attributed to communicators or controls 
are usually given as direct telepathic or clairvoyant 
knowledge, but occasionally a sensitive claims to 
have traveled “out of the body” to some distant 
place and to have made observations there. Some- 
times the sensitive holds a token object and gives 
messages about its present owner or its past own- 
ers. Such “object reading” is called “psychometry,” 
a word unfortunately similar to “psychometrics,” 
which of course has a completely different meaning, 

Physical phenomena. Such manifestations as 
table lifting and the production of ectoplasm were 
frequently reported in the past. Since the inven- 
tion of infrared photography, reports have been 
markedly less frequent, and none have been au- 
thenticated with modern research methods. This 
naturally casts strong, but perhaps unfair, sus- 
picions on the earlier accounts. 

Spontaneous cases. Spontaneous physical phe- 
nomena, such as the stopping of a clock or the fall 
of an apparently stable object when a distant per- 
son undergoes a crisis or dies, are often reported 
and seem well substantiated. Physical effects are 
also often reported at periods of strong emotions, 
as in Jungs account of the loud noises he pro- 
duced psychically in Freud’s office (see Fodor 1963). 
As with other spontaneous cases, there is no guide 
except one’s own judgment as to whether the ac- 
cumulated number of verified cases is or is not too 
large to dismiss as random coincidences. r 

Poltergeists. Recurrent spontaneous physical 
phenomena that occur in the same general loca- 
tion or are associated with one individual are called 
poltergeists. The effects are various, but most fre- 
quently they involve the dropping, breaking, o 
movement of objects; raps; odors; violent disturb- 
ances of beds or bedding. They are often associated 
with a child or an adolescent. An outbreak usually 
ceases in a particular house when the occupants 
leave, although occasionally the disturbances ac- 
company a person to another location. Many A 
ports are clearly fraudulent. Others can be keee 
to shifts in the underground water level whic 
cause movement of a house. There is a sizable 
residue of cases which some critics attribute t0 


undetected fraud but which others think are spon- 
taneous phenomena resulting from unconscious 


hostility (Owen 1964). 


Applications of parapsychology 

Claims are often made that sensitives have 
helped police track criminals, or have located lost 
objects, or have identified underground water or 
minerals (dowsing). Adequately controlled re- 
search on these claims is difficult to perform. 

Rose’s investigation (1955) of psychic healing 
indicates high rates of success for diseases which 
are often diagnosed incorrectly or for symptoms 
likely to be psychogenic, but only the rate expected 
in spontaneous recovery for other diseases. West 
has come to similar conclusions in an analysis of 
healing at Lourdes (1957). [See PSYCHOSOMATIC 
ILLNESS. ] 

Taetzsch (1962) points out that if ESP gives 
some information, the techniques of quality con- 
trol can sort the chaff from the wheat and can 
produce enough information to be usable. Repeated 
calls of the same target can be pooled until even- 
tually there is near certainty about the target. The 
method requires segregating the psi-missing calls 
from the psi-hitting calls; thus, if a subject shows 
other evidence of psi-missing and avoids a given 
target, this would be used to confirm data from 
another subject who shows evidence of psi-hitting 
and who calls the same target that the first avoids. 
At present our identification of the variables which 
correlate with psi-hitting versus psi-missing is so 
uncertain that the method seems too risky to use 
Without careful pretesting. 


Theories of parapsychology 


In general, even “miniature” theories of ESP and 
PK have not been stated rigorously enough for de- 
cisive testing. A notable exception is Scott's list of 
Several models of how information can be conveyed 
by psi (1961). Examples are the models “Psi 
occurs only occasionally, but when it occurs it 
Sives complete information” and “Psi occurs all the 
pe and consists of a multiplication of the per- 
“ipient’s chance of success by a constant factor.” 
Scott has developed the mathematical implications 
of his numerous models and has described experi- 
hehe methods for testing them. Although the 
Reece are clear and the problems important, 
egos was published dealing with them in 
a ree years after Scott’s article appeared— 
teats because parapsychology has more en- 

£ problems than it has experimenters. 
hdi e simplest theory to explain parapsychological 
ings has been put forth by Price (1955) and 


PARAPSYCHOLOGY 397 


Hansel (1966), who claim that all extrachance 
results in well-controlled research are fraudulent: 
the experimenters either lied about their data or 
were psychotic and hallucinated the data. As Price 
and Hansel develop this thesis, both argue that 
further controls could refute it. Price suggests that 
a jury of reputable men should certify that the 
data are correct, but he does not face the possi- 
bility of a new critic who could claim that the jury 
which Price considered reputable was composed of 
liars or psychotics. Hansel suggests that a com- 
puter be programmed to select the targets and to 
record and score the data, but disregards the possi- 
bility that the computer could be programmed to 
give false results. This theory of parapsychology 
seems untestable. The contention that a single man 
was dishonest can perhaps be disconfirmed if he 
is witnessed by others; the contention that those 
others were mistaken or dishonest (a contention 
made by both Price and Hansel in referring to col- 
laborative or witnessed research) would lead to an 
infinite regress. 

The argument is interesting historically, since 
in 1882 Henry Sidgwick, the philosopher who was 
first president of the SPR, wrote, “We have done 
all that we can when the critic has nothing left to 
allege except that the investigator is in the trick. 
_. . We must drive the objector into the position 
of being forced [to accept the data] or to accuse 
the investigators of lying or cheating or of a blind- 
ness or forgetfulness incompatible with any intel- 
lectual condition except absolute idiocy” ( 1882, 
p. 12). Sidgwick’s goal has apparently been 
achieved. 

No single theoretical approach is widely ac- 
cepted. In general, the ones which accept the data 
may be summarized as (1) those which call for 
new topological or other mathematical concepts 
of space-time relations, so that targets which seem 
distant in a three-dimensional or four-dimensional 
system may be conceptualized as adjacent; and 
(2) those which call for laws of psychic events 
which are separable from laws of physical events. 
The latter is not necessarily a dualism, since the 
two sets of laws may eventually be reconciled, but 
it lends itself most readily to an interactionism. 
Eccles (1953), among others, has argued that 
when we “will” to move a part of the body, the 
mind affects the body by PK. The parallel argu- 
ment is that we become conscious of our own brain 
processes by ESP. Interactionism fits easily into a 
survival hypothesis, because autonomous psychic 
events could continue after the death of that body 
with which they have chiefly been interacting. 

Theories of survival range from a denial of it, 
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through C. D. Broad’s thesis (1958) that stray 
memories or similar fragmentary psychic processes 
continue for at least a short time after bodily death, 
to concepts held by F. W. H. Myers (1903) and 
William James (see Murphy & Ballou 1960) of a 
surviving personality so little altered as still to be 
recognizable, and to theories of reincarnation. 
Theories of precognition are numerous and di- 
verse. They have been characterized as falling into 
two classes: those which are unbelievable and 
those which are incomprehensible. None is stated 
in clearly testable form. This may be remedied by 
a new approach as yet stated only in general out- 
line (Zink 1965) or circulated in unpublished 
memoranda. It is based on Feynman’s equations 
showing that a positron is an electron traveling 
backward in time, Mathematically trained “para- 
physicists” are developing further equations to find 
if they can both account for previous data and 
formulate hypotheses to put to experimental test. 


Rigorously controlled research shows extra- 
chance relations between random, concealed tar- 
gets and responses. These relations, designated as 
ESP, are as yet unexplained by physical theory, 
Evidence for either telepathy or clairvoyance is 
excellent; evidence for PK is less strong; evidence 
for precognition is good unless PK is accepted as a 
counterhypothesis; evidence for survival after death 
is inconclusive because it can be “explained away” 
as due only to ESP. 

GERTRUDE R. SCHMEIDLER 
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I 
CONTRIBUTIONS TO ECONOMICS 


Vilfredo Pareto (1848-1923), Italian economist 
ee Sociologist, was born in Paris. His father, 
a ee Pareto, a follower of Mazzini, had been 
xiled from Genoa in 1836 by the ruling house of 


Savoy and had gone to France and taken a French 
wife. It was only in 1858, ten years after Pareto’s 
birth, that an amnesty enabled his father to return 
to Italy. 

Pareto began his education in France, but he 
accompanied his father back to Italy and continued 
his schooling there; his secondary studies were 
mainly mathematical and classical. His formal edu- 
cation was completed at the Polytechnic Institute 
in Turin, where he finished his engineering studies 
at the age of 21 with a thesis entitled “Principi 
fondamentali della teoria dell’ elasticità . . .” (1869). 
From 1870 to 1892, Pareto worked as an engineer. 
He also served as a director of two Italian railways. 

His career as an economist stemmed from a 
chance meeting with Pantaleoni. Studying Panta- 
leoni’s Pure Economics led him to reread Walras. 
Although he had at first been rather indifferent to 
Walras’s work, on rereading it he was much im- 
pressed by the theory of general economic equi- 
librium. In 1891 he met Walras, who was con- 
sidering resigning his professorship at Lausanne 
University. Walras was very pleased to have at last 
found someone capable of understanding the scope 
and importance of his work and suggested to Pareto 
that he might become his successor. In 1893, 
Walras did retire, at the age of 58, and Pareto, 
who was 45, succeeded him. 

This date marks the beginning of Pareto’s sci- 
entific career, in the course of which he produced 
a number of books, all of remarkable quality: the 
Cours d'économie politique (1896-1897), the Sys- 
tèmes socialistes (1902-1903), the Manuale di 
economia politica, which appeared in 1906 and was 
published in French, with various improvements, 
as Le manuel d’économie politique in 1909, and 
the Trattato di sociologia generale, which appeared 
in Italian in 1916 and in a French version in 
1917-1919. 

In 1898, on the death of an uncle, Pareto in- 
herited a substantial fortune and moved to the 
town of Céligny, in Switzerland. From 1900 he led 
a reclusive life there, devoting himself wholly to 
his work and leaving Switzerland only rarely. 
Shortly before his death he was appointed a mem- 
ber of the Italian senate by the fascist government. 

Until he was about 50 years old, Pareto was 
inspired by the liberal approach to political econ- 
omy. Democracy, liberty, free trade, and humani- 
tarianism were his panaceas for the plagues of 
militarism, protection, and religion. As time passed, 
experience, growing objectivity, and a broader view 
of history brought changes in his point of view. 
From about 1900 his impassioned and intransigent 
partisanship slowly gave way to a more somber, 
pessimistic, and skeptical attitude. Perhaps the best 
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description of this intellectual change is Pareto’s 
own: “I was totally unaware that my reasoning was 
only an attempt to give logical clothing to beliefs 
which were fundamentally of an emotional nature” 
(letter to Antonucci, December 7, 1907, in Bous- 
quet 1960, p. 26). In 1900 he wrote to Pantaleoni 
that there had once been a time when his desire 
was to straighten the limbs of the halt but now he 
laughed at their infirmity. 

This change of viewpoint had no effect on Pareto’s 
style or manner. His mind had room for two very 
different personalities, the lucid, cold, precise sci- 
entist and the eager, incisive polemicist who was 
given to sarcasm and did not hesitate to subject his 
Opposition to merciless disdain. 

The determining influences on Pareto’s scientific 
career were his knowledge of mathematics, his 
immense erudition in matters connected with an- 
cient Greece and Rome, and his twenty-year prac- 
tice as an engineer. (A proof of his excellence in 
mathematics is the paper “Sur les fonctions géné- 
ratrices d’Abel,” which he submitted to Leopold 
Kronecker and which was published in 1892.) His 
contact with Walras’s theory of equilibrium served 
to trigger his subsequent brilliant accomplishments. 
Pareto always frankly acknowledged Walras’s de- 
cisive influence on the direction of his thought, 
even after their personal relations had been strained 
by differences over ideology. 

Pareto’s fundamental contributions to economics 
are contained in three publications, the two-volume 
Cours, the Manuel, and the article “Economie mathé- 
matique” (1911a) in the Encyclopédie des sciences 
mathématiques. The Systèmes socialistes and to 
an even greater degree the Trattato, although 
primarily sociological in emphasis, contain a con- 
siderable amount of analysis that extends and 
completes the material in the basic economic 
works. 


Pareto’s economic writings 


“Cours.” Pareto’s aim in the Cours was “to pro- 
vide an outline of economic science considered as 
a natural science based exclusively on facts” 
(1896-1897, p. iii). Two ideas dominate the book: 
that of successive approximation and that of the in- 
terdependence of economic and social phenomena. 

The Cours consists of the material on which 
Pareto based his lectures at the faculty of laws of 
the University of Lausanne. It is divided into two 
parts: the first (75 pages) is devoted to the expo- 
sition of the principles of pure political economy, 
and the second (780 pages) covers applied eco- 
nomics. The section on pure economics provides a 
first-approximation treatment of the phenomena 
studied which permits the general conditions of 


economic equilibrium to be set forth, These con- 
ditions provide the foundation for the successive 
approximations developed in the remainder of the 
book. For Pareto the theory of general equilibrium 
was the key to understanding the interdependence 
of economic and social phenomena. 

His exposition of the principles of pure econom- 
ics is in the tradition of Walras, but Pareto’s text 
is far superior to Walras’s in clarity and pertinence, 
The outstanding feature of the Cours is the way 
it combines theoretical analysis with discussion of 
a large amount of statistical and factual material 
and the use (mainly in notes) of mathematical 
techniques. 

The Cours appears to have been the first text in 
political economy to be so richly garnished with 
facts and statistics. Pareto’s erudition and the 
wealth of knowledge he had accumulated are 
striking: he was able to illustrate his theories not 
only with contemporary examples but also with 
numerous other pertinent examples, particularly 
from Greek and Roman antiquity. 

Nevertheless, Pareto’s Cours has some major 
defects. In the first place, it is an improvisation. 
Coming to economics at the age of 45, Pareto had 
not had time to refine his thinking when he wrote 
the Cours. The analysis of some subjects—for ex- 
ample, credit—is inadequate. Second, many inter- 
esting suggestions, such as the illustration of 
dynamic growth by a “pursuit curve” (courbe de 
poursuite, sec. 41) are not followed through. Third, 
the Cours, like all of Pareto’s other books, is badly 
organized; for example, it treats monetary ques- 
tions before studying production phenomena. 
Finally, Pareto was somewhat prone to argue a 
normative point of view. In the Cours he was still 
too often the committed liberal, although it is clear 
that the change to his later attitude was under Way: 

Yet if the Cours is an improvisation by a begin- 
ner, it is by a beginner of exceptional ability. It can 
still be read with profit for the insight it gives into 
many interesting questions. his 

“Manuel d'économie politique.” Among _ 
economic writings the Manuel d'économie politique 
is Pareto’s magnum opus. It is better pee ote 
and, more important, much better thought pess 
than the Cours. It is basically a work of Dies 
in which Pareto presented a general theory of re 
nomic equilibrium which is considerably MO 
refined than Walras’s. ful 

The first two chapters are devoted to a power = 
analysis of the nature, scope, and limits of eer 
in the social sciences. The next four chapters af 
with economic equilibrium. Their central ae ; 
is that individuals try to satisfy their needs as s 
they can, given the limitations imposed by 
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scarcity of available resources and the boundaries 
of existing knowledge. Two further chapters con- 
tain a special analysis of the three factors of pro- 
duction—labor, natural resources, and capital— 
and a discussion of the role of money. In the last 
chapter, entitled “The Concrete Economic Phe- 
nomenon” (“Le phénomène économique concret”), 
the aim is to examine the relation between theory 
and reality, with emphasis on protection, economic 
cycles, and the sociological implications of life in 
society. The basic sociological idea is that human 
activity can take either of two different paths, the 
production or transformation of economic goods 
or the appropriation of goods produced by others. 
Pareto refused to commit himself on the question 
of protection: he considered it destructive of re- 
sources but also wrote that “in some cases protec- 
tion could be compensated by other factors” ([1906] 
1966, p. 682). He viewed economic cycles as merely 
a particular case of the general rhythm of economic 
and social phenomena and contended that they 
cause less damage than is generally attributed to 
them and that the phenomena observed during 
crises are wrongly considered to be their cause. 

The main text is 539 pages long, and it is fol- 
lowed by an appendix of 132 pages, in which 
mathematical techniques are used to throw further 
light on the theories presented in the body of the 
book. To the modern economic theoretician the 
appendix is Pareto’s fundamental contribution, 
especially from the standpoint of the theory of 
general economic equilibrium and the theory of 
situations of maximum efficiency. 

“Économie mathématique.” Only the first part 
of Pareto’s encyclopedia article on mathematical 
economics was published; World War 1 prevented 
publication of the remainder, and all trace of it 
has been lost. The article generally covers the same 
ground as does the mathematical appendix to the 
Manuel but is more concise and carries certain 
analyses much further, and the superiority of the 
article to the Manuel makes the loss of the second 
part especially regrettable. 


Contributions to economic thought 


There are at least five fundamental areas in 
economic thought in which Pareto’s work repre- 
Sents a breakthrough: the definition of economic 
Science, the integration of economic phenomena 
into the whole of social phenomena, the definition 
s index functions and the theory of demand, the 

tablishment of the foundations of the theory of 
maximum efficiency, and the law of distribution 
of income, 
me nec asascience. Pareto’s greatest achieve- 

nt is his illuminating presentation of the meth- 
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odological basis of economics as a science. First, 
he enunciated a general theory of scientific the- 
ories, with economic theory and sociology as special 
cases. His work here is fully comparable to Henri 
Poincaré’s in the physical sciences but has the 
added merit of covering the social as well as the 
physical sciences. For Pareto the only source of 
knowledge is observation of facts. Abstraction is 
necessary for the analysis of facts, and theory pro- 
vides a simplified image of reality which preserves 
its essential features. Mathematics is merely an 
extraordinarily perfected extension of logic, to be 
used when required, but only then. The whole of 
Pareto’s work, from the Cours to the Trattato, is 
studded with shrewd observations on scientific 
methodology, the philosophy of science, and the 
science of economics. Unfortunately these observa- 
tions are dispersed throughout his work; an over- 
view of them can be found in the first two chapters 
of the Manuel, but this gives only a limited im- 
pression of the scope of his thought, The essential 
features of Pareto’s philosophy of science can per- 
haps be found in the Trattato. 

Second, Pareto applied his philosophy of science 
throughout his work, both in developing theories 
and in relating theories to facts. Thus, he clearly 
explained the link between the static and the dy- 
namic and showed that the static is only a phase— 
but a necessary phase—of the dynamic. The study 
of facts and the development of theories to rep- 
resent and explain facts were his dominant pre- 
occupations. Each argument in the Cours is illus- 
trated by many historical and statistical references. 
Irving Fisher, writing in the Yale Review, said of 
it quite accurately: “No other work contains such 
a compact, varied and comprehensive collection of 
statistical data” (1896, p. 327). In his later work 
Pareto’s interest shifted from the analysis of statis- 
tical data to the analysis of historical and socio- 
logical phenomena. 

Last, Pareto critically examined an immense 
quantity of pseudo-scientific theories. His Systèmes 
socialistes, in which he subjected the logical incon- 
sistencies of socialist economic theories, particu- 
larly Marxist theory, to a rigorous analysis, is an 
excellent illustration of this aspect of his critical 
work, which received its fullest expression in the 
Trattato. 

Economics as part of a larger structure. An 
important feature of Pareto’s work is his integra- 
tion of economic theory into the wider framework 
of the social sciences; his main objective was to 
generalize Walras’s theory of general economic 
equilibrium to cover the entire range of social phe- 
nomena. Pareto’s guiding principle was the inter- 
dependence of economic and social phenomena. 
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For him economic theory was but part of a much 
greater whole; its study was necessary, but not 
sufficient, for understanding society. 

Index functions and the theory of demand. 
Pareto was the first to make a clear distinction 
between the concepts of cardinal utility and ordinal 
utility, which he designated respectively by the 
terms “total ophelimity” (ophélimité totale) and 
“index function” (fonction indice). He showed that 
the theory of economic equilibrium can be devel- 
oped without recourse to a cardinal index of utility, 
using only the concept of the index function (or 
ordinal utility). He was the first to present a gen- 
eral theory of demand that showed how the em- 
pirical laws of demand can be derived theoretically 
from material furnished by introspective reasoning. 

Pareto’s exposition is sometimes difficult to fol- 
low because he used different methods of presenta- 
tion and systems of notation at different times; he 
also occasionally used the same notation for both 
ordinal and cardinal utility. In the following dis- 
cussion the notation used corresponds as closely as 
possible to that of Pareto’s encyclopedia article, 
with modifications to avoid confusion. For clarity 
a bar is used to distinguish quantities relating to 
cardinal utility from quantities relating to ordinal 
utility. Thus, # denotes ordinal utility, and $ de- 
notes cardinal utility; $, and $, are the partial 
derivatives of ġ and 4 with respect to x. 

Ordinal index. Pareto’s point of departure in 
the Manuel, Appendix, section 1, is the fact that 
to each set x, y, z, +--+ of quantities consumed, an 
infinite number of psychologically equivalent sets 
can be made to correspond, whence the concept of 
the index function, 


a) b= $(x, y, Z, +++), 


defining an indifference surface. Then ¢ is defined 
by the simple condition that the consumption pat- 
tern (%2, Y2,Z2, °° -) is preferred to (%4 Yis Z °° >) 
if ġ:>ġı. Under this definition, any increasing 
function of ¢ only, 


(2) F=F(9), 


also satisfies this condition (1906, p. 541). It is 
clear that is nothing other than ordinal utility 
as the term is understood and employed by English- 
speaking economists. Pareto observed that knowing 
the function œ provides full awareness of the eco- 
nomic psychology of the individual concerned. 

As Pareto pointed out, Edgeworth’s starting point 
was the notion of ophelimity (cardinal utility), 
which was assumed to be known although its 
definition presented considerable difficulty. Pareto 
attacked the problem from the other end, Starting 


with the concept of indifference surfaces (“a con- 
cept directly derived from experience”). He wrote 
in “Économie mathématique” (1911a, sec. 15, note 
25, p. 609), “The term ‘line of indifference’ was 
introduced into the literature by FY Edgeworth 
(Mathematical Psychics, p. 21), who assumed the 
existence of ophelimity and deduced the lines of 
indifference from this. V. Pareto (Manuel @éco- 
nomie politique, p. 540) inverted the problem, 
deducing the unknown from the known.” 

Today this approach to the problem is consid- 
ered almost self-evident, but at the time it repre- 
sented a considerable step forward in the exposition 
of economic theory, 

Cardinal index. Pareto went on to show that if 
cardinal utility, $, exists, $ is necessarily one of the 
functions F(¢), since the function ¢ = a constant 
should represent an indifference surface (1911a, 
sec. 15). 

Pareto derived properties of the cardinal in- 
dex, 6 = (x, y, z,- ), by psychological introspec- 
tion and distinguished three cases: (a) If $a =0, 
the goods x and y are independent. (By definition, 
ey = 0°4/8x dy.) (b) If bz, > 0, the goods are de- 
pendent and the dependence is of the first type 
(complementary goods). (c) If $, <0, the goods 
are dependent and the dependence is of the second 
type (substitutable goods). 

In a rather obscure and not particularly con- 
vincing way, Pareto deduced that in the case of 
independent or complementary goods the second 
differential, 


E$ = Freda? + $y dy? + -= 


3 i 
(3) + 26,,dxdy + +++ + 2$udydz+', 


is negative regardless of dx, dy, dz, --- (1906, Ap- 
pendix, secs. 47—49; 1911a, sec. 17). From this it 
should follow that (shown here in the case of 
three goods) 


Pez < 0, 
Gre dey >0, 
Gey dy 
(4) 
uc be 
ee pe ao oe 
ation dia 


In fact, the negative sign of d’ can be deine 

from the hypothesis of “diminishing psychologic: 

returns” (Allais 1943, p. 170). waatiboe 
Pareto never tried to establish a definition 
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complementary and substitutable goods that would 
be free of the arbitrary element involved in trans- 
formation (2). and he failed to specify what hap- 
pens to the conditions (4) when they are written 
in terms of the ordinal utility index, ¢. (This has 
since been done by Allais, in 1943, pp. 137-152.) 
Nevertheless, he must be credited with having in- 
troduced the use of second differentials and definite 
forms into economics; all subsequent research 
bears witness to the interest in this contribution. 

General laws of supply and demand. In the 
Appendix to the Manuel and in the encyclopedia 
article, Pareto gave a general theory of demand 
which enables the various elasticities of demand 
to be calculated on the basis of the equilibrium 
equations 


ie aig, ee 
be of, i , 
(5) 
(x= xo) + P(Y — Yo) + P:(2— Z) + = = 0, 
where 1, p,, P+, -+ represent the prices of the good 


x (money) and the goods y, z, *** (1906, sec. 52, 
p. 579; 1911a, sec. 32, p. 628). 

By differentiating these equations Pareto ob- 
tained a linear system of n equations from which 
the unknowns dx, dy, dz, «++ can be calculated as 
functions of given changes dp,, dp:, +: in prices. 
He showed that the quantities dy/dpy, 02/0p:, `" 
can easily be expressed as a function of the de- 
terminant 


Oo 1l Py Pe 


1 der Gry $z 
(6) M=-—| Py m $w dv 
Po Pez Guz bee 


and its minors and set out the expressions which 

he had already given in 1892 in the Giornale degli 

economisti, 

afeto showed what happens to these relations 

E the goods are independent (1911a, sec. 33; 
6, sec, 53); in this case the conditions 


(7) aco OF 


Raa that if a good y is demanded, the demand 

or it falls when its price rises. 

Fae showed that contrary to Marshall's asser- 

price the marginal utility of money, m, varies when 

a ne change, and thus it is clearly wrong to con- 

doce constant (1906, Appendix, sec. 56). He 
showed, in a rather elegant demonstration 
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(19114, sec. 23), that if the elasticity of demand 
with respect to different goods is constant, then its 
value is unity. This is evidently a very restrictive 
condition. Pareto added, “There is no sign of cor- 
responding research in Marshall’s Principles of Eco- 
nomics . . . and consequently the results at which 
he arrives are incomplete, and in part erroneous” 
(ibid., note 31, p. 620). 

Pareto further showed, in a subsidiary analysis 
(1896-1897, sec. 83), that Marshall and his suc- 
cessors were wrong in estimating the consumer's 
gain from exchange—that is, consumer’s surplus— 
by the curvilinear triangle formed by the demand 
curve. He showed very simply that this procedure 
is exact only if the marginal utility of money is con- 
stant, a condition that is generally not satisfied. 

What Pareto failed to perceive was that when 
the conditions (4) are satisfied the consumer’s equi- 
librium is stable but that stable consumer equilib- 
rium does not necessarily imply that at the equi- 
librium point the conditions (4) hold (see, for 
example, Allais 1943, pp. 468—469). 

Pareto also did not perceive that production func- 
tions can be defined analogously to indifference 
surfaces; had he seen this, he could have devel- 
oped a production theory along the same lines as 
the theory of consumers demand and supply. 
Nevertheless, his theory of the laws of supply and 
demand is a remarkable accomplishment. 

Theory of maximum efficiency. Of all Pareto’s 
contributions to economic thought the most im- 
portant is his rigorous construction of the founda- 
tions of the theory of maximum efficiency of eco- 
nomic management, or maximum ophelimity for a 
society. (English-speaking economists know this 
concept as optimum resource allocation. The ex- 
pression is not a particularly happy one, for the 
word “optimum” carries an inappropriate impli- 
cation. ) 

Pareto’s thinking on the theory of efficiency 
gradually became more precise. The theory was 
foreshadowed in the Cours (1896-1897, vol. 2, 
note 721, pp. 92-94) but appeared in its definitive 
form only in the appendix to the Manuel ([1906] 
1966, pp. 655-656 ) and in the 1911 article in the 
Encyclopédie (19114, pp. 624-625) and was given 
its final expression in the Trattato (1916, secs. 
2128-2131). 

Pareto defined a situation of maximum efficiency 
(1906, chapter 6, sec. 33, and Appendix, sec. 89; 
1911a, sec. 28) as one in which it is impossible to 
increase the index function of one individual with- 
out decreasing that of some other individual. Ac- 
cording to this definition, a situation of maximum 
efficiency is one in which any index function is a 
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maximum subject to (a) the condition that the 
index functions of the other consumers be main- 
tained at given levels and (b) the ruling production 
functions. 

This definition of a situation of relative maxi- 
mum had, in fact, already been given by Edgeworth 
in his Mathematical Psychics. Defining equilibrium 
for n participants in exchange, Edgeworth wrote: 
“The state of equilibrium may be considered as 
such that the utility of any one contractor must be 
a maximum relative to the utilities of the other 
contractors being constant, or not decreasing . . .” 
(1881, p. 27). 

However, having defined situations of relative 
maximum perfectly, Edgeworth used his definition 
only in the analysis of stable equilibrium, without 
perceiving the contribution it could make to the 
study of situations of maximum efficiency. Had he 
taken this additional step, he would have demon- 
strated the fundamental theorem of the equivalence 
of a state of economic equilibrium and a state of 
maximum efficiency, 

It is difficult to know whether Pareto consciously 
made use of Edgeworth’s definition. He read Edge- 
worth’s book in 1892 (see Lettere a Maffeo Panta- 
leoni, letter dated January 31, 1892), but a careful 
study of Pareto’s successive texts suggests that he 
arrived at his definition of maximum efficiency 


(maximum dophélimité pour la société) by his » 


own route, 
Pareto also gave a rigorous definition of surplus. 
He observed that if the index functions are not 
comparable, the quantities 56/¢, are, since they rep- 
resent the quantity of the good x which would pro- 
duce an increment ôb over the initial situation. 
Thus, the gain in terms of x corresponding to any 
change in the economy as a whole is given by 


1 1 

(8) ôT, þa 56, + a 
where the ¢,, #2, +- are the index functions of the 
different individuals and ôs, is the corresponding 
equivalent gain or distributed surplus in terms of 
good a (1906, Appendix, sec. 127; 1911a, sec. 28). 

It is clear that there can be maximum efficiency 
in the sense of the definition only when 


(9) ôs, <0 


õpet, 


for any virtual displacement compatible with the 
constraints. 

It was this approach to the definition of the max- 
imum of ophelimity which, for the first time in the 
history of economic thought, enabled the problem 
of efficient management of an economy to~be 
posed correctly—i.e., independently of the price sys- 
tem or the social structure of the economic system 


being considered, which could be based on private 
or collective property. This simple and natural 
definition contains the seeds of all the subsequent 
developments in the field. It provides a rigorous 
foundation for the general theory of economic 
optima—the management optimum, the population 
optimum, and the capitalistic optimum. It elimi- 
nates from the argument what was a major ob- 
stacle, the arbitrary nature of the distribution of 
income (1896-1897, vol. 2, pp. 91-92), 

Pareto went on to develop a line of argument, 
somewhat lacking in rigor (ibid., secs, 720-726; 
1906, chapter 6, secs. 33-61, and Appendix, secs, 
145-152), which showed that a state of maximum 
efficiency and a state of equilibrium under perfect 
competition are one and the same thing (theorem 
of sec. 723 of the Cours and secs. 146 ff. of the 
Manuel). This led to his deduction that the prob- 
lems to be solved in realizing a situation of maxi- 
mum efficiency, as well as the solutions to these 
problems, were the same for a collectivist economy 
as for an economy based on private property. 

Pareto’s demonstration in the Manuel is incom- 
plete and in part erroneous. Average cost and mar- 
ginal cost are confused; no distinction is made 
between differentiated production sectors, in which, 
from a physical point of view, the best production 
technique consists of n distinct production units, 
and the nondifferentiated sector, in which the best 
production technique involves a single production 
unit; no account is taken of time and therefore of 
interest; second-order conditions are neglected; and 
the exact conditions for validity of the theorem ate 
not stated. Nevertheless, this analysis is the foun- 
dation for all future developments in the field. 

Walras believed, mistakenly, that he had dem- 
onstrated the equivalence of a situation of maži- 
mum efficiency and a situation of equilibrium in 
an economy under perfect competition. Actually, 
he had not even succeeded in developing a rigorous 
presentation of the problem of maximum a 
for a society. In the Trattato, Pareto identifie 
Walras’s error in a failure to distinguish between 
the maximum ophelimity obtainable by an indi- 
vidual undertaking transactions in the market wi 
given resources and the problem of the ee 
of ophelimity for the collectivity (1916, sec. 2128, 
note 1). lus 

Having developed expression (8) for Smg à 
Pareto naturally deduced its first differential, 


1 


(10) do. = -dht dost" 
1a 2a k 
but he did not attempt to calculate the second A 


ferential, although it must be considered in ; 
study of the stability of equilibrium and in 
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study of the second-order conditions for situations 
of maximum efficiency (the general expression 
for the second differential, d’o,, appears in Allais 
1943, pp. 612-616). 

Unfortunately, Pareto dodged the main issues in 
the admittedly extremely complex question of the 
distribution of income. He stated in the Cours 
(1896-1897, vol. 2, sec. 720, p. 91) that his sole 
focus of interest was the conditions of production 
which “produce the maximum of ophelimity,” it 
being taken for granted that the goods produced 
“are distributed according to whatever rule it is de- 
sired to adopt.” This assumption, however, neglects 
an essential aspect of the analysis: even if the 
conditions of maximum efficiency are realized in 
the production system, it is not true in general that 
optimum efficiency in distribution is realized under 
any system of distribution whatsoever. This is 
clearly shown in the line of reasoning developed by 
Edgeworth (see Myint 1948; Little 1950; Samuel- 
son 1950). In fact, Pareto simply evaded the prob- 
lem of income distribution. 

In the Trattato, Pareto generalized the maximum 
of ophelimity for a collectivity for the most general 
case (1916, secs, 2128, 2131-2139). The most im- 
portant part of the text appears in a note, extracted 
from an article of Pareto’s in Giornale degli econo- 
misti (1913), “Il massimo di utilità per una col- 
lettivita in sociologia.” This note is of extreme sig- 
nificance both for social theory in general and for 
the theories of collective choice and planning in 
Particular. Pareto wrote: 


The quantities ôd, , ô+, ++- are heterogeneous. They 
cannot be added together, for such an addition would 


meaningless. . . , The aim of considering the 
quantities 
(11) ee a 1s 
F, ips 5 JE pe» 


A to avoid the difficulties which arise from the fact 
at the ophelimities 5h, , 52, ::* are heterogeneous, 
le tendering them homogeneous and their summation 
pa 10] meaningful. . . . Were there another method 
pene the heterogeneous quantities 56, , 5:, °"* 
À mogeneous . , . for example by multiplying them by 
ertain positive quantities œ, , œ», -> it is evident that 
Consideration of the sum 
1 
a2) 8V = aô, + a2dh2 + °° 
Kan give results analogous to those obtainable by 
adering equation (8)... . ¥ 
tone remote, in this way there are as many equa- 
S (12) as there are individuals, i.e. 
BV, =a, öp, +a, 5, +a, 56, t 
BV, = ay òp, + ay Bp, + a Bhs + 
BV, = ary’ öp, + ax’ BG, + ag’ BaF 
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. .. In order to make these quantities homogeneous, 
they must be multiplied in their turn by certain co- 
efficients B',, 8, 8’, ++- determined with a specific 
objective in mind such as, for example, the prosperity 
of the collectivity. . . . Now, using these coefficients, 
the quantities corresponding to equations (13) have 
been rendered comparable; they may be added after 
multiplication by £, 8y, «+ to give 


1 
(14) 8W=M,8¢, + M:ôġp: + M,6p; +. 


(1916, vol. 1, pp. 1341-1342, sec. 2131, note 1; to clar- 
ify the exposition, Pareto’s text has been altered slightly 
by the introduction of the indexes V,,V., V;,°°+,W, 
and equation numbers have been changed to follow 
the sequence in this article) 


It is then possible to consider W the collective 
preference function as it is seen by the govern- 
ment, whereas the V; are the collective preference 
functions as they are seen by individual citizens. 

In the text of the same section Pareto asserted: 


.. . the public authorities have necessarily to compare 
the different utilities; for present purposes the criteria 
on which they do so may be ignored. In imprisoning a 
thief, for example, the authorities weigh the sufferings 
imposed on him against the utility accruing to honest 
citizens, estimating that the utility will at the very 
least compensate the suffering. If it were not so, they 
would let him stay free. . . . It goes without saying that 
the authorities bring into the comparison as best they 
can—often, it is true, not a very good best—all the 
utilities of which they have awareness. (ibid., pp. 


“ 1342-1343) 


These remarks are relevant not only to social 
issues but also to all economic decisions made by 
the public authorities, including taxation and plan- 
ning. As Pareto suggested, it is incorrect to dis- 
tinguish economic from social issues; what holds 
for one holds for the other (ibid., sec. 2131, note T): 

Pareto then added: “The aim of the definition 
[of optimum collective ophelimity] is to substitute 


‘rigorous considerations in place of the vague and 


imprecise expressions normally used, whose inde- 
terminacy renders them fallacious” (ibid., sec. 
2132). 

Pareto’s analysis (ibid., secs. 2121-2139) illu- 
minates the nature of governmental economic 
decisions, particularly planning, and is the best 
introduction to the modern analysis of collective 
decisions. He showed with admirable clarity that 
there is no such thing as the general interest or, 
for that matter, a social optimum, since the in- 
dexes V; and W are not and cannot be identical. 

If, to use the above notation, the collective indif- 


~ ference function can be written 


(15) WERV V2." Viz >") 
and if it is assumed that, as is generally the case, 
W is an increasing function of the Vi, it can be 
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seen that whatever the function F is, W cannot be 
a maximum unless each of the V; is at a maximum 
for fixed values of the other V;. It thus follows 
that study of situations of maximum efficiency in 
the Paretian sense is useful and necessary even 
though the function W is not specified. 

Pareto apparently did not see the crucial signifi- 
cance of his contribution. He devoted only 7 out of 
the 586 pages in the Cours to this question, 15 out 
of the 691 in the Manuel, 2 out of the 49 in the 
encyclopedia article, and 11 out of the 1761 in 
the Trattato. 

Pareto’s law. Elements of Pareto’s law of the 
distribution of income and wealth appeared in sev- 
eral publications in 1895, 1896, and 1897; these 
have been collected by Busino (see Pareto, Ecrits 
sur la courbe de la répartition de la richesse). An 
over-all statement was presented in the Cours in 
1897 (1896-1897, vol. 2, secs. 957-965, pp. 304- 
326). Extensive verbal comments are given in the 
Manuel (1906, chapter 7, secs. 3-31, pp. 381-393). 

The mathematical expression of Pareto’s law, 
according to his own formulation, is 


(16) A 


N= o 
where N is the number of incomes above a certain 
value R, and A and a are constants. Income, x, is 
assumed to remain above a minimum, h, the cor- 
responding value of N being 


A 
aa 


N _(h\« 

O 
Thus, Pareto’s law is nothing else than the ordinary 
negative exponential distribution, truncated at the 
left to logh. In his controversy with Edgeworth 
(see Busino’s Introduction to Pareto’s Écrits sur la 
courbe de la répartition de la richesse), Pareto 
strongly underlined the necessity of this truncation. 

A considerable number of distributions of income 
are represented with a fair degree of accuracy by 
this law. The value of is stable or varies only 
slightly over time in the same country; it has gen- 
erally remained between 1.5 and 2 during the past 
few centuries. 

Some distributions are satisfactorily fitted only 
after the introduction of two further constants, 
a and Bp: 


N, = 


Hence, 


Aeb: 
(17) NE 


a formulation which was proposed by Pareto as 
early as 1896. 


The constant a, which is generally very small, if 
not zero, may be interpreted as compensating for 
the earned income allowance (abattement à la 
base) characteristic of income taxation statistics, 
The constant £ is also generally very small, 

Pareto supplied many examples for which the 
law (eq. 16) appears to provide an adequate fit 
and gave values of a of similar orders of magni- 
tude. He commented that these were very remark- 
able results and that it was absolutely impossible 
to accept them as merely chance results. 

In a formula that is perfectly clear when stated 
mathematically but is rather obscure in its verbal 
form and has therefore given rise to erroneous 
interpretations (1896-1897, vol. 2, secs. 964, 956; 
1906, chapter 7, sec. 24), Pareto indicated that the 
inequality diminishes with an increase in the ratio 
of the number of persons whose income is below # 
to the number of persons whose income is above x. 
According to Pareto, this definition involves declin- 
ing inequality of incomes with the growth of the 
quantity 


(18) Vgc) = Nie) ae £ Le =l 


where h is the minimum income, If Pareto’s law 
(16) is exact, we have 
h a 

(19) V(x) = (5 Li 
and since x> h, it can be seen that income in- 
equality declines as æ rises. Contrary to an opinion 
held by some writers, Pareto’s definition is not at 
all inconsequent. Although he calculated the total 
value, R, of incomes above x (1896-1897, vol. 2, 
sec. 961, note 1), Pareto failed to see the very 
suggestive interpretation attaching to his law of 
income distribution (eq. 17); in fact, it can be 
shown that if m(x) represents the average of those 
incomes which exceed a given income x, then, 
applying Pareto’s law (eq. 16), 
mix) a 

emits ad 

If it is assumed that individuals’ assessments of 
the degree of income inequality vary as m(*)/* 
the fact that the ratio is constant can in pe 
interpreted as meaning that the assessment of s 
degree of inequality is the same whatever the lev 
of income, x. The coefficient 


(21) B= 
and 


can then be taken as an index of inequality, a 
Pareto’s law is open to a very simple interpreta as 
Pareto tried to prove in several of his wí 


(20) 


a 
a= 1 


‘fi 
Uture of our science. . 
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(ibid., sec. 962: 1906, chapter 7, sec. 15) that the 
distribution of incomes is not random, but the argu- 
ment he used is inexact. He submitted that the 
law of income stribution does not reduce to an 
error distribution and so cannot be regarded as a 
chance outcome Unfortunately, although it is true 
that the normal law cannot be used to fit the dis- 
tribution of income, a lognormal law generally fits 
quite adequately. In other words, the logarithm of 
income follows the error distribution. 

Several authors (Gibrat, among others) have 
suggested use of the lognormal distribution to rep- 
resent the distribution of income. This gives good 
results, particularly in those cases where Pareto’s 
law does not provide a good fit; conversely, Pareto’s 
law often gives good results where the lognormal 
distribution does not provide a good fit. Thus, there 
are many distributions which are well fitted either 
by Pareto’s law or by the lognormal distribution. 

Pareto's law has been subjected to a great deal 
of analysis. In general, the research has shown 
that Pareto’s law can be applied successfully to a 
considerable number of distributions, and it has 
confirmed Pareto’s basic result; namely the relative 
stability of the coefficient a over space and over 
time for the different societies studied. 

The constancy of income inequality which can 
be deduced from Pareto’s law has significant socio- 
logical implications. If inequality is independent 
of the economic system, the socialistic attempt to 
diminish it is irrelevant, and the only way to im- 
prove the lot of low-income groups is to increase 
the efficiency of production. Pareto was quite will- 
ing to exploit this result, a fact which goes a long 
way to explain the heated controversy to which his 
law has given rise. In fact, the coefficient a is not 
a constant but generally varies between 2 and 3, 
and thus the socialist thesis has some value. 
(aes Pareto’s law is only a minor part of 
i ec scientific contribution and, furthermore, 
the ead empirical finding, the demonstration of 
ae F stence of a quite invariant factor in the 
ee ure of human societies is of indubitable im- 
portance. Schumpeter commented very aptly: “Few 
road economists seem to have realised the pos- 

ilities that such invariants hold out for the 
realised . . nobody seems to have 
ieee that the hunt for, and the interpretation 
of Be se of this type might lay the foundations 
Si ntirely novel type of theory” ({1949] 1965, 

and note). 


Inf 
uence of Pareto’s economic ideas 


s oe influence on the development of eco- 
cs as a Science was felt only after considerable 
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delay and has largely been confined to Italy and 
France. His economics have influenced such Italians 
as Barone, Pietri-Tonelli, Pantaleoni, Amoroso, 
Demaria, and Fossati, but outside Italy few writers 
explicitly claim to be in his direct tradition. Allais 
is the only French author to locate himself directly 
in Pareto’s line of thought; Allais’s pupils Boiteux, 
Debreu, Malinvaud, Lesourne, Nataf, and Verhulst, 
among others, also show the influence of Pareto. 

His work has had little effect in the English- 
speaking world; according to Schumpeter, “This 
might seem surprising owing to the fact that sev- 
eral important developments in theoretical eco- 
nomics are now seen to stem from him. But it is 
not difficult to explain. Pareto was the product of 
a sector of the Franco-Italian civilisation that is 
far removed from English and American currents 
of thought” ([1949] 1965, p. 111). Pareto com- 
plained to Edgeworth in 1896 that his work was 
not duly appreciated in Britain. The Trattato 
(translated in 1935) is still the only one of his 
books to appear in English. (Walras was in a sim- 
ilar position for a long time.) Nevertheless, his 
ideas have been widely drawn upon, too often with- 
out explicit acknowledgment of their origin. No 
English-speaking economist acknowledges Pareto 
as his master, although Hicks, Hotelling, Lange, 
Lerner, Samuelson, Koopmans, Dorfman, Arrow, 
and others have been influenced by his work. 

Pareto’s influence in economics was considerably 
diminished by the ideological implications of his 
work, His vigorous attacks on democracy, his co- 
gent criticism of socialist systems and of socialist 
leaders in power, did not endear him to left-wing 
intellectuals, who by general, if tacit, consent seem 
to have chosen to ignore his work. 

At few points in his career could Pareto count 
on a sympathetic reception from the official circles 
responsible for Italian economic policy. The best he 
could expect was lack of understanding; more often 
he had to face hostility. 

Pareto’s erudition, critical faculty, creative imag- 
ination, and talent for synthesis were exceptional. 
He set himself the tasks of integrating the phe- 
nomena of economics into the fabric of social 
reality, generalizing Walras’s theory of general 
economic equilibrium, and establishing a general 
theory of social life based on the analysis of facts. 
He was never able completely to realize this ambi- 
tion, either formally or substantively. Each of his 
books consists of juxtaposed disparate elements, 
held together only by his constant desire to dis- 
tinguish the subjective from the objective, to base 
his examination solely on facts, to look everywhere 
for the regular patterns underlying the apparent 
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diversity of social phenomena. Yet his specific anal- 
yses—all of them suggestive—are excellent. His 
work can be compared to a palace whose general 
architecture is unsatisfactory but each of whose 
rooms contains some valuable artistic features. 

The economic universe in which Pareto worked 
was too vast to be exhausted by the efforts of one 
man. He did not have time to follow all the new 
paths he had opened up. Nor did he have time to 
polish his output, and the exposition of his thinking 
contains several glaring faults. A great deal of 
progress has since been made in the various fields 
he opened up, and apart from questions of scien- 
tific methodology, much of his work is out of date. 
But this fact does not diminish the exceptional 
importance of his contribution. 

With Walras and Irving Fisher, Pareto may be 
regarded as one of the three founders of modern 
economic science. The three, although very differ- 
ent, have much in common. Walras and Fisher, 
like the early Pareto, were ardent champions of 
normative ideas, and in all three the scientist 
struggled with the crusader. They were either 
ignored or hated, especially by their compatriots, 
but the reputations of all three are now rising. 

Pareto was a man of exceptional talent, able to 
master the most varied disciplines and to further 
the progress of science in each. Although he was 
intellectually isolated, his influence on economics 
is clearly identifiable. His thinking is continually 
becoming more relevant to the solution of current 
problems, and its contribution to one of the most 
powerful theoretical approaches in contemporary 
thought is increasingly appreciated. He was a first- 
class thinker, and his work constitutes a milestone 
in the history of thought. He has left us an im- 
perishable heritage. 

MAURICE ALLAIS 


[See also ECONOMIC EQUILIBRIUM; INCOME DISTRIBU- 
TION, article on SIZE; STATICS AND DYNAMICS IN 
ECONOMICS; UTILITY; WELFARE ECONOMICS; and 
the biographies of PANTALEONI and WALRAs.] 
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n 
CONTRIBUTIONS TO SOCIOLOGY 


Besides being one of the most important foun- 
ders of mathematical economic theory, Pareto was 
among the leading theorists of a generation par- 
ticularly crucial to the development of sociology. 
Although he enjoyed a short-lived vogue in the 
United States in the 1930s, it seems fair to suggest 
that his reputation and influence have not been as 
great as they intrinsically deserve. While Pareto’s 
contributions did not equal those of his great con- 
temporaries Durkheim and Weber, he should be 
ranked very high indeed—perhaps the highest 
after them—in his generation (Parsons 1937). He 
Stood somewhat outside the main intellectual tra- 
ditions of his time, particularly those dominant in 
the most important intellectual centers—English 
utilitarianism, German idealism and historicism, 
and French collective “solidarism.” And he was 
= with the brush of fascism—a stigma he de- 
ates » at most, only in part, but a serious one 
as the Western world’s profound crisis of the 
A The intellectual background of Pareto’s socio- 
eek thought lay in two influences. The first was 
a By physical science and engineering: he re- 
ae a his degree in mechanics and was long a 

icing engineer. The second was a Latin—hu- 


manistic orientation, by virtue of which he was 
steeped in the history and literature of both the 
ancient world and the Renaissance, particularly 
the latter. He was fluent in both Italian and French, 
since he had lived as a child in Paris, had moved 
to Italy when he was ten, and later lived in French 
Switzerland. Significantly, he had no knowledge 
of German and relatively little of English. His 
father was a Mazzinist exile, and politically Pareto 
was a disillusioned liberal, not a positive fascist. 

Social action. Pareto saw the theoretical task 
of sociology as an extension of that of economics. 
He defined economics as dealing with one par- 
ticular type of the components of social action 
(he used the French word action and its Italian 
equivalent), those constituting an abstractly con- 
ceived system treatable as a set of interdependent 
variables, on the methodological model of classical 
mechanics. He then defined sociology residually, 
as dealing with those components of action, or at 
least many of them, not handled by economics or 
by the other disciplines, stich as technology, mili- 
tary strategy, etc., which are concerned with what 
he called logical action (see 1916, vol. 1, especially 
chapters 1 and 12, which are concerned with de- 
fining this position). Focusing on the residually 
defined components, he attempted to integrate all 
these elements of action into his grand conception 
of the equilibrium of the total society as a social 
system—and of its disturbances. 

Pareto’s major point of reference in defining 
action was in certain respects very similar to that 
both of Max Weber and of the utilitarians. In com- 
parison with theirs, however, it was more residual 
and less directly determinate in its statements of 
problems. Pareto had no equivalent of the sharply 
defined problem of the status of “ideal factors” that 
Weber inherited from the idealist—historicist tradi- 
tion of German thought. Nor did he have the strong 
presumption of the “randomness of ends” that was 
so important in the utilitarian tradition and that 
became Durkheim’s major point of critical pur- 
chase. Pareto shared with many of his great pred- 
ecessors and contemporaries, particularly with 
the utilitarians, a kind of “Cartesian” background; 
that is, his essential paradigm for empirical scien- 
tific analysis was influenced by Cartesian episte- 
mology. But, as a framework for the analysis of 
social action, this paradigm involved two major 
complications. First, the “subject” was conceived 
by Pareto not only as “knower” but also as actor; 
second, this actor becomes the object of observa- 
tion by the social scientist. Thus there are in effect 
two paradigms—that of the action of observed 
objects and that of the action of the observer in 
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his relations to these objects—which must be dealt 
with simultaneously. 

Pareto’s approach to this problem was through 
the concept of logical action, which is essentially 
the limiting case where the actor is conceived of as 
a “good scientist,” whose choice of means for ends 
will, in fact, under specified conditions (with req- 
uisite probability), bring about the desired ends. 
As Pareto put it, the objective and the subjective 
ends will coincide in this case, so that the “theory” 
guiding the action can be said to meet the “logico- 
experimental” standard. To this he contrasted non- 
logical action, i.e., action that deviates from this 
standard in any way (1916, vol. 1, chapter 2). 

He repeatedly warned against the assumption 
that nonlogical is necessarily illogical. Further- 
more, he divided nonlogical action into two basic 
categories. The first category consists of such action 
as is determined by factors that are independent 
of its “subjective” aspects—most definitely, the 
needs, drives, and instincts of the organism. Had 
he taken into account the role of such factors 
alone, Pareto would have been a biophysical re- 
ductionist of a type familiar in Western intellectual 
history of this century. His second category of 
nonlogical action shows that he was no mere re- 
ductionist: it concerns action that is based on 
normative and cultural factors, insofar as these 
are not part of the logico-experimental knowledge 
attributed to the actor; the bases of commitment 
to the ends or goals of action are included among 
these factors, as well as the elements of cognitive 
and expressive culture—especially religious and 
ideological beliefs, patterns of ritual, and many 
types of expressive symbolism. 

“Theories” of action. Pareto followed a special 
procedure in going beyond this starting point in 
his formulations. Instead of subjecting the totality 
of social behavior to a formal analysis, he confined 
himself to analyzing the “theories” associated with 
it. In the case of logical action, he assumed that 
by definition overt action does in fact correspond 
to the “theory” behind it. But for nonlogical action, 
the relationship is directly problematical. Pareto 
chose to handle the problem pragmatically, treat- 
ing theories as indices of the forces determining 
social action, just as, for example, a thermometer 
reading is an index of the thermal state of the 
system with which the instrument is linked. Here 
Pareto’s analysis seems relatively simplistic com- 
pared to Durkheim's elaborate theorizing about the 
relationship between an actor's internalized orien- 
tations and the “exterior” constraints of his social 
environment. 

With respect to the Cartesian paradigm, Pareto 


was squarely on the subjective side, in contrast to 
Durkheim; he analyzed the “orientations” of actors, 
but not the objects in their situation or the inter- 
relations of actors as situational objects for each 
other, except insofar as their orientations are ob- 
jects to the social scientific observer. “Theories” in 
Pareto’s sense, then, are symbolic systems; in part, 
at least, they belong to what we would now call 
the cultural system, although Pareto himself never 
developed the systematic analytical distinction be- 
tween social and cultural systems. 

Types of nonlogical theories. Beginning with 
this point of reference, Pareto developed two 
cross-cutting, interdependent lines of distinction. 
The first distinction concerns the two ways in 
which theories can deviate from the standard of 
logico-experimental science. Theories may be dif- 
ferentiated into those which are “pseudoscientific” 
and those “which surpass experience” (see 1916; 
chapter 5 treats the former, chapter 4 the latter), 
The former are belief systems to which the scien- 
tific standard is applicable but which demonstrably 
fail to meet it. The latter are those to which the 
scientific standard does not apply, since their prop- 
ositions can be neither demonstrated nor refuted 
on the basis of scientific evidence (expérience— 
meaning empirical and often, though not always, 
experimental operations ). 

Since Pareto treated theories (what we would 
today call belief systems) as indices of the forces 
“determining social action,” one may assume that 
he considered the two classes of beliefs to be in- 
dices of different categories of forces. This indeed 
seems to be the case. In a broad, inexact way, the 
pseudoscientific theories reflect those forces which 
have been most emphasized by Pareto’s interpreters 
(Sorokin 1928; Homans & Curtis 1934), namely, 
needs and instincts at the level of individual psy- 
chology, the classical locus of the irrational de- 
terminants of behavior as they have been high- 
lighted in the present century. The second class, 
“theories which surpass experience,” points equally 
broadly to the cultural dimensions of human action 
or, in Weber's terms, to the “problems of meaning 
and their grounding in action orientations. These 
cultural dimensions, in turn, may be related to 
the vast realm of expressive symbolism in the arts, 
to the patterns of value orientation that have figured 
so prominently in the recent work of anthropolo- 
gists (e.g., Kluckhohn) and sociologists, and, gen- 
erally, to the normative components in social sys- 
tems, perhaps especially the legal levels. Pareto, uh 
a good classical humanist, was highly sensitive to 
the problems of these areas and was seriously con- 
cerned with subjecting them to scientific analysis. 
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Residues and derivations. The second basic line 
of distinction, that between residues and deriva- 
tions, has become better known than the distinc- 
tion between the two ways theories deviate from 
the standard of logico-experimental science. It is 
very important, but not very widely noted, that 
both residues and derivations are components of 
nonlogical theories, not of concrete behavior. The 
distinction, then, is simply that between the rela- 
tively constant and relatively variable elements 
that emerge when many such theories are induc- 
tively analyzed. Residues are “residual” in the 
simple sense that they constitute what is left over 
after the more variable elements have been ab- 
stracted from nonlogical theories. In a technical 
sense, this is the only strictly inductive part of 
Pareto’s scheme, since he subjected an immense 
mass of material to an early form of what we 
would now call content analysis. 

The most important connection between this 
dichotomy and the previous one lies in the fact 
that the category of residues is especially closely 
connected—inductively, Pareto would claim—with 
the category of “theories which surpass experi- 
ence.” The basis of this connection, Pareto would 
have it, is that central components of the residues 
constitute the “major premises” of such belief sys- 
tems (1916, vol. 1, chapter 9). For instance, one 
such component comprises the metaphysical as- 
sumptions on which the nonlogical, not necessarily 
illogical, “guides to action” are built. Indeed, con- 
trary to much of the Anglo-American intellectual 
tradition, Pareto seems not to have been concerned 
nearly so much with the discrepancies between 
what people say and what they do as with the 
variations in the intellectual grounding of what 
they say, particularly in the relations between the 
Scientific and nonscientific aspects of that ground- 
ing. Thus, in his discussion of both magical and 
religious ritual, he did not treat the problem of 
whether people in fact follow ritual prescription; 
he broadly assumed that they do. His problem was 
to account for the bases on which palpably non- 
logical action is considered important and mean- 
ingful. 

According to Pareto’s analytical method, the 
first step is to abstract derivations from non- 
logical theories. Derivations are the nonlogical 
devices of “argument” by which conclusions are 
a from the residues as premises. Pareto classi- 

them under four headings—simple assertion, 
appeal to authority, accord with sentiments, and 
Bi Soe proofs,” ie., direct pseudologic. The idea 
ie has often been compared with 
rationalizations. The main difference is 


that Pareto was not primarily concerned with the 
action of particular individuals, as was Freud, but 
with the currency of derivations as beliefs in a 
society. The difference of system reference is im- 
portant. 

At the core of the conceptual scheme is Pareto’s 
cross-tabulation of residues and derivations against 
the two types of nonlogical theories. His empirical 
investigations revealed that in the resulting four- 
fold table, attributes are not distributed randomly, 
that in fact there are two crowded cells: there is a 
strong correlation (which Pareto never expressed 
numerically) between “theories which surpass ex- 
perience” and residues, on the one hand, and “pseu- 
doscientific theories” and derivations, on the other. 
As a good empiricist, Pareto was careful not to 
close off any perspectives that might lend impor- 
tance to the other two cells, Nevertheless, he 
claimed that derivations are more closely linked 
with pseudoscience than with theories “surpassing 
experience” and that residues are related to the 
cultural dimension of human action. 

Society as a system. Pareto’s intention in work- 
ing with this crucial cross-classification was to de- 
velop a set of categories for delineating a social 
system—pre-eminently a total society—as a sys- 
tem and for understanding its processes. Indeed, 
as Henderson held, it may well be that Pareto’s 
greatest contribution to sociology is his use of the 
system concept (Henderson 1935). Pareto derived 
this concept from analytical mechanics and, before 
using it in sociology, applied it elaborately and, 
on the whole, successfully to economics. The con- 
cept of equilibrium is central to this kind of sys- 
tem analysis, and therefore Pareto has often been 
misunderstood as holding to a “static” conception 
of social phenomena. Actually, he was very careful 
to protect himself on this point by distinguishing 
between static, dynamic, and “moving” equilibria, 
and by allowing explicitly for structural change 
([1916] 1963, vol. 4, chapter 12, pp. 1433-1456). 
It is of course true that he used a natural science 
model and not, for instance, a “dialectic” one. 
However, the relevant critical comment is not that 
he misused the concepts of system and equilibrium 
but that his scheme can be substantially improved 
with newer theoretical resources. These include 
both more refined formulations of the social sub- 
ject matter itself and more detailed general scien- 
tific conceptions of systems, such as homeostasis 
(as developed in physiology), and, more recently, 
cybernetic control. 

Ophelimity and utility. Pareto’s most general 
and most notable statement about the interrela- 
tions of logical and nonlogical action and about 
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their relations to the social system is to be found 
in his discussion of the theory of social utility. He 
began with the conception generally known in eco- 
nomics as the “doctrine of maximum satisfaction.” 
Coining the term ophelimity to designate the eco- 
nomic aspect of satisfaction, as distinguished from 
the broader sociological aspect for which he re- 
served the term utility, Pareto stated the funda- 
mental proposition that ophelimity can be treated 
only distributively, as an aggregate of the satisfac- 
tions of individual actors in the system. Then he 
distinguished two types of change in an economy 
—one that affects all actors in the same direction 
and one that improves the economic situations of 
some while it injures those of others. Only the first 
type of change can be justified on economic grounds 
that are also scientific. Here Pareto presented the 
most important early formulation of the conception 
of the limitations of welfare economics (see [1916] 
1963, vol. 4, chapter 12, pp. 1457-1500). Unfortu- 
nately, most economists (cf. Arrow 1951; Boulding 
1956) have not followed Pareto’s example. They 
have accepted his view that ophelimity, as he called 
it, is a concept which assumes the incomparability 
of wants as between individuals, but they have not 
followed up his further view that utility (in Pareto’s 
sense) is a basis of establishing such comparability 
on two levels—that of distributive problems and 
that of the “welfare” of the system as a whole. 

Welfare for and of the collectivity. Pareto re- 
ferred to ophelimity as being for the collectivity, 
maintaining that a scientific judgment of the wel- 
fare of the collectivity or system (i.e., its welfare 
as a unity) has no meaning in economic terms. 
In “social” terms, however, he asserted that utility 
—as distinguished from ophelimity—is both for 
and of the collectivity. Both require, as we would 
now say, bases of integration which transcend the 
level of the economic interests of units. Pareto’s dis- 
cussion of utility in the distributive sense touches 
essentially the same considerations as Durkheim's 
treatment of the emergence of organic solidarity 
through the institution of contract. The second 
sense he attributed to utility concerns the society’s 
treatment as a total unit and hence the status of 
system members from the viewpoint of their con- 
tributions to the collective whole. In these formula- 
tions, Pareto sharply emphasized both the impor- 
tance of problems of integration at societal levels, 
in terms of values as well as norms, and the fact 
that economics can never be a general science of 
society precisely because it cannot deal with such 
problems. In these respects Pareto converged most 
significantly with his great contemporaries Durk- 
heim and Weber in clarifying the main focus of 
theoretical concern for sociology. 


The “foxes” and the “lions.” The core of Pareto’s 
theoretical interest—and the vital part of his con- 
tribution—was to develop the concept of “theories 
which surpass experience” and to relate them to 
residues rather than to the noncultural (partic. 
ularly psychological) factors impinging on social 
organization. 

His long discussion of the place of residues, der- 
ivations, and their interdependent relationships 
with other elements of action in the equilibrium of 
the social system was an effort to delineate this 
special interest. Here, for purposes of detailed dis- 
cussion, Pareto confined himself to two of his six 
classes of residues—the “instinct of combinations” 
and the “persistence of aggregates.” Very broadly, 
the first consists of the commitments or propensi- 
ties in social groups to adapt flexibly to environ- 
mental or situational exigencies, while the second 
consists of the proclivity in social groups to main- 
tain patterns of commitment once they have be- 
come institutionalized. The latter is something like 
what the present author has called the “pattern- 
maintenance” component of societies. 

Pareto made one of his most important empirical 
generalizations by combining this analytical dis- 
tinction with a conception of the elite element in 
social stratification systems. He confined himself 
to the simplest level of analysis of such systems, 
distinguishing only between elite groups, which 
combine control of great political power with the 
enjoyment of high prestige in various other re- 
spects, and the other groups which constitute the 
mass of the society and which have relatively little 
power, prestige, or wealth. He then developed the 
idea that the composition of the elites alternates 
cyclically—i.e., there is a “circulation of elites” be- 
tween those elements in a society more actuated by 
the “combinations” residues, or the “foxes,” and 
those more actuated by the “persistence” residues, 
or the “lions” ([1916] 1963, vol. 1, chapters 12 
and 13). Pareto showed that, in a political context, 
the lions’ commitment to belief systems and values 
is connected with a readiness to resort to force, 
while the foxes’ flexibility and adaptability mean 
that they are apt to have insufficient concern Wi 
the conditions of the stability of the political sys- 
tem in which they operate. During the Reformation, 
according to Pareto, the predominance of lions in 
the elite reached a high point, as witness the wats 
of religion. On the other hand, in the late nine 
teenth century in democratic countries, he oe 
tended to predominate in a way which contribute 
to the growing instability of that period. 

Circulation of elites. Pareto’s analysis of the 
circulation of elites led to a set of empirical gen 
eralizations. In modern terms, it is an analysis 
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of an important rhythm in the processes of change 
in dynamic societies (like that of the West, both 
ancient and modern), consisting of successive 
phases in which leadership is primarily in the 
hands of adaptive—innovative and then of conserv- 
ative-regressive groups. 

Pareto’s writings in this area bear reconsidera- 
tion in the light of the many developments of social 
science since those early years of the present cen- 
tury when his ideas took shape. Those who object 
to his political tendencies—who have dubbed him 
an “elitist”—tend to take exception to this aspect 
of his work above all, as well as to criticize his ac- 
ceptance of election to the Italian Senate early in 
the Mussolini regime. Politically, he certainly de- 
serves this disapproval and, to a certain extent, the 
label: he had indeed become highly skeptical of 
democratic idealism, particularly the kind that 
often borders on utopianism. But on the strictly 
scientific level, his theoretical procedures were con- 
servative in a positive sense. He used very simple, 
unexceptionable analytical distinctions and attempt- 
ed to proceed step by step from these to inductively 
based generalizations—mobilizing massive histor- 
ical evidence at each step. 


Assessment, Because of his time and his intel- 


lectual milieu, Pareto did not use the newer tech- 
niques of empirical research that have become so 
important for sociology; even his use of statistics 
was probably not as advanced as Durkheim's, and 
he conducted no specific empirical study compar- 
able to Suicide. (He was, however, an accomplished 
mathematician.) His important achievement for 
sociology was his modern, technical approach to 
the problem of general theory, an approach that 
was at once substantive and procedural, and, of 
course, the formulation of a long series of stimu- 
lating empirical generalizations. 
: Insofar as the substantive aspect of his theory 
is concerned, it is now apparent, to be sure, that his 
constant use of the residual method limited the 
usefulness of his more specific theoretic formula- 
tions. This is to say that important as his use of 
the concept of social system was, Pareto had only 
pragmatic criteria to define the boundaries of such 
a system in a theoretical sense; he had to “feel out” 
telations of interdependence. At the present stage 
of the development of sociological theory it has be- 
Come possible to formulate a complete set of con- 
ceptual components of such a system—of course, 
the formulations are subject to continual revision 
nena the points to look at for important “feed- 
ack relationships are defined in advance. Never- 
theless, Pareto’s formulations did highlight strate- 
oS Points for sociological analysis, and it would be 
warding to give them a systematic critical review 


in the light of subsequent developments. It may 
prove possible to reintroduce into sociological 
theory some of Pareto’s specific analyses of the 
various areas he treated residually. A pre-eminent 
candidate for serious reconsideration is his theory 
of social utility. 

Perhaps, however, Pareto’s most important con- 
tribution was, in a broad sense, procedural rather 
than substantive. His conception of scientific pro- 
cedure is based on his conviction of the central 
importance to science of the concept of system, 
From this point of departure he attempted to pro- 
ceed systematically: from the conception of the 
system in analytical mechanics, through that of 
system in economics, eventually to that of a total 
social system. His procedure follows the best tradi- 
tions of theory construction and, with all its sub- 
stantive limitations, can serve as an important 
model today. Most of the neglect of Pareto stems 
from the scientific limitations of subsequent gen- 
erations of sociologists rather than from his irrel- 
evance to their interests. 

TALCOTT PARSONS 


[For the historical context of Pareto’s contributions to 
sociology, see SocioLocy, article on THE DEVELOP- 
MENT OF SOCIOLOGICAL THOUGHT; and the biogra- 
phies of DESCARTES; DURKHEIM; Weser, Max; for 
discussion of the subsequent development of his 
ideas, see SYSTEMS ANALYSIS, article on SOCIAL 
SYSTEMS.] 
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PARK, ROBERT E. 


Robert Ezra Park (1864-1944), American soci- 
ologist, was a leading figure in the notable Univer- 
sity of Chicago “school of sociology.” He was born 
in Harveyville, Pennsylvania, but soon thereafter 
his family moved to Red Wing, Minnesota, and 
Park grew up in what Hamlin Garland called the 
“Middle Border,” where there was still much unset- 
tled land and where towns were new and few. He 
left home after graduating from the local high 
school, and after a year at the University of Min- 
nesota entered the University of Michigan, from 
which he received a bachelor of philosophy degree 
in 1887. John Dewey, his chief instructor at Michi- 
gan, intervened decisively in Park’s intellectual life 
by introducing him to Franklin Ford, a man with 
an original cast of mind. Ford and Park planned 
a new kind of newspaper, to be called The Thought 
News, which would register movements of public 
opinion in some exact manner, just as the market 
Price in the stock market expresses quantitatively 
the dispositions of individuals to buy or sell. The 
proposal was never carried out, mainly because 
techniques of polling and survey analysis were still 
rudimentary and did not command the financial 
support they enjoy today. In this respect Park was 
ahead of his times, but his imaginative view of 


public opinion as a measurable phenomenon found 
an outlet in his later writings and was an inspira- 
tion to the pioneers of survey analysis in the 1940s, 

In 1894 Park married Clara Cahill, daughter of 
a leading Michigan lawyer. From 1887 until 1898 
he was a reporter on daily newspapers in Minne- 
apolis, Detroit, Denver, New York, and Chicago, In 
those days newspapers could not draw upon syn- 
dicated material to balance the volume of Sunday 
advertising, and reporters were expected to supply 
columns of human interest and local color, Park 
was therefore constantly on the prowl for news and 
feature stories, and throughout his life the city 
remained for him a laboratory for discovering hu- 
man nature. 

In a brief “Autobiographical Note” (Collected 
Papers, vol. 1, pp. v-ix), Park recalled that he had 
been impressed with William James’s essay “On a 
Certain Blindness in Human Beings,” in which 
James spoke of “the personal secret” which makes 
life boring to one person and zestful to another. 
Park sought the “secret”; he interpreted his own 
intellectual life as an alternate absorption with the 
realities of social life and with the problem of de- 
scribing social types and processes in terms appli- 
cable to human association everywhere. Thus, he 
left the newspaper world to enter Harvard and 
earned his M.A. there in 1899, studying psychology 
with Münsterberg and philosophy with Royce and 
James. He then took his family to Germany, where 
he studied with Simmel—his only instructor in s0- 
ciology—and with Windelband. He took his PH.D. 
in 1904 at Heidelberg with a dissertation on public 
opinion, Masse und Publikum. Returning to Har- 
vard he became for a year an assistant in philos- 
ophy. 

Park next accepted the position of secretary of 
the Congo Reform Association, which permitted 
him to escape again from academic life. This was 
the time of great popular outcry against Belgian 
atrocities, and Park wrote a series of exposés of 
colonial brutality in Everybody's Magazine (19064; 
1906b). 

At the suggestion of the American Negro leader 
Booker T. Washington, Park began to familiarize 
himself with the condition of the Negroes in the 
southern states. So began years of exploring the 
lives of American Negroes. In his view, action 
alone would not solve “the race problem” and make 
emancipation a reality; increased understanding 
was also required. Park traveled to Europe with 
Washington, and he wrote most of Washington S 
book, The Man Farthest Down. Later he came t0 
see the mode of life of southern whites and Negroes 
as an instance of a universal historic process where 


by human nature and human communities take 
form through conflict; conflict is followed by ac- 
commodation that produces a caste society with 
de facto division of labor; and the caste structure 
eventually changes into one of classes (Park & 
Burgess 1921). 

Turning once more to scholarly pursuits, Park 
in 1914, at the age of 50, joined Albion W. Small 
and W. I. Thomas in the department of sociology 
at the University of Chicago, where he remained 
until he retired in 1929. Between 1929 and 1932 
he visited the world’s racial frontiers in South 
Africa, India, Malaya, and Brazil, and was a guest 
professor at the University of Hawaii as well as 
Yenching University in Peking. From 1936 until 
his death, he lived and lectured at Fisk University, 
a Negro institution in Nashville, Tennessee. 

Sociological theory. Seven years after he joined 
the sociology department at the University of Chi- 
cago, Park, with Ernest W. Burgess, brought out 
a major work, Introduction to the Science of So- 
ciology (1921). In brief, the book develops con- 
cepts for the analysis of group life. It presents a 
wide variety of historical, philosophical, scientific, 
and literary documents; relevant general and the- 
oretical formulations; and suggestions for empir- 
ical study. The illuminating combination of field 
observations of all sorts with a theoretical scheme 
induced from them had also characterized Thomas 
and Znaniecki’s The Polish Peasant in Europe and 
America, published shortly before. 

The Introduction begins with a discussion of hu- 
man nature, in which personality is described as 
the development in the human organism of self- 
consciousness and a moral nature, a process pos- 
sible only in group life. Next, Park and Burgess 
Considered the dominant social process, interac- 
tion. It is social attitudes (tendencies to act) that, 
ultimately, interact and constitute the elemental 
social forces: “They present us human motives in 
the only form in which we can know them objec- 
tively, namely, as behavior. Human motives become 
Social forces only so far as they are communicable, 
only when they are communicated. . . . The clearest 
way to think of attitudes is as behavior patterns or 
units of behavior” (1921, pp. 438-439). Interac- 
tion takes four typical forms: competition, conflict, 
accommodation, and assimilation: “The commu- 
nity and the natural order within the limits of the 
community ... are an effect of competition. Social 
Control, and the mutual subordination of individual 
Peers to the community have their origin in 

nflict, assume definite organized forms in the 
Process of accommodation, and are consolidated 
and fixed in assimilation” (p. 785). Social forces 
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may be informal (for example, custom, folkways, 
mores, public opinion) or institutionalized (law, 
the judiciary, the state), both kinds making for 
conformity to a common purpose; yet these forces 
also “are constantly re-creating the old order, mak- 
ing new heroes, overthrowing old gods, creating 
new myths, and imposing new ideals” (p. 35). 
Under the rubric collective behavior, the authors 
presented the behavior of individuals influenced by 
common (collective) impulses, as when following 
fashions or participating in crowds, sects, or mass 
movements. 

Park and Burgess found it necessary to warn 
students that the study of society cannot be both 
scientific and moralistic. The preceding generation 
of sociologists, many of whom had close ties to 
the Protestant clergy, had not always clearly dis- 
tinguished the description and analysis of social 
life from the normative view of it. Park and Bur- 
gess, therefore, declared at the outset: 


The first thing that students in sociology need to learn 
is to observe and record their own observations . . . 
to get facts rather than formulate opinions. The most 
important facts that sociologists have to deal with are 
opinions (attitudes and sentiments ), but until students 
learn to deal with opinions as the biologists deal with 
organisms, that is, to dissect them—reduce them to 
their component elements, describe them, and define 
the situation (environment) to which they are a re- 
sponse—we must not expect very great progress in 
sociological science. (1921, pp. v—vi) 


The first chapter of the Introduction contains 
what is perhaps still the best historical exposition 
of sociology and its relation to the other social 
sciences: 


Sociology . . . may be described as the science of col- 
lective behavior. . . . Historically [it] . . . had its origin 
in [the study of] history. History has been and is the 
great mother science of all the social sciences. . . . 
Anthropology, ethnology, folklore and archeology have 
grown up largely, if not wholly, to complete the task 
which history began and answer the questions which 
historical investigation first raised. . . . In the same 
sense that history is the concrete, sociology is the 
abstract, science of human experience and human 
nature. On the other hand, the technical (applied) 
social sciences, that is, politics, education, social serv- 
ice and economics—so far as economics may be re- 
garded as the science of business—are related to 
sociology in a different way. They are, to a greater 
or lesser extent, applications of principles which it is 
the business of sociology and psychology to deal with 
explicitly. In so far as this is true, sociology may be 
regarded as fundamental to the other social sciences. 
(1921, Introduction) 


For years the Introduction had few effective com- 
petitors in university classes all over the United 
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States. It was a sophisticated work that drew lib- 
erally from European thought and held no hint of 
provincialism; although it must have been a diffi- 
cult text for many students, its quality was appre- 
ciated. 

Other works. In his next book, The Immigrant 
Press and Its Control (1922), Park again used the 
method of presenting cases followed by theoretical 
statements. He interpreted the foreign-language 
press—whose prevalence then was considered by 
many to be alarming—as an instrument of even- 
tual assimilation. (A companion volume written by 
Thomas, but signed by Park and Herbert A. Miller, 
Old World Traits Transplanted, had been published 
in 1921.) 

Thereafter, Park’s writings were wide-ranging: 
he wrote articles, lectures, book reviews, and pref- 
atory notes to books written—for the most part— 
by his students. These have all been published in 
three volumes edited by Everett C. Hughes, Jitsuichi 
Masuoka, and others (see Collected Papers). Park 
followed his divers interests throughout his career, 
and the classification of his collected papers is top- 
ical, not chronological. Volume 1, Race and Culture, 
contains speculations on such subjects as race rela- 
tions, racial frontiers and attitudes, migrations, and 
the “marginal man”—the name Park gave to the 
man who moves in more than one social world and 
is not completely at home in any. His most famous 
student in this field was E. Franklin Frazier, who 
wrote authoritatively on the American Negro and 
on racial and cultural contacts in the modern 
world. Other prominent students in this field in- 
clude Charles S. Johnson, who studied the planta- 
tion Negro; Edgar T. Thompson, who published 
extensive studies of plantation and migratory labor; 
W. O. Brown, who became an authority on African 
nationalism; Louis Wirth, who began his career 
with an analysis of the ghetto and went on to play 
an active part in improving American race rela- 
tions; and Everett C. Hughes, who studied cultural 
conflict in French Canada and in industrial race 
relations. 

Volume 2, Human Communities, contains Park’s 
writings on the city and on human ecology. Park 
and his student R. D, McKenzie coined the term 
“human ecology,” and Park taught a course of that 
name in 1926. His students explored plant and an- 
imal ecology to appreciate the meaning of such bor- 
rowed concepts as symbiosis, invasion, succession, 
dominance, gradients of growth, superordination, 
and subordination. Park had used the concept of 
mobility as early as 1916 (see Collected Papers, 
vol. 2, pp. 13-51) to designate, for one thing, a 


process of changes in status, whether up or down; 
later he arrived at the notion of using Position, 
which is readily observable and measurable, as an 
index of status, which is more subtle: 


A person is simply an individual who has somewhere, 
in some society, social status; but status turns out 
finally to be a matter of distance—social distance. It 
is because geography, occupation, and all the other 
factors which determine the distribution of population 
determine so irresistibly and fatally the place, the 
group and the associates with whom each one of us is 
bound to live that spatial relations come to have, for 
the study of society and human nature, the importance 
which they do. It is because social relations are so 
frequently and so inevitably correlated with spatial 
relations: because physical distances so frequently are, 
or seem to be, the indexes of social distances, that 
Statistics have any significance whatever for sociology. 
(Collected Papers, vol. 2, p. 177) 


These last phrases are acknowledged in the produc- 
tive work of demographers and of those who con- 
duct opinion surveys. 

Park’s students in ecology were an able set of 
scholars. Wirth incorporated ecological ideas in his 
study of the ghetto. McKenzie published articles on 
the metropolitan community and world ecology in 
which he considered, among other things, the com- 
petition between one currency and another and 
between one language and another. A very strik- 
ing work was Clifford Shaw’s formulation of gra- 
dients of juvenile delinquency, interpreted as in- 
dexes of disorganized community life. Harvey W. 
Zorbaugh’s The Gold Coast and the Slum and Fred- 
eric M. Thrashers The Gang are minor classics. 
Ernest R. Mowrer and Ruth Shonle Cavan used 
spot maps of Chicago’s divorces and suicides to 
show that spatial distributions correspond to moral 
communities. 

The third volume, Society, presents Park’s wit 
ings on collective behavior. Just as Adam Smith’s 
economic man is a product of the division of labor, 
so social man develops through language and & 
conscious and purposeful sharing in a common life, 
that is, through social control. In an environment 
of custom, tradition, institutions, etc., the individ- 
ual becomes social—a person—and acquires status. 
Park maintained that the typical progression of the 
conflict group is from expressive behavior to polit- 
ical action; his ideas on this subject appeared in 
the introductions to books by his students—t0 
Pauline Young’s book on the communistic sect, i 
E. T. Hiller’s on the strike, and to Lyford Edwards 
on revolution. a the 

Volume 3 also includes essays on news and © ; 
newspapers. Park suggested that there is a contin 


uum of states of being informed, from unordered 
and intuitive “acquaintance with” to systematic, 
rational “knowledge about’—the phrases are bor- 
rowed from William James—and placed the news 
somewhere between the extremes, sharing qualities 
of both history and popular literature. “The Natural 
History of the Newspaper” (Collected Papers, vol. 3, 
pp. 89-104) describes how the readers’ demands 
for “news interest” transferred the power of the 
press from the editor to the reporter of news. Sev- 
eral of the essays on news and public opinion were 
written in the first years of World War n, stimu- 
lated by the urgency that the war, the most dra- 
matic of all collective enterprises, gave to opinion, 
intelligence, propaganda, and morale. Many of the 
ideas on social control expressed in these articles, 
particularly social control as seen in custom and 
institutions in time of change, were applied by 
Robert Redfield to folk society. 


Park stands out as a gifted writer in a field in 
which happy turns of phrase are not common. He 
was well-read, urbane, and precise. The allegation 
that he opposed statistics is unwarranted: he en- 
couraged quantitative description whenever it was 
convenient and would enlarge understanding, but 
he refused to limit his scientific curiosity to prob- 
lems which could only be dealt with by a single 
method. 

HELEN MACGILL HUGHES 


[For the historical context of Park’s work, see EcoL- 
oGy, article on HUMAN ECOLOGY; and the biographies 
of BURGESS; JAMES; MUNSTERBERG; SMALL; THOM- 
As. For discussion of the subsequent development 
of Park’s ideas, see ASSIMILATION; CITY, article on 
COMPARATIVE URBAN STRUCTURE; RACE RELATIONS; 
and the biographies of FRAZIER; REDFIELD; Wrta] 
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PARLIAMENTARY GOVERNMENT 


Parliamentary government, or cabinet govern- 
ment, is the form of constitutional democracy in 
which executive authority emerges from, and is 
responsible to, legislative authority. It differs from 
the arrangement of independently elected execu- 
tive and legislative agencies found in the United 
States. Developed in western Europe and particu- 
larly in Great Britain, parliamentary government 
provides the pattern usually assumed by demo- 
cratic experiments in eastern Europe, Asia, and 
Africa. In common usage the term “parliamentary 
government” is reserved for those political systems 
that not only are parliamentary but are based on 
free and competitive elections. This excludes one- 
party dictatorships exercising power within a 
formal parliamentary structure. 

The essential union of executive and legislative 
branches is accompanied by the constitutional 
principle that the legislative body, or parliament, is 
supreme. Usually the principal executive, the prime 
minister, is appointed by a monarchical or presi- 
dential head of state. The prime minister, in turn, 
chooses the executive heads of government depart- 
ments, the most important of whom are in the 
prime minister’s cabinet. Both the prime minister 
and his cabinet, known together as the govern- 
ment, are ordinarily members of parliament. They 
hold ministerial office only as long as they have 
majority support in parliament. In a bicameral 
legislature, this requirement usually means ma- 
jority support in the more popularly elected house. 
Occasionally a government may be made responsi- 
ble to both houses. In any case, the rule of continu- 
ous legislative confidence is regularly demonstrated 
in the government's submission of its program and 
record for parliamentary approval. A defeat for the 
government through an adverse legislative vote, on 
a plainly important issue, indicates a lack of con- 
fidence requiring the government either to resign 
or to attempt, by means of a general election, to 
secure a new parliamentary majority. A govern- 
ment can stay in office only temporarily without 
parliamentary support for its policies. Stalemate 
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between an executive of one persuasion and a legis- 
lature of another, as occurs with the American 
system of separated powers, is meant to be impos- 
sible in the parliamentary system. Instability of 
executive authority, on the other hand, is entirely 
possible. The accepted way to avoid it is by the 
development in parliament of a strong partisan ma- 
jority prepared to support a prime minister and his 
cabinet during the several years between parlia- 
mentary elections. The executive authority then be- 
comes the effective policy maker. 


History 


More than most working governmental forms, 
the parliamentary system is not so much an inven- 
tion as it is an evolutionary product. Its essential 
union of executive and legislative authority is not 
simply a deliberate constitutional design. It is more 
significantly the result of the process by which 
representative assemblies successfully challenged 
monarchs in the course of modern history. This 
process, it must be said, was European, although 
attempts have been made to transfer its result to 
other parts of the world. It can even be argued that 
the process was characteristically British rather 
than European, and that, therefore, parliamentary 
regimes in continental Europe represented institu- 
tional transfers. 

The outstanding feature of the historical experi- 
ence was the evolution of parliament from a 
monarch’s council to a supremacy of its own, As- 
sembled originally, as early as the medieval period, 
to provide advice and especially to give financial 
support to the monarch, parliament became in 
modern history the means by which first an estab- 
lished landowning oligarchy, then a commercial 
class, and finally representatives of the bulk of the 
population secured control of the machinery of 
government. The development was a long one, cov- 
ering three to five centuries, and it coincided with 
what now seems, by comparison with new nations, 
a most gradual change in European society. In 
particular, parliamentary government developed 
where there was a substantial historical interval of 
capitalist, middle-class ascendancy between the era 
of dominance by court and nobility and the era of 
mass democracy. Especially in the nineteenth cen- 
tury, parliament seemed to be the agency of the 
substantial middle class produced by commercial 
and industrial capitalism. 

In Britain, Parliament’s supremacy over the 
monarch dates from 1688, when Parliament as- 
serted its authority to determine the monarchical 
succession. This authority was made effective by the 
eighteenth-century development of a cabinet in 


which those who were nominally ministers of the 
crown became responsible in fact to Parliament, 
The monarch, losing control of his ministers, ceased 
to be the effective executive. In an increasingly 
rationalist and subsequently democratic age, the 
hereditary principle did not provide a likely basis 
for independent executive authority. The claims of 
a representative body were not successfully resisted 
by a monarch, Where the monarch resisted too 
stubbornly, he was likely to be dethroned in favor 
of another monarch or of a president. The latter, 
even if not popularly elected, might be a stronger 
claimant to independent executive authority, but 
usually his powers were cast in the mold of a con- 
stitutional monarch’s. The presidential innovation 
was a late one in the parliamentary system; the 
French only introduced the presidency after 1870. 
This change may make the parliamentary order 
less smoothly evolutionary, as indeed was the case 
in France, than the retention of the monarchy 
while its power is reduced. The parliamentary sys- 
tem with a president has been successful in a few 
nations and has been introduced in many new 
nations. [See MonaRcny.] 

Parliamentary government developed its more 
essential features in Britain before the emergence 
of mass democracy. The larger part of the British 
population did not even have the right to vote until 
about the last quarter of the nineteenth century, 
by which time parliamentary government, includ- 
ing the cabinet system, was well established. Else- 
where, however, the establishment of parliamen- 
tary institutions and the achievement of universal 
suffrage more nearly coincided. The results in such 
instances were not always so favorable for the sta- 
bility of the governmental system as the British 
phasing seems to have been. The evolutionary 
length of British experience bears emphasis be- 
cause it may not necessarily be typical of nations 
attempting to practice parliamentary government. 
This raises the important general question whether 
parliamentary government can be dissociated from 
conditions chiefly typical of Britain, the British- 
settled territories, and those smaller Continental 
nations which closely resemble them. 


Changing conceptions and institutions 

The parliamentary system has been differently 
conceived, not only over time but also from coun- 
try to country. The view that the parliamentary 
body itself, rather than the cabinet, was the effec- 
tive policy-making authority remained a pee 
French conception, in the form of “governam at 
assembly,” long after it had lost meaning iN T 
British system. By the middle of the nineteen! 


century, often viewed as the classical period of 
parliamentary government, the British cabinet had 
assumed a central importance. This was reflected 
in Bagehot’s famous appraisal (1865-1867). Still, 
Bagehot did not move Parliament off stage. The 
cabinet, although it exercised leadership, remained 
an agency of Parliament in the sense that the 
House of Commons decided whether to turn out a 
government after discussion of its policy. Bagehot 
regarded this elective function, rather than the 
legislative function, as the most important one that 
Parliament performed, In this way, Parliament re- 
mained the locus of power despite the cabinet’s 
admittedly crucial role in the whole system. 

In the twentieth century, however, the original 
British model changed as the cabinet increasingly 
had the support of a cohesive majority and thus 
stayed in office from one general election to the 
next. The British Parliament was no longer ex- 
pected to exercise its power to dismiss a govern- 
ment. The elective function was transferred from 
the Commons to the general public. Parliament re- 
mained to register the electorate’s decision as to 
which party's leadership was to form the cabinet, 
but parliamentary debates lost the impact they had 
formerly had on the life of the government. Accom- 
panying the increasingly direct relation of the cabi- 
net to the electorate was a strengthening of the 
prime minister’s role. As the leader of a majority 
party, he has come, in effect, to be chosen as a 
chief executive when voters elect parliamentary 
representatives of his party. He bears individually 
a responsibility to the country. His cabinet has 
tended to become more a changing team of min- 
isters carrying out the leaders program than a 
genuinely collegial policy-making body. The effect 
of this tendency, along with that which has made 
the government more directly responsible to the 
electorate than to Parliament, has been to give the 
British system an appearance that is less parlia- 
mentary and more presidential. It might even seem 
more presidential than the American system, since 
the prime minister is less restrained by a parlia- 
ment, in which his party has a cohesive majority, 
than is a president of the United States by the 
Congress. On the other hand, the prime minister's 
Cohesive majority, ordinarily so supportive, may de- 
cide to displace him in admittedly rare but impor- 
fant circumstances. Moreover, conceiving of the 
Parliamentary system in near-presidential terms is 
also somewhat hazardous in that a cohesive ma- 
ae party may not always exist as it has in 
fe in during the middle years of the twentieth 

ntury [see MAJORITY RULE]. 

Yet the tendencies that have changed parlia- 
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mentary government in Britain are observable elses 
where, and usually in ‘association with a strong 
parliamentary party possessing. a majority or a 
near-majority of seats. An important case in*point 
is the working of the West German syStem after 
World War u. Here the chancellor, as a German 

counterpart of the British prime minister, estab- 
lished an ascendancy based on his party leadership 

and his popularity with the electorate. Most other 

Continental nations in the parliamentary mold 

have not developed strong executive leadership to 
the same degree as West Germany; in the Fourth 

French Republic, parliamentary government broke 

down in the absence of a stable cabinet system. 

The smaller European nations have also been suc- 

cessful in strengthening their cabinets while re- 
taining the parliamentary system. The English- 

speaking nations of the overseas Commonwealth 

even more closely resemble the British pattern. 

Cohesion and party government. Political par- 
ties have not always provided a popular base for 
executive leadership. Earlier they were simply 
groups of representatives tending loosely to sup- 
port ministers or potential ministers. Although they 
began to have followings in the electorate before 
mass enfranchisement, modern, large-scale organ- 
ization of political parties dates only from the last 
decades of the nineteenth century. So does the 
crucial cohesiveness of the parliamentary party in 
support of its leadership. Such support represents 
a commitment to the electorate, and this commit- 
ment makes it possible for parliamentary govern- 
ment to be effective party government. 

The vital party is the one in parliament, rather 
than the extraparliamentary, mass-membership or- 
ganization. The latter may be divided in its support 
of a leadership and its policy, but as long as the 
parliamentary party is able to unite, it serves as the 
base of stable executive authority. The parliamen- 
tary party may be influenced by the external organ- 
ization and its apparatus, but decisions are made 
by the publicly elected members of the parliamen- 
tary party, who ordinarily support their leadership. 
This is a residual aspect of parliamentary suprem- 
acy, transferred from its operation in the body as 
a whole to operation within a party. As such, it has 
been strong enough in Britain and in other parlia- 
mentary nations to resist the pressures, especially 
of social-democratic political movements, to make 
parliamentary parties into agents of an outside, 
dues-paying, activist membership. Members of par- 
liament feel responsible to their party's broader 
electorate, not just to active party workers. This is 
consistent with the willing acceptance by parlia- 
mentary members of the policy positions of their 
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leaders, who also regard themselves as directly 
responsible to the broader electorate. 

Coalition and competition. Party cohesion alone 
is not the key to the achievement of executive sta- 
bility in the parliamentary system. Equally im- 
portant is the presence of a party, or perhaps a 
combination of parties, commanding a majority 
in the legislative body. The simplest case, barring 
the noncompetitive one-party arrangement, is the 
two-party competition that largely characterizes 
British parliamentary politics. Given only two 
major parties, the probability is high that one party 
will have a parliamentary majority, even a com- 
fortable working majority. This regularly happened 
in Britain during the three decades following 1931. 
Minor parties, as opposed to a third party or other 
substantial parties in a multiparty system, do not 
ordinarily win enough seats to reduce one of the 
large parties to only plurality strength. Even a 
third party, or any number of other competing 
parties, would not necessarily prevent one party 
from gaining a majority. The Christian Democratic 
parties of Germany and Italy secured narrow and 
fairly brief majorities in such multiparty circum- 
stances during the post-World War 11 years. How- 
ever, absent from these unusual successes was the 
other ingredient of the British model of party gov- 
ernment: a single opposition party large enough to 
be a potential majority and so an alternative gov- 
erning party, to the point of providing a leadership 
core of potential ministers called the “shadow 
cabinet.” 

Despite the practical success of the two-party 
model, it is not necessarily an essential feature of 
the parliamentary system. The British themselves 
had a three-party system, often without a Majority 
party, as recently as the 1920s, and they may have 
it again. Moreover, there are other nations that 
have almost consistently had more than two im- 
portant parties and yet maintained parliamentary 
government. The leading examples, it is true, are 
the Scandinavian countries and a few English- 
speaking nations. Their particular multiparty sys- 
tems have often produced a majority or near- 
majority party and have seldom been as fragmented 
as the French system, which provides the main 
instance of the difficulty of maintaining parlia- 
mentary government in a multiparty system. 

Without a majority party, various means are 
used to try to secure majority legislative support, 
as required by parliamentary government. A large, 
but not always the largest, plurality party can form 
a cabinet of its own members while seeking the 
votes of another party (or of other Parties) so as 
to produce a working majority. This was British 


practice in the 1920s. Elsewhere, it has been usual 
to form a cabinet that is itself a coalition repre- 
senting enough parties for a parliamentary ma- 
jority. Such a coalition might be headed by a prime 
minister acting as a leader either of the largest 
single party or of a smaller but key center party. 
The task of achieving stability is likely to be facili- 
tated if one party is fairly near a majority on its 
own and can therefore manage with one or two 
minor coalition partners. A broader coalition is also 
feasible, but if it includes all or most of the large 
parties, the effect is to eliminate the possibility of 
an opposition presenting itself as an alternative 
government. The very broad coalition, therefore, 
has usually been regarded as suited only to war- 
time or other special circumstances, when it is 
deemed appropriate in two-party as well as in 
multiparty systems. 

Special circumstances, however, can become in- 
stitutionalized, as in Austria after World War 1, 
Regularly the two major Austrian parties united 
in a coalition, excluding minor parties, and yet 
fought competitive elections against each other. 
In this novel arrangement, the chief function of 
elections was to decide which of the two parties 
would increase its share of cabinet positions, not 
to decide which of the two should form a cabinet. 
Elections may serve a similarly limited purpose in 
a more clearly multiparty nation when a coalition 
cabinet is established over a fairly broad political 
spectrum, embracing perhaps two-thirds but not 
all of a parliament. In this case the voters, by 
electing more or fewer representatives of a given 
party, help determine the relative strength of the 
several parties regularly composing the cabinet. 
It means a more limited form of electoral compê- 
tition than that between two parties, or two groups 
of parties, each contending to form a government 
of its own. But it is competition nonetheless, and 
of a kind that seems relevant where clear-cut al- 
ternatives are simply not available. The result is 
an operation of parliamentary government differ- 
ent from the standard British method, but it is not 
incompatible with democracy. 

The same cannot be said so surely for those 
systems in which there is one-party done 
Much depends on both the degree and the metho 
of domination, Clearly, when one party has a leg 
monopoly, as in communist states, there can be no 
parliamentary government in the Western ay 
But where party competition is legal, even if i 
cially discouraged, the problem is more elusive. A 
is conceivable that a nation, soon after indepen 
ence, for instance, could regard as legitimate Nees 
the party that led the national independence mo 


ment, This seems to be the case, in varying degrees, 
in Africa and Asia during the immediate post- 
imperial period. Parties other than the governing 
party often have too little support to furnish serious 
parliamentary opposition. Political competition 
takes place mainly within the one major party. 
Given freedom of expression, in and out of par- 
liament, and given free choice of parliamentary 
candidates at local party levels, intraparty com- 
petition can be substantial. But when, as often hap- 
pens, the leader of the single party regards open 
criticism as illegitimate, whether from within his 
party or from outside, the result tends to resemble 
the deliberately one-party dictatorships of com- 
munist nations. 

Another kind of difficulty about the role of the 
opposition arises when substantial competition 
comes from parties that are not in fact democratic, 
such as communist and fascist parties. Their oppo- 
sition raises the question of their legitimacy in any 
democratic system. This question stems from the 
assumption that such parties would, if in power, 
overturn the very parliamentary regime under 
which they had operated. This is exactly what the 
National Socialist party did to the Weimar Repub- 
lic, Even without actually coming to power, 2 
fascist or communist party can adversely affect the 
working of the parliamentary system by securing 
enough votes to become the principal opposition. 
The electorate then has no choice except to vote 
for the government, unless it is willing to support 
an opposition dedicated to a radical transformation 
of the democratic constitutional order. The French 
and Italian Communist parties came close to cre- 
ating such limiting alternatives after World War n. 

The executive and dissolution. Strengthening 
the cabinet and prime minister has changed par- 
liamentary government from its nineteenth-century 
character, but it has not violated the basic prin- 
ciple of executive responsibility to parliament. On 
the other hand, increasing the power of the head 
of state, monarchical or presidential, must be 
understood as a step away from parliamentary gov- 
ernment. An important and independent policy- 
making role for an elected president, for example, 
seems incompatible with the parliamentary system. 
The consequence of such an increase of power, as 
exemplified by the constitution of the Fifth French 
Republic and especially by the practices of Presi- 
dent de Gaulle, is not enough to create a full- 
fledged presidential system but is enough to produce 
a hybrid parliamentary system. The counterparlia- 
Mentary tendency is likely to be all the stronger 
a president, equipped with constitutional 

‘uthority, is popularly elected and so can claim a 
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popular mandate to rival parliament and its chosen 
cabinet. Such a president ceases to be the non- 
partisan, dignified head of state typified by a mod- 
ern constitutional monarch or by a president chosen 
by parliament to play the monarch’s role [see DELE- 
GATION OF POWERS]. 

Even a constitutional monarch, however, has a 
political function by virtue of his power to appoint 
the prime minister. In principle, this involves 
choosing a man whom a parliamentary majority 
would elect. And, in practice, the choice often falls 
automatically on the known leader of the majority 
party. Matters are more complicated if there is no 
clearly known leader of the majority party, perhaps 
because of a sudden death or resignation. Poten- 
tially still more complicated is the situation where 
there is no majority party. But even in this contin- 
gency, which regularly occurs in multiparty sys- 
tems, the head of state is effectively limited in his 
choice by the hard fact that the prime minister, in 
order to remain in office, must have the support of 
a parliamentary majority. For the head of state to 
insist on his own preference, rather than parlia- 
ment’s, would violate a principle of the system. 

The same applies to the power of dissolving par- 
liament. Technically, like the choice of prime min- 
ister, this is in the hands of the head of state. But 
for a head of state to dissolve parliament against 
the wishes of its prime minister and cabinet, or to 
refuse to dissolve when the prime minister advises 
him to dissolve, would interfere with the regular 
working of parliamentary government. Originally, 
as with so many other powers, dissolution was in 
fact the prerogative of the head of state, notably of 
the British monarch. But the exercises of authority 
over this important political matter is now the prime 
minister’s. In some systems, e.g., the Third French 
Republic, parliament retained the power to dissolve 
itself after a given number of years. This weakens 
the executive authority because the prime minister 
needs the power to dissolve parliament as a means 
of retaining the support of a majority. Members of 
the prime minister's majority, in this view, will be 
more likely to continue voting for him if they be- 
lieve that his parliamentary defeat might mean not 
his resignation but a new general election. Ordi- 
nary members will not want to risk their seats. 

This line of argument seems applicable in a 
multiparty parliament, where a prime minister 
might effectively threaten dissolution to keep coali- 
tion members in line. But the threat in relation to 
one’s own party seems irrelevant in a two-party 
multiparty parliament, where a prime minister 
unquestionably has the effective power to dissolve, 
but he does not use it as a means to discipline his 
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own followers. They generally have sufficient rea- 
sons to support him without the threat of dissolu- 
tion. Rather, the British prime minister arranges to 
dissolve Parliament at a time calculated to be of 
maximum party advantage—for him as well as his 
parliamentary followers. Therefore, the least desir- 
able occasion to dissolve would be just after a de- 
sertion by parliamentary members of his party. 
Going to the country with a divided party would 
only be to the advantage of the opposition. In sum- 
mary, the power of dissolution strengthens a prime 
minister’s position, as it is meant to do in a parlia- 
mentary system; but with a highly developed party 
structure, like Britain’s, it is used primarily to time 
a general election for party advantage and not for 
retaliation against a loss of parliamentary confi- 
dence, 

Individual members and the legislative function. 
Given the tendency of the parliamentary system, 
in its British form, to develop strong executive au- 
thority based on a cohesive party majority, it fol- 
lows that the legislature is not a policy-making 
body in the manner of the U.S. Congress—more or 
less coequal with the executive, Individual nonmin- 
isterial members of the House of Commons, for 
example, do not directly legislate, as do American 
congressmen when deciding whether to accept gov- 
ernmental proposals or to substitute proposals of 
their own. Members of the British Parliament may, 
especially in private party councils, influence what 
their leadership presents by way of policy, and they 
certainly question ministers (in a daily question 
hour) about policy particulars, in addition to de- 
bating policy generally. But they do not make pol- 
icy as members of a coordinate branch of govern- 
ment; they lack the legislative facilities to do so. 
British parliamentary committees are not independ- 
ent loci of power providing nonministerial members 
with opportunities to overturn the government's 
program. In this respect the British situation is 
extreme, since the House of Commons avoids offi- 
cial subject-matter committees altogether. Other 
parliamentary governments, even if otherwise in 
the British mold, do not usually go this far in 
guarding against a challenge to the theoretical 
supremacy of the whole house or to the practical 
supremacy of the government, whose authority 
rests on the confidence of a majority in the whole 
house. Nevertheless, wherever parliamentary gov- 
ernment has developed a strong and stable execu- 
tive, there is necessarily an important limitation 
on the exercise of independent policy making by 
the nonministerial membership. This means that 
parliament's public importance rests heavily on its 


public debates, but those may now fail to secure as 
much attention as is given party leaders on radio 
and television. 

Only where parliamentary government has not 
developed strong executive leadership do legislative 
activity and organization resemble the American 
congressional pattern. The extreme examples are 
provided by the Third and Fourth French republics, 
both of which had effective subject-matter commit- 
tees whose leaders could substitute their policies . 
for the government's. These committee leaders were 
often rivals of governmental leaders, who could be 
driven from office by adverse committee action and 
subsequent adverse parliamentary action. Respon- 
sibility, in this situation, is not firmly fixed in the 
cabinet, which usually has no stable majority in 
the legislature. Even on foreign policy questions, 
where greater executive authority has been usual 
in all systems, the French style of parliamentary 
government imposed limits on cabinet leadership. 
The absence of a coherent parliamentary majority, 
which enabled French representatives to play more 
active and more direct roles, is closely associated 
with the instability of governments in the Third 
and Fourth French republics. | 


Research and evaluation 


Parliamentary government, notably its British 
form, has long been carefully studied by historical 
and legal scholars. There are standard treatises 
describing the laws and customs—the constitution 
—of the British Parliament. Not all of the histori- 
cal and legal work has been by Englishmen, but it 
has generally expressed admiration for British in- 
stitutions. This was true of earlier Continental 
scholars, who hoped that their nations would emu- 
late the British system, and it was true also of 
late nineteenth-century and early twentieth-cen- 
tury American scholars, who saw much in the 
British system worth adapting to the United States. 
For instance, Woodrow Wilson admired the British 
system, in Walter Bagehot’s classic description, as 
superior to the American presidential-congressional 
system. Bagehot, it should be noted, was not a con- 
ventional historical or legal scholar but a most in- 
sightful journalist. Of a different order is the 
standard twentieth-century analysis by Sir Ivor 
Jennings. In his volumes Parliament (1939) and 
Cabinet Government (1936), Jennings uses 4 
wealth of historical material to illustrate the oper- 
ating constitutional principles. He presents these 
principles as greatly modified by democratic usages 
since Bagehot’s time. 

Since World War 1 there has been consider- 


able new research in the less formal elements of 
parliamentary government. The research, it is true, 
has not been aimed directly at understanding par- 
liamentary government as opposed to another form 
of government; but any effort to learn more gen- 
erally about legislative and executive behavior has 
added to the knowledge of parliamentary institu- 
tions, The new literature involves roll-call analysis, 
systematic accounts of backgrounds of legislators, 

effects of external party organizations, intraparty 
and intracabinet decision making, and the partici- 
pation by interest groups in the parliamentary 
process. The last of these subjects was hardly 
studied at all in most parliamentary democracies 
before the 1950s; indeed, until then interest groups, 
or pressure groups, tended to be regarded as only 
American political phenomena. Consequently, 
there was a significant gap in knowledge of even 
the otherwise much-studied British system. 

More can be learned by detailed studies of par- 
ticular elements in the established systems. But 
there is also a special need to understand, in com- 
parative perspective, the common factors associ- 
ated with stability and effectiveness of parliamen- 
tary government or with failure of such government. 
Western Europe and the English-speaking Com- 
monwealth nations provide the main laboratories, 
since their historical experience with parliamentary 
institutions is long and varied. From this experi- 
ence, there is some hope of developing hypotheses 
of more general applicability with respect to par- 
liamentary government in developing nations. For 
such hypotheses to have validity, however, they 
must also be examined in the environment of the 
developing nations themselves. This lies in the fu- 
ture, since so far there has been an understandable 
scholarly tendency to regard the new constitutional 
arrangements of developing nations as less durable 
and so less meaningful than the broader problems 
of nation-building and informal group processes. 

The evidence is not yet conclusive as to the 
adaptability of parliamentary government outside 
of its limited Western homelands. What is known, 
however, gives little cause for optimism, since no 
developing nation with a non-Western background 
has a long period of experience with parliamentary 
government. The record of parliamentary govern- 
Ment is somewhat discouraging even in the ad- 
vanced nations of western Europe. None of the 
three major Continental nations—France, Ger- 
many, and Italy—consistently maintained a par- 
liamentary system through the first six decades of 
this century, 

Yet the parliamentary system provided the pat- 
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tern for most new twentieth-century democratic 
governments. This held for the embryonic political 
institutions of the supranational European commu- 
nity and for the initial stages, at least, of constitu- 
tions in many non-Western nations. The new 
African and Asian nations emerging from British 
control generally adopted parliamentary constitu- 
tions without monarchs. So did several nations that 
had been under the rule of other imperial powers. 
Japan, the most developed of non-Western nations, 
also established a parliamentary system. 

Whether many of these nations, especially the 
new ones, would be able to remain for long within 
the rules of parliamentary government was uncer- 
tain despite the impressive scale of the Indian 
effort. French-speaking African nations, for exam- 
ple, just as they were becoming independent, 
tended to move away from the parliamentary form 
when France itself did so in changing from the 
Fourth Republic to the Fifth Republic. 

Maintaining a strong and effective executive, 
yet one responsible to a legislative body, has proved 
difficult, if not impossible, even in many European 
circumstances. And without such an executive— 
that is, without parliamentary government that is 
also cabinet government—the system seems un- 
able to cope with all of the domestic and foreign 
problems of a state in the modern world. 


Leon D. EPSTEIN 


[See also CONSTITUTIONS AND CONSTITUTIONALISM; 
GOVERNMENT; LOCAL GOVERNMENT; MONARCHY; 
PARTIES, POLITICAL. Other relevant material may be 
found in COALITIONS; DEMOCRACY; LEGISLATION; 
PRESIDENTIAL GOVERNMENT; and the biographies of 
BAGEHOT; BENTHAM; BURKE; DicEy.] 
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PARSONS, ELSIE CLEWS 


Elsie Worthington Clews Parsons (1875-1941), 
American anthropologist, was born in New York 
into a wealthy and socially prominent family, 
which both permitted and supported her intellec- 
tual endeavors. While her illustrious elder women 
colleagues, such as Zelia Nuttall, Alice Cunning- 
ham Fletcher, and Matilda Stevenson, like their 
male contemporaries, were largely self-trained and 
had institutional associations with museums rather 
than universities, Elsie Parsons was the first out- 
standing woman in anthropology whose career be- 
gan after formal instruction in the discipline be- 
came available. She insisted on entering the newly 
founded Barnard College—rather than enrolling in 
one of the established women’s colleges—and re- 
ceived her B.A, there in 1895. She entered gradu- 
ate studies in sociology at Columbia University, 
where her work was greatly influenced by Frank- 
lin H. Giddings. She received her M.A. in 1897 and 
her PH.D, in 1899, 

In 1900 she married Herbert Parsons, a success- 


ful New York attorney and, for part of his career, 
a congressman. While her children were still 
young, Elsie Parsons devoted herself to extensive 
writing on social problems. She allied herself early 
with the cause of feminism, but fundamentally 
and increasingly her concern was for the right to 
rational, individual expression, unconfined by such 
arbitrary limitations as those of sex, class, and race, 

Although her first writings reflect the influence 
of speculative historical concepts typical of nine- 
teenth-century sociology, they nevertheless give 
evidence of the perceptive insights and under- 
standings that were to typify her later works, Her 
views on post-Victorian society were set forth in 
1913 in The Old-fashioned Woman, a discussion 
of conventional behavior, including sex relations, 
which today seems innocuous but was sufficiently. 
controversial at the time to prompt her to publish 
it under the pseudonym John Main. This publica- 
tion was followed by others in a similar vein that 


appeared under her own name (Spier & Kroeber A 


1943, pp. 245-246; Reichard 1943, pp. 48-49). 
Elsie Parsons did not discover anthropology until 
she was about forty. The discovery came as a result 
of a trip to the American Southwest, where she 
became fascinated by the Pueblo peoples. Her sub- 
sequent search for guidance in the scientific, first 
hand study of cultures brought her into valuable 
intellectual association with Pliny E. Goddard and 
Franz Boas. Anthropology had come sufficiently of 
age to have developed empirical methods of in- 
vestigation for dealing with the kinds of problems 
that Parsons had wrestled with philosophically in 
her sociological studies in the 1890s, In 1915 she 
made her first field trip to Zuni and returned there 
in 1917 and 1918, when she also studied at La 
guna. In 1919 she worked at Laguna with Boas 
and until 1932 continued to make repeated trips 
to various pueblos, particularly Hopi, Acoma, 
the Tanoan-speaking towns, Ever conscious of the 
need for precision and detail, she produced studies 
that are not easy to read but that remain stand 
and invaluable references for students of the South- 
west, providing a solid methodological foundation 
for continuing research in the region. Although her 
interest centered on the Pueblos, she sought rela- 
tionships and cultural anetecedents among the 
neighboring Pima and in Oklahoma among the 
Caddo and Kiowa. These comparative studies have 
been criticized as limited in scope, in that Parsons 
was not familiar with the broader aspects of cul 
tural diffusion throughout North America. How 
ever, valid and useful syntheses of Pueblo datà 
did result from the tremendous collection 
detailed data which she amassed over the yeats 
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and on which, eventually, her Pueblo Indian Reli- 
gion (1939) was based, 

After 1929, her interests turned to Mexico, where 
she concentrated on problems of Indian—Spanish 
acculturation and the nature of interpersonal rela- 
tions within society. She worked principally among 
the Cahitans of Sonora and the Zapotecs of Oaxaca. 
The latter group was the source of her most 
famous and widely acclaimed work, Mitla: Town 
of the Souls (1936). Concern for culture-contact 
problems led Parsons to South America. Her death 
occurred only a few weeks after her return from 
field work in the Andes, and the results of her 
research there, Peguche, Canton of Otavalo, ap- 
peared posthumously (1945). 

In addition to studies among American Indians, 
she also did extensive research on the folklore of 
the New World Negro. She collected accounts in 
the vernacular and was a prime influence in fur- 
thering the recording of such data by other schol- 
ars, Her significance in the field of folklore is 
reflected by the fact that in 1943 an entire issue 
of the Journal of American Folklore was devoted 
to her as a memorial, 

Her career as a teacher was brief. She served 
as a fellow at Columbia University from 1899 to 
1902 and as a lecturer in sociology from 1902 to 
1905, In 1919 she held a faculty position at the 
New School for Social Research, which she helped 
to found. She disliked public lecturing but was 
vitally interested in the work of promising stu- 
dents, In addition to offering scholarly encourage- 
ment, she was spontaneously generous with finan- 
cial assistance, and the full amount of sums she 
gave to students in moments of need will probably 
never be known, Pleased to be able to help in 
furthering anthropology, she was deeply embar- 
rassed by personal thanks of any kind of aid, and 
she even objected to scholarly acknowledgments 
in print. 

Elsie Parsons did not confuse the pleasures of 
individual self-realization with meaningless sym- 
bols of personal prestige. Although eager that 
women be recognized for their intellectual worth, 
she did not feel compelled to be the standard- 
bearer of feminine scholarly accomplishments be- 
si the academic public. She never cared to take 
a organizational offices. Furthermore, because she 
3 oe independently wealthy, she believed that such 

ces should be accorded to active and responsible 
she in academic life who needed to further their 
SIRE Spier noted, however, that because of her 
“thruy and in spite of her preferences, offices were 
he St upon her,” She served as treasurer of the 

rican Ethnological Society from 1916 to 1922 
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and as its president from 1923 to 1925. She was 
vice-president of the American Folk-Lore Society 
from 1932 to 1934 and held the office of assistant 
editor of the Journal of American Folklore, a task 
she truly enjoyed, from 1918 until her death. At 
the end of her career she had the distinction of 
being the first woman president of the American 
Anthropological Association, not because she had 
made any effort to win equal professional oppor- 
tunities and respect for women, but simply because 
she was one of the most respected and outstanding 
anthropologists of her time (Spier & Kroeber 1943, 
p. 244). 

The large number and the varied nature of the 
entries in the bibliography of Parsons’ writings 
reveal a person who must have possessed both 
amazing energy and a tremendous capacity for 
concentrated effort. Yet recorded recollections of 
Elsie Parsons as a person depict a woman char- 
acterized by serenity, sociability, and contemplative 
impartiality. In an obituary essay A. L, Kroeber 
noted her gentle but determined opposition to 
schools, fashions, cults, and converts, and her 
rigorous honesty and courage of mind, adding that 
she “probably never really experienced the active 
gratification of hate” (Spier & Kroeber 1943, p. 
254). 

Anthropology seemed to her as much a calling 
as a profession, and she was deeply aware of the 
dangers of trusting anthropological expertise rather 
than the broad anthropological approach to society. 
Her concern over these dangers became the theme 
of her presidential address to the American An- 
thropological Association, which she did not live 
to deliver personally. The misuses of anthropology 
at that time, in the racist schemes of the Nazis, 
which horrified Parsons and her colleagues around 
the world, alerted her to the hazards attendant 
upon the utilization of anthropology to achieve any 
specific and immediate ends. When work is so 
directed, no matter how benevolently, she warned, 
“even the slow and patient searches for social 
laws may easily smack of divination which ac- 
cording to our definition is concerned not with 
process but with particular interests and is callous 
to scientific control” (1942, p. 344). However, she 
did see in the data and methods of anthropology 
a way to the liberation of thought whereby people 
might reach meaningful understandings and rea- 
sonable courses of action. She believed that an- 
thropologists had a primary obligation to “popular- 
ize” their science among laymen in order that 
everyone might be enabled to evaluate properly 
the facile, shallow, and ready-made opinions sup- 
plied in ever-increasing quantities through mass 
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media of communication. Parsons hoped that even 
those who expressed these opinions might be 
“educated away from their conceits” (1942, pp. 
337-344). 

Perhaps the melancholy circumstances under 
which her address was delivered prevented criti- 
cism of her ideas; they might well have been 
criticized as inconsistent. Parsons felt that social 
commentators often erred and that even when no 
fault could be found in their opinions, danger 
lurked in the fact that their views were promul- 
gated and absorbed on the basis of authority rather 
than reason. She admonished anthropologists that 
they too ran the risk of being oracular rather than 
scientific. Yet she argued for the popularizing of 
anthropology as the means of bringing about a 
desirable intellectual “revolution.” Now that more 
than two decades have elapsed since Parsons wrote 
her final address, it is possible to make more 
precise the concepts she was in the process of 
formulating. Applied anthropology, a new field in 
1941 and the subject of her pointed criticisms, and 
the even more recent development known as “ac- 
tion anthropology,” have been moving toward the 
kind of popularization of anthropology that Par- 
sons had advocated. In 1941 anthropologists knew 
full well that cultural change must be congruent 
with the existing values and patterns of the group 
concerned, but Parsons deplored the idea that those 
values and patterns were to be discovered and 
defined by experts as a basis of directing cultural 
change. Since World War 1 the fact-finding and 
decision-making processes that constitute directed 
change have received greater attention from an- 
thropologists, from nonanthropologists using the 
methods of anthropology, and from the groups 
served. There has been an emphasis on under- 
standing the broader political and social realities 
in which the interests of small groups and im- 
mediate social goals must be considered. Elsie 
Clews Parsons was thus not only a respected and 
influential scholar in her own day and a significant 
contributor to the sum of anthropological knowl- 
edge; she also pointed, in her final statements, to 
the pitfalls as well as the valid objectives along 
the route that anthropology was to travel. 


NANCY OESTREICH LURIE 


[For the historical context of Elsie Clews Parsons’ 
work, see the biographies of Boas; FLETCHER; GID- 
DINGS. For discussion of the subsequent development 
of her ideas, see FIELD WORK; and INDIANS, NORTH 
AMERICAN. | 
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PARTIES, POLITICAL 


Joseph A. Schlesinger 
Harry Eckstein 


1. Party Units 
um. Party SYSTEMS 


I 
PARTY UNITS 
The term “political parties” emerged in the nine- 
teenth century with the development of representa- 
tive institutions and the expansion of the suffrage 
in Europe and the United States. It designated or- 
ganizations whose goal was the capture of public 
office in electoral competition with one ori riore 
other parties. Subsequently the term “party” was 
extended to include political organizations not ar 
gaged in electoral competition: minor parties which 
had no realistic expectations of gaining “office 
through appeals to the electorate, revolutionary 
organizations seeking to abolish competitive elec- 
tions, and the governing groups in totalitarian 
States. 


Party theory 

The expansion of the term “party” to ine 
organizations with varying goals has resulted in a 
scarcity of viable party theory. Apart from the Ti 
neering works of Ostrogorskii (1902) and Mic E 
(1911), there exist mostly descriptive studies 


parties in individual countries (e.g., Neumann 
1956). The most ambitious attempt to carry party 
theory forward has been that of Maurice Duyerger 
(1951). But Duverger uses a historically deter- 
minist framework which makes the mass mem- 
bership party the inevitable product of universal 
suffrage and renders archaic many existing and 
seemingly durable electoral parties. In contrast, this 
discussion will focus upon the political organization 
which actively and effectively engages in the com- 
petition for elective office. This emphasis makes 
it possible to resolve at least one issue which has 
hampered the development of party theory: the 
function of the party within the political system. 
One consequence of the indiscriminate use of 
the term “party” has been preoccupation with the 
party's functions and goals. The literature on polit- 
ical parties is replete with classifications of parties 
according to their goals. The most common distinc- 
tion is that between the mass-based party, which 
is ideological, doctrinaire, programmatic, or issue- 
oriented, and the cadre or brokerage party, which 
is pragmatic and patronage-oriented. Although 
there is no logical barrier to mass-based parties’ 
being programmatic or cadre parties’ being doc- 
trinaire, the distinction persists because the ques- 
tion of the function of the party is fundamental: 
is the party the instrument of its membership, or 
is it a public agency, primarily responsive to the 
electorate? 
_ The:perception of the party as the instrument of 
its membership is characteristic mostly of Euro- 
pean writings and has led to an emphasis on party 
structure. The classic example is the work of 
Michels, who formulated the “iron law of oligarchy” 
to explain the triumph of the leaders’ ambitions 
for office over the membership’s revolutionary 
goals. Michels’ “iron law” is the product of his pre- 
occupation with the German Social Democratic 
party and his disdainful neglect of the party system 
in which that organization developed. In Duverger’s 
work the acceptance of the party as the tool of its 
members is basic to the conclusion that the entire 
electorate will inevitably be incorporated into par- 
ties, thereby rendering the restricted cadre party 
archaic, But Duverger’s structural scheme is con- 
ee at the expense of viable cadre parties, no- 
z he American parties and the Radical party 
rench Third Republic. 
t had of the party as primarily responsive 
À electorate has been mostly a product of the 
Le eee tradition. This perception has some- 
fits aeons in concern only with the total pic- 
Sua ae ae competition. Although McKenzies 
fin cs ritish parties (1955) does examine struc- 
relation to the parties’ office goals, Schum- 
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peter (1942) and Downs (1957) ignore the internal 
workings of the party and concentrate on its place 
in a competitive system. Most American writing 
about parties is ambivalent. Although there is ac- 
ceptance of the office-seeking definition of party, 
and the importance of party interplay, there is also 
the persistent longing to make the party’s function 
service to its membership. As a result, discussions 
of American parties are often fragmentary and con- 
fined to descriptions and criticisms of local ma- 
chines and formal institutions. 

The office-seeking definition of party allows us 
to recognize that in a competitive system the fun- 
damental issue of the party’s function is a matter 
of value judgment and a constant source of organ- 
izational tension within parties. As complex organ- 
izations dealing with matters of importance, major 
political parties attract participants with a variety 
of motives. Some are interested in public policy; 
others enjoy the social life of the party; still others 
participate because they are ambitious for office or 
for the rewards that come from association with 
public officials. While these motives need be neither 
exclusive nor conflicting, the tension between policy 
interests and office interests within a political or- 
ganization is increased within a competitive sys- 
tem. In a competitive system a party maximizes its 
chances for office by offering policy concessions to 
marginal voters and parties. What distinguishes the 
competitive political party from other forms of po- 
litical organization, therefore, is that the question 
of electoral tactics is ever present. The competitive 
party need not always make decisions in tactical 
terms, and errors in tactics are always possible. But 
over the long run, if the party is to remain an ef- 
fective competitor, the goal of office must be the 
crucial factor in the party's decisions. 

By choosing the office-seeking definition of party, 
therefore, we are able to distinguish the party from 
other forms of political organizations, The policy 
goals of parties change. The problem which has 
worried observers of parties since Ostrogorskii and 
Michels is not the rejection of office because of 
principle but the unseemly pursuit of office at all 
costs. The weight of the evidence indicates that the 
goal of office dominates in all parties whose expec- 
tations of office are high. In every European coun- 
try where socialist parties have achieved a realistic 
chance at office, socialist doctrine has been tem- 
pered. However, the extent to which the American 
Federalists and Whigs were predominantly office- 
seeking organizations is evidenced by the speed of 
their disappearance once they lost their chance at 
office. 

For purposes of analysis, the major implication 
of the office-seeking definition of party is that the 
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party must be viewed in relation to the offices which 
it seeks to capture. In other words, basic to the 
understanding of political parties is the awareness 
of the structure of political opportunities within a 
given political system: the public offices available; 
the rules, formal and informal, for their attain- 
ment; and the attitudes of politicians and voters 
toward these offices. A revolutionary organization 
which rejects existing institutions, a social club, a 
debating society, a pressure or propaganda group 
with independent goals, can all be studied as dis- 
crete entities, But political parties compete to con- 
trol a process which they did not establish and 
which could go on without them. Parties recruit 
leaders, educate the electorate, and even organize 
governments; but the state organizes elections. 
Therefore, everything of interest about a political 
party—its organization, its leaders, its policies, its 
income, and its capacity to attract voters—is af- 
fected by the structure of political opportunities 
within a given state. 

To assert that party organization reflects the 
structure of political opportunities is not to deny 
that parties in turn affect the opportunity structure. 
Certainly the developments which took place in 
British party organization at the end of the nine- 
teenth century greatly transformed the process of 
becoming prime minister. In the United States, 
parties early captured the Electoral College, which 
was to choose the president. At the same time, the 
differences between American and British parties 
reflect the different methods by which the two 
countries choose their chief executives. Clearly, 
parties and institutions affect each other. 

It has been fashionable, since the advent of 
Marx, Weber, and Freud, to reject institutional ex- 
planations of the nature and activities of Parties, 
in favor of social, cultural, or psychological expla- 
nations (e.g., Truman 1955), Recently, however, 
some students of parties have concluded that in- 
stitutions are not epiphenomena but critical var- 
iables (e.g., Epstein 1964; Lipset in Ostrogorskii, 
1964 edition). 

The present discussion treats parties as responses 
to the ways states structure the opportunities for 
elective office. This approach is especially useful 
because the structure of opportunities provides a 
framework for the comparative analysis of parties 
and party systems. It enables us to compare the 
relative standings of parties as office-seeking organ- 
izations, in their own countries and across national 
boundaries. 


Party organization 


The basic party unit is the nucleus, or the organ- 
ization aimed at capturing a single office. The 


broader structure of the party emerges from the 
relations among nuclei. Electoral nuclei develop 
within the constituencies of elective offices, Gov- 
ernmental nuclei form around those offices which 
popularly elected officials fill from among them- 
selves, as in the selection of legislative leaders or 
of the executive in parliamentary systems, The 
nuclei of a party are not necessarily discrete, either 
in personnel or in other resources. For purposes of 
analysis, however, we shall examine the basic units 
first and then the relationships among them. 

It is important to note that the development of 
a nucleus rests upon the expectation that it will be 
able to capture office—if not immediately, then in 
the foreseeable future. Seldom are a party’s chances 
for office evenly divided among all its nuclei. A 
party will have some safe constituencies, some 
where it competes, and others where it has no 
chance at all to win elections. Indeed, a party's du- 
rability will depend in great part upon safe constit- 
uencies, which assure some continuity in office re- 
gardless of the party’s general electoral fortunes. 

At the same time, parties often run candidates 
in constituencies where they have no hopes of 
winning office; but we must clearly distinguish 
these efforts from the party activities. The major 
British parties contest many hopeless constitu- 
encies, in part to provide training and trial runs for 
candidates who hope to advance to more favorable 
constituencies, and in part to accommodate the 
national character of British elections, which means 
that the campaign in one constituency may well 
affect the results elsewhere. In the United States 
during the period of Democratic dominance of the 
South, the Republican party maintained a network 
of organizations which drew federal patronage and 
exerted influence within the national nominating 
convention. But this network was in effect an ap- 
pendage of the presidential nucleus. Such efforts 
do not in themselves constitute party nuclei; they 
are significant only in relation to the organizations 
which have a chance to win office. ea 

Cooperation among party nuclei is determine 
at least minimally by the structure of political op- 
portunities, which gives substance to the party's 
goals. The American and the British political sys- 
tems present two very different opportunity dee 
tures and allow us to observe the varying impa 
of the opportunity structure upon party organiza- 
tion. The American presidential system, which oP 
erates within the federal framework and ëi 
the popularly elected bicameral legislature, me 
ages party nuclei to act independently. In A 
United States thousands of officials are nominata 
and elected, many on different occasions. Evag 
when and where the potential electorate 18 


same, different offices can attract substantially dif- 
ferent numbers of voters. While there is little pres- 
sure for nuclear cooperation to control the independ- 
ently elected executive, such pressure does exist in 
the legislature, where positions of leadership can 
be captured by multinuclear action. 

In contrast, the British structure of opportunities 
imposes tighter relationships among party nuclei. 
The British parliamentary system provides for a 
single popularly elected chamber, all of whose 
members are elected together at a time determined 
by the dominant party leader. The electoral nuclei 
in turn define the governmental nucleus. In such 
a system the structure of political opportunities en- 
courages multinuclear cooperation and party co- 
hesion. 

At the same time, within each political system 
the refinements of the opportunity structure mod- 
ify its general impact upon party organization. 
Beyond the number of offices in the opportunity 
structure, there is the arrangement or hierarchy 
of offices. In the United States, the federal system 
provides no long-range career outlets for state offi- 
cials. For the ambitious state governor there is only 
the national senate or the presidency and its sur- 
rounding administrative offices. Thus, the Amer- 
ican federal system provides a restraint upon the 
independence of state party nuclei as well as pres- 
sure for cooperation between the state nuclei and 
the nuclei for national office. The Canadian federal 
system, on the other hand, provides for miniature 
parliaments in its provinces, which make long- 
range careers possible for provincial leaders. In 
turn, indigenous provincial parties which have little 
hope of developing into national organizations are 
able to flourish. The Social Credit party in Alberta 
is a good example. 

In addition to the arrangement of offices, the op- 
portunity structure imposes upon the party the pro- 
cedures which it must follow to achieve office. In 
the United States the provisions for numerous in- 
dependent elections encourage the independence of 
party nuclei. But there are also provisions for the 
sharing of electorates which facilitate multinuclear 
Cooperation. Presidential candidates share the bal- 
lot with a host of other candidates for office; gov- 
ernors and senators frequently run for office at the 
eal ee and United States representatives, along 
ne mA state officials, appeal to the subelector- 
rake tl e higher officials. These arrangements 
ee a the “coat-tail” effect, the possibility 
cae eae ae! for one office will influence the 
fa dare other offices; they thereby create pressure 
RAT nuclei to consider the activities and the 
lene of other nuclei. On the other hand, in 

ntary systems which use the single-mem- 
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per district and avoid shared electorates, the basis 
for nuclear independence exists, as, for instance, 
in the French Third Republic. ‘ 

Within parliamentary systems, however, the sim- 
ilar behavior and attitudes of different electorates 
can exert pressure for cooperation among nuclei. 
Despite the major advances in our knowledge of 
electoral behavior, the impact of the party upon the 
voter remains largely unknown. England appears 
to have a disciplined electorate, receptive to party 
activity. Such an electorate encourages maximum 
nuclear cooperation by making possible the electoral 
destruction of a recalcitrant leader who has been 
denied his partys nomination. American and 
French electorates are rarely willing to behave in 
this manner. The Converse-Dupeux study (1962) 
shows that American voters identify with a party 
more closely than do French voters, but aggregate 
American election results show that in any given 
election enough voters will split their ballots to 
allow victories to both parties within the same con- 
stituency. 

Undoubtedly the reasons for which voters sup- 
port parties have important organizational conse- 
quences. The reasons are usually complex and 
include ethnic, religious, social, economic, ideolog- 
ical, and geographical factors in a variety of com- 
binations (Alford 1963). Within a party, shifting 
voter support can produce factions or clusters of 
nuclei contending with each other because their 
electoral bases are divergent. In the French Fourth 
Republic the rising social and economic status of 
the Radical voter brought the new Radical leaders 
into conflict with Radicals of the Third Republic, 
who wished to retain the party's prewar doctrinal 
concern for “the people.” In the United States the 
geographic support of the South for the Democratic 
party has increasingly caused conflict with those 
who support the party for economic and social rea- 
sons elsewhere. 

Another source of organizational tension within 
parties is provided by the party system, or the com- 
petitive relationship of parties, and by the modifica- 
tions which the structure of opportunities imposes 
upon the party system. Party systems vary in the 
number of parties which have a chance at office. 
Depending upon the structure of opportunities, 
party systems vary also in the distribution of the 
chances for office among party nuclei. More than 
one party must have a chance at office in order for 
the party system to be competitive. Yet the chances 
do not have to be, and often are not, equally dis- 
tributed, even when there are only two real com- 
petitors. Prior to 1932 the American Republican 
party won most national elections; since 1932 the 
Democrats have been the dominant national party. 
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From the standpoint of nuclear cooperation, the 
uneven distribution of strength among nuclei af- 
fects the flow of organizational resources and 
causes tension within the party. We have assumed 
that party nuclei will exist only around those offices 
for which there is the chance of victory. But even 
for these offices the party’s chances can range from 
perfect to doubtful. Differences in electoral strength 
will produce nuclear organizations with different 
needs and resources. The nuclear organizations 
with the strongest electoral support are likely to be 
in the best position to command the other resources 
of organization, money, and personnel. At the same 
time, they are under the least pressure to compete 
with other parties for votes. The nuclei most sub- 
ject to electoral pressures are those which face the 
strongest competition. Given the varied electoral 
needs of its nuclei, a party may well find itself with 
two or more nuclei or nuclear clusters framing dif- 
ferent appeals to the voters. 

Of course, a party is subject to additional ten- 
sions arising from its needs during the governmen- 
tal phase. The needs of the governmental nucleus 
may well conflict with the needs of the electoral 
nuclei. In government by coalition the fate of com- 
petitors who are also allies becomes the concern of 
the governmental nucleus and imposes further 
strains upon the electoral nuclei of the dominant 
party. 


Party activity 

The structure of political opportunities deter- 
mines not only the extent of party organization but 
also its quality or content. Offices foster nuclear 
organization, and electoral procedures and prac- 
tices provoke nuclear cooperation or dissension. 
Another consequence of the close relationship be- 
tween party organization and the structure of op- 
portunities is that the limits of party activity are 
unclear. Imbedded in the electoral and governmen- 
tal process, the party in its activities is often indis- 
tinguishable from voters and pressure groups, on 
the one hand, and from government, on the other. 
In the search for office, parties go outside their 
ranks for leaders; they accept an occasional vote 
as the basis for party identification; they allow gov- 
ernment bureaucrats rather than the party program 
to define party policy. 

In describing the content of party activity, there- 
fore, it is more useful to employ the concept of con- 
tribution to the party organization than the concept 
of membership. The term “membership” connotes 
an unwarranted clarity in the boundaries between 
the party and its environment, whereas, as has just 
been noted, even such critical contributions to the 
maintenance of the organization as voting, recruit- 


ment of candidates, and even candidacy itself may 
come from individuals not readily identifiable as 
formal members. Conceiving of party leadership, 
recruitment, money, and communications as con- 
tributions to party organization enables us to ask 
not only who makes them but also to focus on the 
flow of contributions from one nucleus to another, 
Thus, it is the transfer of contributions which cre- 
ates the multinuclear party. 

Leadership. In party organization, leadership 
is a contribution of first importance. The fluctuat- 
ing, ephemeral, and largely voluntary character of 
most contributions to party activity makes it most 
useful for someone to move people to participate, 
to make agreements with other leaders, and to 
bring together the materials of party combat. 

Yet in parties, more than in any other type of 
formal organization, the official lines of authority 
are suspect, and there is always implicit the ques- 
tion of who is the “real” leader. This is true in great 
part because a political party is a leader-producing 
organization, and the ambitious men attracted to 
parties find it advantageous at least to seem to be 
playing a major role. If the “real” party leader is 
often difficult to locate, it is because many compet- 
itors for leadership have a stake in keeping it so. 

The contribution of leadership is also difficult to 
assign because there are two broad categories of 
party leaders, which in fact may or may not merge. 
There are the public leaders, men who also repre- 
sent the party as its candidates for public office, 
and there are the associational leaders, men whose 
office is limited to the party organization. Often, 
but by no means always, there is no sharp distinc- 
tion in personnel between the two categories; men 
move from one to the other or hold both public and 
party office concurrently. 

The difficulties for analysis presented by these 
overlapping categories pertain especially to the 
American system. In American parties there is ae 
formal hierarchy of authority or delineation 0 
functions. The committees, chairmen, and conven- 
tions which range from the locality to the nation 
have no consistent authoritative relation tocas 
other or to the parties’ officeholders and nominees. 
At the same time, the actual conditions for omo 
holding in the United States, numerous public 0f- 
fices with a high rate of turnover, favor the eon 
tion in leadership. This situation contrasts Me : 
that of the parliamentary system, where the av! 
ability to the party of “safe” public offices or it, 
almost always assures the merger of both categori 
of leaders. Thus, whatever the original source T 
the party leader's strength, organization work f 
officeholding, party and public leadership ace 
combine and provide the necessary continuity in 


organization where most activity is transitory. [See 
LEADERSHIP, article on POLITICAL ASPECTS.] 

Recruitment and nomination. The major task 
facing the leaders of the nuclear organization is 
the choice of its candidate for office. Who contrib- 
utes to this task, and how is it accomplished? There 
are two aspects to the process. One is the recruit- 
ment—and discouragement—of candidates for the 
nomination, The second is the choice of the nom- 
inee from among the active seekers. The latter as- 
pect is more overt and is normally surrounded by 
rules of procedure which serve to make the nomina- 
tion authoritative. But the recruitment process is 
equally critical in defining the choices which can 
be made within the party. 

Although there are a few empirical studies of 


the recruitment process, those which exist place” 


the burden of the task with the aspirants them- 
selves. There is little evidence from which to con- 
jure the picture of an organization actively seeking 
candidates, Studies of candidates for lesser offices, 
such as the state legislatures of the United States 
(Wahlke et al. 1962), show relatively few who 
perceive themselves as recruited by party organi- 
zations. A high proportion of politicians come from 
families with active political experience. Party or- 
ganizations give evidence of actively recruiting 
candidates for offices where there is little expecta- 
tion of victory, but when the nomination is of 
value, the nuclear organization normally responds 
to choices presented to it by men actively seeking 
the nomination, who advance along various career 
lines. [See POLITICAL RECRUITMENT AND CAREERS.] 
: The opportunity structure, therefore, in defin- 
ing the paths of advancement, has much to do with 
the recruitment of party candidates. The loosely 
ordered American system provides multiple lines 
of advancement, so that parties have diverse 
sources for candidates. Parliamentary systems, on 
the other hand, channel most recruitment through 
legislative offices. 

The contribution of nomination, which follows 
recruitment, to party organization is equally com- 
plex and often obscure. In the United States, re- 
forms such as the convention and the direct pri- 
mary. have clarified some aspects, but some decisive 
activity still takes place in private. Much of the 
ee activity in a nomination consists of 
F ene other men’s ambitions and, if the 
a on is to succeed at the polls, tying their 
Eee ae to those of the candidate. Therefore, the 
pate ation ideally combines a maximum of dis- 
. <r Ser of all but one of the ambitious with 

TA mum of public disgrace for the others. 

a Ae discouragement process takes place within 
of rules according to which one man gains 
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the designation “nominee” of the party. In a sta- 
bilized party system, capture of this label is im- 
portant, and the state frequently acts to assure 
that only candidates selected according to the rules 
can appear in the general election under that label. 
This does not prevent officeseekers, however, from 
organizing their own campaigns to capture the 
label. As with recruitment, nomination is less a 
case of an organization’s selecting candidates ac- 
cording to qualifications than it is of providing 
the framework within which they contest for the 
nomination. 

The actual choice of a candidate may involve 
numerous individuals, and, although parties nor- 
mally establish their own procedures for nomina- 
tions, the state may intervene. In the United States 
the direct primary imposes upon the party maxi- 
mum participation in the nominating process. 
This imposition can produce irrational conse- 
quences for the party, for there is no assurance 
that the primary voter is imbued with the office- 
seeking drive which works in favor of the most 
effective electoral candidate. As a result, pre- 
primary conventions have developed in several 
American states as a means of controlling the 
nomination, Party conventions are, in fact, among 
the most common methods of nomination. They 
are usually made up of delegates apportioned ac- 
cording to party membership. In Norway the state 
intervenes in the convention process by paying the 
expenses of the nominating conventions if they 
follow prescribed methods (Valen & Katz 1964), 
Nominations by party caucus or by constituency 
committee are also common. 

Whether or not the state plays a role, the ability 
to control a nomination is one of the few observ- 
able tests of strength within a party. As a result, 
party leaders usually avoid taking the test unless 
forced to, or if they do, they seek to support a 
certain winner. The problem is particularly ger- 
mane to the delicate relationship between the nu- 
clei for higher offices and lesser offices. For this 
reason, open intervention by party leaders in the 
nomination contests for lesser offices is rare. Even 
in the disciplined British parties, control of con- 
stituency nominations by the central office is far 
less than has been assumed by those who would 
infer discipline as radiating from the center (Ran- 
ney 1965). Although empirical studies of nominat- 
ing procedures are rare, the evidence seems to be 
that constituency nuclei are in control in most 
systems. 

Whatever the contribution to nomination, 
whether it be the discouragement of ambitions or 
their open defeat, it is dysfunctional if it leaves 
the nominee too weak to win the general election. 
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Thus, although not all of the actors may be so 
motivated, all of the candidates who do hold office 
ambitions will be under a common restraint at 
least not to oppose, and preferably to support, the 
nominee, The most effective control the party has 
is the defeated candidate’s own hope for prefer- 
ment. It is peculiar to party organization that at 
one and the same time it encourages men to open 
conflict and then forces them to curb animosity 
to achieve a subsequent goal. Much of the two- 
facedness of political activity as it appears to ex- 
ternal observers comes from this inherent charac- 
teristic of nuclear organization, That burying the 
hatchet is not easy is evident from the frequency 
with which it fails or, if done, is done grudgingly. 
Still, the constant process of personal advancement 
through organized conflict either repels or weeds 
out personalities incapable of sustaining the ten- 
sion. In his earlier work, Lasswell ( 1948) empha- 
sized the dominance of power drives in the person- 
alities of political leaders. More recently, he has 
come to note that such persons do not rise to the 
top in democracies (Lasswell 1954; see also Lane 
1959, pp. 124-128). The pure power seeker, as 
distinct from the man with office ambitions who 
sees office as instrumental to other goals, can prob- 
ably find more satisfactory outlets in areas other 
than politics. An important aspect of the contribu- 
tion of nomination is that self-interest must control 
itself and create cooperation out of what might be 
the most divisive and corroding of ambitions, the 
drive for political power. [See PERSONALITY, PO- 
LITICAL.] 

Issue formulation. Once the nuclear organiza- 
tion chooses its candidate, it must seek support for 
reasons other than its power drives. Although it is 
known that the voters’ response to parties involves 
much more than a choice between policy stances, 
there is a good deal of issue content in electoral 
decisions, and to succeed, a party must devise a 
program or platform. The position of the party as 
seen by both its leaders and the voters is only par- 
tially covered by the formal “platform.” Therefore, 
one must ask who contributes the issues. To a 
great extent, the process of leader selection decides 
the policies for which the party will stand. But 
beyond the party leader there are “leading citizens,” 
Specialists in public affairs, scholars, and journal- 
ists on whom the party can call for contributions 
in devising policy. 

The intellectuals’ function, however, is severely 
limited by the way in which competitive governing 
parties are forced to devise policy. A party in con- 
trol of government must make choices, but its 
range of alternatives is usually narrow. The party 


is restricted by time, internal differences, the ad- 
ministrative apparatus of government, and other 
relevant elements of the political system, including 
foreign relations. A drastic reformulation of public 
policy, therefore, requires either massive continu- 
ous support or the elimination of the competitive 
rules. Otherwise, the party in government finds its 
position defined largely by the way it responds to 
issues which arise during its tenure. Thus, the 
formulators of the party's position must consist 
largely of its elected leaders and its administrative 
corps. Although parties out of power presumably 
have greater freedom in defining their stands, they 
also must respond to the government and to the 
issues as they arise, and allow public officials the 
major role. 

Parties which govern in coalitions, however, can 
more easily develop a refined ideological position 
than can parties which must govern alone. The 
extent to which a party can be held responsible 
for government affects its ability to retain or to 
define a stand independent of the government, In 
coalitions where the governing responsibility is 
shared or obscure, parties can govern and retain 
a doctrine which has little relation to the govern- 
ing experience. 

Communication. As in any organization, com- 
munication is critical to all aspects of party ac- 
tivity. Very little, however, is known about this 
contribution. Historically, newspapers were closely 
associated with the growth of parties, and party 
newspapers are still common, especially in Europe. 
But there have also developed independent news 
media which collect and disseminate information 
vital to the party. For example, public opinion polls 
provide information on voter attitudes far more 
reliable than the reports of party workers; yet more 
often than not, the polls are conducted and re- 
ported by media outside the party's control. An 
important party activity, therefore, consists of 
affecting the information which independent news 
media report. [See COMMUNICATION, POLITICAL] 

Technical services. The nuclear organization 
can use many technical contributions. Periodic con- 
tact with the electorate, quite apart from testing 
their sentiment, is useful in keeping track of the 
party’s supporters. Every campaign creates a PE 
of possible tasks for which volunteers must 
recruited. Each party affair, rally, or coffee kom 
involves bringing together as many people A 
possible to give the appearance that the party €n 
: 5 itself, there 
joys popular support. On election day itself, ee 
are many things which an organization can ae 
make sure that the committed voter gets to Fe 
polls and has his vote tallied. The British parti 


have career staffs which assign professional agents 
to constituencies; in this sense the agent is inde- 
pendent of the candidate. Characteristically, in 
American parties technical contributions are made 
by small permanent staffs which blossom during 
the campaign. Such professional staff members 
exhibit no tendency to develop a sense of direc- 
tion independent of party leaders. They gain in- 
fluence in their relations with associational or 
public leaders of the party, not as a separate bu- 
reaucracy. 

Money. Since useful services are by no means 
all voluntary, money is a prime contribution to the 
nuclear organization. Money represents “instant” 
organization by helping to satisfy the periodic need 
to expand activities rapidly. The active nucleus, 
therefore, puts much effort into gathering money. 

Money may come in small amounts from many 
people. In addition to providing funds, the small 
contribution also serves to reinforce popular iden- 
tification with the party. The small contribution 
may also be used to clarify party membership 
when it is regularized in the form of dues. This 
procedure is common in European parties and is 
not unknown in major American state organiza- 
tions, e.g., in Wisconsin and Michigan. Fear of the 
corrupting influence of money has led to the for- 
mulation of legal restraints in several countries on 
the amount and sources of money, as well as on 
the amounts that can be spent in a campaign. 
Nevertheless, the consequences of money, either in 
directing party policy or in affecting elections, are 
hard to evaluate, [See POLITICAL FINANCING. ] 


Party effectiveness 


It is difficult to assess the impact of any of the 
connective contributions upon the fate of the po- 
litical party or the outcome of the electoral process. 
The effects of the activities of parties are sur- 
rounded by an even higher level of uncertainty 
than those of other organizations. The only test 
of effectiveness is whether or not a party wins 
office. But it is seldom clear whether victory or 
defeat is due to the party’s decisions, the decisions 
of its opponents, or even to the decision of some 
foreign government over which the party can have 
no possible means of control. 
pit sites as long as a party remains a viable 
eee within the Party system, it attracts re- 
off s. Any party with a reasonable chance at 
ae will attract those with an interest in office. 
ae the expectations of victory, the more 
5 ie: the party becomes to individuals and 
ae groups concerned with the actions of 

ent. Thus, dominant parties often become 
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holding companies for competing political factions 
and interest groups; the Indian Congress party, 
the Israeli Mapai, and the Democratic party in the 
American South are good examples. 

On the other hand, parties whose chances for 
office are small tend to attract those whose goals 
are not the goal of office. The disaffected or groups 
whose limited objectives can be attained through 
the influence of a minor organization gravitate 
toward minority parties. Indeed, the major danger 
to minority parties is their attraction for partici- 
pants who have a vested interest in preserving a 
party’s minority status. In order for a political or- 
ganization to become or to remain a true political 
party, it must respond or adapt to the structure of 
political opportunities. 

JosEPH A. SCHLESINGER 


[See also ELECTIONS; INTEREST GROUPS; LEGISLATION, 
article on LEGISLATIVE BEHAVIOR; NONPARTISANSHIP; 
POLITICAL MACHINES; POLITICAL PARTICIPATION; 
REPRESENTATION, article on REPRESENTATIONAL BE- 
HAVIOR; VOLUNTARY ASSOCIATIONS; Votinc; and 
the biographies of Key; LOWELL; MICHELS; OSTRO- 
GORSKII.] 
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Il 
PARTY SYSTEMS 


The study of political parties has two major 
facets, One involves their characteristics as discrete 
entities—for example, their various social bases, 
histories, goals and appeals, formal organizations, 
and actual power structures. This is the study of 
party “units,” a subject discussed in the preceding 
article. The other involves the competitive inter- 
action patterns among party units or, as Duver- 
ger puts it, “the forms and modes of their coexis- 
tence” (1951, p. 203 in 1954 edition). This is the 
study of party “systems.” 


Scope and definition 


Party units and party systems are so closely re- 
lated that one cannot really deal adequately with 
either without reference to the other. Interactions 
among parties obviously are affected by the nature 
of the interacting units, while the interactions in 


turn have repercussions on the units themselves, 
These relations between units and systems will, of 
necessity, be alluded to frequently in this article, 
However, only party systems will be treated system- 
atically here, party units being referred to only in 
passing, literally by allusion. This is not simply a 
matter of arbitrarily limiting the scope of the dis- 
cussion, for, to an extent, party systems both can 
and must be treated independently of the units that 
constitute them. In no sense are they merely sums 
of their parts. The interactions of party units invari- 
ably have attributes not fully stated in the descrip- 
tion of the units, and while they are obviously much 
dependent on individual party characteristics, they 
also reflect other factors, not least the broader set- 
tings—society, political system, general structure 
of political competition—within which parties exist, 
Party units and party systems constitute sep- 
arable subjects also in two other senses, First, 
defining party systems as competitive interaction 
patterns among parties implies that political sys- 
tems could conceivably have parties without having 
party systems, Manifest cases in point are those 
monolithic parties that serve regimes as instru- 
ments of control, coordination, and permeation, 
rather than being structures of genuine political 
competition. Second, a party might stand outside of 
the party system of a polity even where such a 
system exists. A party will do so if it does not com- 
petitively interact with other parties in a substantial 
way—for example, if it is a very minor party exer- 
cising very little political influence, or if it is essen- 
tially a pressure group (or revolutionary organ- 
ization) that goes through the motions of party 
competition only to publicize or disguise itself or ie 
get advantages that party status often confers 
(such as free railway travel and free radio or tele- 
vision time), k 
One could, of course, make participation in a 
party system a condition of including an organiza 
tion in the universe of parties in the first place, 
regardless of what the organization calls itself. be 
probably should do so if the concept of party iS i 
delimit a coherent theoretical subject—that ae 
set of cases similar enough to one another, yet ik 
ferent enough from others, to permit the formu a 
tion of informative generalizations that pertain 3 
all of the cases and some that pertain to them a 
“False” parties, needless to say, must be ae 
but under rubrics other than that of parties. their 
may form special theoretical universes of aR 
own, and they may belong to larger univers E 
which genuine parties, as well as many other ee 
tures, also belong—for example, the very fate 
world of structures for seeking and mainta 


political power or the still larger universe of human 
organizations. 

To delimit the concept of party systems precisely, 
it is necessary to concentrate on, and to be more 
specific about, the patterns of competition that 
characterize the interactions of their units. Political 
competition takes, after all, a great many forms. 
The protagonists in a civil war are certainly en- 
gaged in a kind of political competition; so are 
lobbyists maneuvering for minor advantages, legis- 
lators attempting to constitute majorities or to at- 
tain sensitive positions in the law-making process, 
bureaucrats promoting pet projects, and mobs dem- 
onstrating or rioting in the streets. What sort of 
political competition, then, is peculiar to party sys- 
tems? Manifestly, it is electoral competition: overt 
participation in open, formalized, genuine elections. 
Not only is this a process that cannot possibly be 
confused with the activities of other political par- 
ticipants; not only does focusing upon it promise 
to yield a homogeneous and highly differentiated 
universe of organizations for theoretical study; but 
the association of party competition with elections 
happens also to be implicit in conventional usage, 
at least since the time when the parliamentary 
combinations of representative oligarchy, under 
pressure of democratization, first developed extra- 
parliamentary machines to provide them with new 
sources of power. 

When one speaks of party systems, then, one 
speaks at bottom of interactions in a particular kind 
of political process. That process, however, is closely 
tied to certain political structures and functions. 
The structures are representative governments 
(although a party system may well grow out of 
organizations operating in other contexts). The 
functions are those important ones that. elections 
perform in representative systems: political recruit- 
ment and relating the recruitment of leaders to the 
articulation and aggregation of political goals—in 
Neumann's words, “organizing the chaotic public 
will” (1956, p. 397) in order to shape choices that 
elections can resolve. The aggregation performed 
by party systems should not, however, be conceived 
Solely as the internal process of resolving conflicts 
between particular men and measures that always 
precedes electoral competition; the whole history 
of parties, which gradually turns their labels into 

images,” is involved as well. 

Š sa la is necessary not only for the sake 

tone but also to provide a foundation for 

tary a in representative, especially parliamen- 

ae hs Effective authority ina democracy 

he pibe rest on solid masses of steady votes 
epresentative assembly, and these must per- 
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force be provided by organizations that form a 
persistent legislative will out of the myriad possible 
shifting constellations that might arise in repre- 
sentative assemblies. But while in both cases the 
formation of collective will is involved, the two 
functions of aggregation—to represent bodies of 
opinions and to support authority—may very well 
conflict. The more faithfully divergent opinions are 
represented, the more likely it is that authority will 
be insufficiently supported; the reverse is true as 
well: the maximum support of authority would 
come from a party system so overarchingly aggre- 
gative as to offer no choice at all, In the final 
analysis, then, aggregation involves the search for 
an optimum between the conflicting values of unity 
and divergence, the one to support power, the other 
to provide choice. 

However vital the functions that party systems 
perform, and however closely they may be tied to 
the process of electoral competition, it is neverthe- 
less inadvisable to make these functions central 
to a delimitation of party systems. This is for two 
reasons. One is that many substantially different 
structures, in the same and different political sys- 
tems, recruit leaders and aggregate preferences. In 
representative systems, for example, aggregation 
of preferences is carried out not only within the 
legislative process but also by any moderately com- 
plex pressure group and, increasingly nowadays, by 
bureaucrats, who consult among themselves and 
with others both in and out of government. The 
same holds true for recruitment. The second reason 
is that recruitment and aggregation are certainly 
not the only functions of party systems. The units 
that constitute them serve a host of other ends. 
Parties continuously provide political education and 
inculcate political skills. Above all they do so in- 
ternally to their militants, by such means as dis- 
cussions in branches, study courses (most typical 
of Marxist parties), conferences and summer 
schools (as in Britain), the publication and dis- 
semination of research reports and handbooks, and 
training schools (like those of the Neo-Destour in 
Tunisia); most obviously, they provide, in their 
internal structure, facilities for the acquisition of 
political skills of every kind. In many countries 
they serve also as frameworks for their members’ 
social life, sometimes indeed all-embracing frame- 
works. This is the case mainly in some non-Western 
countries, where parties often furnish a badly 
needed cement for men suddenly cut off from their 
traditional ties and often perform, especially in 
urban areas, some of the functions discharged else- 
where by traditional structures, Sometimes this in- 
volves rather surprising activities; Hodgkin cites 
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the example of a section of an African party which 
combines the activities of “emancipating young 
women from family influences, assisting the proc- 
ess of matrimonial selection, providing on a con- 
tributory basis marriage and maternity benefits 
(including perfume and layettes for the newborn ), 
preserving the Dioula tribal spirit, and running an 
orchestra” (1962, pp. 144-145). Such activities 
are characteristic of sham parties that serve as 
adjuncts to dictatorial regimes, but one can also 
find them in less dubious cases; Scandinavian 
parties, for instance, often sponsor boy-scout 
groups and summer camps, provide adult educa- 
tion courses, and—less peculiarly—publish news- 
papers and journals. In some cases, the party 
system even directly supplies governmental serv- 
ices, although this is unusual and occurs mainly in 
new or newly independent countries where govern- 
mental frameworks have not yet hardened. More 
commonly, party systems perform the functions of 
directly controlling the executive, most often in 
colonial systems that permit party competition but 
sometimes also in highly developed countries like 
Britain, where informal legislative committees prob- 
ably do more to call the executive to account than 
formal parliamentary institutions. Finally, party 
Systems, quite apart from their constituent units, 
may serve perhaps the most basic function that 
must be performed in any sound polity: that of 
giving the polity an identity above the separate 
realities of its constituent units. The regular inter- 
play of competing electoral forces may in some 
incohesive new nations be the only central point to 
which loyalties that rise above parochial attach- 
ments may be tied, 

To summarize: The subject of “party systems” is 
concerned with the interaction patterns among sig- 
nificant and genuine electoral organizations in rep- 
resentative governments—governments in which 
such systems serve pre-eminently (whether well or 
badly) the functions of providing a basis for effec- 
tive authority and for defining choices that can be 
resolved by electoral processes, 


The study of party systems 


Until very recently studies of party systems al- 
most invariably consisted simply of discussions of 
the units constituting them and dealt chiefly with 
particular countries, while little attention was paid 
to general system characteristics. This applies par- 
ticularly to the early textbook and monographic 
literature on parties. But it even applies to com- 
parative and theoretical works like those of Weber, 
Ostrogorskii, and Michels, which, possibly excepting 
some highly value-laden passages in Ostrogorskii 
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(1902), deal entirely with party units. Today, de- 
spite some small advances toward the special the. 
oretical study of party systems, it is still this sort 
of literature that provides the main underpinning 
for the subject. No descriptive concepts for charac- 
terizing party systems have been settled upon 
(although various ones are used), and no general- 
izations about such systems have been adequately 
tested (although many have been proposed ), Hence, 
no settled body of ordered knowledge can be re- 


, Ported in this article, and much space must be 


devoted to quarrels, gropings, uncertainties, and 
tentative suggestions. 

The first works that accord party-system charac- 
teristics any sort of explicit treatment appeared 
during the late interwar and early postwar years, 
During this period scholars became increasingly 
aware of the links between patterns of party com- 
petition and the performance of representative gov- 
ernments. Representative government in the inter- 
war years was in a state of crisis, which often 
seemed to stem from the party systems, particularly 
the fragmentation and ideological intensity of party 
competition in certain countries. Critical fire was 
directed at representative systems because the facts 
of party life—bosses, oligarchies, cliques, interest- 
mongering, unscrupulous electioneering, insensitiv- 
ity to opinion or real problems—seemed to make 
a mockery of the ideals of liberal government. Early 
writings on party systems, consequently, concen- 
trated heavily on remedies and prophylaxis: on 
how party systems could be made more democratic, 
more responsible, less fragmented, and less dog- 
matic, so that antidemocratic criticism might be 
disarmed and crises averted. Typical of this genre of 
writings is F. A. Hermens’ Democracy or Anarchy? 
(1941), a book which argues with great vigor that 
proportional representation produces party systems 
inimical to successful democracy and recommends 
the single-member simple-majority system as a cure 
for practically all democratic ills. An even mote 
famous specimen is the special report, Tou 
More Responsible Two-party System, produced yy 
a committee on political parties of the Ae 
Political Science Association (1950), which fee 
an intimidatingly large series of interrelate H 
posals designed ‘to produce parties “able to ae 
forth programs to which they commit hen 
and possessing “sufficient internal cohesion to ¢ i 
out these programs,” parties able to provide ay 
integrated government and an integrated OPP 5 
tion, parties more able to resist pressure an At) 
formulate the public interest, parties TespO 
public will, and party leaders responsive to Pi 
members. 


This premature but useful emphasis on therapy 
was followed, in the 1950s, by some first efforts to 
systematize our knowledge and understanding of 
party systems: their anatomy and physiology, so to 
speak; how they respond to conditioning factors; 
and how in turn they affect their larger settings— 
all matters that therapy presupposes to be known. 

The main pioneering work in the systematic em- 
pirical study of party systems is Maurice Duverger’s 
Political Parties (1951). Duverger devotes about 
half of that work to party units (his term is party 
structure) and half to party systems, making him, 
to my knowledge, the first to raise the distinction 
between units and system to a highly explicit level. 
In both sections of the book he tries to construct 
general descriptive categories for characterizing and 
classifying cases, as well as empirical generaliza- 
tions about the cases and types. 

Duverger’s book does not pretend to furnish a 
finished theory; how could it as a first exploration? 
On the contrary, it is hedged at every turn with 
caveats and disclaimers. His theories, according to 
Duverger himself, are “vague, conjectural and ap- 
proximate” and intended merely to raise issues and 
provide tools for further inquiry. Few even of Du- 
vergers more mordant critics seem to have had 
greater doubts about his work than Duverger him- 
self; almost everything he says is presented as 
merely suggestive, indecisive, tentative, superficial, 
vague, preliminary, incomplete, and imprecise— 
the adjectives again being his. 

Unfortunately, few other scholars have so far tried 
to go beyond Duverger’s first attempt. A symposium 
volume, Modern Political Parties (Neumann 1956), 
did provide some well-summarized data on many 
national party systems, as well as a few well- 
formulated questions and concepts in its introduc- 
tion and conclusion; but it too remained very much 
on the level of preliminary concept formation and 
very tentative hypothesizing. Nor do works like 
Hodgkin's African Political Parties (1962) or the 
many textbook sketches of party systems in non- 
Western areas and countries go much further. They 
add a great deal to our data, but, as to theory, they 
either present ad hoc interpretations or apply un- 
critically Duverger’s formulations. 
i However, the main lines of inquiry to be pursued 
in the general study of party systems have by now 
emerged. First and foremost, one needs adequate 
descriptive concepts for characterizing, distinguish- 
ing, and classifying party systems; without such 
Concepts, theorizing in other forms can hardly pro- 
ell Second, there is the problem of finding the 

terminants of the various forms of party systems. 
Third, there is the problem of the functional impacts 
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of different party systems on the larger political and 
social systems in which they exist. 


Variations in party systems 


Number of interacting units. In thinking of dif- 
ferences in the patterns of interaction among party 
units, one variable has always come first to mind: 
the number of interacting units, or, in other words, 
the extent to which party systems are consolidated 
or fragmented. In fact, this is not just the foremost 
but the sole dimension on which most existing 
studies distinguish party systems. Much the most 
frequently used typology of party systems has been 
a purely numerical division into three classes: one- 
party systems, two-party systems, and multiparty 
systems; with, occasionally, one-party systems sub- 
divided, particularly to distinguish dictatorial from 
representative structures. Practically all generaliza- 
tions about the determinants and functional im- 
pacts of party systems are couched in terms of this 
typology. Examples of such generalizations are prop- 
ositions to the effect that the process of nation 
making, or a commitment to rapid social and eco- 
nomic development, leads to and/or requires one- 
party systems, that the single-ballot simple-majority 
system of elections leads to two-party systems, and 
that multiparty systems make parliamentary gov- 
ernments unstable and ineffective. 

No doubt, the number of parties is a basic and 
obvious dimension along which party systems vary. 
But despite appearances, it is not a dimension that 
lends itself to making distinctions clearly and easily. 
In fact, the numerical typology has seemed to act 
as a positive barrier to the discovery of trustworthy 
hypotheses. Writings on party systems are very rich 
in empirical generalizations yet extraordinarily poor 
in any that have withstood rigorous testing; so often 
does reality seem to confound propositions based 
on the traditional typology, that one has reason to 
think that the fault lies in the typology itself. 

The fundamental difficulty with the established 
numerical typology involves its application to con- 
crete cases: the typology cannot sensibly be taken to 
mean literally what it says, and itis difficult to use in 
anonliteral sense. Take, to begin with, the concept of 
a one-party system. Strictly speaking, there can be 
no such thing. If party systems involve interactions 
among party units in the process of electoral com- 
petition, then the idea of a one-party system is 
logically absurd, for one cannot have a competition 
or an interaction with only one actor. It follows that 
if the concept is to be used, there must be specific 
conditions under which the existence of all parties 
but one should be ignored for purposes of general- 
ization, even in genuinely competitive systems, or 
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under which a single party without competitors is 
nevertheless considered to be genuinely engaged in 
electoral competition. This in fact is the procedure 
in most writings on one-party systems. Duverger, 
for example, includes in the category—like many 
other writers—the American South as well as Nazi 
Germany, Fascist Italy, Portugal under Salazar, and 
Turkey from 1923 to 1950 (1951, p. 275 in 1954 
edition). Coleman has listed as one-party systems 
in Africa (as of 1959) northern and eastern 
Nigeria, Ghana, Somalia, and the Federation of 
Rhodesia and Nyasaland, despite the fact that in 
each case more than one party actually contested 
elections and managed to win representation. Blank- 
sten similarly regards Mexico as a one-party system 
(Almond & Coleman 1960). 

These writings raise the essential problem of 
finding sensible but unconventional criteria for in- 
cluding and excluding parties in a party system. 
In much of the literature, however, such criteria 
are not stated at all, and parties are ignored or 
taken into account practically at will, while works 
that do state their criteria usually arrive at them on 
an ad hoc, arbitrary basis. The result is that the 
universe of one-party systems depicted in the liter- 
ature, sometimes even in a single work, is extremely 
heterogeneous, including such very diverse phe- 
nomena as monolithic dictatorial parties that sup- 
press all opposition (for example, totalitarian 
“parties”), parties that do not use a high degree 
of coercion but nevertheless have a literal monop- 
oly (for example, the True Whig party of Liberia 
or the African Democratic Rally in the Ivory Coast, 
Niger, and the Sudanese Republic), “unified” na- 
tionalist and postnationalist movements that con- 
sist of many, sometimes not very cooperative, 
groups (for example, the Burmese Anti-Fascist Peo- 
ple’s Freedom League until 1958), and parties that 
simply greatly overshadow their competitors (like 
the Democrats in much of the American South, or 
the Northern People’s Congress in Nigeria, or the 
Congress party in India). That such heterogeneous 
cases do not yield valid generalizations is hardly 
surprising. 

Much more than a straightforward count of or- 
ganizations calling themselves parties is necessary 
to make sense of the category “one-party systems”; 
and even excluding all but genuine electoral organ- 
izations does not quite overcome the difficulty. The 
same is true of two-party systems. The category is 
not logically absurd, but two-party systems in the 
literal sense are extremely rare, if indeed they have 
ever existed. Not even Great Britain, usually con- 
sidered the paragon of bipartism, really fills the 
bill. For one thing, minor parties have always oper- 
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ated alongside major ones in Britain—parties like 
the Liberals of today, the early Labour party, the 
Irish Nationalists, the Liberal Unionists, National 
Labour, the Independent Labour party, the Com. 
munists, the Fascists, Furthermore, in the 77 years 
between 1885 and 1962, the era of modern parties 
and elections, Britain has had only 43 years of 
straightforward one-party majority government, It 
can be argued as well that the major British parties 
function cohesively only at election time, when 
much less is decided than is generally believed, and 
that at other times they are simply combinations 
of factions, despite their discipline in parliamentary 
voting, much in the manner of the major American 
parties. That at any rate seems a tenable position 
in regard to certain party functions, for example, 
the formulation of interests. Talking about two- 
party systems, therefore, also requires one to choose 
what units to include in and exclude from a party 
system, and such choices, if made without sensible 
and explicit criteria, will once again lump together 
a very heterogeneous set of cases, difficult or im- 
possible to generalize about. Take but one example: 
the familiar proposition that bipartism promotes 
stable government will almost certainly founder if 
tested against both British and postwar Italian ex- 
perience. The reason is that in Italy the opposing 
forces (until the recent, still inconclusive “break- 
through to the left”) were constituted very differ- 
ently from the British. One force, the Christian 
Democrats, has consisted of factions representing 
a far wider spectrum of far more antagonistic posi- 
tions than either British party, while the other, the 
Communist-Socialist alliance (formal until 1952, 
thereafter informal and to some extent uncertain— 
but little more so than the alliance of forces in the 
Christian Democratic party ), has been substantially 
an antidemocratic force, the existence of which, one 
may suspect, is all that has bound together i 
opponents. (There are, of course, other parties in 
Italy, but all extremely weak; in the 1958 election 
no other party got as much as 7 per cent of the 
vote. ) 

The concept of a multiparty system also poses g 
logical difficulty. Again, if taken literally, it is @ 
classification that does not classify, for virtually 
party systems fall under it. Even if we depen 
somewhat from literal meaning (for example, a 
discounting very minor or vastly overstate 
forces), the concept would still describe a ete 
ing variety of cases. On one extreme would be ene 
tries like Australia with its three major parties ( ) 
Australian Labour, Liberal, and Country parties) 
or Canada with its two large and two smaller P é 
ties (the Liberals, Progressive Conservatives, 


New Democratic party, and Social Credit); on the 
other extreme would be countries like Indonesia 
which has had an astonishingly large number of 
electioneering groups, more than forty having con- 
tested one election. The classic multiparty systems, 
so often held up to our dismay, like Weimar Ger- 
many (which generally had about nine or ten sig- 
nificant electoral forces) or the French Fourth 
Republic (which had six or seven), would only fall 
around the middle of the spectrum. Moreover, the 
category would comprise some systems in which 
single parties can win and have at times won out- 
right majorities (for example, Sweden and Nor- 
way); systems in which single parties consistently 
come close to that point (Denmark and Norway); 
systems in which no party even approaches a major- 
ity (the Weimar Republic, the Fourth Republic); 
systems in which the distribution of party strength 
is fairly even and systems in which it is remarkably 
skewed; systems in which effective party alliances 
are easy to form and maintain and systems in which 
they are hard to form and harder to preserve; sys- 
tems rent by dogmatic ideological conflicts and 
systems divided by less intense, sometimes pica- 
yune, policy disputes or by regional, religious, eth- 
nic, linguistic, or socioeconomic divisions; systems 
in which party cooperation is virtually nonexistent 
and systems in which parties collaborate on the 
parliamentary level almost as much as distinct party 
factions in some so-called two-party systems col- 
laborate in elections. Under these circumstances, 
again, a lack of tested generalizations is just what 
one would expect. 

The upshot is that if the categories one-party, 
two-party, and multiparty systems are taken at face 
value they yield a classificatory scheme that simply 
does not classify. Taken literally, all but a very few 
idiosyncratic cases fall into the third category, since 
the first is logically untenable and the second is 
empirically nearly empty. And if the categories are 
not taken literally, criteria for counting and dis- 
counting units must be specified, justified, and con- 
sistently applied, which is nowhere done and is 
very difficult to do. In addition, however, a further 
difficulty should be mentioned: The propensity for 
failure of generalizations about party systems that 
are couched in the traditional typology may be due 
$ the fact that many factors independent of num- 
; me shape the impacts of such systems and reflect 
F conditions that impinge on them; among these 

ctors the number of party units may be only a 
minor source of difficùlty. 
ne Aen an adequate set of concepts for 
baad erizing and classifying party systems thus 

y requires (a) technical precision about estab- 
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lishing the number of competing party units so that 
the numerical dimension can be precisely used and 
(b) proper attention to other dimensions of vari- 
ability. 

Distributions of strength. The two requirements 
just mentioned can be met to a considerable extent 
by distinguishing among party strengths. In regard 
to the number of party units in party systems, 
measures of strength can help one identify the num- 
ber of genuinely competitive units, those strong 
enough to play a role worth noting in electoral com- 
petition and in any combinative processes subse- 
quent to elections. If it is sensible to characterize 
a game by the number of players, then surely it is 
also sensible to count only those players who really 
make a difference. Beyond that, persistent (that is, 
patterned) differences in distributions of strength 
among party units merit consideration as a separate 
dimension in characterizing and classifying party 
systems. 

Party strength refers in the first instance, and 
most obviously, to quantities: the number of votes 
or representative offices that a party is able to ob- 
tain. One speaks of a two-party system in Great 
Britain chiefly because of the extreme quantitative 
weakness of all but two parties in that country and 
of a one-party system in northern Nigeria because 
of the overwhelming strength of the ruling North- 
ern People’s Congress, not because it has no com- 
petitors at all, Duverger, in this connection, classifies 
parties into four types: (1) parties with a majority 
bent, that is, capable of commanding an absolute 
parliamentary majority; (2) major parties, which 
cannot normally command absolute majorities but 
can govern with some external support; (3) medium 
parties, which can participate in government only 
by playing a subordinate role in coalitions and 
which cannot get an opposition to coalesce around 
them; and (4) minor parties, which are so small 
as to be incapable of playing any significant role 
at all in government or opposition (1951, pp. 283- 
290 in 1954 edition). One might add the concept 
of the dominant party—which Duverger uses in 
another connection and different sense—to denote 
a very strong party not confronted by any signifi- 
cant opposition. 

Party strength, however, is not a matter of quan- 
tities pure and simple; note that Duverger’s criteria 
for classifying parties according to their strength 
are not explicitly numerical but are concerned with 
the actual or potential role of parties in govern- 
ment and opposition. While that role is certainly 
determined substantially by the size of their repre- 
sentation and electoral support, it may also be 
affected by other factors. Some of these are them- 
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selves quantitative. It is necessary, for example, to 
consider the size of a party’s competitors. One must 
also consider the distribution of numerical strength 
among the larger parties; in some cases that dis- 
tribution might allow small independent parties 
persistently, not just under unusual circumstances, 
to call the tune in the formation of governments 
and in decision making. In addition, a small politi- 
cal force could be important because of its sheer 
capacity for entering coalitions of many kinds in 
rather fragmented party systems (that is to say, 
because of its opportunistic desire for offices) or be- 
cause its location in the political spectrum makes 
it somehow indispensable in coalition making, re- 
gardless of whether its leaders are opportunists or 
men of principle. The Weimar Republic’s Center 
party thus played a role in government far greater 
than its size might lead one to suppose (its vote 
varied between 14 and 19 per cent) precisely be- 
cause it was truly a center party in a very wide 
spectrum of parties and, by Weimar standards, a 
party conspicuously lacking dogmatic principles or 
monolithic class support. The case of the Radical 
Socialists in France is even more blatant. Never a 
massive force in sheer numbers they were neverthe- 
less the dominant party in the Third Republic, the 
payoff value of its representation (as measured by 
access to offices) being greatly enhanced by the 
quantitative weakness of, and deep divisions among, 
other parties, by its center position, and by its 
leaders’ quite extraordinary appetite for office. The 
Radicals usually controlled important positions, 
especially the Ministry of the Interior and the pre- 
miership itself, and this regardless of electoral 
fluctuations. 

Qualitative factors of this sort can also make a 
party less of a force than its size might indicate. 
Above all, a party may be weakened by lack of in- 
ternal cohesion. Indeed, the nominal unity of par- 
ties that are in fact very disunited is often best 
disregarded in the categorization of party systems. 
It is, therefore, possible to speak without violation 
of logic of a one-party system even where no nom- 
inally separate organizations fight elections. The 
term then denotes that genuine competition takes 
place among persistent factions within a formally 
unified party rather than among formally auton- 
omous parties. Usually, but not always, this is what 
is meant by a parti unifié as against a parti unique, 
terms first developed to distinguish between African 
party systems that are monolithic and those that 
display considerable intraparty pluralism (Schach- 
ter 1961, p. 306). Similarly, a nominal two-party 
system can consist of such weak structures that 
it might best be classified with certain multiparty 
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systems, a case sometimes argued in regard to the 
American party system—implicitly by V. 0, Key 
and explicitly by James McG. Burns, This is par- 
ticularly the case where factions belonging to the 
nominal parties frequently “collaborate across for- 
mal party lines, rather than merely representing, 
as in the British case, distinct intraparty tendencies 
that rarely, if ever, coalesce with kindred factions 
in other parties. On the other hand, certain for- 
mally independent parties can be counted as a 
single force because of their persistent and close 
alliance; such is the case of the communists and 
Nenni Socialists in Italy before the Hungarian up- 
rising (the parties then were “two-in-one and one- 
in-two,” according to Nenni). 

It is very difficult to specify precise, general, and 
easily applicable indicators of the capacity for, and 
strategic position in the process of, coalition mak- 
ing, of the degree of interparty identification, and 
of internal party cohesion. While some useful 
measures exist (for example, the amount of cross- 
party voting in legislatures or the frequency of 
significant participation in coalitions), these quali- 
tative factors must necessarily be gauged to a large 
extent by thorough acquaintance with particular 
cases; they are more matters of informed judgment 
than of exact measurement, For that reason they 
may be used to modify the results obtained from 
quantitative assessments of party strength, but the 
latter must necessarily be basic, if only because 
they produce the least equivocal results. 

It may be useful at this point to illustrate how 
quantitative indicators of party strength are and 
could be used to determine the number of units in 
a party system—although this exercise should be 
prefaced by saying that not enough has as yet been 
done along these lines to permit more than sugges- 
tions. In existing writings on party systems, two 
measures of party strength are used, usually im: 
plicitly, for this purpose. One involves the absolute 
size of party units, that is, parties with less than F 
certain percentage of votes or seats are disregarde 
or parties with more than a certain percentage ar 
considered to constitute single-party systems. The 
other involves relative sizes or ratios of forces, Be 
ties being disregarded if they fall far below the a 
of the principal party or parties. Both ne 
although they may lead to different results, ote 
help establish the number of genuinely compet! tly 
units in a party system, as do such less free 
used measures as the extent of division of com A 
over certain offices over time or the rate of altem 
tion in office of various parties. 

On consideration, it faa seem advisable 2 
combine measures of absolute and relative size 


determine the number of competitive party units. 
Suppose, for example, one classified as a one-party 
system any country in which a single party, over a 
number of elections, gets 60 per cent or more of 
the votes or seats, In that case one will end up 
placing in the same class systems which clearly are 
greatly dissimilar, for example, systems in which 
the remaining 40 per cent go to a second party, 
which has at least a chance of becoming the gov- 
erning party (precisely the situation in Dahomey 
as of 1959—before the cessation of party politics— 
and approximated in South Africa and Southern 
Rhodesia in 1958), and systems in which the 40 
per cent are divided among many inconsequen- 
tial parties, none of which has the slightest chance 
of governing (as in northern Rhodesia and the 
Cameroons in 1959). Such a questionable result 
can be avoided if one takes into account the ratio 
of forces between parties, but that criterion alone 
also seems unsatisfactory. For example, classifying 
as a one-party system any in which the ratio be- 
tween the largest and next largest party is more 
than 2:1 would make likely an even more question- 
able result: classifying as a one-party system one 
in which no party can govern by itself, because 
none gets even a bare majority, but in which one 
party is markedly stronger than the others (a con- 
dition approximated in the Weimar Republic until 
1924 and again from 1928 to 1930). Combining 
the two criteria—for example, defining as a one- 
party system one in which a single party over time 
gets more than, say, 60 per cent of votes and seats 
and in which the ratio between the largest and 
next largest party is more than 2:1—avoids both 
difficulties. Similar formulas could be worked out 
easily for party systems having any number of 
units, although distinctions beyond four or five 
units may not be important to make and may be 
adequately covered under some more general rubric. 
(For example, a two-party system might be defined 
as one in which two parties get about 75 per cent 
or more of votes and seats, and in which the ratio 
between the first and second party is less than 2:1 
and that between the second and third party more 
than 2:1—the 75 per cent minimum, rather than a 
lower one, being advisable because parties which 
poll less than 35 per cent of the votes are not 
likely to have a “majority bent.” 

The obvious advantage of such formulas is that 
ay can make the concept of a one-party system 
logically tenable and the concept of a two-party 
Sade empirically relevant, while introducing some 
oe into the very large, nearly all- 
a passing and very heterogeneous universe of 

party systems—distinctions that particularly 
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draw lines between very fragmented systems and 
those that display a considerable amalgamation of 
political forces. They thus solve the most basic 
problems that arise in a classification of party sys- 
tems by number of constituent units. But, to repeat, 
the results obtained by such formulas must still be 
modified by assessment of qualitative factors that 
affect party strength within the broad categories 
used, and such assessments may even lead to shift- 
ing a party system from one category to another. 
Usually one will want to make further quantita- 
tive distinctions to get more detail about the com- 
parative strength of parties within the various cate- 
gories, particularly when strength is treated as a 
separate dimension and not as a tool in the count 
of party units. For some theoretical purposes, for 
example, a distinction may have to be made be- 
tween one-party systems that barely satisfy the 
minimum quantitative requirements and those in 
which a single party virtually has a monopoly of 
votes and seats—no less than between dominant 
parties that are highly cohesive and those that 
are extremely pluralistic. In addition to the dis- 
tinction between cohesion and pluralism in the 
units of two-party systems, one may have to dis- 
tinguish between two-party systems that persist- 
ently and markedly favor a single party (like the 
Bonn Federal Republic so far) and those which, 
like Britain, consist of two rather evenly matched 
forces. So also three-, four-, or five-party systems 
may be evenly balanced or markedly skewed, and 
this difference may matter a great deal. The Scan- 
dinavian party systems, for example, are certainly 
pluralistic, but so skewed that they have not pre- 
vented effective authority in parliamentary govern- 
ment. The same might be said today of India. 
Party-system integration. To characterize a pat- 
tern of competition one needs to know not only the 
number of competitive units and their relative 
strengths but also how the units compete, This 
aspect of party systems has many facets, most of 
which are difficult to discuss in language at once 
general and precise. There is, for example, the 
large question of what sorts of appeals parties com- 
monly use—whether, among other things, their 
appeals tend to be emotionally intense or sober, 
intellectually dogmatic or pragmatic, a matter of 
policies or slogans, programs or platforms. There 
is the very large question of tactics—whether, for 
example, the parties want mainly to make new con- 
verts or to mobilize old supporters, try to exaggerate 
divisions or to minimize their importance, operate 
independently or in more or less close, more or less 
national, more or less open alliances. There is the 
related question of what sort of supporters the 
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parties can muster and toward whom precisely they 
direct appeals: for example, whether the parties in 
a system tend to have distinct social bases or tar- 
gets or whether they are rather similar in these 
respects (as American parties are similar in their 
heterogeneous supporters and targets, and the In- 
dian and some African party systems are similar 
in recruiting militants from a single social segment, 
the urban middle class). There is the question of 
the more general similarity or dissimilarity of par- 
ties in a system: whether the parties fundamentally 
resemble each other, so that there is a rather simple 
pattern of competition between them (for example, 
in that all are “communal” parties or “national” 
parties, all “interest” parties or “aggregative” parties, 
all “ideological” or “pragmatic” parties, all “mass” 
parties or “caucus” parties), or whether they belong 
to fundamentally different species and thus use 
fundamentally different practices in more complex 
patterns of competition (as is the case, for exam- 
ple, in many transitional societies before stable 
regimes, competitive or noncompetitive, are estab- 
lished—for example, in the early years of Indo- 
nesia, where one found simultaneously religious 
and secular parties, broadly aggregative and narrow 
interest parties, national and ethnic parties, organ- 
izationally strong and weak parties, ideological and 
pragmatic parties). 

Despite the rich technical language available for 
characterizing individual parties, there is no gen- 
eral and precise vocabulary for dealing with varia- 
tions of this sort on the party-system level; and one 
must rely, to a large extent, on unsystematic de- 
scriptions of particular party systems. However, at 
least a start has been made toward the construction 
of appropriate concepts, perhaps the most notable 
attempt being Helmut Unkelbach’s use of the con- 
cept “party system integration” (parteipolitische 
Integration), which can subsume many aspects of 
the different modes of party competition (1956, see 
especially pp. 36-41, 51-95). 

To Unkelbach, a party system is integrated when 
it involves a low level of conflict or enmity ( Gegner- 
schaft) or a relatively small “distance” between 
parties; it is unintegrated when the opposite holds. 
The term may describe dynamic processes as well 
as stable states: party systems “disintegrate” when 
Gegnerschaft is activated in them and become in- 
tegrated when it is toned down. And it can be used 
to describe particular parties or complexes of par- 
ties within a party system as well as the system in 
general. Unkelbach mentions, for example, two 
special forms of party integration which obviously 
involve subsystems of party systems: two-party 
systems in which each is highly aggregative and 
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successful in avoiding centrifugal tendencies, and 
groups of parties that are both close to one another 
and capable of commanding parliamentary major- 
ities between them. He also distinguishes between 
genuine integration, which is based upon a con- 
sensus sufficient to allow different parties or fac- 
tions to carry out a common program, and false 
integration, that is, mere electoral collaboration, 
The crucial consideration throughout is the polit- 
ical “distance” between parties or factions, and this 
concept summarizes many important aspects of the 
modes of party competition. It is gauged (unfor- 
tunately, as yet not measured) on the basis of five 
general criteria: (1) differences in fundamental 
views or orientations (weltanschauliche Grund- 
lagen); (2) differences in concrete goals or in the 
rank order of goals pursued; (3) differences in the 
means of striving for concrete goals; (4) differ- 
ences in the assessment and valuation of political 
personalities; and (5) differences in the social com- 
position of the parties or factions, particularly their 
class basis and the extent to which they aggregate 
diverse forces. The results obtained on these bases 
can, however, be modified by two related condi- 
tions: the existence of a higher consensus, which 
creates an emphasis on the requirements of the 
common welfare and thus lessens political distance, 
and the predominance of elements of conflict over 
elements of harmony, which increases the distance 
by creating “artificial” divisions. In parliamentary 
regimes another criterion, one that does not involve 
electoral activity, can be used to supplement, of to 
indicate, the others: the character of collaboration 
in coalitions. The collaboration may be frictionless 
or involve only minor differences, indicating polit- 
ical proximity or even kinship; or it may involve 
great differences or prevent the maintenance of 
coalitions, indicating various degrees of enmity. 
(Collaboration in opposition might indicate the 
same things, as might the collaboration of legis- 
lative factions in presidential regimes. ) f 
Assuming the possibility of reasonably precise 
assessment, the concept of party-system integration 
holds great promise. It can help to determine accu- 
rately the number and relative strengths of truly 
competitive units in a party system; for example, 
whether a single nominal unit, like the Malayan 
Alliance, should be counted as several real units, OF 
whether several nominal units form a single ii 
one, like the Italian communists and Nenni Socia" 
ists before 1956 and the two main Austrian ba 
which contest elections but usually collabo ai W 
government through the proporz system. More al 
portant, the concept can be used to make ene 
distinctions between party systems consisting 


similar numbers of units. As an example, contrast 
a two-party system like the British, in which the 
distance between the parties is slight, with one like 
the Italian (assuming it qualifies as a two-party 
system), in which the distance could hardly be 
greater. Such systems must surely result from dif- 
ferent conditions and have different consequences. 
Furthermore, it might well be found that the con- 
ditions and consequences of party-system integra- 
tion or lack of integration are everywhere the same, 
regardless of the number or strengths of units that 
are involved. Beyond all this, the concept gets sum- 
marily at many aspects of party competition and 
could provide answers for important questions 
about the impacts of party systems on the larger 
political system. 

Party-system salience. The dimensions discussed 
so far—the number, relative strengths, and integra- 
tion of party units—treat party systems entirely 
without reference to their contexts (with the excep- 
tion of one of the subdimensions of party-system 
integration, social composition, which inherently 
requires reference to something beyond the systems 
themselves). But party systems have contextual no 
less than immanent characteristics. Not only are 
they conditioned by (and in turn condition) the 
larger political and social systems and some of their 
other subsystems, but they also participate with 
other structures in various political processes. The 
ways in which they do so are part and parcel of 
their structural characteristics and may, like their 


integration, be placed on a single, but divisible, q 


dimension: “salience.” 

In general, party-system salience refers to the 
functional significance of party systems in political 
processes, in other words, to the extent that they 
are in a position to contribute to, or undermine, the 
Operation of political systems or their parts. This 
involves at least two closely related considerations: 
the significance of party systems relative to other 
Structures in over-all processes of political compe- 
tition (including the extent to which other struc- 
tures are subsumed to the units of party systems) 
and the range of the political (and social) functions 
the party systems actually perform. 

Political competition has one or more of three 
objects: to recruit decision makers, to influence the 
making of decisions, and to affect the specific ways 
in which decisions are applied and values are allo- 
cated. In regard to each of these, party competition 
May range from being all-important to being im- 
material relative to other sorts of competitive proc- 
esses. A familiar example is the difference in 
Fem salience between spoils systems and 

ose in which the appointive and allocative powers 
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of elected officeholders are highly circumscribed. 
In this case it is largely formal—legal rules that 
define the salience of party systems, but many other 
factors may have similar effects. No rules, for ex- 
ample, underlie the presently growing relative sig- 
nificance of pressure groups in most Western 
countries; rather this tendency reflects, among 
other factors, the expansion of governmental activ- 
ities; the increase in comprehensive memberships 
and effective organization of the groups; the fossil- 
ization of many parties (Lowi 1963); and the great 
consensus on the broad, aggregative policy alterna- 
tives that electoral competition usually defines. Nor 
do rules underlie the relative lack of salience in 
leadership recruitment in the case of fragmented 
party systems that do not yield clear electoral re- 
sults and where the recruitment function is largely 
performed jointly by parliamentary groups and in- 
dividuals, who may operate without close reference 
to party lines; Italy in the age of trasformismo and 
France during most of the modern era are cases in 
point. On the other hand, constitutional rules 
greatly contributed to the relative insignificance of 
the party system in Imperial Germany, where the 
executive was not a creature of parliament and was 
accountable to it only in very restricted ways, al- 
though Bismarck’s skill in manipulating the party 
system was certainly a factor in his personal 
success, 

The salience of party systems relative to other 
structures of political competition reflects, but is 
not quite the same thing as, the range of political 
functions to which party systems contribute. It is 
not the same thing because party systems contrib- 
uting to a great variety of functions may not con- 
tribute to them very much; yet the range of func- 
tions is itself a partial measure of their salience. 
Perhaps the main distinctions to be made on this 
subdimension are between party systems that re- 
strict activity to the more manifest functions of 
such systems (recruitment and the aggregation of 
preferences ), those that emphasize more latent 
functions (such as the creation of a sense of na- 
tional identity, the awakening of participant atti- 
tudes toward the political system, and the provision 
of political education and training), and those that 
combine both types of functions. The last category 
undoubtedly is much the largest. However, the 
earliest Western parties, those of the age of repre- 
sentative oligarchy, Duverger argues, belonged uni- 
formly to the first group, while the broader party 
systems formed in response to democratization or 
in anticipation of suffrage expansion at first fell 
mainly into the second group, as do many contem- 
porary non-Western parties. 
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Party-system salience is an important dimension 
not just for purposes of description but also because 
variations in salience may be expected to have im- 
portant consequences for theories about the deter- 
minants and functional impacts of party systems. 
The more salient party systems are bound to have 
more considerable functional repercussions than 
the less salient ones, and such systems are also 
likely to be more sensitive to external conditioning 
factors. They may be expected to be the crucial 
nerve centers, buttresses, or impediments of repre- 
sentative government. But if party systems matter 
little compared to other structures, and this is real- 
ized, the tendency of social forces will be to work 
upon the other structures. Hence any generalization 
about the other dimensions of party systems will be 
vulnerable to the extent that the weight of such 
systems in political processes is ignored. 

Party-system dynamics. Up to this point only 
Static variables have been discussed—that is to say, 
variables that can be applied at any point in time 
as well as to describe persistent patterns over time. 
Undoubtedly these variables are best applied to 
highly persistent patterns, since very transient 
structures resist rigorous analysis. But patterns that 
cover long durations can rarely be discussed ade- 
quately in static terms alone. They are always likely 
to change, to develop; some party systems, indeed, 
are so unstable that a classification of them at any 
moment in time will be virtually useless. While 
changes in party systems can largely be described 
by characterizing them statically at successive points 
in time, inherently dynamic categories that describe 
general, frequently occurring characteristics of the 
process of change itself are also useful. 

A distinction should be made at the outset be- 
tween dynamics within party systems and the dy- 
namics of party systems. For the first, Duverger 
has developed a very useful typology that deals with 
“success patterns” and “displacement patterns” 
within party systems that change in content but not 
in form. Duverger distinguishes four basic types of 
intrasystem dynamics. One is alternation—the reg- 
ular movement from office to Opposition and oppo- 
sition to office which one finds in some, but not all, 
two-party systems, and which may matter to the 
construction of generalizations about such systems 
quite as much as system integration. The second 
he calls stable distribution—the absence of serious 
variations in party strength over a long period (as 
in Holland, Switzerland, and Belgium between the 
wars). The third is leftism—the gradual disap- 
pearance of parties on the right and the birth of 
parties on the left, or the gradual weakening of the 
right and increasing strength of the left, or the 
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displacement of moderate leftist by more radical 
leftist parties, without any change in the formal 
characteristics of the system. Finally, there is dom- 
ination—either the continuous dominance of a 
party (like that of the Radicals in the French 
Third Republic) or alternation in which the swing 
of the pendulum in one direction is consistently 
much longer than in the other. Additionally, Duver- 
ger distinguishes between “normal evolution” and 
“sudden mutations” within a party system, the lat 
ter referring to very abrupt shifts in the political 
balance, as in the rapid rise of Nazism in Germany 
after 1930 and of Rexism in Belgium in 1936. 
The basic distinction to be made in characteriz- 
ing the dynamics of party systems is of course 
between stable and mutable systems—those that, 
despite intrasystem developments, maintain their 
formal characteristics over time and those that do 
not. The latter category, however, obviously re- 
quires some subcategories, For example, continuous 
and rather prolonged tendencies (like the gradual 
fragmentation and weakening of the dominant par- 
ties in India and in Pakistan before 1958) should 
be distinguished from sudden discontinuities, like 
those that result from coups and revolutions (for 
example, the rapid destruction of the old German 
and Italian party systems by the Nazis and Fascists 
and their subsequent reappearance in radically al- 
tered form), One must also distinguish fluidity in 
party systems from patterned trends. Fluidity in- 
volves seemingly random, or at any rate highly 
ephemeral and often contradictory, movements in 
party life—the constant appearance and disappear- 
ance of parties, the continuous ebb and flow of 
consolidation and fragmentation, that characterized 
the limited, oligarchic representative systems of 
nineteenth-century Europe, with their parties cf 
notables and caucuses, and that seems equally typ- 
ical of some new and transitional nations today, 
especially those in which personalism is the charac- 
teristic form of party leadership. In contradistine 
tion are patterned trends toward consolidation oF 
fragmentation (decreases or increases in the num- 
ber of competitive units) or toward the increas 
integration or disintegration of party corir a 
The origins of party systems are also an pe 
of party-system dynamics, for origins are always 
part of developmental patterns. For example, Pa 
systems that originate in the followings of a ne 
crats or other political notables in oligarchic he 
sentative systems may be contrasted with ge 
that begin with mass organizations. Similar disti ah 
tions can be made along many other lines. One Ci 3 
contrast the party systems of new states a a 
Philippines, where the colonial power fostere P 


ties before independence, with those in which the 
beginnings of independence and party life coincide; 
and one can contrast party systems that originate 
in the followings of dynamic personal leaders, in a 
broad but unmaintained nationalist or revolution- 
ary consensus, in ethnic or other “communities,” 
or in interest-group formations that turn from semi- 
political to fully political activity. In making such 
contrasts, however, we are no longer merely charac- 
terizing party systems, we begin to deal with the 
determinants of their varieties; for the origins of 
party systems, while an aspect of their dynamics, 
are of interest primarily because they may condition 
subsequent development. 


Determinants of party systems 


To account for the characteristics of party sys- 

tems, three broad aspects of their contexts can be 
explored. One is the general political system or cer- 
tain of its substructures: party systems might be 
treated mainly as responsive to larger or other 
aspects of politics. The second is social structure 
and culture; party systems might be treated pri- 
marily as structures that politically crystallize socio- 
cultural forces. The third is their own histories: 
party systems with a past might be conditioned 
largely by that past, that is, might be self-propelling 
along predictable paths once set in motion, or even 
self-maintaining, regardless of external forces. 
Each of these approaches obviously comprehends 
many possibilities that must be disentangled, and 
while in some respects they may be mutually ex- 
clusive, in others they can be treated as comple- 
mentary and combined. 
y Political systems. Among the hypotheses that 
link party systems to other aspects of political struc- 
ture, indeed among all existing hypotheses about 
party-system determinants, one looms much larger 
than the rest, in terms of both the frequency and 
rigor with which it has been argued. This is the 
theory that party systems are essentially products 
of electoral arrangements, of the formal rules gov- 
erning their competitions. 

The most compelling reasons for the prevalence 
of this theory are that the manifest objective of 
party competition is to maximize a group's repre- 
Sentation and influence in government and that 
different electoral systems, like different rules of 
games, require varying strategies in pursuit of that 
saad Above all, the formal mechanics of elec- 
Pa systems seem to have a bearing on combina- 
ie raean deciding whether groups with 
fer ar interests and opinions can maximize their 
a uence by operating independently in elections, 

through different kinds of alliances, or by per- 
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manently subsuming themselves to larger aggrega- 
tive groups. Election rules thus influence the num- 
ber, sizes, and integration of units and dynamic 
tendencies in party systems. Hypotheses about the 
effects of electoral systems are also popular because 
they involve formal-legal political structure, long 
the special concern of political scientists, and be- 
cause they are particularly suitable to studies that 
have a remedial intent. Electoral systems, after all, 
are easy to identify and manipulate, while social 
and historical forces are more elusive since they 
are generally givens and thus defy control. [See 
ELECTIONS, article on ELECTORAL SYSTEMS.] 

Despite their frequency and surface plausibility, 
however, theories about the relations between elec- 
toral rules and party systems, closely examined, 
leave one with some feelings of unease. Invariably, 
they have more logical than empirical content. 
Moreover, in the course of their development, they 
have become constantly more complex (reversing 
the usual course of scientific inquiry) and increas- 
ingly qualified by other factors, both tendencies 
resulting from attempts to neutralize cogent criti- 
cisms without yielding fundamental ground. These 
points, which at least make one wonder, are illus- 
trated by the three most comprehensive works so 
far written on the subject. 

The first systematic exposition of the relations 
between electoral and party systems was that of 
Hermens (1941 ). The argument of that work, some- 
what simplified, was that proportional representa- 
tion undermines democracy by the effect it has on 
party systems: it tends to fragment such systems 
into large numbers of independent units and, in 
the sense we have used the term here, to disinte- 
grate them; in contrast, the majority system of 
elections consolidates and integrates the parties and 
thus promotes effective government. The case for 
this theory was in the first instance made logically, 
and that part of it carried much conviction. Then 
empirical materials were produced to support the 
logic; at this point, however, the argument became 
much less compelling, since the data were clearly 
selected to fit the case and contradictory cases could 
easily be found. These cases in fact soon became 
the basis of a large anti-Hermens literature. 

Duverger’s Political Parties (1951) was the next 
major work on the subject (see also Duverger 
1950). It resembles Hermens’ work both in its basic 
arguments and the force of its logic but differs from 
it in several respects. In the first place, Duverger’s 
argument is more complex because it concerns three 
general types of electoral arrangements, not two: 
proportional representation, the simple-majority 
single-ballot system, and the majority system with 
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two ballots (as used in France during most of the 
Third Republic). To the last Duverger attributes 
results similar in many respects to those of propor- 
tional representation, The basic implication is obvi- 
ous: a major modification of the majority system 
can produce results different from those that Her- 
mens attributed to it. More important, Duverger 
conscientiously adduces empirical exceptions to his 
arguments, sometimes indeed as many exceptions 
as cases in point; for example, the hypothesis that 
the simple-majority single-ballot system leads to a 
two-party system with alternation in power (which, 
according to Duverger, comes as close as it is pos- 
sible to come to a sociological “law”) is illustrated 
by Britain, Uruguay, the United States, and Turkey 
(since 1950), but is contradicted by the “excep- 
tions” of Belgium before 1894, Sweden before 1911, 
Denmark before 1920, and contemporary Canada. 
The deviant cases are then explained away by vari- 
ous “special conditions,” implying, again in opposi- 
tion to Hermens, that there is more than electoral 
logic to the conditioning of party systems, even if 
electoral logic retains pride of place. However, to 
call the other relevant factors “special conditions” 
and the cases they govern “deviant cases” is to 
assume that the rule “electoral systems determine 
party systems” has already been established; but 
Duverger nowhere shows just how this has been 
done. His manifest object is to complicate the Her- 
mens thesis somewhat in order to accommodate the 
more obvious contradictory cases and to disarm 
empirical criticisms of the more complicated theory 
itself by invoking saving conditions wherever it 
seems endangered by less obvious cases. 
Unkelbach’s work (1956), which also contains 
many of the same basic arguments, is a distinct 
advance over Duverger at least in one respect: many 
deviations from the general rules are themselves 
due, in Unkelbach’s argument, to variations in elec- 
toral systems, not to “special” conditions. The book 
shows, with mathematical logic, how even minor 
changes in an electoral system can dictate impor- 
tant adaptations in party strategy and thus in party 
systems. Hence, Unkelbach does not deal with only 
three types of electoral rules. Instead he adduces 
Separate arguments for a huge number of them, 
even when they seem to have much in common. 
For example, within the general category of “major- 
ity systems” he distinguishes the effects of the 
relative majority system (Britain), the absolute 
majority system (Imperial Germany), the New 
York City variant of the relative majority system, 
the majority system with ballotage (Third Repub- 
lic), the majority system with the alternative vote 
(Australia), the “two-step” relative majority system 
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(Uruguay), and the various kinds of majority sys. 
tems in multimember constituencies. But this pro- 
cedure, while making it less necessary to introduce 
nonelectoral factors into the analysis and while 
being logically more convincing in its very nature, 
creates an important difficulty of its own. It ends 
up making a separate case for almost every elec- 
toral system in every country and thus makes it 
virtually impossible to test the logical arguments by 
empirical generalization. And even so, Unkelbach 
is compelled to make his findings fit the data by 
systematically invoking three additional determi- 
nants: other aspects of political structure, under- 
lying social forces (a large category indeed), and 
the “free” actions of decisive personalities. These, 
moreover, are not treated as merely minor influ- 
ences that can to some extent modify the tenden- 
cies emanating from electoral systems: they can 
reverse them and must therefore be, in some in- 
stances at least, the fundamental determinants of 
party systems. 

The whole literature linking electoral and party 
systems thus confronts one with a paradox: it has 
great logical force and has acquired increasingly 
greater, if not simpler, force of logic over the years, 
but it often lacks empirical fit, no matter how com- 
plicated the logic has been made. The explanation 
for this discrepancy between logic and experience 
is that the conditions assumed in logical arguments 
are not always satisfied, perhaps even rarely satis- 
fied, in actual cases. These assumed conditions are 
very much like those posited in economic theories: 
that parties try to maximize their influence and 
that they are infinitely malleable institutions that 
can readily adjust their behavior to changes in the 
conditions of competition. But both are doubtful 
assumptions when indiscriminately applied, partic- 
ularly the latter. Because of this, theories like 
Duverger’s and Unkelbach’s may be more useful as 
models against which to gauge the actual ration- 
ality, power drive, and malleability of units in party 
systems than as theories that account for variations 
between such systems. s 

To illustrate, take a case that would seem highly 
deviant to all the electoral-systems theories: a coun- 
try that uses the British electoral system but has 
nothing like a two-party system with alternation = 
power, namely, India. The Indian party system be 
somewhat ambiguously poised between the we 
party dominant and polyparty categories. There A 
one very large aggregative party, Congress, ae oa 
great number of very small, unaggregative, CO 
munal, and ideological parties, like Jan ma 
Mahasabha, Ram Rajya Parishad, and the oe 
munists. Congress may resemble the kinds of p 


ties we normally find in two-party systems but the 
rest certainly do not, and the system as a whole 
Jacks integration; nor has there been any tendency 
toward party consolidation and integration in the 
opposition—if anything, the reverse. 

Weiner’s study of the Indian party system (1957) 
mentions four factors that help explain this situa- 
tion. First, Indian parties typically are not just 
electoral organizations but loci of their members’ 
social life, “total organizations” that are often sub- 
stitutes for family, caste, village, status group— 
even job. Such organizations, frequently found in 
non-Western nations and not unknown in the West 
itself, clearly mean too much to their adherents to 
be very malleable in the light of coldly rational 
electoral calculations. Second, the Indian parties 
lack power drive, since the general Indian culture 
particularly undervalues the political realm. Third, 
many opposition parties fail to make calculations 
that would improve their electoral chances simply 
because they are not committed to working within 
the parliamentary system (something also familiar 
in the West and, even more, in other areas). Finally, 
there is a general lack of interest organizations. 
This seems to militate against party consolidation 
and integration because a group submerged in a 
larger organization for electoral purposes does 
not have available alternative channels—pressure- 
group politics—for pursuing its special interests, 
at least not to the extent it would in many Western 
nations where the value sacrifices involved in 
electoral consolidation are correspondingly much 
lower. 

These and other observations we have made sug- 
gest a generalization about the relations between 
electoral rules and party systems. Electoral rules 
govern the forms of party systems along the lines 
of maximizing models only under specifiable con- 
ditions, the absence of which may lead to results 
very different from, and even contradictory to, 
those predicted by the models. These conditions are 
(1) that the logic of the rules be understood by the 
actors; (2) that there be commitment by the actors 
to representative rule; (3) that they have a strong 
appetite for power as an end and that significant 
power positions be attainable through electoral 
competition; (4) that the actors’ political intensity 
be low (that is, their political style be pragmatic, not 
doctrinaire); (5) that the units through which they 
act be relatively specific, not highly diffuse, in social 
purpose; and (6) that there be available in the po- 
litical system useful alternative channels for pur- 
Suing interests. 

Three points follow from these conditions. Since 
the requirements for unfettered maximizing actions 
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in party competition are numerous, such actions 
are likely to be rare, and hypotheses confined to 
the relations of electoral rules and party systems 
are likely to be weak. Second, one must pay at least 
as much attention to other aspects of the political 
contexts in which party systems operate as to elec- 
toral rules; these include other kinds of formal 
rules and, above all, aspects of political culture and 
the wider structure of political competition in a so- 
ciety. [See POLITICAL cuLTuRE.] And third, since 
the latter are ineluctably tied to sociocultural forces, 
only exceptional party systems may be explicable 
within a framework of purely political considera- 
tions—which raises the possibility that social and 
cultural conditions may themselves suffice to ac- 
count for their characteristics. 

Sociocultural forces. It has been argued (by 
Downs 1957) that party systems may be con- 
ceived of simply as showing the distribution of po- 
litically relevant preferences in society; for exam- 
ple, the more modal preferences there are, the 
larger the number of parties. This position may be 
useful as a premise for abstract model-building, but 
it is not even a tenable truism, Parties, after all, 
may, and often do, “aggregate” to such an extent 
that they comprehend internally various modal 
preferences—think of the Christian Democrats in 
Italy, Congress in India, or either American party. 
Also, modal preference groups have available many 
channels other than parties through which to act 
politically, particularly pressure groups and revo- 
lutionary organizations; for that reason alone 
“modal preference” need not equal “party,” although 
it may in special cases. 

Party systems are obviously susceptible to the 
conditioning of social structure and culture, Trib- 
ally fragmented societies are not likely to produce 
consolidated or integrated party systems and have 
rarely done so, although party itself might some- 
times be an antidote to social fragmentation. The 
doctrinaire and schismatic tradition of Islam seemed 
to make the preservation of an aggregative domi- 
nant party far more difficult in Pakistan than 
was the case in India (Almond & Coleman 1960, 
p. 196). Sometimes, indeed, the salient divisions of 
social structure are virtually mirrored in those of 
the party system, as in the Belgian party split be- 
tween Catholics and freethinkers, a division cor- 
responding to the overriding national split between 
those who speak Flemish and the Walloons who 
speak French (Neumann 1956), or in the division 
in South Africa between English- and Afrikaans- 
speaking people, or in the ethnic bases of the 
Nigerian parties. 

Many such cases can be cited, but the relations 
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between party systems, social structure, and cul- 
ture are far from simple or readily apparent. One 
difficulty is to determine which among the many 
aspects of social structure and culture most im- 
pinge upon party systems, for such systems, on the 
evidence, may or may not reflect almost any aspect 
of social life. In one case, Scandinavia, the most 
relevant factor, if there is one, would seem to be 
the long past division of society into estates; in 
Belgium, coinciding religious and linguistic divi- 
sions; in America, regional divisions; in Italy, per- 
haps a split between the sexes, as much as anything 
else; and in many of the new nations, demographic 
factors and generational conflict. 

More important is the fact that even the pro- 
foundest sociocultural cleavages often are not re- 
flected, or are only imperfectly reflected, in party 
systems, no matter what they are. Some sociocul- 
tural splits never enter the political realm at all. 
Even if they do, they may find political outlets 
other than parties. They may have to find such 
alternative outlets if parties are, as they well 
may be, unresponsive to sociocultural divisions or 
changes in such divisions, particularly since it is 
not easy to found new competitive units where the 
old are well entrenched. And, just as many socio- 
cultural conflicts are integrated in the units of 
party systems, so there are party splits that simply 
do not correspond to sociocultural ones. Party itself 
may be the main thing that divides some societies, 
especially where the appetite for office and spoils 
is great; to what sociocultural divisions, after all, 
did the party and factional divisions of transformist 
Italy correspond? 

All this means that, like electoral exigencies, 
sociocultural conditions give only a very partial 
account of party systems, in that they account only 
for limited numbers of cases and/or account only 
for some aspects of the cases. A combination of 
the two approaches would clearly yield better re- 
sults than either alone, for electoral strategy is 
clearly one factor that bears upon how parties 
reflect sociocultural forces, while sociocultural 
forces in turn can obviously distort and limit a 
purely maximizing approach to elections. 

The past of party systems. There is, however, 
yet a third way to account for the characteristics 
of party systems, which seems to work well in nu- 
merous cases, Party systems, at any point in time 
except their origin, are largely creatures of their 
own histories; once in being, they become self- 
moving, even in the sense of self-perpetuating. The 
argument implies that party systems are highly re- 
sponsive to external conditioning only at their for- 
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mation, that subsequently they either retain their 
essential characteristics or change only very slowly, 
unless totally destroyed in political upheavals, In 
other words, their development is largely inherent 
in their original shape. Party systems may simply 
become incorporated into the habit background of 
society (for example, acquire extensive followings 
of nonfloating voters). Like so many organizations, 
parties, especially modern, bureaucratized, mass 
parties, may spawn groups with strong vested in- 
terests in keeping the systems substantially as they 
are. Or the party system may itself create strategic 
imperatives that condition its development: for ex- 
ample, if a consolidated system emerges early, it 
may in all cases be more politic for small groups to 
seek to work within it rather than independently. 

One of the most conspicuous cases illustrating 
this theory is the remarkable similarity between the 
party systems of Imperial Germany and the Wei- 
mar Republic, despite a change in the electoral 
system, and the great change in German party life 
that followed the destruction of the old organiza- 
tions during the Nazi period. Even electoral-systems 
theorists implicitly provide cases in point. An ex- 
ample is the now common argument that while 
proportional representation, contrary to earlier be- 
liefs, might not fragment or disintegrate party sys- 
tems, it cannot lead to their consolidation and 
integration—which may only be an instance of the 
inability of electoral systems to affect party systems 
at all. 

Party origins are used particularly often to ac- 
count for one-party dominant systems—systems 
that for obvious reasons have never been related by 
anyone to electoral machinery. The most common 
hypothesis is that one-party dominance comes into 
being where party systems are formed in struggles 
for national independence, since such struggles 
channel political activity toward a single, widely 
shared, intensely held objective and reduce domes- 
tic conflict by focusing political Gegnerschaft on 
the clash with the external forces (Hodgkin 1962, 
pp. 22-23), Such dominance may then persist 
through the very machinery, symbols, and habits 
it creates, even where the political system remains 
in some sense competitive. It may even persist 
when unity begins to disintegrate, since the Por 
of disintegration may give rise only to many sma" 
uncompetitive opposition fragments. This we 
Process may be related to extreme ethnic, region 5 
religious, or ideological divisions; it could, howevé 
also result from the fact that the very strength 2 
the dominant party makes the consolidation of 0P- 
position useless (under any electoral system) an! 


thus provides no benefits to balance against the 
unavoidable costs of factional combination. 

Origins may also play a significant role in shap- 
ing other kinds of party systems. It has been ar- 
gued, for example, that fragmented party systems 
tend to come into being (and then to persist) 
where the beginnings of wide political participation 

coincide with the simultaneous existence of deeply 

divisive conflicts over fundamental issues, such as 
issues of national identity, constitutional order, and 
basic internal and international policy. Contrasts 
are often drawn in this connection between Britain 
—where issues were settled in the following se- 
quence: national identity, religious policy, consti- 
tutional order, basic socioeconomic policy, and 
the question of Britain’s international function 
—and France and Germany, where all such issues 
arose at once, just when the party systems were 
being formed. Contrasts have also been drawn 
between party systems that originate in competi- 
tive and noncompetitive systems, as full-fledged 
participants or nonparticipants within competitive 
systems, within or outside of parliaments, in “pre- 
party” formations or directly as party formations. 
To mention some examples: Parties that originate 
as clandestine groups in noncompetitive systems 
seem generally to retain a highly uncompromising 
style and highly disciplined organization, becoming 
either instruments of noncompetitive rule (in the 
manner of the Moroccan Istiqlal or the Tunisian 
Neo-Destour) or disintegrated units in genuine 
party systems. Parties that do not early in their 
histories play a fully participant role in government 
—like the German Imperial or prewar Japanese 
parties, which contested elections but had no 
chance to run affairs—seem to develop highly 
doctrinal, “irresponsible,” and oppositional styles 
that lead to disintegrated party systems. Parties 
that originate in parliamentary combinations, like 
the old British caucus parties, seem to put much 
more emphasis on electoral struggle than other 
goals and ties and thus to respond more to the 
exigencies of electoral systems than parties formed 
outside of parliaments, And parties originating, 
like so many contemporary African parties, in other 
associations —trades and farmers’ unions, youth as- 
Sociations, literary societies, study circles, tribal 
associations, religious sects, etc.—also seem highly 
resistant to electoral logic, tending to be “total or- 
ganizations” in one-party dominant or disintegrated 
party systems. 

In theories of party systems, the idea of inertia 
must play a serious role. Yet changes in party Sys- 
tems do occur, and not only after great political 


PARTIES, POLITICAL: Party Systems 451 


upheavals or always in ways that seem immanent 
in earlier stages. This, coupled with previous argu- 
ments, implies that just as party systems cannot 
be characterized on a single dimension, so their 
characteristics cannot be accounted for by one set 
of determinants. Numerous conditions manifestly 
are involved. 

The present task of study is, however, not merely 
to add up determinants to yield plausible accounts 
of specific cases—nothing is easier—but to dis- 
cover the general circumstances under which 
each of the conditions discussed above may be of 
special significance. For example, it may well be, 
as already suggested, that sociocultural factors are 
of particular significance during the genesis and 
youth of party systems—much as environmental 
stimuli most affect human personalities early in 
life; that thereafter party systems become con- 
stantly less sensitive to such forces and more self- 
determining; but that, under conditions earlier 
specified, they will replace the pulls and pushes of 
broad social forces with responsiveness to electoral 
rules. It may also be that through such a process, 
party systems become gradually less salient in polit- 
ical competition compared to structures more sen- 
sitive to social forces and, in some cases perhaps, 
become a positive hindrance to effective representa- 
tive rule, regardless of the degree of consolidation, 
distribution of strength, or amount of integration 
in the systems (cf. Lowi 1963, pp. 573-575). 


The functional impacts of party systems 


The last sentence touches upon the third set of 
problems that arise in the study of party systems, 
namely, problems about their functional impacts. 
By this is meant broadly how party systems affect 
the contexts that affect the systems: how party 
systems contribute to the viability and effective 
working of other structures, including political sys- 
tems as a whole, and to the achievement of specific 
goals. Questions concerning the functional impacts 
of party systems are inseparable from questions 
about their salience. But since it can be assumed 
that generally only high salience can give party 
systems great functional significance, study can be 
concentrated on the other characteristics of the 
systems, or salience can be treated as itself a gen- 
eral aspect of the functions as well as the forms 
of party systems. 

In a field generally in infancy, the study of func- 
tional impacts is at present much the least devel- 
oped. We lack not merely tested theories but any 
theories at all, or even explicitly stated problems 
about which to theorize. Possible party and party- 
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system functions are usually just inventoried, and 
even that only partially, without attention to the 
kinds of party systems that perform them ade- 
quately or the conditions under which they do 
so (Lowi 1963, pp. 571-573). 

Thus far, the functional impacts of party sys- 
tems have been studied mainly in connection with 
one problem that arose naturally in the therapeutic 
literature on the subject: What kinds of party sys- 
tems support or undermine stable and effective 
representative rule? The answer most often ad- 
vanced has, equally naturally, been couched in 
terms of the classical typology of party systems: 
two-party systems supposedly make democracies 
stable and effective, multiparty systems do not. In 
view of what has been said here, the terminology 
used in this proposition itself casts doubt upon it. 
But even if one ignores that point, the hypothesis 
still leaves much to be desired both in logic and 
empirical fit. Logically, one can construct a situa- 
tion in which a polarization of the party system 
might have devastating consequences—for exam- 
ple, a polarization of moderate and extremist forces 
that would make electoral competition tantamount 
to civil war and concentrate all the energies of the 
moderates on the process of keeping rather than 
using power. Where political conflict has great 
potential for violence a wide dispersion of political 
forces might be much preferable, even if not ideally 
desirable. In addition, a highly consolidated party 
system unsupplemented by alternative channels of 
political competition—a hyperaggregative system 
—could lead to massive alienation from represent- 
ative government. It would be equally easy to con- 
struct logical models of working multiparty sys- 
tems, not least because such systems have actually 
existed and now exist—in Norway, in Denmark, in 
Iceland, in prewar Czechoslovakia, and in postwar 
Holland. All this quite apart from the question of 
what sort of party system optimizes the quite sep- 
arate values of representing opinions and support- 
ing authority, while performing well the more 
manifest party-system functions discussed at the 
outset. 

In the functional assessment of party systems, it 
is precisely such balances between different func- 
tional impacts that must be struck, for party sys- 
tems impinge upon much more than the stability 
of parliamentary institutions. Indeed, in many con- 
temporary societies other functional questions, as 
yet unstudied, even unformulated, loom much 
larger. For instance: What types of party systems, 
under what conditions, conduce to or hinder the 
formation of new political identities, especially 
broad national identities? What systems, under 
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what circumstances, help create feelings of political 
community, that is, ties within as well as ties to 
new political structures? What party systems are 
suitable to political mobilization, that is, to en- 
gendering participant attitudes and behavior among 
political “parochials”? What party systems tend to 
adapt well to new political demands and forces? 
Do any party-system characteristics particularly 
help or hinder the rapid “modernization” of soci- 
eties, in politics or in a wider sense? What party 
systems, under what conditions, encourage the pro- 
liferation and salience of other structures of politi- 
cal competition, and with what general effects on 
politics and society? What party systems produce 
what sorts of special political skills, help overcome 
personal disorientation and insecurity in periods of 
rapid social change, and bring out, accentuate, or 
effectively sublimate destructive conflicts? 

We know little or nothing about such questions; 
they can only be proposed as items on a large 
agenda. But the questions themselves indicate some 
of the most poignant difficulties, even dilemmas, of 
representative rule, as well as the centrality of party 
systems to such rule. Party systems operate at the 
most sensitive points of representative government, 
where special interests, aggregated conflicts, na- 
tional authority, and national purpose conjoin. 
Hence their great multifunctionality, But can any 
party system satisfy equally the various functions 
to which it may be crucial? It seems unlikely. Even 
the basic ends of representing opinions and Sy 
porting authority may be contradictory, not just 
separate; similarly, party systems that activate polit- 
ically large masses may be precisely those that 
accentuate conflict rather than cohesion, segmental 
loyalties rather than national identity. Equally seri- 
ous, party systems appropriate to certain ends may 
be least likely to exist where they are most urgently 
needed. For instance, where national identity and 
community must be created, party systems are 
likely to be most fragmented and unintegrated pre» 
cisely because of the lack of larger social ties; 
where broad political mobilization is still merely ê 
goal, party systems are most likely to consist 0! 
units based on restricted elites; where social trans- 
formation is an overriding end, party systems may 
tend to become foci of social recalcitrance. A 

Few problems of comparative politics, then, 4 
more crucial than those of discovering what wees 
of party systems optimize satisfaction of the a 
ous ends that they may have to serve and how, ina 
to what extent the development of such syste a 
may be promoted. Before these problems Re 
coped with, however, the whole universe of PE 
systems must be better charted, and the forces 


bear upon them, and which they in turn exert, 
must be better understood. My object here, given 
a subject conspicuously understudied, has been to 
provide an indication of how these ends might be 


reached. 
Harry ECKSTEIN 


[See also ELECTIONS, article on ELECTORAL SYSTEMS; 
GOVERNMENT; POLITICAL SOCIOLOGY; POLITICS, COM- 
PARATIVE; REPRESENTATION, article on REPRESENTA- 
TIONAL SYSTEMS; SYSTEMS ANALYSIS, article on 
POLITICAL SYSTEMS.] 
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PASTORALISM 


Pastoralism is an economic activity involving 
the care of herds of domesticated livestock. In its 
traditional forms it is either practiced as the main 
mode of subsistence or combined with agriculture. 
Pastoralism functions as a cultural system with 
a characteristic ecology. The community of the 
pastoralists can be considered in two dimensions, 
as an ecological unit and as a sociocultural 
community. 

Definitions. The pastoral community differs 
from natural animal communities in that it is sub- 
ject to the cultural control exercised by man. More- 
over, this pastoral community is something more 
than the sociocultural community. It is a self- 
perpetuating social group with a characteristic 
population size and composition, geographic locus 
and distribution, and in common with all other 
ecological communities, certain functional requi- 
sites for survival: This community has a set of 
institutions which relate it to a greater cultural 
whole, and in addition it maintains and is main- 
tained by an inner conscious cohesiveness (see 
Arensberg 1961). But as members of a pastoral 
community, pastoralists also have sociocultural in- 
stitutions through which they enter into mutually 
supportive relations with their herds and dependent 
or parasitic relations with the natural environment 
of their eco-system. In the same sense, pasture is 
not solely a natural phenomenon but is culturally 
controlled, being designated or delimited as indi- 
vidual, family, communal, tribal, or national prop- 
erty by pastoralists, who use it to provide natural 
sustenance to their herds. 

Man and herd live in a symbiotic community; 
the human component of this community takes the 
form of a village composed either entirely of pas- 
toralists or of some specialized pasto! lists living 
among agriculturalists. However, the pastoral vil- 
lage is found primarily in those cultures given over 
wholly or in significant degree to pastoralism. 

In the pastoral community man and herds 
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make social and psychological adjustments to 
each other; together they adapt to the natural en- 
vironment in which the herds have their special 
ecological niche, the pasture. This adaptation is 
related to the size of the herd, which is characteris- 
tically large enough to be self-sustaining, and to 
its practice of grazing the grassy cover of the pas- 
ture or, to a lesser extent, browsing the foliage. 

There is little or no improvement of the plant 
or water supply in the pastoral ecology, and hence 
a natural ceiling on nourishment for the stock is 
imposed where pastoralism is the main source of 
subsistence; in such cultures a limit is placed on 
size and rate of growth of human and animal 
populations, This is one of the causative factors 
of the stagnation or slow development that charac- 
terizes pastoralist economic systems. An outstand- 
ing example is the Mongol pastoral pattern, which 
was little changed from the time of their great 
empire in the thirteenth century to the twentieth 
century. 

In order to understand the nature of pastoralism, 
it is necessary to differentiate it from what it is 
not, especially from similar forms with which it 
may be confused. Pastoralism is distinct from gen- 
eral animal domestication and from certain types 
of specialized domestication in which a single ani- 
mal or a small number are raised by farmers, 
These instances of animal husbandry do not in- 
volve maintenance of self-sustaining herds in pas- 
ture, and the animals concerned generally forage 
about the human settlement or depend on farm- 
land stubble, fallow lands, etc. Cattlemen of the 
American grasslands do not form natural com- 
munities of pastoralists because their enterprises 
are artificially organized for the cattle owners’ 
profit. The pastoralist community is also not 
equated with the farming village to which are at- 
tached a few shepherds who tend the flocks. This 
is a form of specialized labor that is widespread in 
rural Europe and elsewhere. However, it is not ex- 
cluded that such a village, while primarily con- 
cerned with cultivation, may form a community 
with its herds through the specialized activity of 
the village herdsmen (see below). 

Principal breeds of pastoral stock. Pastoralism 
in its most developed form has an Old World cul- 
tural origin. The domesticated stock include: 


Reindeer (Rangifer tarandus tarandus ) 

Sheep (Ovis aries) 

Goat (Capra hircus) 

Bovines 
Cattle, European type (Bos taurus) 
Zebu, or humped cattle (Bos indicus) 
Yak (Bos grunniens) 


Equines 
Horse (Equus caballus) 
Ass (Equus asinus asinus) 
Camels 
Dromedary (Camelus dromedarius) 
Bactrian (Camelus bactrianus) 


The relations between the different breeds of 
stock are not clear in many cases, and a number 
of historical and genetic problems remain to be 
solved. The bovine classification is arbitrary, and 
it groups the yak with the taurines and the zebu, 
contrary to usual zoological practice. The reason 
is that the yak interbreeds with the taurines, and 
the pastoral behavior of both is generally similar. 
Again, the ass has been included among the pas- 
toral stock, but several local breeds have been 
omitted. Where cultural considerations do not con- 
flict with genetic differentiations, they have domi- 
nated other considerations in forming and subdi- 
viding the above list of pastoral animals. 

Pastoralism based on these species extends 
across northern Eurasia, from Scandinavia and 
northern Russia to Siberia as far as the Chukchi 
Peninsula (the classical area of reindeer herding) 
and various parts of south and central Asia. Pas- 
toralism has existed longest in the Middle East, 
extending from the Mediterranean to the Indus 
Valley. Pastoralism is practiced among many peo- 
ples of the Caucasus. It is distributed throughout 
Europe in a variety of forms and across northern, 
eastern, and southern Africa. 

In the New World, pastoralism of an indigenous 
type was limited to Camelidae of the Andean high- 
lands; the llama as a pack animal and the alpaca 
as a wool-producer were domesticated by the Incas 
and their forerunners. Pastoralism in modified 
forms is distributed in most parts of the world 
today. 


Domestication in relation to pastoralism 


All pastoral animals are domesticated, but some 
domesticated animals are not pastoral: for exam- 
ple, the dog, cat, pig, fowl. Domestication of a 
mals is by definition based on close association a 
a breed with man, whereby behavior and, by in- 
ference, attitude are mutually influenced. Domesti- 
cated stock breed within the pastoral community, 
accept man as the ecological dominant, and ee 
erally, but not always, supply some product T 
perform some service of use to man. As a resu 
of domestication, man and beast make mutu! 
adjustments; thus dogs and other animals m 
serve as guardians are in turn protected. There a 
moreover, one-sided adjustments: rut patterns 
domesticated animals differ from those of the j 


The culture of domesticating peoples takes account 
of domestication not only in economy and ecology 
put also in religion and mythology, fable, and other 
folklore genres by imputation of an anthropomor- 
phic psychology to animals or by worship of bull 
spirits, etc. 

Domestication comprises a greater range of cul- 
tural activities than pastoralism. Domesticated 
animals, including some pastoral stock, are fre- 
quently raised for participation in religious ritual, 
sport and play, warfare, and for such psychological 
comfort to people as they may provide, as well as 
for economic purposes. 

Pastoralism is an aspect of the domestication of 
animals, having developed as a specialized form of 
the latter in the ancient Near East. There sheep 
and goats were the earliest of the domesticated 
species; remains have been found in mesolithic 
and neolithic sites of the ninth and eighth mil- 
lennia B.C. 

Herding is also to be differentiated from pas- 
toralism, because the care of flocks or herds of 
wild, tame, domestic, and domesticated animals 
may take place in environments other than pasture 
and because pastoral animals require a specialized 
ecological niche for their maintenance. They thus 
differ from nonpastoral domesticated animals, 
which may be kept here and there about the house- 
hold and village, in the interstices of human settle- 
ment. Asses, sheep, goats, horses, cattle, although 
usually pastured, may be raised within the village 
in barns, stalls, pens, tethers, sties, or simply roam- 
ing loose, The domesticated elephant of Asia and 
Africa may not be regarded as a pastoral animal 
because it does not have a specialized ecological 
niche, 

_ There are a variety of pastoral forms. At its 
simplest pastoralism involves a single species, such 
as the reindeer in northern Eurasia or the south 
African Hottentot cattle. In the arid or semiarid 
zones of Manchuria, the Near East, and north 
Africa, in parts of sub-Saharan Africa, and in tem- 
perate Europe, pastoral herds are of a more com- 
plex composition, The Mongols raise conjointly 
herds of sheep, goats, horses, and camels in addi- 
tion to cattle, yaks, and various hybrids. In 
ancient times asses and mules were possibly also 
raised under pastoral conditions in Mongolia. In 
its sociocultural aspects pastoralism is also vari- 
ously organized. In parts of Asia and Africa it may 
r the affair of whole families (as among most 
peat groups), of whole villages (e.g., Turkmen, 
louins), and even of whole cultures (e.g., Mon- 
hy Tuareg). In these circumstances there will 
Samy a modest degree of economic activity 
ithin the herding unit. But traditional pastoralism 
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in Europe and certain neighboring parts of Asia is 
practiced by individuals specialized in herding in 
a context where agriculture is a major economic 
activity. Transhumance is one of the forms that 
this kind of mixed herding-farming pastoralism 
takes. Another form, in which herding and primi- 
tive farming receive equal emphasis, is sometimes 
called seminomadism. 


The pastoral community 


In terms of the ecological system, the pastoral 
community in those cultures having pastoralism as 
the principal mode of subsistence can be easily 
defined; the pastoralists and their herds subsist 
in an ecological symbiosis, the herds supplying 
products for the herders—nourishment, clothing, 
shelter, dung for fuel—and, further, products for 
the herders’ trade with farming peoples. For their 
part, the pastoralists protect the herds from preda- 
tory animals, help their stock to forage under snow, 
help the bovines in calving and mares in foaling, 
and help kids and lambs to suckle. This last is es- 
pecially important, for domesticated pastoral stock 
are woollier than the wild, and the udders are 
harder for the young to find. 

Pastoralists and their herds depend upon the 
natural bounty; the stock graze the grass and 
browse the foliage of shrubs and trees; men and 
animals both utilize the water and salt. The grass 
grazed over must be allowed to grow again the fol- 
lowing year; springs must have a chance to re- 
shape their banks after the herds have trampled 
them. These annual cycles are reflected in the no- 
madic pastoral round. 

The mixed ecological relations of symbiosis and 
parasitism are diagnostic traits of pastoralism. 
Pastoralism thus constitutes a complex ecological 
system, different from the symbiotic relations of 
agriculture and the parasitic relations of hunting, 
fishing, and gathering. The pastoral ecological 
community is based on a man-herd symbiosis 
which is in one of its aspects parasitic in the larger 
eco-system. 

The community of pastoralist and herd is based 
upon ecological relations which are known from 
other contexts: dominance and symbiosis. Man’s 
dominance over the herds is to be seen in the sub- 
mission of the herds to the cultural regime in a 
context of mutual support. In this community, 
factors of psychological affect and rapport are es- 
tablished between man and beast. Man’s cultural 
role is extended to accommodate the behavior of 
the herds. The behavior of the beasts is radi- 
cally changed; there are psychoneural changes, 
such as in rutting patterns and in those processes 
of behavior formation that are influenced by the 
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circumstances of imprinting and nomadization. It 
is in this way that the pastoral community emerges 
as a reality: the animal stock undergoes be- 
havioral changes as the community forms under 
the cultural regime, that is, under the ecological 
dominance of man. 

Imprinting among pastoral animals has been 
given a different meaning than that originally as- 
signed to it by Lorenz (1935) in his work on bird 
imprinting. In its primary meaning, imprinting is 
a form of learning that occurs in the earliest period 
of life, a period which is brief and sharply defined. 
Learning in the period of imprint is in a high de- 
gree ineradicable. In the view of Scott (1962) and 
others who have dealt with pastoral-animal learn- 
ing, imprinting is most importantly identified with 
socialization. Ethology has as one of its branches 
the study of behavioral patterns that are species- 
specific, yet a new branch of ethological study 
should be pursued in the context of pastoralism. 
Although animal behavior patterns are specific, 
the different types of pastoral stock conform in a 
general way to a single behavior pattern under the 
cultural control of complex pastoral nomadism. 
[See IMPRINTING.] 

The ecological community of herders and herds 
has created an innovation, pasture as the ecological 
niche of the herds; both pasture and herds being 
maintained in conjunction with, and in the neigh- 
borhood of, a human settlement, the village of 
pastoralists. In the sociological community the 
stock functions as mobile property. Stock raising 
is an institution common to a number of communi- 
ties. Considered within the sociocultural com- 
munity, pasture is also in a nonecological relation; 
it becomes a real or immobile property which may 
be owned by an individual or by the whole social 
group. As such, pasture is frequently managed by 
several social communities, 

The behavior of wild species of herd animals 
in search of food has much in common with the 
behavior of the pastoralists’ stock. Wild sheep, 
horses, and camels have territories of their own 
and change their grazing places in an annual 
round. This is a nomadic cycle and has been iden- 
tified as such by Japanese investigators who have 
observed the grazing habits of horses (Imanishi 
1954). 

Thus, nomadization is not a monopolistic fea- 
ture of human ecology; still less is it an exclu- 
sively pastoralist trait. Nomadism, the movement 
of a community through an annual cycle, is char- 
acteristic of a number of wild species and of cer- 
tain human societies, for example, some hunters, 
fishers, and gatherers. Moreover, not all pastoral 


societies are nomadic or contain wholly nomadic 
communities. But nomadism is characteristic of 
certain pastoral cultures whose common traits in- 
clude highly mobile living arrangements, such as 
tents and covered wagons, a technology and ma- 
terial culture adapted to mobility, and movement 
of the human group and the herds from pasture 
to pasture over socially recognized routes in an 
annual cycle. 


Types of pastoralism 


Nomadic pastoralism. Pastoralism as tradition- 
ally and, in certain cases, currently practiced by 
Mongol, Turkic, Uralic, Semitic, and Hamitic 
peoples is generally nomadic. 

Pasture is often discontinuous and connected 
by routes of access; villages are mobile and main- 
tained as distinctive entities whether they are 
within or beside the pasture. 

These groups have a well-developed institution 
of property which is held in herds, pasture, and 
the routes between pastures. Their legal arrange- 
ments include the means for control over pasture, 
identification of herds, regulation of access to 
routes, and regulation of disputes over property 
rights. 

The herds of many of these peoples are complex 
in composition, but under the ecological dominance 
of man they have come to nomadize conjointly. 
The Mongols traditionally caused herds of horses, 
cattle, sheep, goats, camels, and yaks to graze 
and move in a single nomadic cycle, according to 
man-made rules; thus the different species con- 
formed in their behavior to the pastoralist culture. 
The gregarious nature of these herds and their 
habits of socialization and feeding have remained, 
while at the same time becoming transformed 
under the cultural regime of the nomadic pas- 
toralists. Not all domesticated stock can make such 
adjustments or make them to a degree necessary 
for pastoral gregariousness. Nor does imprinting 
occur in this way among all domesticated animals; 
the dog and pig adjust otherwise to community 
with man. In other words, culture accounts for the 
patterns of behavior of domesticated animals na 
are different from those of wild and feral animals; 
it also directs those differences in behavior An 
among the domesticated animals, pastoral Ma 
nonpastoral. The pastoralist culture interacts wi 
animal potentialities in intelligence, attitude, ia 
behavior, applying and developing these for +t 
man use, In these terms, nomadic pastoralism 
a unitary cultural type. 

Unspecialized herding—farming. 
herding is sometimes called semin 


A variety of 
omadism. The 


cultures practicing this mixed form of subsistence 
are without specialists in either herding or farm- 
ing, but individuals and certain families of the 
village move back and forth from one occupation 
to the other. This mixed herding-farming economy 
dates from the middle and late bronze age of the 
Near East, about 5,000 years ago, and from a mil- 
lennium later in southern central Asia. According 
to Zhdanko (1963), it was practiced continuously 
into the early twentieth century by successive 
settlers in the delta of the Amu-Darya south of the 
Aral Sea. 

The agriculture of these settlers was primitive; 
their herds, while complex in composition, were 
restricted in their movements about the grazing 
grounds; these were in any case subject to multiple 
uses, including some farming, with scarcely any 
provision for specialized function as pasture. The 
herds were small both in absolute numbers and in 
proportion to the population of the villages; their 
small size required neither vast and complex no- 
madic movements nor the complex management 
of herd and pastoral range found among the great 
pastoral nomads, such as Kazaks and Mongols. 

The villages of the unspecialized farmer-—herds- 
men are sedentary; use of tents to follow the herds 
is not well developed. This form of herding is not 
nomadic; the term “seminomadism” has been ap- 
plied to it, The pastoral ecology is not fully de- 
veloped by the peoples practicing this mixed mode 
of subsistence. 

The Turkmen, neighbors of the Kara-Kalpaks, 
practice whole-village pastoralism, while certain 
of the Turkmen villages are given over wholly to 
farming; still others practice the mixed, unspe- 
cialized form of herding—farming found among the 
Kara-Kalpaks. 

_ Transhumance and estivation. Transhumance 
is a highly developed form of pastoralism practiced 
by sedentary cultures whose major economic ac- 
tivity is agriculture, It is best known from Europe 
but is also practiced, although to a lesser degree, 
by a number of peoples of the Caucasus, the 
Middle East, and mountainous parts of Asia as 
far east as Tibet. In the transhumant pattern, pas- 
toralism is closer to parity with agriculture. 
y In the Mediterranean countries, villages engaged 
in transhumant pastoralism traditionally send their 
herds of sheep, goats, and cattle to summer up- 
land pasture. The stock is given into the charge 
of shepherds, goatherds, or cowherds specialized in 
their tasks, and they have a defined place in the 
vilage and in their respective national economies. 
a terms of the specialization of labor and market- 
8g institutions, transhumance is a higher develop- 
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ment than the mixed farming-herding mentioned 
above, just as the agriculture of these transhumant 
villages is more highly developed both technically 
and institutionally than the agriculture of semipas- 
toralists. Birot and Dresch (1953-1956) have de- 
scribed a type of transhumance in the Mediterra- 
nean plains that is characterized by an element of 
capital-intensive pastoralism. This, however, is not 
of a traditional type. 

Estivation is a further development in terms of 
specialization of labor, product, and market. It 
combines the utilization of specialized labor in 
herding with an investment in cheese manufacture 
and marketing institutions for this product. 

Other forms of pastoralism. There are signifi- 
cant variations of pastoralism, regional to some 
extent but more significantly institutional. Northern 
Eurasian reindeer breeders have adjusted to their 
stock and vice versa but do not have complex herds 
requiring mutual adjustments of different kinds of 
stock. Mongol and Turkic nomads of central and 
inner Asia, Semitic and Hamitic (including Tuareg) 
nomads of southwest Asia and north Africa have 
evolved a complex pastoralism with mixed stock. 
They have developed economic and political insti- 
tutions which relate them to neighboring agricul- 
tural civilizations. Mongol and Chinese economic 
interactions historically have been those of special- 
ized monopolistic producers and consumers, each 
dependent on the product of the other. Certain 
Turkic groups, such as the Turkmen, have devel- 
oped a closer interdependence of herding and farm- 
ing within the same ethnic group and local econ- 
omy. At the same time they traded with, and raided 
on, their Persian farming neighbors. 

South of the Sahara, east African pastoralism 
has evolved with many of the same features: a 
community with complex herds (zebu, camel, goat, 
sheep) and nonsedentary village arrangements. 
The tent is absent here. The productivity of stock 
is lower than in north African and central Asian 
pastoralism, and the nomadic round is less firmly 
regulated. Territoriality of pastoral communities 
is not well fixed; grazing rights and route rights 
between pastures are loosely defined, if at all, for 
example, among the Turkana. The complex set of 
intersocietal institutions relating herding and farm- 
ing peoples—developed in a high degree in north 
Africa and southwest, central, and inner Asia—are 
but modestly developed in sub-Saharan Africa and 


in northern Eurasia. 


Traditional pastoralism 
Pastoralism as an ecological system, while not 
limited to the arid zones of the Middle East, central 


458 PASTORALISM 


Asia, and north Africa, has been most highly de- 
veloped in this environment. Reasons for this ap- 
pear to be both historical and systemic, that is, 
inherent in the nature of pastoralism and inherent 
in its relations to the environment. The latter is 
the more interesting scientific consideration. The 
arid zones may support farming, but only by in- 
tensive exploitation of the ground through irriga- 
tion, that is, by intensive application of labor and 
capital. 

Traditional pastoralism has been maintained as 
an ecological system for more than four millennia 
in the arid zone of the Near East, for more than 
three millennia in the arid zone of central Asia, 
and for almost as long a time elsewhere in the Old 
World. This has been achieved without great labor- 
intensive or capital investment. The result has been 
a level of living for the common pastoralist that is 
often superior to the level achieved by the peasants 
of the agricultural high cultures of China, India, 
and the Near East. Although the political develop- 
ment of pastoralists has led to the growth of em- 
pires, their technology, letters, and science have 
remained backward by comparison with the great 
farming civilizations. 

The ease with which pastoralists have achieved 
their level of minimum accomplishments in eco- 
nomic development has been through the develop- 
ment of an ecological community with the live- 
stock, The community has provided favorable 
conditions for exploitation of the stock as living 
farms, or factories on the hoof, founded on and 
made possible only by the ecological symbiosis of 
herder and herd. Frequently pastoralists have also 
developed institutions for marketing in the neigh- 
boring agricultural lands the pastoral product and 
such natural products of the territory as furs and 
salt. 

Mongol pastoralism. Mongol pastoralist tradi- 
tions exhibit both common features and variations 
in the general theme of the pastoral complex as 
practiced in parts of Asia and north Africa. Pas- 
toralism continues to be practiced by the Mongols, 
although profound modifications were introduced 
during the middle of the twentieth century; above 
all, Mongolian pastoralism in its traditional form 
developed the inherent potentialities of pastoralism 
more fully than most other pastoralist cultures. 

The Khalkha Mongols, who have herded in Mon- 
golia for over a thousand years, number about 
750,000. In recent decades the total number of 
their mixed stock has been about 25 million in the 
following proportion; sheep (53 per cent), goats 
(24 per cent), horses (11 per cent), cattle (8 
per cent), and camels (4 per cent). 

During the 1920s, when the traditional economy 


was still in force, total gross size of herds in Mon- 
golia was only ten per cent below the present; there 
has been little change in the internal composition 
of the herds, to judge by the fairly constant ratio 
of different kinds of stock. Thus, without improve- 
ment of the grass cover or water supply, the ecolog- 
ical system tends to continue, even in the face of 
radical political change. 

Some of the most favorable ecological conditions 
for pastoralism are met in the grassy upland steppe 
of central Mongolia, the Ara Khangai. This is homo- 
geneous pastoralist country, where the population, 
with few exceptions, is directly or indirectly in- 
volved with stock raising. Here the human popula- 
tion density is one of the highest in rural Mongolia, 
1.4 per sq. km. The gross herd density is 52 per 
sq. km.; the ratio of gross herd size to human pop- 
ulation is 36:1. These, too, are exceptionally high 
figures, rare in traditional pastoralism. The mean 
average population density for Mongolia is 0.5 per 
sq. km.; the mean gross herd density is 14 per 
sq. km. (Krader 1955). 

The traditional pastoral village of the Mongols 
was composed of extended families under a patri- 
arch, his wife, his sons and unmarried daughters, 
the wives and the children of married sons; resi- 
dence upon marriage was patrilocal; kinship was 
and is reckoned in the patriline. A wealthy or prom- 
inent Mongol also had poor and distant kin as 
dependents and supporters in the kin village of the 
past. Agnatic kin lived side by side in villages which 
might, in the winter encampments, number ten to 
fifty felt tents arranged in a circle or arc. Men 
tended the herds and supervised the village’s no- 
madic moves, while women maintained the house- 
hold. 

Mongols have done little farming, and some 
Mongols hold that piercing the earth by ploughing 
and planting is a defilement of the earth spirit. 
From time to time agriculture has been ince 
into Mongolia, but it has never become a significan! 
factor in the economy. On the contrary, the Huns 
and other Turks and the Palaeo-Asiatic Kets wh 
occupied Mongolia before the Mongols have 
lived primarily by herding. 

Traditionally, the component fa’ di 
villages were related by descent and grouped a d- 
patriclans that were in turn grouped into o 
erations and principalities; villages, clans, a d 
erations each had a chief or ruler. Thus all indivi 
uals were related by consanguinity, we 
same time they were divided into noble an a 
mon strata. The villages were made up of ie 
folk who conducted their own affairs w g 
leadership of an elder. The larger social group 
were ruled by the nobility. 


milies of Mongol 


Pasture, migration, route and herd rights, mar- 
riage arrangements, and tribute to superiors, 
whether in kind or in labor, were regulated by rules 
embodied in folk tradition and written law; disputes 
were adjudicated through the intervention of higher 
authority. The hierarchical position of each social 
group was established by ranking of the collateral 
lines of descent; to each group obtained certain 
legal rights, economic practice, religious beliefs, 
and coercive force. On occasion, segments of patri- 
lineages left their native pastures and joined other, 
more distantly related villages. Such fusions were 
subject to careful regulation: application had to be 
made to the proposed host group, which if it agreed 
and received the support of its clan chief, would 
accept the newcomers. A fictional set of kinship 
relations was then introduced; the newcomers be- 
came younger brothers, cousins, and nephews of 
the hosts, and their women and children were de- 
nominated accordingly as nephew’s wife, brother's 
child, etc. The economic, social, legal, and religious 
sanctions were at first applied casually, “as though” 
the newcomers were kin; but after two or three 
generations the former strictures were forgotten. 
These were not in any sense segmentary lineages 
which could functionally survive as independen- 
cies, Each village or clan was enmeshed in a great 
social hierarchy subject to the set of group rela- 
tions of the whole, involving agreements of coequal 
kin villages and intervention from the authority of 
clan heads and princes. 

The traditional economy was not entirely self- 
sustaining, although it could maintain itself autoch- 
thonously for a time. The Mongols drove animals 
on the hoof to Chinese markets: horses for riding 
and draft, sheep for mutton; also hides, furs, pelts, 
wool, felt, and sinew. They did not, however, find 
a market for their milk and milk products. The 
Chinese in return provided cotton and silk cloth, 
grain, and tea. 

Mongols and Chinese engaged in commodity ex- 
change in market locations, usually subject to fac- 
tors internal to the market. In addition, circulation 
of pastoral and farming products had a political 
tole. Tribute was exacted from the weaker power, 
Sometimes from the pastoralists, sometimes from 
the farmers, When the Chinese had more political 
and military force, they dominated the tribute col- 
lection, but they also returned economic goods in 
kind to placate or “pacify.” 

Beyond this, Mongol and Turkic nomads main- 
tained the caravans which linked east and central 
Asia with the Mediterranean and Europe. They pro- 
Vided the draft and pack animals from their herds 
of camels, horses, and asses, and they served as 
caravaneers, 
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Mongol pastoralism is a specialized cultural type 
which emerged out of the more generalized herd- 
ing—hunting—farming culture of the bronze age of 
northern Asia; this pastoral community formed a 
stable ecological system which has remained in situ 
over two millennia, maintained by two sets of fac- 
tors—the internal regime of the Mongolian plateau 
and the international set of mutually supportive 
economic relations with the Chinese and other 
farmers. The set of institutions through which 
goods were circulated in markets—through the col- 
lection and counterpayment of tribute and sump- 
tuary gift exchange—constituted a complex system 
with built-in political and economic balances and 
instabilities. The system was an imperfect integra- 
tion of a herding and a farming people, breaking 
down on occasion and giving rise to raids, wars, 
and conquests. Military and political events empha- 
size rather than hide the interdependence of the 
pastoral and farming economies of east Asia, 

Tuareg pastoralism. The Tuareg are a pastoral 
people of the western Sahara, a habitat that is far 
drier than that of the Mongols. Both peoples live 
in communities of men and herds with a high 
degree of dependence on the pastoral product. Total 
population size and density of human and herd 
population per unit of ground are lower among 
Tuareg than Mongols; in the 1930s the Tuareg 
numbered 240,000. Their range includes desert 
and semidesert with rare access to the lusher grass- 
lands; the vegetation cover is dependent on the 
amount and rate of precipitation, and it is highly 
variable. 

The Tuareg herd goats and sheep; of secondary 
importance are camels, cattle, and asses. They are 
nomads and live in communities that typically 
cluster ten to twenty tents; these in turn are com- 
posed of several minor camps. The minor camp is 
an extended family consisting of two to seven tents; 
and this is the cooperative goat-herding unit. Here, 
as among the Mongols, the family is in operational 
control of herd management; the family, however, 
is not the unit of range management that is sub- 
ject, in these complex societies, to regulation by 
higher authority. 

Traditional Tuareg society links together a num- 
ber of village kin communities; these combine, on 
the principle of descent, into tribes and, further, 
into federations. The federations form political 
unities with centralized rule over each in the hands 
of a nobleman. Just as among the classical Mon- 
gols, Tuareg society is stratified, and lord and vas- 
sal relations are established for the purpose of 
sustaining and extending political power. 

Agriculture had little importance in much of the 
Tuareg economy until irrigated agriculture was in- 
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troduced to the Ahaggar Tuareg in the nineteenth 
century. 

Labor is specialized chiefly by age and sex and 
then by crafts—ironworking, stone work, pottery 
making, and leather working. Thus, women of the 
sedentary Tuareg are pottery workers; pastoral 
women do not work in this line. As in other parts 
of Africa and Asia, a special ritual status is given 
to smiths, and they have an argot of their own. 

Caravan trade is a necessary source of livelihood 
for the Tuareg, notwithstanding the inroads of 
European transportation into the Sahara. Agricul- 
tural products such as millet are transported from 
the Sudan to the south; salt is mined within the 
Tuareg land and traded without. Butter, cheese, 
dates, and wheat are circulated in camel caravans, 
which also supply agricultural products to the pas- 
toralists. Tuareg with no agriculture are even more 
dependent on the caravan trade than those with 
agriculture. 

Despite differences in habitat and certain cul- 
tural features, the internal economies of the Mon- 
gol and the Tuareg are comparable. This is true in 
terms of herd, pasture, route and range manage- 
ment, social and political structure, legal practices, 
village and principality organization, and ecological 
community arrangements. 

Pastoral economies in these complex societies are 
in interdependent economic exchange with farmers 
through systems of market and caravan trade. 
Macro-institutions of state and empire, transconti- 
nental caravan trade, and international tributary 
relations are defective, instable, and evanescent. 
But small-scale institutions of the herding village 
and pastoral community have adapted to modern 
life. 


Economic prospects of pastoralism 


The future of pastoralism is mixed. In its no- 
madic form it has been reduced in importance or 
transformed. In central Asia and Mongolia it has 
been changed into sedentary pasturing and stabling 
of animal stock. Tuareg nomads today comprise 
but two per cent of the population. The principal 
factors in the retrenchment of nomadism are its 
association with “anti-progressive” forces: nomads 
constitute a special problem in education and in- 
dustrialization; central governments of the Middle 
East generally regard nonsedentary populations as 
tribes, that is, forming a state within a state, and 
do not trust them. 

Nevertheless, meat, leather, milk, and wool have 
continuing world markets, and modified forms of 
pastoral communities continue to occupy their spe- 
cialized ecological niche in the arid zones. These 


may be reduced, but only where there is sufficiently 
heavy investment of capital and labor—as, for 
example, in the exploitation of oil and water re- 
serves—to make mechanized transport and indus- 
try economically feasible. 

In general, migration of labor to urban, indus- 
trial, and other money-economy centers is depleting 
pastoralist manpower in the rural sectors of 
underdeveloped economies. Pastoralism is also en- 
dangered by national public policies which place a 
low social value and therefore a low priority on 
pastoralism. This has the effect of eliminating cer- 
tain pastoral areas from the economy without sub- 
stituting another economic function for the lands 
vacated or for the pastoral people themselves; and 
it has turned certain former animal and animal- 
product exporting nations into net importers, plac- 
ing a further strain on development. 


LAWRENCE KRADER 


[See also DOMESTICATION, article on ANIMAL DOMES- 
TICATION; ECOLOGY. ] 
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PATENTS 


Patents are public documents conferring Cer- 
tain rights, privileges, titles, or offices. The word 
aes from the Latin litterae patentes (open 
letters); the expression “letters patent” is still used 
in legal parlance. 
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A patent of invention is the grant of an exclu- 
sive right to the use of a technical invention. This 
is the only meaning we attach today to the word 
“patent” standing alone. Defined more precisely, 
a patent confers the right to secure the enforce- 
ment power of the state in excluding unauthorized 
persons, for a specified number of years, from 
making commercial use of a clearly identified new 
and useful technological invention. 


Methods of operation 


What are inventions? If the patent law is to 
protect novel and useful inventions, it must be 
determined just what should be regarded as an 
“invention” and what as “novel” and “useful.” Legis- 
lators and judges have had much trouble answer- 
ing these questions. 

An invention is a new contrivance, device, or 
composition of matter, or a technical art newly 
created—in contrast with a discovery of a prin- 
ciple or law of nature that has already “existed” 
although unknown to man. But not every new way 
of doing or making something, not every new 
thing never made before, is regarded as an inven- 
tion. Some hold that it must be “an unusual mental 
achievement,” involving more than the exercise of 
ordinary technical skill. Some courts have declared 
that to be an invention, a new idea must originate 
in a “flash of genius,” not merely in the routine 
experimentation of large laboratories. The United 
States Congress, annoyed by such narrow judg- 
ments, stated in the Patent Act of 1952 that “pat- 
entability shall not be negatived by the manner in 
which the invention was made.” 

What is novel and useful? Subjective novelty 
is universally rejected in favor of objective tests 
such as “not previously patented, published, or 
used.” But whether novelty is to mean “never 
known anywhere” or only “not yet used within 
the country” was controversial for a long time. 
Among other questions long debated and arbitrarily 
solved were these: Should novel combinations of 
well-known elements or novel uses of known de- 
vices or techniques be patentable? How should 
priority be decided in cases of nearly simultaneous 
inventions: Should the patent go to him who was 
first in getting the idea, or to him who was first 
in putting it into patentable form, or to him who 
was first in submitting it to the patent office? 

The usefulness of an invention cannot well be 
determined when the patent is applied for: How 
can one know whether what may seem useless 
at the moment will not become useful later? For 
this reason, patent offices usually have not both- 
ered about economic or commercial utility. Several 
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patent laws, however, deny on grounds of “social 
usefulness” the patentability of articles designed 
for “immoral purposes.” Some laws deny protection 
for “trivial” inventions; some confer only shorter 
periods of protection for “petty” inventions, such 
as Gebrauchsmuster (utility models) in Germany; 
some make “improvement inventions” eligible only 
for supplementary “patents of addition” of shorter 
duration, terminating, for example, with the pri- 
mary patent on the invention that they improve. 

Procedural questions. The questions of who is 
to judge the novelty and general patentability of an 
invention, and at what stage, have received differ- 
ent answers, and different procedures have been 
adopted in different countries. Under the registra- 
tion system the validity of a registered patent is 
examined only if an interested party attacks it in 
the courts and asks that the patent be invalidated. 
Under the examination system a patent is issued 
only after the patent office has carefully examined 
the patentability of the invention. This examina- 
tion may include so-called interference proceed- 
ings, when the office finds that two or more pend- 
ing applications seem to claim, partly or wholly, 
the same invention, so that the priority of one 
invention has to be established. The examination- 
plus-opposition system (Aufgebotssystem ) provides 
for an interval of time after the publication of 
the specifications examined and accepted by the 
official examiner and before the issuance of the 
patent, in order to enable interested persons to 
oppose the patent grant. In such proceedings the 
grounds of the opposition, such as “prior use” or 
“prior patent grant,” are heard and examined by 
the patent office. 

The registration system is administratively the 
cheapest but may burden the economy with the 
cost of exclusive rights being exercised for many 
inventions which, upon examination, would have 
been found nonpatentable. The examination sys- 
tem, it has been argued, avoids a mass of worth- 
less, conflicting, and probably invalid patents, 
onerous to the public as well as to bona fide 
owners of valid patents; it prevents fraudulent 
registration and sale of patents similar to claims 
patented by others; and it reduces the extent of 
court litigation. 

Period of protection. The duration of patents 
has been determined by historical precedent and 
political compromise. The 14-year term of the 
English patents after 1624 was based on the idea 
that two sets of apprentices could, in seven years 
each, be trained in the new techniques, although 
a prolongation by another seven years was allowed 


in exceptional cases. The 17-year term in the 
United States was a compromise between the 
“historical” 14 and the potential 21 years of pro- 
tection. The terms vary from country to country; 
Italy and Mexico grant 15 years from the date of 
application; Japan, 15 years from publication; 
United Kingdom, 16 years from the filing of speci- 
fications; United States, 17 years from the date of 
the grant; Germany, 18 years from application; 
France, 20 years from filing; and Argentina and 
Chile, 5, 10, or 15 years, depending on the type 
of invention, 

The literature contains a variety of arguments 
in favor of a longer period of protection: it should 
be long enough to protect the inventor for the rest 
of his life; for the average length of time for which 
a user of an invention might succeed in keeping 
it secret; for the average time it would take for 
others to come up with the same invention; or for 
the average period in which investments of this 
kind can be amortized. Some pleas have been 
made for eternal protection through perpetual 
patents. 

In several countries patents terminate prema- 
turely upon failure to pay renewal fees; such fees 
may increase from very modest charges for the 
first years to progressively higher levels in later 
years. The fiscal results of this scheme are insig- 
nificant, but it probably fulfills the economic put- 
pose of weeding out worthless patents. Premature 
termination—revocation, forfeiture, or nonenforce- 
ment—of patents may in some countries be or- 
dered as a legal sanction for an “abuse” of the 
patent grant. 


History 

The beginnings, 1200-1624. 
leges by which a sovereign granted the exclu 
use of an invention to its inventor or introducer 
were known as early as the thirteenth century and 
were quite frequent in the fourteenth. They con- 
ferred manufacturing monopolies or only licens- 
ing monopolies; they were given either to 4 D 
ducer introducing technology already used abroa 
or to the “first and true inventor.” u 

The first patent law was adopted in 1474 by n 
Republic of Venice, but its grants of protection 3 
inventors existed side by side with special Dee 
leges to makers of novel products; Venetian r 
tice included revocation of grants if the m 
was not used (compulsory working) and Loa 
reduction of excessive royalty rates (determinati! 
of “reasonable royalties”). 

Most privileges granted in the Nether! 


Occasional privi- 
sive 


Jands after 


1581 were clearly designed as incentives to invent, 
whereas most of the patent monopolies granted 
by the Tudor and Stuart kings in England served 
protectionist, autarkic, or fiscal purposes, if they 
were not downright misappropriations and hand- 
outs. After several parliamentary attempts to stop 
this misuse, a court, in the Case of Monopolies in 
1603, declared that monopolies, such as that in 
playing cards, were unlawful unless they were 
granted for new inventions. The Statute of Mo- 
nopolies of 1624 outlawed all monopoly patents ex- 
cept a few specified kinds, such as “letters patent 
and grants of privilege for the term of 14 years 
.., to the true and first inventor or inventors of 
new manufactures.” Incidentally, the inventor 
under this law was less “true and first” than the 
inventor under the Venetian law of 1474, since the 
English “inventor” could be an importer of foreign 
skills and know-how. Novelty “to the realm” was 
sufficient; “whether learned by travel or by study, 
it is the same thing,” declared a court in 1693 (see 
Fox 1947). 

Spread of patent legislation, 1624-1845. The 
English law became the model for statutes adopted 
by some of the American colonies (Massachusetts 
in 1641, Connecticut in 1672). South Carolina in 
1691 wrote the first law that recognized the in- 
ventor’s right to a patent. In 1787 the constitution 
of the United States gave to Congress the power 
“to promote the Progress of Science and useful 
Arts, by securing for limited Times to Authors and 
Inventors the exclusive Right to their respective 
Writings and Discoveries.” The first patent law 
of the United States was enacted in 1790 and 
amended in 1793. 

In France in 1791, the Constitutional Assembly 
adopted a comprehensive patent law, in which the 
inventor's right in his creation was declared a 
Property right” based on the “rights of man.” De- 
spite such ringing words, the law gave the right 
to the use of foreign inventions, not to the true 
Inventors, but to the domestic entrepreneurs who 
introduced them in France. 

The patent law of Austria (1810, amended 
1820) provided that inventors had neither any 
Property rights in their inventions nor any rights 
to patents. However, it provided that the govern- 
ment could, if this was in the national interest, 
grant to inventors privileges that would restrict 
other subjects’ “natural rights” to imitate the in- 
ventor’s ideas. 
re 1810 and 1843 patent laws were en- 

ed in 14 of the states (kingdoms and princi- 
Palities) of Europe, including four German and 
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four Italian states. Brazil and Mexico were the 
first Latin American countries to adopt patent laws 
(in 1830 and 1832, respectively), followed by 
Chile in 1840 and Paraguay in 1845. 

The antipatent movement, 1850-1873. In the 
early 1850s a movement arose for the abolition 
of patent protection and against the adoption of 
patent laws in countries which had none. In Britain 
three parliamentary committees or royal commis- 
sions, in 1851-1852, 1862-1865, and 1869-1872, 
reported serious abuses of the system. The last 
committee recommended drastic reforms, includ- 
ing reduction of the patent term to seven years, 
annulment of patents not put into practice within 
two years, and compulsory licensing for all pat- 
ents. A bill with these provisions passed the House 
of Lords in 1872. 

In Germany several trade associations, in 1853 
and 1863, recommended weakening or abolishing 
the system. The Congress of German Economists 
passed a resolution condemning the patent system 
as injurious to public welfare. Prussia opposed a 
patent law for the North German Federation, and 
in 1868 Chancellor Bismarck recommended that 
all patent laws be repealed in all of Germany. 

In Switzerland, the only industrial country of 
Europe that had remained without patent legisla- 
tion, the government and legislature rejected pro- 
posals for the introduction of a patent system in 
1849, 1851, 1854, and 1863, the last time with a 
reference to economic experts who had declared 
the principle of patent protection to be “pernicious 
and indefensible.” In the Netherlands, which had 
had a patent law since 1817, the parliament re- 
pealed the law in 1869 after a long debate. 

The system sustained, 1873-1912. It looked as 
if patent protection would be abolished every- 
where, when the tide turned and, in and after 
1873, the friends of the system overwhelmed the 
opposition by a well-organized counterattack. Vari- 
ous reasons have been advanced for the sudden 
change. The best historical evidence links the de- 
feat of the antipatent movement with the defeat 
of the free trade movement after 1873. The de- 
pression strengthened the supporters of tariff pro- 
tection as well as those of patent protection. 

The defeat of the opposition was reflected in 
legislative actions. In Britain the drastic patent 
reform bill, passed by the House of Lords, was 
withdrawn in the House of Commons in 1874. In 
Germany a uniform patent law for the entire Reich 
was adopted in 1877. Japan, which had adopted 
her first patent law in 1872, only to abolish it in 
1873, enacted another law in 1885. Switzerland 
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held out longer: a referendum in 1882 still re- 
jected patent legislation. A new referendum in 
1887 enabled the legislature to pass a law, under 
which, however, most inventions in the chemical 
and textile industries were not patentable. Only 
in 1907, after a German threat of punitive tariffs 
on Swiss products, did Switzerland drop these ex- 
emptions from patent protection. The Netherlands 
withstood for 40 years all recriminations about 
“piracy” of foreign technology but finally reintro- 
duced a patent system in 1910, which became 
effective in 1912. 


The international convention 


In 1883 an international convention created the 
International Union for the Protection of Indus- 
trial Property (UIPPI) and established several 
rules to be observed by participating countries (in 
part amended in 1900, 1911, 1925, 1934, and 
1958). Only three provisions will be mentioned 
here. 

The “national treatment” clause provides that 
foreigners (nationals of other Union countries) 
shall receive in each country the same treatment 
as the nationals of that country. Hence, reciprocity 
is ruled out: Swiss inventors were able to get pat- 
ents abroad even when Switzerland granted no 
patents at all. 

The “priority” clause establishes the right of the 
inventor to obtain patents in all Union countries 
in which his kind of invention is patentable and 
in which he applies within 12 months from the 
time of the original application. Hence, he is pro- 
tected against others trying to patent elsewhere 
what he has disclosed to the patent office of one 
country and against foreign patent offices denying 
him patents on the ground of “lack of novelty” 
because of “prior publication.” 

The “abuse” clause provides that each country 
may take measures to prevent abuses resulting 
from the exclusive rights conferred by patents, 
such as “failure to use,” but it may revoke these 
patents only if compulsory licensing is an insuffi- 
cient remedy. Compulsory licenses cannot be re- 
quired until three years after issuance of a patent, 
and then only if the patentee does not produce ac- 
ceptable excuses. 

As of December 1963, only 51 countries be- 
longed to the Union, but the pressure on nonmem- 
ber countries to join is strong. 


Patent statistics 


In view of the role attributed to patent protec- 
tion in early stages of industrialization, statistics 
of patents granted in eighteenth-century England 


Table 1 — Patents of invention issued inthe United States 


AVERAGE NUMBER 


PERIOD PER YEAR 
1790-1800 28 
1801-1825 164 
1826-1850 539 
1851-1875 6,540 
1876-1900 19,703 
1901-1925 36,084 
1926-1950 38,704 
1951-1960 43,012 


Source: Calculated from data in U.S, Bureau 
of the Census 1960 and U.S. Bureau of 
the Consus Statistical Abstract... 


are of interest; The average number of patents 
taken out per year between 1700 and 1750 was 
nine; between 1751 and 1800 it was 34. From 
1801 to 1830 an average of 116 patents per year 
were issued in England. 

Patent statistics for the United States from 1790 
to 1960 are summarized in Table 1. They show 
that patenting increased rapidly during the nine- 
teenth century but reached a plateau early in the 
twentieth century. As the number of scientists and 
engineers employed in research and development 
has increased, the number of patents per research 
and development worker has declined drastically. 
This decline holds for patent applications as well 
as for patents issued. The number of scientists and 
engineers employed in research and development 
in the United States increased from 87,000 in 1941 
to 327,000 in 1958. During these 18 years the num- 
ber of patent applications filed per year fell from 
55 to 24 per 100 persons in this group. One must 
conclude that it has become more difficult to create 
patentable inventions, or less customary to take 
out patents, or both. x 

Foreign patenting. Since every patent is 4 po- 
tential source of revenue, and since the interna 
tional distribution of these revenues may affect 4 
country’s evaluation of the patent system, one may 
be interested in the relative number of patents 
granted to foreigners. Table 2 gives this informa- 
tion for several countries for 1957—1961. ia 

The percentage of patents granted to (or applie 
for by) foreigners evidently reflects the size Fes 
the degree of industrialization of the country: A 
lowest in large and highly industrialized coun 
and is high in small countries even if they cE 
dustrialized (as in Belgium) and in paninda 
ized countries even if they are large (as in ing 
Available data are insufficient to prepare am 25 Aa 
showing the international balances of fol d 
enting (i.e., the number of patents obtained a TE 
compared with the number of patents gati ar 
foreigners ) and still less so for the internationa! 


Table 2 — Patents granted, in total and fo foreigners, 
selected countries, 1957-1961 


PERCENTAGE 
To TO 

TOTAL FOREIGNERS FOREIGNERS 
United States 239,594 37,669 157 
United Kingdom 218,995 102,866" 47.0 
France 157,702 93,617 59.4 
Canada 100,261 94,906 94.7 

Federal Republic 

of Germany 89,953 33,416 37.1 
Italy 77,698 48,835 62.9 
Japan 62,261 21,183 34.0 
Belgium 44,546" 38,112° 85.6 
Switzerland 41,050 26,602 64.8 
Netherlands 19,729 15,567 78.9 
India 13,195 11,794 89.4 
Ireland 2,009 1,939 96.5 
Trinidad and Tobago 495° 466" 94.1 


a. Number of patents applied for. Numbers granted not available. 
b. 1957-1960. Dota for 1961 not yet available. 


Source: United Nations . . . 1964. 


ances of earnings from patents (i.e., royalties re- 
ceived from and profits attributable to foreign 
patents compared with royalties and higher prices 
paid to foreign holders of patents). The data for 
Trinidad and Tobago are given only to illustrate 
the situation of a small, developing country. 


Arguments for patent protection 


Arguments to justify the patent system have 
been based on natural law, on moral requirements, 
or on economic considerations. Economic consid- 
erations weigh the objectives which patent protec- 
tion is to serve, the benefits which society is likely 
to obtain, the excess of these benefits over social 
cost, and the superiority of the patent device over 
alternative means to the same end. For moral con- 
siderations, material benefits to society are not rele- 
vant, although one may still inquire about the cost 
to society—“justice at any cost” may be too expen- 
Slve—and whether there are not cheaper ways to 
discharge the particular moral obligation. The 
natural law argument virtually excludes all assess- 
Ments of social benefits, costs, or alternatives, since 
it deems the inventor's rights to be indisputable, 
inalienable, and immutable. 

Inventors’ natural property rights. As we have 
Seen, the French Constitutive Assembly of 1791 
proclaimed the inventor's “property right” in his 
idea to be one of the “rights of man.” Statutory 
patent law was merely to stipulate the ways and 
Le ie safeguarding the inventor’s “natural” 
erin ne property right in the patent was only a 
ko PoR by which infringements of the inven- 

atural property right in his idea could be 
prevented. 
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Under this juridical construction the word 
“monopoly,” with its bad connotations, was re- 
placed by the word “property,” recognized as a good 
thing. The public-relations scheme—admitted to be 
such by Stanislas de Bouffler when he proposed it 
in introducing the French patent bill of 1791—was 
successful. It persuaded the public to regard patent 
protection not as a government intervention de- 
signed for a purpose but, rather, as an integral part 
of the institution of private property; not as an 
enforcement of a monopoly granted by the state 
but, rather, as a prevention of theft. 

The moral obligation to reward inventors. The 
moral argument for patents assumes that the sys- 
tem is the best possible way in which society can 
meet its moral obligation to secure to each inventor 
his just reward, Critics of this position have ques- 
tioned whether justice calls for such material re- 
wards and also whether the patent system is the 
best method of securing them. 

The critics contend that the development of tech- 
nology is largely a cumulative process to which 
many— scientists, technicians, and tinkerers—con- 
tribute in succession until a particularly lucky one 
happens to put the finishing touch to a novel art. 
If he alone is rewarded, while those who have pre- 
pared the way for him get nothing, critics doubt 
that this is “just.” Critics also question the justice 
of the system when two inventors have reached the 
same solution almost simultaneously and one of 
them is declared to have been a little earlier and 
to deserve the full reward; or when the inventor of 
a toy or a kitchen utensil receives fat rewards, 
while a scientist who makes a fundamental dis- 
covery must be satisfied with a meager bit of glory. 

Classical economists, especially Bentham and 
J. S. Mill, argued that the material reward which a 
patent can secure for the inventor is fair and just 
in that it is proportional to the service the inventor 
has rendered to society. Critics deny that the re- 
wards are commensurate with the achievement. 
Some of the greatest inventions are ahead of their 
time and, since their practical application will be 
deferred, no earnings can be expected from their 
use within the period of patent protection. Even 
where inventions can be used without delay, no 
close relationship, let alone a proportional one, can 
be expected between the earnings from their exclu- 
sive use and the benefits accruing to society. The 
earnings are associated with the optimum restric- 
tion on use rather than with the social optimum of 
utilization. The profits derived from a patent mo- 
nopoly cannot be indicative of the social benefits 
derived from the invention it covers. 

This does not mean that expert appraisers esti- 
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mating the social contribution of inventions would 
be more successful. Rewards in the form of prizes 
and bonuses determined by the government may 
not be any more “just” or “fair” than the monopoly 
rents from patents. Grave errors of judgment, and 
perhaps severe misuse of discretionary powers, 
might occur if government had to determine “just 
awards” to inventors, 

Criticisms leveled against the moral argument 
for patents need not apply to the various economic- 
incentive arguments. Incentives, to be effective, 
need not be just: high prizes in a lottery are de- 
signed to attract buyers of chances, not to mete 
out justice. 

Compensation for disclosure of secrets. One of 
the economic arguments for patents emphasizes 
the benefits society can derive from the disclosure 
of secret technological information made public 
through the patents. Society pays for these benefits 
with grants of exclusive rights as it offers tempo- 
rary monopolies in exchange for the surrender of 
“monopolies through secrecy.” In a simpler version 
of the argument, the promise of a patent monopoly 
serves as incentive for the inventor to give up his 
secret. 

The formulation of the argument as a “social 
contract,” or bargain between the state and the 
inventor, has often been criticized, by James E. T. 
Rogers (1863), among others, on the ground that 
it is a rather poor bargain for society. The contract 
theory held that protection of exclusive use for 14 
to 20 years was justified because supposedly this 
was the average length of time during which a 
user of a secret technology could expect to keep it 
secret. The critics objected that this average dura- 
tion of secrecy becomes a sham if each inventor 
has the choice of taking the patent or keeping his 
secret, for he will patent only what he cannot ex- 
pect to keep secret. Society thus fails to secure dis- 
closure of safe secrets but agrees to grant monop- 
olies restricting the use of technologies that could 
be found out and made available for general use. 

The argument for patent protection as an incen- 
tive to disclose technological secrets also includes 
another effect of early disclosure: knowledge of the 
patent may inspire competitors to look for substi- 
tute processes and substitute products, that is, to 
try to “invent around the patent.” Lawyers and 
engineers stress the supposedly beneficial effects of 
such “competitive invention,” but some economists 
point to the waste involved in searching for alterna- 
tive technologies. 

Other versions of the argument stress the in- 
direct effects of disclosure. Early disclosure may 
give other inventors new ideas, not for alternative 


techniques of making the same products but for 
entirely different processes or products, perhaps in 
different industries. In addition, the dissemination 
of technological information in the descriptions of 
inventions may serve to. bolster general technical 
knowledge and curiosity. Critics have asked 
whether such benefits could not obtained at lower 
cost by other institutions or methods. 

Another aspect of this argument deserves con- 
sideration: some inventions are disclosed by publi- 
cation in technical journals or by patents taken out 
years before the first use, whereas without patents 
these inventions might become known only when 
the products for which they are used reach the 
market. How important this is, no one knows. 

Incentives for inventive activity. The classical, 
traditional argument for patents stresses that the 
promise of patent monopolies is an incentive to 
engage in inventive activity. The argument pre- 
supposes (1) that without government intervention 
society would undertake less inventive activity than 
was good for it and (2) that patent protection is 
the best possible form such intervention can take. 

If “perfect competition” is defined to imply 
immediate imitation of anything that promises 
supernormal profits, it follows that under such con- 
ditions inventive activity will not pay, and some 
artificial obstacles to quick imitation will be needed 
to promote invention. Differently defined, however, 
perfect competition allows for natural delays and 
normal friction, and so does not imply immediate 
imitation. The innovator may then have a natural 
head start in the practical application of a new in- 
vention even without patent, for it takes time until 
competitors can examine the product, detect the 
technology used, build the plant and equipment 
required to produce it themselves, and bring their 
competing product to the market. In the meantime 
the first user may have made enough profits to pay 
for his efforts. This was the counterargument made 
by Schäffe (1867). Competition, moreover, 18 sel- 
dom perfect, even in the absence of patent monop- 
olies. Newcomers’ competition is slow and imper 
fect under the oligopolistic organization of ae 
industries, which lengthens the first user's ae 
start. Finally, the oligopolist’s fear of falling behin 
in the race with his competitors may be as ae 
of an incentive to invent as the hope for ae 
monopolies can be. For all these reasons, sever 
economists have refused to accept the assumption 
that there would not be “enough” inventive activity 
in the absence of government intervention. a 

Those who recognize that the state should P A 
mote invention may nevertheless reject the a 
tion that patents are the best method for 


purpose. They may hold that prizes and bonuses 
for important inventions, or subsidies and govern- 
ment contracts for research and development, can 
serve the purpose more efficiently and at lower cost 
to society. These alternative methods can do what 
the patent system cannot, namely, promote tech- 
nological progress in selected areas. Arnold Plant 
and others have said that the patent system serves 
chiefly to steer inventive efforts into different 
channels, away from industries in which improve- 
ments rarely take the form of patentable inventions 
and toward industries that are favored by the re- 
quirements of patentability. Only by coincidence 
will these be the industries in which inventive ef- 
forts would be most productive for society. Selec- 
tive promotion of technological advance through 
direct government support of research and devel- 
opment has therefore become the principal ap- 
proach to the problem, at least in the United States. 
The need for and effectiveness of patent protec- 
tion as incentive to inventive activity has been 
questioned for economies in which large corpora- 
tions employ salaried inventors in large labora- 
tories, It is held, however—for example, by John 
Jewkes—that the patent system still serves its 
traditional purpose in protecting independent in- 
ventors and small producers who could not survive 
without this protection (Jewkes et al. 1958). 
Incentives for development and investment. 
The cost of postinventive development has become 
so large relative to the cost of strictly inventive 
activity that the arguments for patent protection 
have changed. The emphasis has been shifted from 
promotion of invention to promotion of develop- 
ment and innovative investment. Patent protection, 
it is now said, serves less as an incentive to engage 
in inventive activity than as an incentive to under- 
take the large investments needed to develop in- 
ventions, make them usable in mass production, 
construct pilot plants, create mass markets, and 
build the productive facilities to supply the goods. 
These outlays are said to be so large and so risky 
that they would not be undertaken without patent 
Protection against competition. 
i Critics of this argument point to large innovative 
investments in many industries in which there are 
no important patents. One cannot deny, of course, 
that there may be investments which would not 
oe without the hope of monopoly profits de- 
rom patent protection. The question is 
Aiea these investments are more productive or 
Wh y more beneficial than other investments. 
ae em investments that are made attractive 
the y. patent protection be more productive, from 
Point of view of society, than alternative invest- 
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ments which, if they were not placed at a disadvan- 
tage by the incidence of patent protection, would 
be the highest bidders for available funds? There 
is no reason why it should be socially desirable to 
divert investment funds from industries without 
patents to industries with patents. 

The theory of a general lack of investment op- 
portunities has also been adduced to support the 
argument that patent protection is needed as an 
incentive to invest. Investable funds might go beg- 
ging, it is said, and savings might find no outlets, 
if it were not for the profit opportunities afforded 
by patents. 

Sources of investable funds. Another argument 
for patent protection is that firms sheltered from 
competition will, thanks to their higher profits, 
have larger funds available for investment than 
will firms exposed to competition. To claim that 
patents will cure both a scarcity of investment op- 
portunities and a scarcity of investment funds is 
self-contradictory if “scarcity” refers to an imbal- 
ance of potential sources and uses of funds. The 
contradiction disappears if the alleged scarcity is 
meant to relate to a desired rate of progress, for 
one may hold that progress is accelerated if firms 
have more funds as well as more opportunities to 
invest. Monopoly positions due to existing patents 
afford a larger flow of funds to firms, which will 
be induced to use these funds for activities that 
will, by leading to additional patents, perpetuate 
and strengthen their monopoly positions and, in 
the process, enhance technological progress. This 
theory as Schumpeter observed (in a short discus- 
sion in Capitalism, Socialism, and Democracy), 
may justify not only patent protection but also 
other restrictions on competition. 

Excess of social over private benefits. Most eco- 
nomic arguments for patents can be formulated 
as applications of the standard justification of gov- 
ernment intervention in a competitive market econ- 
omy: Where the private marginal product of an 
activity is smaller than its social marginal product, 
while private marginal cost is not correspondingly 
below social marginal cost, the state can increase 
total welfare by promoting the allocation of addi- 
tional resources to that activity. 

If a new process or product can be imitated by 
competitors without delay, and output is not limited 
by any monopolistic restrictions, the market price 
of the product of the new technology will soon fall 
to the level of its production cost. This cost does 
not include expenses for inventing and developing 
the novel art, for an imitator incurs no such ex- 
penses, nor are they a part of the innovator’s mar- 
ginal costs of production. They are sunk costs; they 
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do not increase as the use of the new technology 
is increased. Thus, under unlimited competition, 
output will increase and price will fall to the com- 
petitive level, which does not allow recovery of 
sunk costs. 

New technology undoubtedly has “social value,” 
since it contributes to an increase in national prod- 
uct. But its private value will approach zero if it 
can be used without restriction. Thus, while con- 
sumers will benefit, the inventors, developers, in- 
novators, and their financiers may get insufficient 
returns on their investments of time and money. 
Hence, without government intervention, the allo- 
cation of resources to inventive and innovative 
activities would be less than “optimal” from the 
point of view of society. Intervention attempts to 
compensate for the shortfall of the private below 
the social marginal productivity of inventive and 
innovative activities. 

This is a cogent argument for promoting inven- 
tion, but not necessarily for patent protection; other 
methods of encouraging, subsidizing, or financing 
inventive activity may well be more efficient or less 
costly. The bias inherent in the patent system in 
favor of technologies that happen to satisfy the 
requirements of patentability is one of the defects 
of the system. Another is that the system tries to 
promote the creation of technical knowledge by 
restricting its use. What is wanted is a system that 
does not discriminate between types of technology 
or operate by restricting the application of the in- 
ventions that it stimulates. 


Charges of abuse and proposed remedies 


Some nonlawyers speak of an “abuse” of the pat- 
ent monopoly when the social objectives it is sup- 
posed to serve are not promoted but, rather, are 
jeopardized by the way it is used; for example, 
when the “limits” of the monopoly “intended” by 
society are overstepped and the monopolistic con- 
trol is extended in time, scope, or strength. In most 
countries, the law does not recognize this broad 
concept of “abuse.” 

Patentees may succeed in extending the effective 
duration of control (a) through procedural devices, 
especially by prolonging the pendency of the patent 
between application and issuance; (b) through 
secret use of the invention prior to the filing of the 
application; (c) through incomplete disclosure, 
making it impossible for those without special 
know-how to use the invention even after the expi- 
ration of the patent; (d) through the successive 
patenting of strategic improvements that make the 
unimproved invention commercially obsolete after 
the expiration of the original patent; (e) through 


the creation of a monopolistic market position 
based on the good will of a trademark associated 
with the patented product or process, where the 
mark and the consumer loyalty continue after ex- 
piration of the patent; and (f) through licensing 
agreements that survive the original patent because 
they license a series of improvement patents and a 
possibly endless succession of future patents. 

The patentee may succeed in extending the scope 
and strength of the monopoly beyond the limits 
consistent with the social objectives of the system 
—especially, beyond the control] of the use of a 
single invention supposedly in competition with 
other inventions—to achieve control of an entire 
industry or of the markets of other goods not cov- 
ered by the patent. Substantial control of an indus- 
try can be achieved by a “basic patent” (on a bona 
fide basic invention); by an “umbrella patent,” 
under which illegitimately broad or ambiguous 
claims, covering the entire industry, have been al- 
lowed and are not tested in the courts; by a “bottle- 
neck patent,” which is not basic but is good enough 
to delay or exclude the entrance of newcomers to 
the industry; by an accumulation of patents which 
secure domination of all existing firms and effec- 
tively close the industry to newcomers; or by the 
use of restrictive licensing agreements establishing 
domination or cartelization of the industry and ex- 
clusion of newcomers. Control sometimes is extend- 
ed to markets for products not covered by the 
patent through the use of tying clauses in licensing 
agreements, chiefly in order to increase monopoly 
revenues by means of price discrimination [see 
Monopoty]. (For example, by selling a material 
that must be used on a patented machine at @ 
monopoly price, the seller collects more from those 
who make heavy use of the machine than from 
those who make less use of it, although the cost 
of making and using the machine is the same no 
matter who uses it.) All these “extensions” of con- 
trol have been objected to by opponents of monop- 
oly, but this does not mean that they have been 
found to be unlawful. 

Patent pooling arrangements, sometimes ne 
sary in order to permit the efficient use of com 
mentary inventions controlled by different 
have often been the vehicles for highly res a 
cartel agreements. Indignant complaints have we 
raised against the use of patents to oppress hee 
firms by harassing litigation or threat of mi 
and against the use of license agreements tot a 
competitors or customers not to contest the ees 
of dubious patents. There have been similar i to 
plaints against the taking out of patents, ee 
work the patented invention, but to keep © 


ceS- 
ple- 


from working it, especially to “fence out” possibly 
competing developments of the patented invention 
or to “fence in” the competition by blocking pos- 
sible developments of inventions patented to them. 

Nonworking and insufficient working. Non- 
working of patented inventions has been high on 
the list of grievances against patent protection. One 
must distinguish, however, between the nonuse of 
inventions whose use would be uneconomic and 
the “suppression” of inventions that could be used 
economically. In the first category are inventions 
of nonmarketable articles, inventions of processes 
that are inoperable or too expensive, and inventions 
of alternative processes, instruments, or products 
that are not superior or perhaps are inferior to 
those in actual use. Neither the patentees nor any- 
one else may want to use such inventions. If others 
do want licenses that a producer holding the patent 
refuses to grant, although he himself does not use 
the invention, one may suspect a case of suppres- 
sion. (This prima facie indication could be rebutted 
by showing that the applicants would not use the 
invention either, if they could freely choose among 
all known technologies, including the ones used by 
the producer who refused to license.) In the ab- 
sence of any applications for licenses, suppression 
of inventions is difficult to prove: one would have 
to show that their use would be economically prac- 
tical and desirable although the patent owners, per- 
haps in view of the “premature” obsolescence of 
their capital equipment, have decided to keep these 
inventions on the shelf. The proof might be feas- 
ible for cost-saving inventions, but hardly for prod- 
uct-improving ones; after all, cost calculations can 
be checked, but demand estimates are mere con- 
jectures. 

Under the laws of some countries, especially 
England, “insufficient working” is regarded as an 
abuse of the patent monopoly, as is the charging 
of “excessive prices” for patented articles. Since it 
E the very essence of patents to restrict competi- 
tion and to permit output to be kept below, and 
price above, competitive levels, it is difficult to con- 
ceive of economic criteria by which one could judge 
whether output is less than what would be “reason- 
ably practicable” and whether price is “unreason- 
ably high.” 

The patent laws of some countries provide 
against the domestic nonuse of inventions patented 
to foreigners who seek mainly to have a sheltered 
ee for their imports. Compulsory working—a 
a of revoking patents not worked after a few 
ý P ae operate like a protective tariff, leading 
Se economic uses of resources. For this reason, 

pulsory working was replaced, as a remedy 
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for domestic nonuse of patented inventions, by 
compulsory licensing of the patents to those inter- 
ested in taking up production. 

Compulsory licensing. Compulsory licensing is 
not always instituted as a penalty or remedy for 
“unlawful abuse”; in some countries it may be re- 
sorted to whenever deemed necessary to safeguard 
the public interest. Be it because of “abuse,” as in 
England, or “in the public interest,” as in Germany, 
the issuance of a compulsory license may be re- 
quested by an interested party whom the patentee 
has refused to license or may be proposed by a 
government department. In Germany such actions 
have been taken to facilitate the use of dependent 
patents, that is, of patents covering inventions that 
could not be worked without license under a patent 
held by someone else. In Britain insufficient use of 
a patent may be charged in a claim for a compul- 
sory license or for “licenses of right,” especially in 
a case of food products and medicines (which were 
made patentable only by the Patent Act of 1949), 
although these provisions have rarely, if ever, been 
used. In the United States compulsory licensing 
has sometimes been ordered by the courts in cases 
in which patentees have used their patents in viola- 
tion of the antitrust laws. Some of these compulsory 
licenses have been royalty-free, while others have 
provided “reasonable” royalties. 


Economic effects of patent protection 

The “effects” of an existing social (legal, eco- 
nomic) institution can be seen only by a mental 
experiment: one must imagine what difference it 
would make if the institution were abolished. This 
is difficult, for institutions are rarely eliminated 
without something else taking their place. If there 
were no patent system, what would be done to pro- 
tect industrial secrets; to prevent the raiding of 
personnel with special know-how; to maintain, 
strengthen, or weaken the market positions of the 
technologically most progressive firms; or to sup- 
port research and development by trade associa- 
tions, by government bureaus, and by individual 
firms? Thus, should one analyze the benefits and 
costs of the patent system on the assumption that 
there would be other ways of promoting techno- 
logical progress or that there would be nothing of 
the sort? Obviously, arbitrary assumptions cannot 
be avoided. 

Social benefits. One may attribute to the patent 
system all additions to the national product that 
are obtained through the use of such technology 
as is invented, developed, or put to use thanks only 
to patent protection, actual or expected. Not all 
inventions actually covered by patents owe their 
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existence or utilization to the patent system. Many 
inventions would have been made in any case. On 
the other hand, some inventions not covered by 
patents may have been inspired by patented tech- 
nology or stimulated by the hope for patents. Per 
contra, the patent system may have prevented some 
inventions from being made or used; for example, 
if the strong patent position of one firm discourages 
others from working in the particular field. 

The most important distinctions are between 
(1) inventions which, without patent protection, 
would never have been made or used; (2) inven- 
tions which, without patents, would have been 
made or used only at a later time; and (3) inven- 
tions which would have been made or used at the 
same time. The additions to national product that 
are due to the third category of invention cannot 
be credited to the patent system; and those due to 
inventions of the second category can be credited 
only for the years of earlier use. 

There is no objective evidence for judging the 
relative frequency of inventions in the three cate- 
gories. Some patent lawyers think that almost all 
inventions fall into the first category; some sociol- 
ogists regard this as a nearly empty class and judge 
most inventions as belonging to the third category. 

The problem of estimating the increments of 
national product that depend on new inventions is 
quite complicated. It is soluble as far as cost-reduc- 
ing inventions are concerned; for quality-improving 
and product-innovating inventions one would have 
to use “welfare indices” based on hypothetical con- 
sumer surpluses. There is a further complication 
in the implications of economic growth. It is rela- 
tively easy to estimate the benefits due to a cost- 
reducing invention if the supply of labor and other 
resources remains constant, but if resources and 
demand grow from year to year, the effects of a 
cost reduction will increase accordingly. Thus, the 
present value of the increment to national product 
attributable to a particular invention will depend 
not only on the rate of interest but also on the 
prospective growth rate of the supply of resources. 

Social costs. When some economists said that 
the patent system costs society nothing, they must 
have meant that there was no net cost, that is, that 
the benefits exceeded the cost. Six categories of 
cost may be distinguished. z 

(1) Operating costs of the patent system. These 
are the remuneration of administrators, lawyers, 
technicians, and clerks in government, in the law 
offices, and in industry. One must include the time 
that scientists and engineers spend in expressing 
their findings in a form that qualifies them as pat- 
entable inventions. 

(2) Cost of research, invention, and develop- 


ment. These may be counted only to the extent 
that the activities are induced by the patent incen. 
tive. The money cost of research may understate 
the opportunity cost if, for example, scientists em- 
ployed in industrial research could do more produc- 
tive work in teaching or basic research. 

(3) Cost of innovation and resource realloca- 
tion. The innovation induced by patent protection 
requires resources in itself and also may accelerate 
the obsolescence of capital equipment and cause 
losses through the necessary retraining of displaced 
labor with specific skills. 

(4) Restriction of output due to limited use of 
patented technology. Such restriction is inherent 
in the system, since only by holding output below 
the “competitive norm” can any earnings be se- 
cured from the patent monopoly. In the case of in- 
ventions that would never have been made without 
patents, the loss due to output restriction may be 
entered into the social-cost account only if the 
benefits from the full utilization of the invention 
are credited to the patent system. For inventions 
that would have been made later without patents, 
the ratio of benefits to costs depends on how much 
earlier the new technologies come into use as a 
result of the patent system and for how many years 
their use is restricted under the exclusive rights. 

(5) Restrictions of output due to transcendent 
strengthening of monopoly positions. The acquisi- 
tion of strong patent positions may strengthen the 
market power of firms in areas other than those 
covered by its patents, leading to smaller outputs 
in these other areas. 

(6) Loss of output due to delayed invention and 
use, Losses occur where the existence of strong 
patent positions discourages potential inventors 
and potential users of inventions from invading the 
protected area. 

Net benefit or loss. Present knowledge do 
enable us to make numerical estimates of the tot 
social costs and benefits of the patent a 
Apodictic assertions about the “great” net bene: 
or net loss are without any foundation. Most © 
those who have made such assertions have not Sa 
considered the relative order of magnitude of t 
various items. 

While it may be impossible to estimate the e 
benefits and costs of the patent system, one Pee 
attempt to analyze the marginal benefits and aa 
of particular moderate changes in the dura oe 
scope, or strength of patent protection. For z of 
ple, one may examine the probable has 
lengthening (or shortening) the dural a 
patent term by one year. The marginal bene al prof: 
be found by considering what effects margin Ei 
it expectations would have upon marginal re 


es not 


and development budgets, upon real resources 
acquired with these funds, upon the flow of inven- 
tions and the portion of the flow that can be prac- 
tically applied, and upon the change in total out- 
put that depends on it. The marginal cost may be 
found by concentrating chiefly on cost items (2) 
and (4) in the list above. An analysis of this sort 
for a patent term longer than 15 years conveys the 
impression that the marginal benefits fall short of 
the marginal costs. 

Similar techniques of analysis may be employed 
for other changes in the patent system, such as the 
introduction of compulsory licensing or the exten- 
sion of the patent system to (or its withdrawal 
from) particular types of products or processes. 
Not that we may hope for a consensus of the ex- 
perts (with their judgments often affected by their 
interests), but some crucial issues can probably be 
isolated for special study. 

Present knowledge and ignorance. Even where 
economic analysis seems to have reached conclu- 
sive results, the conclusions often are politically 
ineffectual. There can be little doubt, for example, 
that in small countries and in nonindustrial coun- 
tries the social costs of the patent system exceed 
its benefits, chiefly because many of the benefits 
could be obtained there without granting patents. 
Nevertheless, these (perhaps newly independent) 
nations are anxious to adopt a patent system and, 
thus, to have the honor of paying higher prices for 
imported products. 

For large industrial countries the net effects of 
patents are as yet unknown. This ignorance is not 
likely to be overcome soon, since the information 
needed to test the essential factual assumptions 
does not exist. Until we have the evidence, we shall 
have to be cautious and skeptical regarding both 
the claims made by enthusiastic supporters of the 
patent system and the charges preferred by fanatic 
abolitionists. 

Fritz MACHLUP 


[See also INNOVATION; RESEARCH AND DEVELOPMENT. ] 
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PATERNALISM 


The derivation of the term “paternalism” from 
a Latin-English kinship term suggests its root 
meaning: a type of behavior by a superior toward 
an inferior resembling that of a male parent to his 
child—in most cases, a son. The precise forms of 
this behavior vary from society to society because 
the culture of kinship varies, and also because the 
nature of the tasks performed in paternalistic re- 
lationships vary. There are two general functions 
of paternalism: (1) the transmission of goods and 
services across gaps between generations and 
status levels and (2) the provision of means of 
access to resources to persons ordinarily excluded 
from such means. While economic phenomena 
probably are more commonly associated with pa- 
ternalism than are any others, almost any kind of 
social content can appear: political, intellectual, 
religious, or marital. 

Within different types of paternalistic systems, 
the following three basic ideas and two modes of 
motivated action can be found. First, since a 
“child” is defenseless and lacks property, he re- 
quires assistance and support. Second, since a 
“child” is not fully aware of his role and therefore 


not fully responsible, he requires guidance. These 
first two ideas underlie the benevolent mode of 
paternalistic action, where the superior person's 
actions are dominantly supportive of the inferior, 
The third idea holds that since a “child” is igno- 
rant, he can be deceived, or treated in such a way 
as to serve the interests of the “adult,” without be- 
coming aware of this. This third idea obviously 
leads toward exploitative paternalism. 

The theoretical relevance of paternalism was 
first noted by Max Weber, who developed the con- 
cept of patrimonialism ( [1906-1924] 1946, p. 297). 
Patrimonial relations were defined by Weber as 
consisting of those existing between a boss, em- 
ployer, feudal lord, or other similar figure and his 
band of henchmen, who give loyalty and obedience 
in return for protection. Weber pointed to the great 
historical depth of this type of relationship. It is 
well to remember that paternalistic systems con- 
tribute clear-cut solutions to the problem of com- 
plementarity of roles within a hierarchical struc- 
ture: status differences within a hierarchy can be 
organized by complementary services and functions 
so that the necessary tasks are performed smoothly 
and efficiently. 

In the present article we shall be concerned, 
first, with economic paternalism in plantation and 
industrial contexts. Second, we shall examine cer- 
tain patron-client relations in local communities. 
In many cases both economic paternalism and 
patron-client systems are modeled on roles and 
terminology of folk kinship systems. The selection 
of cases in these two contexts is by no means eX 
haustive, since paternalism can become a model 
for relationships in any hierarchical setting. How- 
ever, the cases selected for description represent 
the majority of types. 


Labor paternalism 


The plantation. Plantation systems in both the 


New World and the Old World have received consid- 
erable attention from social scientists interested in 
paternalism. These observers agree that plantation 
paternalism was in large part a projection of soci 

patterns found elsewhere in the national society: 


at 

For example, Siegel (1955, p. 405) no 
5 y rela 

everywhere in rural Brazil structural be found: 


similar to specific plantation forms are to 
“There is the set of social and economic 
which determine the relationship of the v44 cae 
the northeastern cowboy, to the ranch onmia a 
patterns linking the guarimpero to the aoa F 
and diamond buyer, and the camarada sys aradi 
the plateau region of southern Brazil” (a oe onal 
is an agricultural worker who enters into på 


patterns 
ueiro, 


istic agreements with a farmer). The fact that 
plantation paternalism is a symptom of more 
widespread social patterns highlights the general 
problem of change which these systems present to 
the social scientist. Paternalism in any setting 
must be considered from at least two points of 
view: (1) the specific causes which call it forth in 
a particular setting and (2) the extent to which its 
presence is related to traditional forms in the 
society at large. 

One cause of paternalism in plantation systems 
is the character of the labor force. Greaves (1959) 
and Stein (1955) see paternalism as the result of 
the existence of a poor and uneducated aboriginal 
labor force, which led plantation owners to supply 
many personal services in order to induce laborers 
to remain or, in the exploitative mode, in order to 
hold them as virtual slaves (see Alexander 1962, 
on northeast Brazil, where a slave-owning economy 
influenced the later plantation system). The case 
of Argentina is instructive in that paternalism in 
both the plantation system and industry has been 
much less marked than in other Latin American 
countries, because the labor force was drawn from 
better-educated European immigrants lacking tra- 
ditions of dependence (Alexander 1962). 

Also relevant to the discussion of causes is the 
economic format of the plantation: rural and rela- 
tively isolated, thus requiring that both managers 
and workers live at the site of work. In this situa- 
tion daily relations between management and labor 
take intimate forms, and the possibility of contrac- 
tual, functionally specific relations is less than 
in the case of an urban factory. This situation 
underlines a feature found in many paternalisms: 
the superior or boss is a kind of “gatekeeper”; 
access to rewards and necessities can be obtained 
only through him, and hence he can demand more 
than just adequate performance. The fact that the 
labor force is often derived from peasant or tribal 
communities plays a role insofar as these people 
are accustomed to authoritarian and dominance- 
Submission patterns based on kinship. However, 
Wolf (1959) saw the personalized relationships of 
the plantation as a “ritual pantomime of depend- 
ence”: not a genuine personalization, but forms 
used by the manager to soften the rigor of an in- 
herently exploitative labor system. 

In agricultural tenancy, paternalistic relations 
are generally less well developed than on the plan- 
tation. However, in parts of Japan ( Beardsley et al. 
1959) and in Lebanon (Sayigh 1958) the gen- 
bend low educational level of the tenants, plus 
pa levels of productivity, encouraged certain 

S of paternalism. 
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Change in labor paternalism. While there is no 
detailed survey specifically aimed at studying 
change, the cumulative evidence suggests that 
paternalism on plantations has been in a process 
of decline, especially since World War u. Adams 
(1959) points out that in Guatemala the rise of 
labor unions since 1944, coupled with the national 
government's labor security and land reform meas- 
ures, had the effect of dissolving the old patrón- 
mozo relationship between plantation owner and 
worker. Specifically, a hostile relationship devel- 
oped between workers and employer, promoting a 
willingness on the part of the former to look to 
labor unions and the government for the supports 
formerly furnished, at the price of low wages and 
semibondage, by the plantation owner. This case 
illustrates a phenomenon noted also in the case of 
Japan (Bennett & Ishino 1963, chapter 5): when 
external agencies began to supply unemployment 
relief and medical care to boss-organized dock 
workers, the workers deserted the boss systems and 
signed up with the public employment offices. In 
Hawaii the deeply rooted paternalism of the sugar 
and pineapple plantations has been giving way 
under the impact of unionization and the Ameri- 
canization of Japanese and Filipino laborers (Nor- 
beck 1959, p. 148). However, it should not be 
assumed from ‘these cases that all paternalistic 
systems automatically vanish when other forms of 
labor security are introduced. In the cases noted, 
there was evidence that these particular organiza- 
tions had strong exploitative tendencies and that 
workers rejected them because the price paid for 
security was no longer bearable. The process was 
also facilitated by changes in the national cultural 
climate toward more rational, universalistic values. 

In other cases paternalistic plantation systems 
have survived or even have been created under 
modern conditions because the workers needed 
these kinds of relationships and because the price 
of protection was not excessive. Ford (1955), in a 
report on Peru, notes that sugar plantation owners 
and managers would willingly give up paternalistic 
practices, as demanded by labor leaders, but can- 
not, because the workers expect this kind of treat- 
ment. Somewhat similarly, in Indonesia the Dutch 
introduced a paternalistic system on the sugar 
plantations not only because the labor force con- 
sisted mainly of small farmers removed from the 
land and, hence, devoid of any important support 
other than wage work, but also because the workers 
expected this type of treatment, having received it 
from wealthy Muslim employers under the basic 
obligation of charity in their religion (Hawkins 
1962; Geertz 1965). However, Hawkins (1962, 
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p. 84) also notes that in spite of these expecta- 
tions, workers on the plantations had “a strong 
feeling that work was something the Dutch im- 
posed upon them.” Thus, one might reasonably 
expect that as labor organization and government 
intervention continue to develop in the formerly 
colonial countries, paternalistic relations in the 
plantations will give way to a more rationalized 
employer-employee relationship, The idea of indi- 
vidual freedom of contract, coupled with national- 
istic feelings, may be sufficiently strong to dissolve 
the traditional sentiments. Moreover, there is no 
denying that labor paternalism, however benevo- 
lent, has an element of bondage; the worker simply 
is not free to move from job to job. Whether or not 
his personal welfare is better served under a freer, 
but less paternally secure, system is another issue. 

At least three points of view have emerged on 
the question of change in labor paternalism (see 
Odaka 1964). The first, or “Marxist,” position sees 
paternalism as a traditional residue which will 
pass when the final vestiges of “feudal” society 
have been removed. The second views paternalism 
as a necessary, and often humane, system of re- 
lationships which organizes modern economic 
effort in ways acceptable to the local cultural tra- 
ditions; hence, it need not disappear, since modern 
economic effort can be organized in ways other 
than those common in the Western countries (e.g., 
C. S. Belshaw 1955; 1960; Abegglen 1958). The 
third sees paternalism as a response to particular 
socioeconomic conditions associated with scarce 
capital, needs for concentration of capital in a few 
hands, and general low levels of living (e.g., Ben- 
nett & Ishino 1963). Thus, in this third view, pa- 
ternalism may appear or recur in any country dur- 
ing any historical period when these conditions 
are present. 

Industrial paternalism. The factory usually dif- 
fers from the plantation insofar as it is located in 
a town and does not require the isolated, in-service 
residence of both management and labor. This fea- 
ture may be less important in cases where the 
town is particularly small and/or the factory is 
the sole source of wage labor. Moreover, the fac- 
tory, except in a few cases where the organization 
is a lineal descendant of “preindustrial” systems 
(as in the case of provincial Japanese industry), 
was usually established on the concept of the labor 
contract; hence, the feature of bondage or inden- 
ture, so prominent in the case of the plantation, 
was generally of lesser significance. Finally, labor 
organization was introduced earlier and was more 
influential in the factory system, in any given case, 
than on plantations, because it stems from an 
urban—industrial tradition. 


However, as in the case of plantations, paternal- 
istic relations in factory systems have appeared 
wherever the national social structure contains 
such patterns to an appreciable degree, In Iran, 
Vreeland (1957, p. 166) notes, “the employer- 
employee relationship . . . derives from the ancient 
patterns of work organization . . . the traditional 
authority pattern found in the Iranian family and 
government. The worker . . . still commonly looks 
to his employer as a son looks to his father, for 
direction, for protection, for control.” In Brazil the 
cotton-textile factories built housing for their work- 
ers, established company stores, built churches on 
factory grounds, and financed the saints’ festivals 
(Stein 1955). In Japan the nineteenth-century 
spinning mills did similar things and even more— 
the plant owner became a foster father to the 
young girls indentured to him and even arranged 
marriages. In modern Japanese factories such ex- 
treme practices have withered, but a similar sense 
of responsibility often pervades the relationships 
between labor and management (Abegglen 1958). 
However, there is considerable evidence that the 
rise of vigorous labor unions in postwar Japan is 
contributing to a decline of this type of hierarchical 
mutualism. 

Industrial paternalism was common in the 
United States in the nineteenth and early twentieth 
centuries, and it continues in many parts of Europe 
today. The Pullman Company case is perhaps the 
classic example of American paternalism: the 
company created a total utopian world for its 
workers, including housing, but forbade unions and 
any sort of bargaining. In spite of the many fringe 
benefits, the Pullman workers mounted one of the 
most massive strikes in the history of the American 
labor movement. The Pullman and similar cases 
demonstrate the inherent incompatibility of pè- 
ternalism with a national social system which 
stresses individualism and free contract. Again the 
evidence suggests that paternalistic systems 
wither once these values become part of the ni 
tional culture, Paternalism endures if its partiot 
pants believe in it and if they are getting sona 
thing out of it at a bearable price. 

A case with transitional features is t 
Rico (Gregory 1960). Here the modern 
of unions and rationalized industry coe* gly, the 
traditional paternalistic practices. Interestin| Ta 
most effectively unionized sectors are the S A 
ones, while the many new factories often a nd 
have the most paternalistic systems. This draw 
plained by the fact that the new industries ©” 
their labor from a rural population th 
with Hispanic patronage practices. mete 
grants, eens to wage labor and Jost in th 
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teeming cities, look to the employer for comfort 
and protection. The existence of a benevolent, 
paternalistic governor also encouraged these senti- 
ments, and government paternalism in Puerto Rico 
has often been accused by intellectuals of fostering 
traditionalism and delaying individual responsi- 
bility. However, these are transitional aspects of 
a society undergoing rapid development; they per- 
haps bear out the point that a little paternalism 
may be a good thing in warding off the shock of 
rapid change—although it creates patterns of de- 
pendent behavior which must be changed in later 
stages of development. 

In Brazil industrial paternalism was developed 
after World War 1m, as a measure designed to help 
the workers adapt to a new industrial economy and 
to prevent them from turning to the unions and 
the Communist party (Alexander 1962, p. 178). 
The industrial federations pooled their resources 
to establish two organizations which provided med- 
ical care, schools, libraries, low-cost grocery stores, 
insurance, and many other services for workers in 
various industries. In Argentina, as noted, paternal- 
ism was generally of less importance because of 
the relative sophistication of the labor force. How- 
ever, after 1943 the expansion of industry brought 
large numbers of rural people and relatively un- 
educated urban proletarians into the factories; 
hence, the Perón government found an opportunity 
to develop paternalistic assistance. The conse- 
quence was the descamisados movement, the “shirt- 
less ones” who formed the bastion of Perén’s mass 
support. Thus paternalism, on a national indus- 
trial scale, can have major political significance 
(as in the Pullman case). 

In Turkey the state enterprises have provided 
considerable aid to workers in the areas of housing, 
medical care, food, and clothing. The 1936 labor 
code required private employers to provide similar, 
although not so elaborate, benefits. Critics claim 
that “a benevolent, though somewhat stifling pa- 
ternalism dominates the personnel policies of the 
State enterprises” (Rosen 1962, p. 264). These 
State and legalized private paternalistic practices, 
based on folk paternalism rooted in the Turkish 
Social structure, provide an example somewhat 
comparable to that of Puerto Rico. Folk paternal- 
ism has not become translated into institutionalized 
forms in so efficient a manner in all cases. 

The general features of both plantation and in- 
dustrial paternalism may be summarized. In both, 

e managerial element assumes responsibilities 
for workers over and beyond the basic contractual 
Provisions for wages and routine working condi- 
eee These responsibilities include various bene- 

ts, depending upon local customs: financial pro- 
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tections and guarantees concerning the payment 
of medical expenses for the worker and his family; 
basic services and supplies, such as clothing and 
food; the practice of carrying workers on the pay- 
roll in periods of business decline; housing; re- 
ligious facilities; wage differentials, depending 
upon size of family or length of service; and many 
others. Motives for extending such services are 
various and can take a benevolent form as well as 
calculating and exploitative modes. However, work- 
ers in all cases are expected to remain with the 
employer and to exhibit loyalty in return for these 
“fatherly” manifestations. Political support for the 
owner—manager class, in the event this class enters 
politics, is anticipated and used. 


Patron-client paternalism 


We turn now to a brief discussion of paternal- 
istic systems of a different sort: those which in- 
volve not an entire labor force or group but a 
relationship between two individuals, the “patron” 
and the “client” (sometimes more than one client). 
These patron-client systems are extremely wide- 
spread, since they usually do not require institu- 
tionalized recognition. They appear in such cul- 
turally disparate situations as among ranchers and 
cowboys in the North American West (writer's 
own research, unpublished); in the Japanese lum- 
bering and forestry industries (Bennett & Ishino 
1963); in Greek villages (Campbell 1963); and 
between castes in India (Kolenda 1963). Many of 
these systems take on quasi-kinship attributes. 

Kenny's description of the Spanish village patron 
may serve as a general model of the Mediterranean 
and Latin patron—client systems: “I define patron 
as . . . someone who is regarded (and who regards 
himself) at once as a protector, a guide, a model 
to copy, and an intermediary to deal with someone 
else or something else more powerful than one- 
self, whether or not such power is imaginary or 
real in a single context or in all, and whether or 
not the advantages to be gained from his patron- 
age are material or intangible” (1960, p. 15). In 
Mediterranean and Latin countries (both Old 
World and New World), the individual will seek 
out such patrons in order to receive certain bene- 
fits and protection; above all, the patron serves as 
an intermediary who can deal with the official and 
professional world. The specific methods of ac- 
quiring such benefactors differ from country to 
country, but their basic functions appear remark- 
ably constant. It is important to note that these 
functions are entirely desirable from the point of 
view of the client; that is, on the whole, patron— 
client systems, as a variant of paternalism, lack 
exploitative or clear-cut dominance-submission 
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features. In return for the favors granted, the 
client is of course expected to perform certain 
services, but generally these appear to be well 
within the limits of a permissible price. 

These functions are underlined in the reports on 
Greek patron—client systems by Friedl (1962) and 
Campbell (1963), who both point out that these 
systems actually result in a more equitable dis- 
tribution of facilities and privileges which other- 
wise would tend to be concentrated in educated 
and wealthy elites, The systems, moreover, serve 
a vital function in keeping the relatively anony- 
mous and isolated village population in touch with 
the institutions of the national society. 

This Mediterranean—Latin style of patron—client 
system is found in other parts of the world, notably 
certain parts of the Far East (see Dewey 1962, on 
Java). However, in these Oriental cases, including 
Japan, there is a much stronger element of feudal- 
derived noblesse oblige; that is, the patron and 
client have rather clearly understood, and often 
ritualized, obligations vis-à-vis each other. The 
local prestige of either can be injured if these ob- 
ligations are voided or ignored. In the Japanese 
case, Bennett and Ishino (1963) reported on 
patron-client systems in the Japanese forestry in- 
dustry, where a timber dealer would become a 
patron to a group of lumbermen (nakama), who, 
in return for his guarantee of steady employment 
and support even in economically depressed peri- 
ods, would agree to cut trees for him only and to 
work around his house and grounds. The dealer's 
prestige in the community was dependent upon his 
continued observance of the agreement, and the 
lumbermen would suffer a loss of prestige and work 
if they left his employ without notification. Ben- 
nett and Ishino contrasted this type of patron—client 
system with another paternalistic arrangement in 
the same community, where a big lumber entre- 
preneur had organized elements of the industry 
from dealers and lumbermen to factories, running 
the whole enterprise in a domineering manner. In 
Japan the latter type of arrangement, verging on 
the exploitative mode of the employer—-employee 
systems described earlier, is often referred to as an 
oyabun—kobun system (oya: parent; ko: child; 
bun: status). 

A patron—client system of considerable antiquity 
and pervasiveness is found in India, where it may 
be called the jajman-kamin relationship. The 
jajman is the patron figure, a person of a higher 
caste than the kamin, or client. The latter performs 
numerous services for the jajman, who pays the 
kamin in kind, with products of the agricultural 
economy, clothing, or other articles. Kamins may 


also perform informal services for their jajmans, 
such as communication to lower castes. In the 
very large number of analyses and critiques of 
analyses of this system, there are few which at- 
tempt to handle it as a variant of patron-client 
systems (see Kolenda 1963; Pocock 1962 for sum- 
maries of the literature). Some emphasize the eco- 
nomic distribution feature, while others emphasize 
role theory, power relations among castes, “feudal” 
relationships, and the like. 


Reviewing the evidence, it appears that paternal- 
istic systems have two main currents. One, stem- 
ming from purely instrumental institutions and 
goals with hierarchical structures, moves be- 
tween exploitative and benevolent modes, with the 
former possibly somewhat more prominent. The 
other stems from the genuine needs of inferiors 
and superiors in hierarchical social systems for 
mutual support; in this category we have the 
patron-client systems described, and also a large 
variety of other types, including professor—graduate 
student, master—apprentice, and _priest~neophyte. 
In all cases, persisting feudal or caste traditions 
can have a powerful influence on paternalistic 
arrangements. 

The question of trends toward disappearance or 
persistence of paternalistic relations is difficult to 
answer. In some contexts economic modernization 
and social democratization definitely encourage 
their decline; in other cases rapid change creates 
conditions exposing a proletariat to insecurity, 
which can create new forms of paternalistic de 
pendency. The survival or creation of a paternal- 
istic system depends on needs and on the existing 
social organizational patterns and traditions. T 
memories of suitable forms of paternalism, ani 
the requisite familial models, may survive ind : 
nitely, even in societies with highly vations 
economic and political institutions, and thus 


available for use if the need arises. 
Jonn W. BENNETT 


po-xınsHiP. Other 


į EU] 
[See also KinsuiP, article on PS TATIONS: 


relevant material may be found in PLAN 
JAPANESE SOCIETY.] 
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Donald Gildersleeve Paterson (1892-1961), 
American applied psychologist, was a powerful 
force in shaping both content and method in ap- 
plied psychology and in supervising the training 
of hundreds of students who now direct research 
and practice in applied psychology all over the 
world. He was especially active in the development 
of vocational and occupational guidance and be- 
came an important leader in the field of student 
counseling. He made significant advances in the 
measurement of intelligence, mechanical skill, and 
other abilities and wrote a notable book on the 
relation of physical to psychological traits. Outside 
these major areas of research, he became well 
known for a wide variety of contributions, ranging 
from studies on the legibility of type (see Paterson 
& Tinker 1940) to labor market research (see 
Minnesota, University of ... 1948). 

Paterson was born in Columbus, Ohio, the young- 
est of five children. Both his father and mother 
were totally deaf as a result of childhood illnesses, 
and his father taught at the state school for the 
deaf. After his mother died, when Paterson was 
four years old, his father married a woman who 
was also a teacher of the deaf. 

The Paterson children grew up learning sign 
language and the finger alphabet. In addition, they 
gained an intimate awareness of the special prob- 
lems the deaf individual faces in education and 
daily life, and they also acquired a sensitivity to 
the effects both of handicaps and of special abilities 
of all kinds. 

In 1910 Paterson entered Olivet College, where 
one of his sisters was teaching. Two years later he 
transferred to Ohio State University, where he fell 
under the spell of the dynamic young psychologist 
Rudolf Pintner, whose lectures in abnormal psy- 
chology Paterson found extraordinarily stimulating. 
Paterson at first rejected Pintner’s claim that some- 
thing as “intangible” as intelligence could be meas- 
ured objectively. After a vigorous class debate, 
Pintner challenged him to take the Binet test mate- 
rials home to test the children in his neighborhood. 
Paterson later wrote, “I . . . discovered the range of 
individual differences among these neighborhood 
children. From that day, I became enthusiastic 
about the possibilities of the mental test method” 
(1948, p. 51). 

Pintner then suggested, since Paterson had ready 
access to deaf children, that he find out if the Binet 
tests were applicable to this special group. Paterson 
found that they were not, so he and Pintner to- 
gether set about developing their own tests for the 
deaf. Paterson later wrote, “Before I knew it, we 


were hard at work on the development of nonlan- 
guage performance tests. He (Pintner) had entered 
on a lifetime program of research on problems of 
the deaf and he quite simply swept me along with 
him” (ibid., p. 51). 

Paterson took his A.B. in 1914 and his M.A, in 
1916 under Pintner. By the time he left Ohio State, 
he and Pintner had written 15 articles and a pio- 
neering book, A Scale of Performance Tests (Pint- 
ner & Paterson 1917), In four years Paterson had 
mastered the rigors of test construction and stand- 
ardization, had committed himself to a career in 
research, and had become an apostle of applied 
psychology as an instrument for the realization of 
individual, educational, and social goals. 

In 1916 Paterson was appointed instructor in 
psychology at the University of Kansas, where 
Walter Hunter was chairman. With Hunter's en- 
couragement he began a series of studies of the 
reliability of mazes and wrote a statistical critique 
of published maze studies, pointing out that the 
averages reported had not been tested for statistical 
significance; when they were, most proved to be 
only unreliably different (1917). This kind of at- 
tack, new to psychology, was published in the im- 
portant journal Psychological Bulletin and must be 
counted as one of the specific forces which drove 
psychology toward a more scientific methodology. 

It was also at Kansas that Paterson met Margaret 
Young, the only student in Hunter’s large class who 
could answer the first tough question Paterson 
asked in his quiz section. They were married in 
1920 and had two children. 

In 1917 Paterson left Kansas to undertake psy- 
chological work for the U.S. Army, first in a civilian 
capacity and later as an officer. He attained the 
rank of captain in the Surgeon General's Office, be- 
coming chief psychological examiner at Camp 
Wadsworth. Here he gained experience in group 
testing with the Army Alpha and Beta ee 
test derived in large part from the work of Pintn 
and Paterson) and firsthand contact with the a 
sive problems of personnel evaluation and plae 
ment that confronted the army. real 

After the war, Paterson joined the newly i ie 
Scott Company, one of the first modern psyc e 
ical consulting organizations. He remained Ai 
1921, when R. M. Elliott, a close friend from 

s ZA 3 associate pro- 
army days, invited him to accept an 


ee n- 
- fessorship in psychology at the University ofM 


si- 
nesota. In 1923 he was made full professor, 4 PO 


tion he held until his retirement in 1960. fancy, 
The department at Minnesota was in its Ne art- 
having been separated from the philosophy P 
ment in 1919, and Elliott, its first chairma” i to 
building a new kind of psychology departm 


meet the needs of the postwar world. Elliott and 
Paterson worked together to bring about the bright 
future they saw for applied psychology. At the 
same time, Paterson became associated with Dean 
John B. Johnston in effecting far-reaching develop- 
ments in the university's program of service to the 
individual student. The “Minnesota point of view” 
which emerged had three guiding principles: con- 
cern for the individual, with his unique combina- 
tion of abilities, interests, and attitudes, all of 
which should be assessed and evaluated in relation 
to his opportunities, choices, and goals; respect for 
objective data and methods of data collection; and 
the belief that proper data would provide a sound 
basis for action. This program produced a large 
number of studies aimed at solving immediate and 
limited problems, as well as research on more gen- 
eral problems having psychological and social 
importance. 

Paterson was keenly responsive to the social con- 
ditions of his time, and this sensitivity, coupled 
with his basic orientation, produced enduring proof 
that psychology can play an important role in so- 
ciety, The need for better vocational guidance led 
to the Minnesota Mechanical Abilities Project; the 
great depression of the 1930s led to the Employ- 
ment Stabilization Research Institute and, later, to 
the Industrial Relations Center; and the dream of 
individual guidance within mass education led to 
the Student Counseling Bureau, all at the Univer- 
sity of Minnesota. In each of these efforts Paterson 
was an instigator and worker, establishing proto- 
types for similar studies and programs throughout 
the country. 

Outside the university Paterson’s influence was 
central in the organization of the Minnesota Col- 
lege Association and of a statewide college testing 
program, He worked for the founding of national, 
state, and city organizations in applied psychology 
that would stimulate opportunities for training psy- 
chologists, exchanging professional communica- 
tions, and using informed lobbies where political 
and social action was needed. In addition, Paterson 
occupied many national advisory positions, such as 
the chairmanship of the Technical Committee of 
the National Occupational Conference from 1933 
to 1938, membership in the Occupational Research 
eam of the U.S. Employment Service from 

934 to 1940, and chairmanship or membership in 
numerous committees of the American Council on 
Education for developing new instruments for eval- 
uating achievement. 

Paterson and his colleagues at Minnesota argued 
that the master’s degree should be recognized as 
a practicing professional degree; they believed the 
need for applied psychology was so great that the 
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demand for workers could not be met by psychol- 
ogists at the PH.D. level. Although Paterson’s point 
of view has won many adherents, it is still not 
accepted, for many pressures and sentiments tend 
to decree that the professional psychologist should 
hold the doctorate. 

Approximately three hundred students earned 
M.A. degrees and 88 earned PH.D. degrees under 
Paterson’s guidance, Additional thousands of stu- 
dents were influenced by him in his courses in 
individual differences, personnel psychology, and 
vocational and occupational psychology. He com- 
municated his zest for research to his students. 


_ While the total contribution of his students is 


neither precisely known nor complete, a booklet 
prepared in his honor at his retirement lists over 
1,400 books, articles, and other publications by his 
students, in addition to his own bibliography of 
more than 300 items (Viteles et al. 1961). 

From 1943 to 1954 Paterson edited the Journal 
of Applied Psychology and at various times also 
served as consulting editor for almost every applied 
psychology journal published in the United States. 
His skill in unraveling a manuscript and dissecting 
a body of data was unmatched. Students and col- 
leagues alike profited from his keen (if sometimes 
lethal) comments. 

Paterson’s research interests and talents exhib- 
ited great diversity. As an example, from 1928 to 
1955 he and Miles Tinker conducted a series of 
studies of the legibility of type, with a major land- 
mark, How to Make Type Readable, appearing in 
1940. The central theme of most of his varied ac- 
tivities, however, was his concern with the individ- 
ual person. 

In his social and political views Paterson pre- 
sents a reproof to those who associate a strong 
hereditarian position with right-wing opinions. He 
firmly believed (even in matters where such beliefs 
are unpopular) that the limits of psychological traits 
are genetically fixed, but he was always alert to 
data which suggested modifiability of traits and 
was constantly searching for better educational 
strategies and techniques which would permit the 
individual to capitalize on the aptitudes he pos- 
sessed. His devotion to the individual led him to 
defend with ardor every form of civil liberty. He 
was a founder of the Minnesota branch of the 
American Civil Liberties Union and for six years 


edited its newsletter. 
James J. JENKINS 


[Other relevant material may be found in INTELLI- 
GENCE AND INTELLIGENCE TESTING; PSYCHOLOGY, 
article on APPLIED PSYCHOLOGY; VOCATIONAL INTER- 
EST TESTING; and in the biography of BINET.] 


480 PATRONAGE 


WORKS BY PATERSON 


1917 The Johns Hopkins Circular Maze Studies. Psycho- 
logical Bulletin 14:294-297. 

1917 PintNER, RUDOLF; and PATERSON, DonaLp G. A 
Scale of Performance Tests. New York: Appleton. 

1930 Physique and Intellect. New York: Appleton. 

1930 The Measurement of Man, by J. A. Harris, R. E. 
Scammon, Donald G. Paterson et al. Minneapolis: 
Univ. of Minnesota Press. 

1930 MINNESOTA, UNIVERSITY OF, DEPARTMENT OF Psy- 
cHoLocy Minnesota Mechanical Ability Tests, by 
Donald G. Paterson et al. Minneapolis: Univ. of Min- 
nesota Press. 

1936 PATERSON, Donatp G.; and DARLEY, JoHN G. Men, 
Women and Jobs: A Study in Human Engineering. 
Minneapolis: Univ. of Minnesota Press. 

1938 PATERSON, DONALD G.; SCHNEIDLER, GWENDOLEN 
G.; and WirLiamson, Epmunp G. Student Guidance 
Techniques: A Handbook for Counselors in High 
Schools and Colleges. New York: McGraw-Hill. 

1940 PATERSON, Donatp G.; and TINKER, MILES A. 
How to Make Type Readable: A Manual for Typog- 
raphers, Printers, and Advertisers. New York: Harper. 

1943 Rudolf Pintner. Journal of Consulting Psychology 
7:50-52. 

1948 MINNESOTA, UNIVERSITY OF, INDUSTRIAL RELATIONS 
CENTER Local Labor Market Research: A Case Study, 
by Dale Yoder, Donald G. Paterson, H. G. Heneman 
et al. Minneapolis; Univ, of Minnesota Press. 

1953 PATERSON, DONALD G.; GERKEN, D. D'A.; and HAHN, 
MiLTon E. Revised Minnesota Occupational Rating 
Scales. Minneapolis: Univ. of Minnesota Press. 

1961 JENKINS, JAMES J.; and PATERSON, DONALD G. 
(editors) Studies in Individual Differences. New York: 
Appleton. 


SUPPLEMENTARY BIBLIOGRAPHY 
VirELES, Morais S. et al. 1961 Vocational Counseling: 


A Reappraisal in Honor of Donald G. Paterson. Min- 
neapolis: Univ. of Minnesota Press. 


PATRONAGE 

See CIVIL SERVICE; CREATIVITY, article on so- 
CIAL ASPECTS; FINE ARTS; OFFICE, MISUSE OF; 
PATERNALISM. 


PATTERN ANALYSIS 
See CLUSTERING. 


PATTERN VARIABLES 

See ANGLO-AMERICAN SOCIETY; COMMUNITY- 
SOCIETY CONTINUA; ECONOMIC GROWTH, article on 
NONECONOMIC ASPECTS; INTERACTION, article on 
SOCIAL INTERACTION; SYSTEMS ANALYSIS, article 
On SOCIAL SYSTEMS. 


PAVLOV, IVAN PETROVICH 


Ivan Petrovich Pavlov (1849-1936), Russian 
physiologist, psychologist, and Nobel prize winner, 
was born in the city of Ryazan, the eldest in a 
family of ten. His father was a priest and teacher 
of Greek and Latin, his grandfather and great- 


grandfather were sextons and produce farmers, His 
great-great-grandfather, Mokey Pavlov, was a freed 
serf, the son of a serf who had no surname and 
whose Christian name, Pavel, became the family 
name of succeeding generations. Pavlov’s mother 
was also the daughter of a priest, yet was herself 
illiterate; she was known in the vicinity for her 
ability to heal jaundice by having her patients gaze 
steadily at a pike swimming in a tank of water. 
Pavlov was enrolled in the second grade of the 
First Ryazan Parochial School in 1860, at the age 
of 11 (an injury caused by a fall had delayed his 
formal schooling), and in the Ryazan Theological 
Seminary in 1864. He left the seminary a year be- 
fore graduation to enter the division of natural 
sciences of the physicomathematical faculty of the 
University of St. Petersburg in 1870. The radical 
change in his outlook and career plans appears to 
have been brought about by his reading of Russian 
translations of Jacob Moleschott’s Physiologisches 
Skizzenbuch and G. H. Lewes's Physiology of Com- 
mon Life, as well as the writings of D. I. Pisarev, 
and particularly, I. M. Sechenov’s “Refleksi golov- 
nogo mozga” (“Reflexes of the Brain”; 1863), 
which Pavlov had managed to digest in his semi- 
nary days. ” 
Pavlov was graduated from the university in 
1875 and became a student of the Imperial Medico- 
surgical Academy in the same year. As a university 
student, he had performed two original experi- 
ments, one with V. N. Veliky on “the effect of the 
laryngeal nerve on blood circulation” and “the af 
ferent accelerators of tachycardia,” the other with 
M. Afanas'ev on “the nerve supply of the pancreas. 
An abstract of the report of the first experiment 
was published in 1874 in the Transactions of the 
St. Petersburg Society of Naturalists. Pavlov was 
awarded a gold medal for his second experiment, 
a full report of which was published in 1878 in the 
highly authoritative Pfliiger’s Archiv für die ge 
samte Physiologie des Menschen und der M 
These two experiments substantially foreshadowe 
Pavlov’s research interests for the next 16 years 
innervation of the vascular and digestive eet 
As a student at the academy, Pavlov served ie 
as a research assistant to K, N. Ustimovich ei 6 
veterinary division and later as an assistan! n 
Sergei P. Botkin in the clinical division. He we 
graduated from the academy in 1879 with the ae 
usual record of 11 scientific publications, ont 
of which were collaborations and seven of Ht ie 
appeared in Pflügers Archiv. He stayed on â his 
academy for postgraduate research, conte 
work in the physiological laboratory of ae 
clinic. His 77-page doctoral thesis, Rowe. 
nervy serdtsa” (“Efferent Nerves of the He 


appeared in 1883. In 1884 he was accorded the 
rank of lecturer in physiology and awarded a two- 
year foreign travel fellowship that he spent at the 
laboratory of C. F. W. Ludwig in Leipzig and at 
that of P. Haidenhain in Breslau (he had visited 
the latter laboratory earlier, as a student). 

In 1890 Pavlov was made professor of pharma- 
cology of the academy and director of the physio- 
logical laboratory of the newly founded St. Peters- 
burg Institute of Experimental Medicine. He 
became professor of physiology in the academy in 
1895, holding the post until 1924, when the 
Soviet Academy of Sciences established a special 
Institute of Physiology under his direction and be- 
gan planning the transfer of its laboratories to the 
village of Koltushi (now named Pavlovo), about 
twenty miles from Leningrad. The compound of 
imposing buildings of the Institute was completed 
in 1935 and has since expanded. Comprising not 
only outstanding and interrelated research labora- 
tories but also neuropsychic clinics and young 
children’s experimental schools, Pavlovo is a unique 
scientific community, centering on what is perhaps 
man’s leading life science—behavioral change 
(conditioning) and related neural action. Pavlov 
was elected a member of the Russian Imperial 
Academy in 1907, a member of the U.S. National 
Academy of Sciences in 1908, and a member of 
the American Philosophical Society in 1932. His 
honoris causa doctorates include those awarded by 
the universities of Geneva, Vienna, Cambridge, 
Edinburgh, and Paris. In 1935, at the age of 86, 
Pavlov was honorary president of the Fifteenth In- 
Savana Congress of Physiology, held in Lenin- 
grad. 


Behavioral psychology 


Pavlov’s 62 years of almost unceasing experi- 
mentation (eight days before his death he dis- 
cussed and directed the conduct of nine experi- 
ments on dogs and discussed and interpreted the 
behavior of five mental patients) illustrate well the 
Scientist's way of learning—by studying “more and 
more about less and less.” His first 16 years of 
research centered on uncovering the neural mech- 
anism of blood circulation and digestion; by 1890 
he had restricted his research almost wholly to the 
mechanisms of digestion, a fact cited when he was 
awarded the Nobel prize; and after 1903 he con- 
centrated almost entirely on the study of saliva- 
tion. As early as his doctoral dissertation, Pavlov 
had begun to uphold Botkin’s view of nervism— 
the nervous system in some way affects both nor- 
mal and pathological activities of the organism. 
(This doctrine continues to make Russian medicine 
and physiology distinctive by emphasizing the func- 
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tional over the organic, becoming over being.) 
However, Pavlov went much beyond Botkin. He 
combined nervism with objective associationism, 
thereby founding a new life-science discipline— 
“higher nervous activity’—in his own country and 
laying a firm basis for objective and behavioral psy- 
chology and for behavioral sciences in general. His 
specific physiological discoveries and surgical in- 
novations (e.g., the Pavlov pouch, permitting the 
collection of pure gastric juice) developed into a 
general, novel and, to a large extent, revolutionary 
system of studying normal and abnormal action in 
man and animals. 

Objective associationism. In a more specific 
sense, one might say that almost all of Pavlov’s 
research revolved around the action of reflexes— 
in the nineteenth century, unlearned, or what he 
came to call “unconditioned,” reflexes and in the 
twentieth century, his famous “conditioned” re- 
flexes [see LEARNING, article on INSTRUMENTAL 
LEARNING]. Here he was influenced by a distinctive 
feature of the life sciences in Russia: Sechenov's 
concept of the reflex. Whereas in western Europe 
and in America the reflex denoted only a special 
reaction—peripheral, segmental, simple, quick, un- 
conscious, involuntary—Sechenov had made it the 
essence of all reactions. As early as 1860, in his doc- 
toral dissertation, Sechenov had declared that vol- 
untary reactions are reflexive in nature. He devel- 
oped this idea into an all-embracing and radical 
life-science philosophy in his classic “Refleksi . . Ka 
(1863). All significant animal and human reac- 
tions, wrote Sechenov, are (a) determined and de- 
terminable, i.e., evoked by measurable and con- 
trollable physical stimuli and (b) mediated by the 
nervous system. Hence, they are all reflexes, 
whether they be simple or complex, peripheral or 
central, involuntary or voluntary, unconscious or 
conscious, physiological or psychical. The reflex 
was set forth not only as the unit of body action 
but also of body—mind or body-including-mind ac- 
tion. Indeed, present-day Soviet psychologists typi- 
cally state that Sechenov uncovered the reflex na- 
ture of the psyche, meaning thereby that he upheld 
the materialistic—or physicalistic—philosophy that 
psychical reactions invariably originate in physical 
stimulation and invariably are mediated by neural 
reactions. [See the biography of SECHENOV.] 

Although Pavlov was fully familiar with Seche- 
nov’s writings, he did not at first realize their full 
implication. In the 1890s, during his experiments 
with digestion, he ascertained that glandular secre- 
tion is by no means only a function of measurable 
physical stimulation. However, at that time, he 
freely used the concept of “psychic secretion,” 
which, as he wrote later, he had “then contrasted 
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sharply with reflex secretion”; he also spoke of the 
experimental animals’ “thoughts, desires, and feel- 
ings” ([1897] 1955, pp. 83-85.) By 1898, however, 
Pavlov had deleted from the manuscript of his 
pupil's, S. G. Wolfson’s, doctoral dissertation, “The 
Work of the Salivary Glands,” several paragraphs 
containing such sentences as the “role of the 
psyche in salivary secretion is of course undeni- 
able” and the “psyche determines the work of the 
salivary glands, choosing between acceptable and 
rejectable substances.” And in 1901 he took similar 
exception to a psychic interpretation in a disserta- 
tion, entitled “Analysis (of the Normal Conditions) 
of the Work of Salivary Glands in the Dog,” by 
another of his students, A. T. Snarsky. Pavlov 
wrote that: 


Dr. Snarsky clung to subjective interpretations of the 
phenomena, but I, taken aback by the fantasy and 
scientific barrenness of the approach, began looking 
for another way out of the difficult position. After per- 
sistent thought and mental conflict I finally decided 
that with regard to so-called psychic stimulation I 
must remain in the role of a pure physiologist, that 
is, of an objective external observer dealing only with 
external phenomena and their reactions. . . . The chief 
stimulus for my decision, although then an uncon- 
scious one, arose out of an impression made upon mé 
in my youth by the brilliant monograph of I. M. 
Sechenov, father of Russian physiology, entitled Re- 
flexes of the Brain (1863). ([1923] 1928-1941, vol. 1, 
pp. 38-39) 

The decision in favor of the objective-physio- 
logical approach was fully realized in 1903 in an 
address before the Fourteenth International Medi- 
cal Congress. Entitled “Experimental Psychology 
and Psychopathology in Animals,” the address put 
forward clearly and cogently the Pavlovian doctrine 
of unconditioned and conditioned reflexes, forever 
replacing psychic secretion with conditioned- 
reflex secretion and declaring boldly that: “Vital 
phenomena that are termed psychic are distin- 
guishable from pure physiological phenomena only 
in degree of complexity. Whether we call these 
phenomena psychical or complex-nervous is of 
little importance, as long as it is realized and 
recognized that the naturalist approaches them 
only objectively, leaving aside the question of their 
essence” ([1903] 1928, pp. 59-60). 

This statement combines with two others—first, 
that “the physiology of the higher nervous system 
[read: the behavior] of higher animals can be suc- 
cessfully studied only if one completely renounces 
the indefinite formulations of psychology and 
stands wholly upon a purely objective ground” 
([1904] 1928, p. 75), and second, that “the natu- 
ralist has no right to speak of higher animals’ 


psychic processes [Pavlov’s italics] without desert. 
ing the principle of natural science—which is the 
work of the human mind directed to nature through 
studies that derive their assumptions and inter. 
pretations from no other sources than nature itself” 
([1906] 1928, p. 82). These statements were surely 
clarion calls to behavioral objectivism and were 
sounded in almost the same form a decade later 
by John B. Watson, the father of American be- 
haviorism. Yet it must be made clear that Pavlov 
was not simply a “general behavioral objectivist’ 
but a specific “objective associationist.” His historic 
contributions inhere not just in advocating the 
objectification of categories of the mind through 
behavior categories but in providing a specific ob- 
jective method of studying the chief traditional 
explanatory principle of mind—associationism. 
What had, ever since Aristotle, been merely imaged 
and meditated on, Pavlov made to flow, so to speak, 
in capillary tubes and electric batteries. Through 
the conditioned reflex, the study of association 
became subject to all the refinements and meas- 
urements and means of verification of modern 
science and technology. 

The conditioned reflex. In its original, classical 
paradigm the conditioned reflex demonstrates that 
a stimulus initially inadequate to evoke some re 
flex may become adequate after it has Been ad- 
ministered one or more times together with a 
stimulus adequate to produce the reflex. Pavlov 
called the inadequate stimulus the conditioned or 
to-be-conditioned stimulus; the adequate stimulus 
was called the unconditioned stimulus; the reflex 
in response to the originally adequate stimulus, 
the unconditioned reflex; and the reflex in response 
to the originally inadequate and later adequate 
stimulus the conditioned reflex. Later, Anatolii G. 
Ivanov-Smolenskii (1927-1952), working in Pav- 
lov’s laboratory, J. Konorski and S. Miller, in their 
own and in Pavlov’s laboratory (see Konorski pe 
and Starytsin in Bekhterev’s laboratory, ie 
fied the original Pavlovian paradigm. Similarly, j 
the United States the paradigm was modified ve 
was called instrumental conditioning (Hilgard er 
Marquis 1940) or operant conditioning (Skinn h 
1938), and it stimulated a great deal of reseate 
and thought in the field. The modified paana 
however, added little to Pavlov’s empirical laws fe 
and generalizations about, conditioning, its te 
tribution being more in the realm of expan 
theory. a 

Apnoea six thousand successful ara 
ments employing the exact Pavlovian Pa 
of pairing unconditioned with to-be-condi ds 0 
stimuli have by now been reported. All kin 


organisms, from protozoa to men and from neo- 
nates (even fetuses) to the most advanced human 
and animal seniles, and all kinds of reflexes and 
stimuli have been used in these experiments. More- 
over, conditioning has been related experimentally 
to almost all other known organismic changes, 
either as the changes affect the conditioning or 
as the conditioning affects the changes. To cite 
two out of hundreds of examples: data from eight 
experiments are available on the effects of “ex- 
perimental clinical death” on conditioning, that is, 
on the fate of conditioned reflexes when an animal 
is experimentally put to death and then revived 
(Negovskii 1954); and data from 82 experiments 
are at hand on the effect of ionizing radiation on 
conditioning (Voprosy . . . 1962). Reports of condi- 
tioning experiments exist in 29 different languages, 
although the large majority are in Russian and 
English. 

Early in his experimentation Pavlov noted that 
the study of conditioning yielded sets of inter- 
related functional laws or generalizations and 
suggested related sets of testable hypotheses for 
further laws or generalizations. On such a basis 
a complete systematic discipline could be erected, 
and Pavlov gradually proceeded to do so. 

The chief functional laws or generalizations con- 
cerning empirical conditioned-reflex data estab- 
lished by Pavlov include: (a) extinction, the 
gradual diminution and final disappearance of a 
conditioned reflex, when the conditioned stimulus 
is repeatedly administered without the uncondi- 
tioned stimulus; (b) spontaneous recovery, the 
partial reversal of the effects of extinction and re- 
gaining of the original conditioning in the course of 
time—a sort of forgetting of extinction that is 
greater than the forgetting of the original condi- 
tioning; (c) generalization, the partial transfer of a 
conditioned reflex to stimuli similar to the condi- 
tioned stimulus (stimulus generalization) and to re- 
flexes similar to the conditioned reflex (reflex gen- 
eralization); (d) differentiation, the training that 
involves alternating presentations of the condi- 
tioned stimulus, accompanied by the unconditioned 
stimulus, with presentations of related stimuli, un- 
accompanied by the unconditioned stimuli (differ- 
ential conditioning); (e) higher-order conditioning, 
the formation of a conditioned reflex through pair- 
ing a to-be-conditioned stimulus with a previously 
Conditioned stimulus (instead of an unconditioned 
stimulus); (f) dynamic stereotypy, the patterning 
of successively applied conditioned stimuli in such 
a way that the positional order of each stimulus 
rather than its specific nature determines the 
nature and magnitude of the conditioned reflex 
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evoked by the stimulus; (g) disinhibition, the par- 
tial regaining of conditioning, when a partially or 
totally extinguished conditioned stimulus or a par- 
tially or totally differentiated similar stimulus is 
applied together with some new or extra stimulus— 
putative indication that extinction and differentia- 
tion are inhibitory processes that may themselves 
be inhibited (i.e., disinhibited) by the action of 
an extra stimulus. Obviously these generalizations 
are basic to the understanding of the mechanisms 
of change in human and animal behavior—ways 
through which the change occurs in normal (and 
abnormal) life situations, methods and techniques 
for its full analysis in the laboratory, and means 
of producing and controlling it through special 
education. 

In the main, Pavlov’s formulations—although 
not necessarily his whole system—have been veri- 
fied and accepted. In a glossary in a recent stand- 
ard textbook on conditioning and learning (Hilgard 
& Marquis [1940] 1961) 36 terms are attributed to 
Pavlov and only 25 to all other students of the field 
combined. A perusal of the two current official 
periodicals of American experimental psychology 
—the Journal of Experimental Psychology and the 


-Journal of Comparative and Physiological Psy- 


chology—reveals that their contributors are still 
very much concerned with Pavlov’s work, perhaps 
more so than with the work of any other physiolo- 
gist or psychologist. A recent survey of members 
of the American Psychological Association placed 
Pavlov second only to Freud in the members’ esti- 
mate of who most influenced contemporary psy- 
chology. And there is no doubt that Pavlov would 
have ranked first, had the survey been confined to 
experimental psychologists. 

Conception of higher nervous activity. Pavlov 
named the new life-science discipline of the forma- 
tion and disruption of conditioned reflexes “higher 
nervous activity,” and in his first book on the topic 
(1923) he added to it the term “behavior” in pa- 
yentheses. This addition indicated both Pavlov’s 
realization that others—notably American psychol- 
ogists—saw the new discipline as the study of 
behavior and also his intention to convey that, while 
conditioned reflexes become manifest in behavior, 
their interpretation inheres in higher-nervous or 
cortical action—indeed, that they reveal this action 
and have no real systematic or scientific import 
without it. Since, however, no technique for di- 
rectly probing higher-nervous or cortical action was 
as yet known in Pavlov’s day, he proceeded to de- 
velop a hypothetical, yet highly comprehensive, 
system of such action: a postulated complex inter- 
play of neural excitation and inhibition, their ir- 


484 PAVLOV, IVAN PETROVICH 


radiation, concentration, and induction, underlying 
all behavioral conditioned-reflex manifestations. 

The Pavlovian neuroconditioning system was 
often called by others a conceptual (i.e., inferen- 
tial) nervous system: a system based only on in- 
ferences—which is all that it originally was. 
Modern research, however, with its direct electro- 
encephalographic, microelectrodic, and even bio- 
chemical probing of neural action during condition- 
ing, seems to show that Pavlov’s inferences were 
for the most part correct. In any event, his original 
conditioning scheme, which was like the modified 
paradigm, provides complete information on de- 
pendent and independent variables—control and 
measurement of both the conditioned and the un- 
conditioned stimuli and measurements of condi- 
tioned reflex, unconditioned reflex, and original 
reflex in relation to conditioned stimulus—and has 
in recent years become the most serviceable means 
of correlating direct neural action and conditioned 
action, thus unveiling the true physical basis of 
learning. This enterprise, one of the most chal- 
lenging in our age, is also one in which the con- 
tact between research workers in the East and 
West has been very close, a fact witnessed by the 
Colloquium on Electroencephalography of Higher 
Nervous Activity in Moscow in 1958, the Macy 
Conferences on the Central Nervous System and 
Behavior in 1958, 1959, and 1960, the Montevideo 
Conference on Brain Mechanism and Learning in 
1959, the New York Pavlovian Conference on 
Higher Nervous Activity in 1961, the California 
Conference on Brain and Behavior in 1961, and 
several others. 

While a system of behavior based wholly on 
conditioned reflexes might be expected, at least 
superficially, to be highly mechanistic and reduc- 
tionistic and not to draw a basic distinction be- 
tween animal and human learning, Pavlov had, 
in his later years, prevented his system from having 
these defects. In 1927 he wrote: “Of course a word 
is for man as much a real conditioned stimulus as 
are other stimuli common to men and animals, 
yet at the same time it is so all-comprehending that 
it allows no quantitative or qualitative comparisons 
with conditioned stimuli in animals” ( [1927] 1960, 
p. 407). In 1932 Pavlov asserted that speech, and 
especially the kinesthetic stimuli to the cortex 
from the speech organs, are second signals—sig- 
nals of signals. These second signals are in es- 
sence abstractions of reality and means of gener- 
alization uniquely characteristic of human higher 
thought ([1932a] 1941, p. 69); and, similarly, also 
in 1932; “In man there comes to be . . . another 


system of signalization, a signalization of the first 
system . . . a new principle of neural action is 
[thus] introduced” ({1932b] 1941, p. 113), This 
is the Pavlov “second-signal system” principle that 
distinguishes verbal conditioning, or language ac- 
quisition, in man from first-signal conditioning in 
men and animals. And it is a principle that is 
clearly broader than that of most American behav- 
iorists, for whom language is either a mediator 
operating essentially according to the laws of the 
reactions that it mediates or is merely a condi- 
tioned vocal reaction. 

Psychopathology. From the very beginning 
Pavlov was concerned with the relation of condi- 
tioning to psychopathology (as indicated in the . 
title of his 1903 address )—an unusual concern for 
that time. His interest continued, and in 1920, in 
Pavlov’s laboratory, N. R. Shenger-Krestoynikova 
demonstrated empirically how a conditioned reflex 
produces experimental neurosis. Thereafter Pavlov 
devoted a large porticn of his experimental re- 
search and thought to this topic, supplementing 
his laboratory work with frequent visits to mental 
clinics. He postulated the “clash” of excitation and 
inhibition as the general cause of psychological 
disturbances and “protective inhibition” as a cor- 
rective cure. He assumed the existence of special 
psychopathological forms of neural action, which 
he termed “paradoxical,” “ultraparadoxical,” “total- 
inhibitory,” and “egalitarian”; he investigated them 
through unconditioned and conditioned behavioral 
manifestations, In the early 1930s K. M. Bykov, 
and E. Sh. Ayrapet'yants and V. L. Balakshina, 
began a stimulating series of experiments on inter- 
oceptive conditioning (conditioning in which direct 
stimulation of an internal organ is the conditioned 
stimulus ). These experiments, which suggest prom- 
ising leads to a fully objective science of psycho- 
pathology and of psychosomatics, are only now 
beginning to be duplicated in American labora 
tories. M. K. Petrova (1925) was an early pioneer 
in drug and sleep therapy, while a number of Pav- 
lov’s other students made both diagnostic and thera- 
peutic use of concepts based on interaction beo 
the patients’ first-signal and second-signal systems. 
[See also NEuROsIs; PSYCHOSOMATIC ninas 7 

Pavlovian psychology, indeed learning psycho 
ogy in general, has become so replete with temp 
interpretations of psychopathology that uao 
nately a good deal of uncritical writing has it 
published both in the United States and in ae 
Soviet Union. Yet there is no denying the porh ‘iil 
soundness of the approach, In the Soviet AA 
it is the only one in existence, and in the Un 


States it is gaining adherence in the face of com- 
peting approaches—notably psychoanalysis; there 
continue also to be many efforts to synthesize Pav- 
lov with Freud. 


Pavlov and American psychology 


As already indicated, Pavlov, Bekhterev, and the 
earlier work of Sechenov had in essence set forth 
the requisite principles for an objective and behav- 
ioral psychology a number of years before Watson 
launched behaviorism, However, while Watson was 
generally apprised of the “Pawlow method” through 
a short English summary in 1909, he learned of 
the Russians’ systematic principles only in 1914, 
when as he stated himself, he read the 1913 French 
and German translations of Bekhterev’s Objective 
Psychology. Watson’s 1913 article, “Psychology as 
a Behaviorist Views It,” in which behaviorism was 
first promulgated, mentions nothing of Russian 
research, and there is no doubt that he developed 
his system wholly independently. Yet Watson re- 
lated himself unequivocally to the Russian enter- 
prise afterward. When he was elected president of 
the American Psychological Association, the title 
of his presidential address was “The Place of the 
Conditioned Reflex in Psychology” (Watson 1916). 
He continued to uphold the conditioned reflex as 
the high hope fora true science of man and so- 
ciety and to produce challenging Pavlov-like and 
Bekhterev-like laboratory experiments. Yet Wat- 
son’s conditioned-reflex edifice—and that of early 
classical behaviorism in general—was in the main 
only programmatic and general, lacking both Pav- 
lov's well-knit system and its abundance of empiri- 
cal material. Between 1916 and 1930 American 
behaviorism dealt with a single “law” of condition- 
ing rather than with system-forming interrelated 
“laws,” and the number of American experiments 
in the area was only 16 compared to the Russians’ 
two hundred. 

The development of systems of behavior based 
predominantly on the conditioned reflex was begun 
in the United States in the early 1930s, a few 
years after the English translations of Pavlov’s 
two major books on the topic. This period marked 
the rise of American neobehaviorism—primarily 
the work of C. L. Hull, B. F. Skinner, E. R. Guthrie, 
and E. C. Tolman. Roughly, the two systems, Pav- 
lovian and American, differ in that Pavlov's is a neu- 
robehavioral, or S-N-R (stimulus > neural process 
> response), system, while American neobehavior- 
istic Systems are in the main positivistic, or autoch- 
thonous-behavioral, systems: S-R ( stimulus > re- 
sponse) or S-O-R (stimulus > organism > response), 
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where O is a postulated matrix of mediating vari- 
ables. Yet it is also true that in significant respects 
the difference between the Pavlovian system and 
the American neobehavioristic systems may well 
be less than the difference among the American 
systems themselves. Each American neobehavioris- 
tic system is broadly based on Pavlov-established 
conditioning laws or generalizations, plus its own 
superstructure. 

In the United States and elsewhere outside the 
Soviet sphere, Pavlov has influenced all concepts of 
modifiable behavior or learning—what are known 
as the vertical dimensions of mind. However, in 
the Soviet Union itself his influence has also ex- 
tended to mind's horizontal dimensions—through 
the doctrine of types of nervous systems based on 
alleged genotypic individual differences in five 
phases of neural action (strength of excitation and 
of inhibition, mobility of each, and balance be- 
tween the two). Roughly speaking, the doctrine 
posits that individual variations in unconditioned 
and conditioned effector reactions—in recent years 
also in electroencephalographic and biochemical 
reactions—demonstrate that the nervous systems 
of men and animals fall into several specific ge- 
netic types that most clearly differentiate all 
phases of living and behaving: from susceptibility 
to disease (and to ionizing radiation) and life ex- 
pectancies, to work styles, motor deftness, modes 
of thinking, and of course temperament, person- 
ality, and emotional balance. Even variation in 
general intelligence—“generad ability” in Russian 
terms—is held to be a function of the type of 
nervous system. To American researchers the doc- 
trine seems hardly credible, yet it has not been 
tested in any significant manner outside the Soviet 
Union. [See GENETICS, article on GENETICS AND 
BEHAVIOR. ] 

As of today one might argue with good reason 
and abundance of evidence that Pavlov’s neurobe- 
havioral S-N-R system is gaining the upper hand 
over American pure behavioral systems—S-R and 
S-O-R. Recent spectacular advances in techniques 
of neural recording and interpretation of the infor- 
mation thereby obtained stress the need for inte- 
grating neural and behavioral data and relating 
the latter to the former. The earlier American view 
that the neural has no information significant to the 
analysis of the behavioral no longer holds. Modern 
psychology demands that the neural level have, and 
continue to have, an important role in behavioral 
analysis as neo-behaviorism is gradually being re- 
placed by brain-behaviorism, which is in essence 
identical with Pavlovian higher nervous activity. 
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There is also strong evidence that Pavlov was on 
the right track when he accorded true verbal con- 
ditioning, or language acquisition—what he called 
the second-signal system—a higher ontological 
status and when he refused to class it as merely a 
conditioned vocal reaction, thereby upholding the 
qualitative distinctness of man. Consider the clini- 
cal-neurological evidence that the human speech 
area is found in the associational cortex, while the 
mechanisms of mere vocalization—animal and 
human—are imbedded in the deep-lying mesen- 
cephalon. Moving away rightly from anthropomor- 
phizing animals, psychology need not bounce to 
the other extreme—human zoomorphism. 


Social science implications 


In general terms Pavlov expressed a “deep, irrev- 
ocable and uneradicable conviction that this path 
[study of conditioned reflexes] is the path of the 
final triumph of the human mind over its last and 
uppermost problem—full knowledge of the laws 
and mechanisms of human nature and thus full, 
true, and permanent happiness . . . deliverance 
from present gloom and the purge of present-day 
shame in interhuman relations” ({1923] 1928- 
1941, vol. 1, p. 41), He did not specifically attempt 
to extend his principles to any system—or indeed 
any analysis—of social and societal behavior, hold- 
ing social science to be embryonic. But Pavlov’s 
students and particularly Bekhterev and his stu- 
dents did thus utilize conditioning concepts. How- 
ever, in the middle 1930s, sociology and social 
psychology ceased to be independent fields of study 
in the Soviet Union, and the entire approach lay 
fallow. The two fields are now being revived and, 
by all tokens, so is a Marxist-Leninist and Pavlo- 
vian approach to them. In the United States the 
systematization of social phenomena in terms of 
conditioned reflexes met with considerable approval 
in the 1920s and early 1930s, as witnessed by the 
influence of Allport’s Social Psychology. But in the 
mid-1930s phenomenology and psychoanalysis be- 
came dominant in both sociology and social psy- 
chology, presumably because they could more read- 
ily be applied to immediate social problems. There 
can be no doubt, however, that ultimately, as in the 
case of psychopathology, societal and individual 
social-attitudinal changes are intimately related 
to learning processes and conditioning principles. 
Thus, a Pavlovian social psychology and sociology 
that is critical, comprehensive, and well system- 
atized may well re-emerge with the ongoing rapid 
development of psychophysiology in the United 
States. 
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PEACE 


Social science has uncovered more knowledge 
about war than about peace, just as psychology 
probably has yielded more insights into negative 
deviance (such as mental illness) than into posi- 


_tive deviance (such as creativity). Unfortunately, 


studies tend to be focused on wars as units of anal- 
ysis rather than on periods of peace, and there is 
a tendency to define peace simply as “nonwar.” 
Thus, peace thinking has had a tendency to become 
utopian and to be oriented toward the future; it has 
been speculative and value contaminated rather 
than analytical and empirical. It is conceivable that 
this might change if research were to be focused 
more on peace than on war. 

Two concepts of peace should be distinguished: 
negative peace, defined as the absence of organized 
violence between such major human groups as 
nations, but also between racial and ethnic groups 
because of the magnitude that can be reached by 
internal wars; and positive peace, defined as a pat- 
tern of cooperation and integration between major 
human groups. Absence of violence should not be 
confused with absence of conflict: violence may 
occur without conflict, and conflict may be solved 
by means of nonviolent mechanisms. The distinc- 
tion between these two types of peace gives rise 
to a fourfold classification of relations between 
two nations: war, which is organized group vio- 
lence; negative peace, where there is no violence 
but no other form of interaction either and where 
the best characterization is “peaceful coexistence”; 
positive peace, where there is some cooperation 
interspersed with occasional outbreaks of violence; 
and unqualified peace, where absence of violence 
is combined with a pattern of cooperation. 

The conception of peace as “nonwar” is neither 
theoretically nor practically interesting: as used, 
for instance, in describing the relationship that 
obtains between Norway and Nepal, it can often 
be explained in terms of a low level of interaction 
resulting from geographical distance and thus 
will hardly be identified by many as an ideal rela- 
tion worth striving for. For peace, like health, has 
both cognitive and evaluative components: it desig- 
nates a state of a system of nations, but this state 
is so highly valued that institutions are built 
around it to protect and promote it. It is the concept 
of positive peace that is worth exploring, especially 
since negative peace is a conditio sine qua non and 
the two concepts of peace may be empirically re- 
lated even though they are logically independent. 

In the absence of what one might call solid 
empirical research and a coherent peace theory, 
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the concept of peace can best be explicated by 
means of an examination of peace thinking. Just 
as there has been no lack of attention paid to war 
(Sorokin 1937-1941; Wright 1942; Richardson 
1960a), so there is no scarcity of peace plans 
(Wynner & Lloyd 1944; Hemleben 1943; Doob 
1960; Hinsley 1963; Murty & Bouquet 1960), and 
an extensive typology would be needed to do full 
justice to most of the latter. The approach here will 
be to present the outlines of such a typology, and 
to bring the insights and knowledge of social sci- 
ence to bear upon some of these ideas. For peace 
is a problem of social organization, and the theory 
of peace and war will someday be subsumed under 
the general theory of social organization. 

A typology of peace plans. A major axis for the 
classification of any peace plan is the level of or- 
ganization that it singles out for reform, The prob- 
lem of peace, in the mind of a person proposing 
something, can be located at any of five levels: the 
level of the isolated individual; the level of human 
groups; the level of human societies or nations; the 
level of the international system of nations; and 
the emerging level of a world state, For simplicity, 
the first three can be collapsed, leaving only three 
levels of peace plans: the subinternational, the in- 
ternational, and the level of the world state. 

Of these three major types of peace plans, the 
first is not oriented toward the international system 
at all, whereas the other two are, but in very differ- 
ent ways, Plans that focus on the international 
system seek to preserve the essential characteris- 
tics of the present system of nation-states while 
organizing it in such a way that it will be stabilized 
at some level of interaction that can be character- 
ized as “peaceful.” Those who seek a world state 
take for granted that some kind of integration is a 
necessary condition and ask how this integration 
can be stabilized in favor of peace. Obviously, the 
world-state approach is oriented toward both posi- 
tive and negative peace, whereas the focus on the 
international system asks less of the system in 
terms of positive peace as long as only negative 
peace is obtained. 

Within each class of models there are a number 
of more specific ideas; we shall mention some of 
particular importance. All of them take their point 
of departure from one specific variable used to 
characterize the system, and the model is identi- 
fied with one particular value of that variable. 


Subinternational peace plans 


There are several well-known approaches at the 
subinternational level, and although they play a 


minor role in contemporary thinking about peace, 
they should be mentioned because of their preva. 
lence. The basic idea of intrahuman approaches to 
peace is that intergroup, and also interhuman, con. 
flicts are nonrealistic conflicts (Coser 1956) and, 
more particularly, projections of intrahuman con. 
flicts. Hence, if man could be freed from more of 
his inner conflicts, he would behave in a less ag. 
gressive manner at the international level. Past 
generations’ techniques of freeing individuals from 
internal conflicts depended on religious conversion, 
whereas in contemporary societies psychotherapy 
is more frequently called for—if not for the whole 
population, at least for its leaders, and if not for its 
present leaders, at least as a screening device for 
future leaders (Klineberg 1964). 

Interhuman approaches to peace emphasize the 
idea of projection of interhuman rivalry onto higher 
levels of human organization and, more positively, 
the idea that training in peaceful conflict resolution 
at lower levels may be transferred to higher levels, 
including the international level. Life in the family, 
at school, at work, or in associations may be seen 
as possible training grounds, particularly for those 
who are to become world leaders. 

Likewise, intrasocietal approaches to peace em- 
phasize the idea of projection: societies that are 
especially conflict-ridden will use external aggres 
sion as a means to force internal cohesion. This 
approach also includes the idea that some political 
systems are more peace-loving than others; more 
particularly, great importance is given to a fair dis- 
tribution of the wealth of a nation to its inhab- 
itants, 

The major objection to these subinternational 
approaches is, of course, that there is a confusion 
of levels of analysis. Wars between nations take 
place at the international level; this level is sut 
generis and requires analyses and reforms at that 
level. Lower levels may be of some importance, but 
their impact is likely to suffer a quick decrease as 
one moves into the foreign offices and other on 
of international policy decision making. This is Be 
to deny the possible importance of screening app! 
cants for key positions in the international system 
in order to prevent an accumulation of trusts 
that may be converted into aggression pene 
portant decision makers. This proposal assumes 
clinical psychology is further advanced in o pe 
and prognosis than in the cure of the menta ie 
viant, And it is probably also true that the A 
training ground for desired types of behavior dof 
real-life situations that approximate the kini 
situations in which decision makers work. 


As for the intrasocietal approach, a study by 


Michael Haas (1965) shows that it has so far 


failed to produce very significant correlations be- 
tween intrasocietal structure and external behavior. 
There is some evidence that democratic societies 
are less belligerent and also that more developed 
societies are more belligerent, The latter is con- 
firmed by a study from the International Peace Re- 
search Institute in Oslo, based on Quincy Wright's 
data (Broch & Galtung 1966). Moreover, there is 
nothing that seems to confirm the widely held idea 
that a major increase in the standard of living of 
the world population or a fairer distribution of the 
fruits of man’s labor would contribute significantly 
to a more peaceful world. A better distribution may 
solve internal problems but at the same time free 
resources for external aggression. 


International peace systems 
Various suggestions for international peace plans 


are both theoretically and practically more promis- 


ad than those that focus on the subinternational 

Based on distribution of power, Most peace 
thinking has centered on the problem of how power 
shall best be distributed among the nations of the 


= world. Theories relating to this are usually marred 


by the neglect of other kinds of power than coer- 
cive power; influence potential in its most general 
sense is rarely considered. If we stick to this tradi- 
tion of studying the distribution of military power, 
there are four major models of peace. 

The first model is that of minimum equality, 
which is based on the theory that the international 
System is best served by making power the monop- 
oly of one nation or system, just as it is monop- 
olized by some statuses in the intranational system. 
Examples are the Pax Romana, Pax Ecclesiae, and 
Pax Britannica—and the contemporary efforts to 
establish a Pax Americana or a Pax Sovietica before 
the stalemate was crystallized in the idea of a sys- 
fem of peaceful coexistence. 

The second model focuses on maximum equality, 


_ OF what is usually referred to as a “balance of 


Power” (Kaplan 1957), in the sense that no nation 
Or alliance is strong enough to defeat another na- 
tion or alliance. A modern version is the “balance 


Of terror,” in which a nation may defeat other na- 


tions, but only at the risk of being completely 

yed itself, War becomes impossible under the 

lance of mutual destruction of a Pax Atomica: 
risks are too great. 

A third model views military powers as stabi- 


lized at a low level; this refers to all kinds of arms- 
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control efforts, especially those that have taken 
place from the Hague Peace Conference of 1899 
to the present day, including contemporary think- 
ing that aims at subtracting from a Hobbesian 
bellum omnium contra omnes both some means of 
violence and some objects of violence. The idea 
is to rule out general and complete war. 

Finally, there is the model that views power as 
stabilized at a zero level; this refers to the general 
(all nations) and complete (all weapons) disarm- 
ament advocated by pacifists, Pacifism asserts that 
this state may be obtained unilaterally by the effect 
of example, because weapons become meaningless 
when they do not encounter similar weapons, and 
by the refusal of soldiers to use arms, as well as 
by governmental decisions. 

Where the model of minimum equality is con- 
cerned, there might perhaps be agreement among 
nations about the appointment of a police nation 
in the world but not about its consequences—i.e., 
that coercive power usually will be accompanied by 
other kinds of influence. The police nation, it may 
be feared, will abuse its power and impose its value 
system and sociopolitical system on other nations, 
This would be acceptable if there were a general 
value consensus, in which case the system would 
be close to one of the many world peace systems 
described below. 

The major difficulty in the model of maximum 
equality seems to be that the system, although in 
momentary equilibrium, is not in stable equilib- 
rium, It is based on the relative evaluation of two 
power potentials, and since military power is many- 
dimensional (because it consists of many weap- 
ons systems), this evaluation may be far from con- 
sensual. There will always be room for the idea 
that one’s own power is not sufficiently developed, 
Thus, the basis is laid for arms races, and it is 
difficult to see any good theoretical justification for 
the thesis that there will be points of stability—for 
instance, that major technical breakthroughs will 
not occur (Richardson 1960a; Galtung 19644). The 
need for sufficient retaliatory power after an en- 
emy’s first strike also makes the terror balance 
unstable, Overkill capacity in peace is no argument 
against a continued arms race; what matters Is 
what is left after the first strike, 

One major difficulty in the model that focuses 
on arms control is the arbitrariness of all border 
lines between permissible and illegitimate weapons. 
For such border lines to be consensually accepted, 
they must be protected by some kind of disconu- 
nuity (Schelling 1960), such as the clear line that 
existed between conventional and nuclear weapons 
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before the overlap in destructive power became too 
conspicuous with the introduction of the variety of 
tactical atomic weapons. Thus, such agreements 
are likely to be highly technical, difficult to under- 
stand, easy to evade, and difficult to supervise. 

Where the model of general and complete dis- 
armament is concerned, one major objection is its 
failure to consider the need for countervailing 
power. One evader of an agreement may dominate 
the total system if he has an absolute weapon at his 
disposal. For this reason, general and complete dis- 
armament can preserve peace only if the distribu- 
tion of power in the system accords with the mini- 
mum.-equality model (or with one of the world 
peace systems, discussed below), or if provision is 
made in the system for the effective use of non- 
military forms of power (economic sanctions, non- 
military defense systems, etc.) against those who 
evade disarmament agreements. It should be em- 
phasized, however, that there is no reason to believe 
that weapons, once they exist, will have to be used. 
Any city has in its hardware stores and its phar- 
macies more than enough weapons and poisons to 
provide for overkill capacity, and this weaponry is 
usually not very well protected, Nevertheless, it is 
not usually abused, and the reasons for this may 
also become operative at the international level. 

Based on organization of conflicts. The second 
general type of international peace plan focuses on 
the organization of conflicts. The basic model here 
is the crisscross model, which is based on the idea 
that a system is strengthened, not weakened, the 
more conflicts it harbors, provided all these con- 
flicts do not divide the units the same way. If two 
nations are allies in one conflict (for instance, be- 
tween East and West, in the language of the cold 
war), they may nevertheless be antagonists in an- 
other conflict (for instance, between rich and poor 
nations), and this subjects them to cross pressures, 
The effects of cross pressures are a tendency toward 
withdrawal and neutrality, or nonalignment; the 
development of multiple loyalties that prevent com- 
plete identification and involvement in any conflict; 
and a tendency to serve as a channel of communi- 
cation between the groups that are not exposed to 
cross pressure. 

A major weakness of this theory is that one con- 
flict will probably be defined as more salient than 
the other, so that the crisscross effect will be re- 
duced. Another weakness is that if both conflicts 
are concerned with struggle for rank (for instance, 
political and economic rank), then nations that are 
high on one and low on the other will be in “rank 
disequilibrium” and will probably feed much ag- 
gressiveness into the system (Galtung 1964b). 


Based on individual loyalty conflicts. Whereas, 
according to the crisscross model, peace is obtained 
when nations are caught in cross pressures, other 
peace models imply that the violence potential of 
nations is reduced when divided loyalties are in. 
duced in individuals. One such plan seeks to ma- 
nipulate multiple national loyalties: the idea is to 
impede conflict polarization by institutionalizing, 
preferably across potential conflict border lines, 
secondary and primary relations between individ- 
uals from different nations. In practice, the sugges- 
tions are many: for example, mixed marriages, ex- 
changes of all kinds (children, youths, students, 
professionals, politicians), and increased knowl- 
edge and empathy. 

A second plan seeks to build crosscutting organ- 
izational loyalties. The division of the world into 
about 130 nations and territories provides, roughly, 
an exhaustive and mutually exclusive division of 
mankind. But it is also divided into nearly two 
thousand international organizations, which provide 
many individuals with other focuses of identifica- 
tion, although this division is far from exhaustive 
(and not exclusive either). With increased com- 
munication, geographical distance decreases in im- 
portance; thus, organizational identification may 
gradually gain in importance relative to national 
identification and eventually provide a multiple 
loyalty that would make it difficult to organize in- 
dividuals in wars against their peers. 

Probably the only major objection to the idea of 
multiple national loyalties is that it may be of rel- 
atively low significance, given the conformity of 
most people to the calls of their national leaders. 
However, the idea of crosscutting organizational 
loyalties raises some important theoretical issues. 
An organization is built around values but may 
span a wide spectrum of geography—possibly the 
whole world. Conversely, a nation is built aroun 
a territory, usually based on geographical contigu- 
ity, but may span a wide spectrum of value orien- 
tations if it is of the pluralist variety. However, ee. y 
is also the classical idea of the nation-state, which i$ 
both homogeneous in values and contiguous in a 
ritory and in which the inhabitants are protecte 
against violent internal ideological conflicts because 
of cultural homogeneity. (Belgium and India ee 
vide examples of what may happen when this ¢ i 
dition does not obtain.) This internal homogeni Ši 
is bought at the expense of heterogeneity bale i 
nations, which means that complete identifica a 
with the nation-state yields a tremendous C0) 

i i with complete 
potential. And this is also the case a 
identification with the organization, as long Je 
geographical proximity means something. Peop 


seem to want others to share their values, and in 
a world divided into organizations they would not 
be protected against the proselytizing efforts of 
rival organizations by the walls set up by the 
nation-state. Hence, where identification is con- 
cerned, the most peaceful world is probably found 
at an intermediate point, with most people sharing 
their loyalty equally between the two focuses or 
some identifying with the national focus and some 
with the organizational. 

Based on degree of homology. Homology be- 
tween nations—the extent to which they are sim- 
ilar in social and political structure, so that each 
institution and status in nation A has its “opposite 
number” in nation B—has been singled out as a 
factor of importance. There are two models of 
peace based on this concept. In the models of 
minimum homology nations are as different as 
possible, in the sense that they have different 
value structures; thus, they will not compete for 
the same scarce values but can establish a very 
specific pattern of interdependence—a kind of sym- 
biosis based on complementarity. On the other 
hand, where maximum homology prevails, nations 
are as similar as possible in social structure and 
value structure, thus facilitating diffuse interaction 
and creating a value consensus. It is argued that 
the more similar nations are in terms of their cul- 
ture (including language), the lower is the prob- 
ability of misunderstanding. The theoretical basis 
is the thesis of “value homophily’—i.e., that sim- 
ilarity tends to produce liking and interaction. 

Again, it is difficult to see that there is any guar- 
antee for peace in any of these theories. In partic- 
ular, there is no basis for equating value consensus 
with absence of conflicts; value consensus means 
that the same objects are valued positively, and if 
the objects are scarce, the probability of conflicts 
is higher, not lower. Misunderstandings may cause 
conflicts, and they may also conceal them. And 
Where minimum homology is concerned, as be- 
tween the European powers and the political sys- 
tems found in Africa and America at the time they 
were opened up to European penetration, it seems 
quite likely that one nation will prevail over the 
other—militarily or culturally—and unlikely that 
the symbiosis will be stable. But again there may be 
a curvilinear relationship at work: it may be that 
at a very high level of mutual penetration, values 
S shared to the extent that a basis for cooperation 
n pursuing the same goals, such as high produc- 
tivity or happiness, may be present. To the extent 
that this is the case, it means that on the path to 
maximum homology there is a danger zone to pass, 
where the pursuit of goals has still not been coordi- 
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nated and the goals are both consensual and 
scarce. 

Based on international stratification. We may 
assume that international stratification—that is, 
the ranking of nations as high or low on such rank 
dimensions as size, population, power, natural re- 
sources, income per capita, cultural level, social 
level, and urbanization—is multidimensional and 
that there is a tendency toward rank equilibrium, 
in the sense that nations tend to divide into those 
that are high and those that are low on most or 
many dimensions. The problem, as for systems of 
individuals, is how the interaction between nations 
is regulated; there seem to be two models that focus 
on regulating “class conflict” at the international 
level. 

First, there is the feudal type of system, where 
there is a high level of interaction at the top and a 
low level at the bottom—that is, the international 
system is tied together at the top by trade, diplo- 
macy, and all other kinds of exchange. Although 
there is very little interaction at the bottom, there 
may be some from top to bottom. This system is 
easily controlled by the wealthy nations; if there 
is a consensus among them, the system may 
achieve a stability similar to that of the caste sys- 
tem or of slave societies. 

By contrast, in what may be called the modern 
system there is an equal level of interaction at the 
top and at the bottom: the “underdogs” unite in 
organizations that make it possible for them to 
countervail the influence of the “top dog” nations. 
Thus, one might envisage a kind of trade union of 
small nations that is able to strike against the big 
nations, organize embargoes to obtain better prices 
for raw materials, etc., and thus force upon the 
world a more equitable distribution of world output. 
Over time the importance of class lines may then 
decrease in the international system. 

Again, there are many objections to these models. 
Most people today would probably object to the 
feudal system on a purely ideological basis. There 
is also the important difference between intra- 
national and international levels of organization: 
the upper class in a national society is often marked 
by solidarity in its relationship to the lower classes, 
whereas upper-class nations in the international 
system have a higher tendency to fight among 
themselves (which is just one more expression of 
the lack of value and system integration at the 
international level). Thus, the international system 
tends to be divided into subsystems, each of which 
consists of one big power and its satellites, with a 
feudal structure both within and between these 
subsystems. 
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The modern system seems more promising, but 
that may only be because it has never really been 
tried. One objection is that it lends itself readily to 
international class warfare of poor and small na- 
tions against rich and big nations. A likely variant 
is that one big power will try to win over as many 
as possible of the satellites of the other big powers 
and become the leader of these satellites in such 
class warfare. Another objection is that the modern 
system will probably have less rank equilibrium, 
that is, there will be more nations that can be 
ranked high on one dimension of the international 
stratification system and low on one or more of 
the others. This will increase the potential for inter- 
national aggression. 

The latter objection, at least, seems consistent 
with current developments: the “revolution of 
rising expectations” that is taking place has lifted 
some lower-class nations over temporary disequilib- 
ria, such as being rich but not powerful, toward 
new positions of equilibrium at a higher level, such 
as being both rich and powerful after capital has 
been converted into weapons and weapons into 
territorial gains. Thus, as in the case of models 
based on degree of homology, there is probably a 
curvilinear relationship at work between the sys- 
tem elements, which could mean that the period 
ahead will be a particularly difficult one. 

Based on degree of interdependence. Some 
models of peace are based on interdependence 
between nations. Interdependence, or interaction 
where some kind of positive value is exchanged 
between the parties to the interaction, may vary 
in frequency (how often), volume (how much 
is transferred), and scope (variety of value ex- 
changed). Trade between two nations is a good 
example of how all three can vary independently. 
There are two principal models based on this con- 
cept. The model of minimum interdependence en- 
visages a world where each nation is autonomous 
and self-sufficient and no nation intervenes or in- 
terferes in the affairs of any other nation; this is 
a clear case of negative peace, where positive peace 
is explicitly ruled out (Burton 1965). On the other 
hand, in the model of maximum interdependence 
all pairs of nations have maximum interaction in 
terms of frequency, volume, and scope. The idea 
is that all pairs of nations would be protected from 
rupture and violent conflict by the web of affilia- 
tions spun between them; positive interaction with 
other nations would be built into each nation in 
such a way that wars would be too costly. Thus, all 
nations should enjoy relations somewhat like those 
between Norway and Sweden. 

A major difficulty with these two models lies in 


their failure to deal with the essential character- 
istics of violence. Violence in interindividual rela. 
tionships seems to be at a maximum under the 
conditions of either maximum or minimum scope 
of interaction. In the case of maximum interac- 
tion, which sociologists would refer to as “diffuse 
interaction,” conflicts may lead to extreme patterns 
of violence, as seen in civil wars or in enmity be- 
tween former friends (Coser 1956). In the case of 
no interdependence, empathy serves less as a pro- 
tection against application of extreme violence. In 
general, most violent conflicts are both preceded 
and accompanied by polarization of attitudes (Cole- 
man 1957); in this case polarization is already 
built into the system. Thus, we are probably justi- 
fied in assuming a U-shaped relationship between 
scope and intensity of conflicts, which would mean 
that precisely the two values singled out in these 
two models—that is, the values of minimum and 
maximum interdependence—are the most danger- 
ous ones. 

The probability of conflict, however, may be 
highest when the interaction is specific—that is, 
limited in scope but not down to zero—but in that 
case conflicts are most likely to be regulated by 
contract and rational calculation, so that they will 
be less intense. With very diffuse interaction be- 
tween nations, conflicts along one dimension of 
interaction may be dampened by the other dimen- 
sions, but they may also be reinforced and esca- 
lated; moreover, even if this is improbable, the dis- 
utility of open conflict is high. This does not mean 
that there does not exist a level of interdependence 
so high that the probability of all-out conflicts is 
almost zero, but on its path toward that point the 
international system will have to pass through dan- 
gerous zones of nonzero probability and high in- 
tensity of conflict. Moreover, statistics on Ta 
show that they are highly correlated with the a 
gree of interdependence between nations (Wrig i 
1942); only rarely are wars directed toward TA 
plete strangers. Finally, it should be on : 
that much of what is called “economic develop 
ment” is an effort to obtain self-sufficiency an 
hence to reduce the need for interdependence. Ai 

Based on functional cooperation. ne of 
interdependence that implies only agate eer 
exchange should not be confused with funci erie 
cooperation, which implies that the parties oS ane 
produce something they may then share. Copr z 
tion is one form of functional cooperation, aS w as 
several nations combine resources in order Be 
duce something really big, such as 4 wee ae 
commercial airplane or a gigantic deve ae al 
project. There is good reason to believe th 


the level of individuals, functional cooperation on 
equal terms is one of the factors most efficient in 
producing integration (Berelson & Steiner 1964, 
p. 513). This hypothesis of functional cooperation 
has often been put forward in connection with 
nations (by President Kennedy, for instance, in 
connection with the moon race). The idea, then, is 
that any specific cooperation provides training for 
more comprehensive cooperation. 


World peace systems 


All of the following models of world systems 
have in common a certain resemblance to a nation- 
state, usually one held to be successful by the 
person who puts forward the proposal. The idea 
is that since many nation-states have obtained rea- 
sonable security and equity for their inhabitants, 
there must be something in their structure that is 
worth copying at the world level. Proposals vary, 
but they all have in common the idea of a center 
of decision making with means at its disposal to 
obtain compliance from the constituent units. Of 
the many dimensions that can be used to describe 
such models only three will be discussed. 

First, models of world systems can be described 
in terms of the type of unit on which the system 
is based. When the basic unit is the individual, the 
world system is conceived as a world state, with 
a very low level of autonomy for intermediate 
levels, such as the nation. With nations as units 
the world system becomes a confederation, with 
the nation as a political level interposed between 
the individual and the government. Congruence 
between the authority structures of nation and con- 
federation may have a stabilizing effect on the 
system as a whole (Eckstein 1961). 

The difference between these two models is 
rarely argued in terms of their relevance for peace. 
Rather, the world confederation is seen as an in- 
termediate step in a more gradualist approach 
toward the world state or as a system with the 
built-in protection of some internal autonomy. Also, 
there is the idea that border lines should be pre- 
served to some extent, precisely because they slow 
down cultural diffusion and influence and thus con- 
tribute to the preservation of sociocultural pluralism 
—which many fear might disappear in a world 
State because of the homogenizing effect of a strong 
a of decision making. But cultural differ- 

ces in a pluralist system may serve as focuses of 
Conflicts or, at least, as focuses for prejudices and 
ue forms of discrimination. The costs of plural- 
sm would therefore have to be calculated in terms 
of potential for conflict. 

Models of world systems can also be described 
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in terms of their scope and domain. By “scope” we 
refer to the variety of needs satisfied by the world 
system; and by “domain .” to how many receive need 
satisfaction from the system. 

Classification in terms of these two system func- 
tions produces two basic models. The first rates 
high on scope but low on domain. It is the form 
taken by the regional federation, which gives much 
in terms of scope to its members but is exclusive 
in terms of membership; a leading example is the 
European Economic Community. The second model 
rates high on domain but low on scope. It is the 
form taken by the functionally specific organiza- 
tion, which sets no limits, at least in principle, to 
the number and type of people whose needs it may 
serve but is able to do this only because both the 
needs and the type of service provided are of a 
limited type; an example would be any specialized 
agency of the UN. 

A true world system has to rate high on both of 
these dimensions because it cannot exclude any 
class of units, whether nations or individuals, if it 
wants to be universal, and it also has to offer a 
wide variety of goods and values if it is to be seen 
as a source of gratification. Thus, the two models 
just described must be seen as steps on the road 
toward a closer analogy with the nation-state 
(Œ. Haas 1964). The world system would rate high 
on domain and high on scope; it would be a true 
world state or federation, which excluded nobody 
and tried to satisfy a wide variety of needs that 
were formerly satisfied at subordinate levels 
(Deutsch 1966). No agreement exists on whether 
the regional or the functionally specific model bears 
more promise as a step toward this full-fledged 
world system. There is the pragmatic point of view 
that the regional model is needed as a training 
ground in systems that are low on scope, and the 
functionally specific model in systems that are low 
on domain, But there are also the arguments that 
the bigger the unit, in terms of number of mem- 
bers, the bigger will be the wars involving that unit 
and that regionalism is likely to unite upper-class 
nations (because they have the most interaction 
experience) and thus contribute to a feudal world 
pattern. 

Nature of compliance systems. A political sys- 
tem is meaningless unless there is a relatively high 
degree of compliance with a high proportion of 
centrally decreed norms. According to Etzioni 
(1961), there are three basic types of compliance 
mechanisms: the normative, the contractual, and 
the coercive [see SOCIAL CONTROL, article on OR- 
GANIZATIONAL ASPECTS]. This typology can usefully 
be applied here. 
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Normative compliance means simply that there 
is an internalized desire to comply; behavior that is 
institutionally necessary is internalized as a need 
disposition in the personal system. The many sug- 
gestions for a world system based on this type of 
compliance focus on two main ideas: creating 
loyalty to the central agency on the grounds that it 
provides many services; and creating loyalty on a 
more ideological basis by a suitable use of symbols, 
such as the idea of the brotherhood of all men. A 
basic idea here is the democratic idea that nor- 
mative compliance is promoted by a pattern of 
representative and direct democracy on the world 
level; another idea is that of creating a “welfare 
world” after the model of the welfare state. 

Contractual compliance, by contrast, is built into 
the institutional structure by making compliance 
pay, just as an employee is motivated to perform 
his tasks because he knows he will receive a salary. 
In the same way, it is argued, nations and individ- 
uals can be motivated to perform services for the 
international community, provided that the latter 
gives them something in return. They may or may 
not combine this with a subjective feeling of 
loyalty; what is essential is that they should per- 
form according to a quid pro quo principle. The 
system is based on exchange, and disagreements 
can be resolved by mediation, arbitration, and 
adjudication, as well as by a suitable clearing sys- 
tem for the exchanges. 

Coercive compliance is, of course, compliance 
based on the use or threat of force, especially 
against members defined as aggressors. At the in- 
ternational level the model is the deviance—detec- 
tion—conviction—adjudication-sanctioning scheme 
borrowed from the control systems of national so- 
cieties. International peace based on coercive com- 
pliance is enforced by such institutions as those 
provided for in the UN Charter: observer corps, 
peace-keeping forces of different kinds, the World 
Court, and sanctions of all kinds built into a sys- 
tem of international law. 

Particularly important is the search for sanc- 
tions, whether economic or diplomatic, that stop 
short of war (Galtung 1965). This is the legal ap- 
proach to the problem of international conflict 
management; it presupposes a nucleus of global 
institutions and is inconceivable at the purely 
international level unless one nation takes upon 
itself the task of acting as a third party in the inter- 
national system. The problems of the legal ap- 
proach, which is essentially an effort to obtain 
predictability by codifying international behavior, 
can also be analyzed in terms of the three modes 
of compliance. Thus, for legal rules to be adhered 


to, normative compliance is not strictly necessary, 
but some element of internalization built around 
important symbols in the system is an important 
positive contribution. To obtain contractual com- 
pliance, legal rules must be equitable and reason- 
able; to coerce, they must be institutionalized by 
means of credible negative sanctions. 

To be implemented, all of these ideas need a 
central agency, whether it takes the form of the 
concert of Europe, the League of Nations, the 
United Nations, or some other form. The central 
agency will have to do what is needed to build upa 
basis for all three types of compliance, whether it 
takes the form of information or propaganda and 
manipulation of symbols, the administration of 
services in such a way as to buy loyalty, provision 
of a clearinghouse of exchanges of all kinds in 
order to make interdependence under a central 
organization pay, or the administration of en- 
forcement mechanisms. Crucial for all three types 
of compliance is the extent to which the cen- 
tral world agency is able to compete with other 
levels of organization, such as nations, which 
provide the same kind of basis for compliance but 
possibly also promote compliance with norms that 
conflict with those of the world government. It 
should also be pointed out that any political system 
will probably need all three types of compliance. 
Normative compliance alone may not be enough 
in the long run; the value of a “good conscience 
will show a rapidly diminishing return. The ight 
behavior must somehow be made to pay, and if the 
system runs out of resources for rewards, it may 
have to resort to force. But force without some 
basis in normative commitment is tantamount to 
terror, and terror is notoriously ineffective in the 
long run. 


Peace movements 


The tremendous disparity between the different 
approaches to peace that have been described ie 
be interpreted as a sign of basic confusion in oa 
ing on the topic, But it is more likely to be a Te d 
tion of the complexity of the problem itself. It R 
be that, in this respect, peace is somen i 
health: the phenomenon is extremely a 
grasp as a whole, and one’s approach there a 
tends to be determined by the kind of rea 
health—he is interested in obtaining, Clearly, at 
are good reasons for these differences of ee sos 
in fact, a peace plan can be classified not on. ss, 
cording to its content but also according P an- 
put it forward. This may be a person or an o 
ization, located either in the decision-™ of a 
nucleus of the world system or in the center 


society or on the periphery of a society. If the last 
is the case, the proposal is likely to bear some of the 
imprints of marginality: an absolutist and moral- 
istic, as opposed to a gradualist and pragmatic, ap- 
proach; a tendency toward single-factor, as opposed 
to multiple-factor, thinking; and a tendency to con- 
fuse organizational levels, so that the training and 
capacity of the plan’s author are made to seem 
more important than the possible merits of the 
plan itself (Galtung 1966). Thus, psychologists 
will talk about personality and aggression, sociol- 
ogists about conflict-inducing social structures, ed- 
ucators about the school as the pivotal element in 
peace building—and the result is the well-known 
collection of single-factor theories, around which 
one may construct an organization and rally to- 
gether people who share the belief. Thus, almost 
all the theories referred to above have found their 
organizational expression. 

On the other end are the decision makers in 
the center of society, who have a gradualist, prag- 
matic approach and employ multiple-factor think- 
ing. In their effort not to confuse organizational 
levels their approach becomes so slow, so careful, 
and so withdrawn from the public eye, that the 
tension between center and periphery becomes high 
enough to produce impatience and conflict, lead- 
ing to demonstrations, party formation, and other 
forms of protest. The result is the traditional dia- 
logue between center and periphery. The periphery 
gradually becomes like the center as it gains in 
power and leaves a certain imprint on the total 
System; however, in turn, a new protest movement 
is created at the periphery. In other words, peace 
movements, like other social movements, follow 
the “church-sect” cycle, Since the periphery—the 
peace movement—is split into single-factor organ- 
izations, the world does not get a multiple-approach 
impact from the movement. 

Historical development. The fact that there are 
around 1,500 groups working for disarmament and 
peace (International Peace/Disarmament Direc- 
tory [1962] 1963) suggests, among other things, 
an enormous democratization of the struggle for 
peace; even though their total membership may be 
small, these organizations do provide open forums 
for expressions of public opinion. Such forums 
are relatively new. Although the organized peace 
Movement can be said to have begun in the United 
States in 1815 (Angell 1935), it was not until 
1843 that the first international peace congress was 
held (in London). The Nobel peace prize (instituted 
in 1897) also stimulated much interest in the 
problem of peace. The history of the award repays 
Study. The Nobel Foundation Calendar of 1963- 
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1964 (see Nobelstifteltsen, Stockholm, Prix Nobel) 
listed 59 prizes, of which 13 went to the United 
States, 8 to France, and 7 to Britain; thus, 3 coun- 
tries took 46 per cent of the prizes, 12 other coun- 
tries took 40 per cent, and the remaining 8 prizes 
went to institutions, such as the Red Cross. There 
is thus a Western bias in the geographical distri- 
bution of the Nobel prizes, and it is not surprising 
that the socialist countries have instituted their 
own prizes for peace, the best known being the 
Lenin prize. 

The peace movement received a terrible shock 
when World War 1 proved how much stronger was 
attachment to the nation than adherence to inter- 
nationalist and pacifist principles and how much 
stronger was the fear of the sanctions of one’s own 
government and compatriots than of the sanctions 
of fellow members of organizations. The resolu- 
tions passed at the Universal Peace Congress in 
Geneva in 1912 and at the congress of the Second 
Socialist International in Stuttgart in 1907 were 
strongly pacifist, but the weakness of the peace 
movement then is also its weakness today: loyalty 
to the peace movement is based on normative com- 
pliance alone, not on contractual or coercive com- 
pliance. In times of crises, only extremely idealistic 
or very peripherally located people are likely to 
remain faithful to their ideals. World War 11 pro- 
vided the peace movement with a new abundance 
of examples of this fact. However, this does not 
mean that the peace movement has no impact. 
Indeed, it serves as an imperfect substitute for 
a foreign-policy national assembly, since public 
opinion probably has less influence on foreign- 
policy than on domestic-policy decisions in many 
countries. 

Peace research. Since the late 1950s and early 
1960s there has been an intensification of the more 
academic study of peace and a drive toward pro- 
fessionalization of the peace movement (Galtung 
1963). The Repertory on Disarmament and Peace 
Research Institutions lists close to one hundred 
institutions in twenty countries as being active in 
the field; and two scientific quarterlies, the Journal 
of Conflict Resolution (edited at the Center for 
Conflict Resolution, University of Michigan) and 
the Journal of Peace Research (edited at the Inter- 
national Peace Research Institute in Oslo), are de- 
voted to research in this interdisciplinary field. 

One major difficulty with peace research is the 
problem whether the future will be a continuous 
extrapolation of the past or qualitatively different. 
For instance, it can be argued that the model for 
the disarmament process currently under discus- 
sion—the model that presupposes balance and con- 
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trol—probably has few, if any, counterparts in the 
history of the past. Another set of models for peace 
preservation, namely, balance-of-power models and 
collective security systems, have shown very severe 
limitations in the past. But can one assume that if 
a structure has not appeared in the past, it is be- 
cause it is not viable or that if a structure has failed 
in the past, it will also fail in the future? Or if one 
studied how city-states and nation-states have suc- 
cessfully reduced their armaments in the past 
(probably more through normative and contractual 
than through coercive compliance), could this be 
a guide to future action? Acceptance of this simple 
inductive approach might mean that peace re- 
search would become merely a scientific-sounding 
pretext for imposing the past on the future. 


JOHAN GALTUNG 


[Directly related are the entries DISARMAMENT; INTER- 
NATIONAL CONFLICT RESOLUTION; PACIFISM; WAR, 
article on THE STUDY OF WAR. Other relevant mate- 
rial may be found in Conriict; DipLomacY; FOR- 
EIGN POLICY; SYSTEMS ANALYSIS, article on 1N- 
TERNATIONAL SYSTEMS; and in the biography of 
RICHARDSON. ] 
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PEARSON, KARL 


Karl Pearson, “founder of the science of ea 
tics,” was born in London in 1857 and died 4 
Coldharbour in Surrey, England, in 1936. T 

Pearson’s father, William Pearson, was ‘ehan 
rister, Queen’s Counsel, and a leader in the a 
cery courts. He was a man of great ability, ke 
exceptional mental and physical energy and A son 
interest in historical research, traits which his 
also exhibited. 


An incident from Pearson’s infancy, which Julia 
Bell, his collaborator, once related, contains in 
miniature many of the characteristics which 
marked his later life. She had asked him what was 
the first thing he could remember. He recalled that 
it was sitting in a highchair and sucking his thumb. 
Someone told him to stop sucking it and added 
that unless he did so, the thumb would wither 
away. He put his two thumbs together and looked 
at them a long time. “They look alike to me,” he 
said to himself. “I can’t see that the thumb I suck is 
any smaller than the other. I wonder if she could 
be lying to me.” Here in this simple anecdote we 
have rejection of constituted authority, appeal to 
empirical evidence, faith in his own interpretation 
of the meaning of observed data, and, finally, im- 
putation of moral obliquity to a person whose judg- 
ment differed from his own. These characteristics 
were prominent throughout his entire career. (The 
chief source of information about Pearson's early 
life is a 170-page memoir written immediately after 
his death by his son, Egon S. Pearson [1938], also 
a distinguished statistician. ) 

In Pearson’s early educational history there are 
indications of a phenomenal range of interests, 
unusual intellectual vigor, delight in controversy, 
the determination to resist anything which he con- 
sidered misdirected authority, an appreciation of 
scholarship, and the urge to self-expression, but 
there is almost no suggestion of any special lean- 
ing toward those studies for which he is now chiefly 
remembered. 

In 1866 he was sent to University College School, 
London, but after a few years was withdrawn for 
reasons of health. At the age of 18 he obtained a 
scholarship at King’s College, Cambridge, being 
placed second on the list. 

In an autobiographical note entitled “Old Tripos 
Days at Cambridge” (1936), published shortly be- 
fore his death, he wrote of those undergraduate 
days as some of the happiest of his life. “There 
Was pleasure in the friendships, there was pleasure 
in the fights, there was pleasure in the coaches’ 
teaching, there was pleasure in searching for new 
lights as well in mathematics as in philosophy and 
religion.” His tutor was Edward J. Routh, consid- 
ered by some the most successful tutor in the his- 
tory of Cambridge, a man for whom he developed 
a real affection, Pearson used to speak of the stim- 
ulation received from his mathematics teachers, 
Routh, Burnside, and Frost, and described contacts 
with other distinguished persons. He gave an 
amusing account of an examination held on four 
ae in the homes of the four examiners: George 

abriel Stokes, whom he venerated as the greatest 
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mathematical physicist in England and one of the 
two best lecturers he had ever known; James Clerk 
Maxwell, another great physicist but a poor lec- 
turer; Arthur Cayley, lawyer and mathematician, 
inventor of the theory of matrices and the geom- 
etry of n-dimensional space; and Isaac Todhunter, 
who had by that time published his History of the 
Mathematical Theory of Probability. The examina- 
tion paper set by Todhunter provided a turning 
point in Pearson’s career. A demonstration which 
he submitted in this examination was attached, 
with an approving comment, by Todhunter to the 
unfinished manuscript of his History of the Theory 
of Elasticity (1886-1893). After Todhunter’s death 
Pearson was invited to finish and edit this History. 
This task was the beginning of his vital associa- 
tion with the Cambridge University Press, whose 
proofs were, for the next half century, rarely 
absent from his writing table. 

Besides mathematics and the theory of elasticity, 
his interests during his Cambridge years included 
philosophy, especially that of Spinoza; the works 
of Goethe, Dante, and Rousseau, which he read 
in the original; the history of religious thought; and 
a search for a concept of the Deity that would be 
consistent with what he knew of science. Deeply 
concerned with religion but resenting coercion, he 
challenged the university authorities, first by his 
refusal to continue attendance at compulsory di- 
vinity lectures and then by his objection to com- 
pulsory chapel. He won both fights, and the uni- 
versity regulations were altered, but he continued 
to attend chapel on a voluntary basis. 

After taking his degree with mathematical 
honors at Cambridge in 1879, he read law at Lin- 
coln’s Inn and was called to the bar in 1881. There 
followed travel in Germany and a period of study 
at the universities of Heidelberg and Berlin, where 
he balanced the study of physics with that of meta- 
physics; of Roman law with the history of the 
Reformation; of German folklore with socialism 
and Darwinism. After returning to England he was 
soon lecturing and writing on German social life 
and thought, on Martin Luther, Karl Marx, Mai- 
monides, and Spinoza, contributing hymns to the 
Socialist Song Book, writing papers in the field 
of elasticity, teaching mathematics in King’s Col- 
lege, London, and engaging in literary duels with 
Matthew Arnold and the librarians of the British 
Museum, 

One of the friendships which had a deep in- 
fluence on his life was with Henry Bradshaw, li- 
prarian of Cambridge University, to whom Pearson 
referred in “Old Tripos Days” as “the man who 
most influenced our generation.” In a speech, he 
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described Bradshaw as “the ideal librarian, but 
something greater—the guide of the young and 
foolish” and added that the librarian showed him 
what the essentials of true workmanship must be. 
So deep was their friendship that Bradshaw could 
reprove the younger man for excessive ardor and 
lack of wisdom in intellectual controversy. 

His first publication, at 23 years of age, was a 
little book, which must have been largely auto- 
biographical, entitled The New Werther (1880), 
written in the form of letters from a young man 
named “Arthur” to his fiancée. It foreshadows The 
Ethic of Freethought (1888) and The Grammar 
of Science (1892). Arthur writes: 


I rush from science to philosophy, and from philos- 
ophy to our old friends the poets; and then, over- 
wearied by too much idealism, I fancy I become 
practical in returning to science. Have you ever at- 
tempted to conceive all there is in the world worth 
knowing—that not one subject in the universe is 
unworthy of study? The giants of literature, the mys- 
teries of many-dimensional space, the attempts of 
Boltzmann and Crookes to penetrate Nature's very 
laboratory, the Kantian theory of the universe, and 
the latest discoveries in embryology, with their won- 
derful tales of the development of life—what an 
immensity beyond our grasp! . . . Mankind seems on 
the verge of a new and glorious discovery. What New- 
ton did to simplify the various planetary motions must 
now be done to unite in one whole the various isolated 
theories of mathematical physics. (Quoted in Egon S. 
Pearson, 1938, p. 8) 


All that the young writer wanted was a complete 
understanding of the universe. Thirty years later, 
the first issue of Biometrika carried as its frontis- 
piece a picture of a statue of Charles Darwin with 
the words: Ignoramus, in hoc signo laboremus. 
Those five words ring out like the basic theme of 
Pearson’s life: “We are ignorant; so let us work.” 

In 1884 Pearson became professor of applied 
mathematics and mechanics at University College, 
teaching mathematics to engineering students as 
well as courses on geometry. He occupied himself for 
the next few years with writing papers on elasticity, 
completing Todhunter’s History of the Theory of 


Elasticity, lecturing on socialism and free thought, . 


publishing The Ethic of Freethought, writing in 
German (Die Fronica 1887) a historical study of 
the Veronica legends concerning pictures of Christ, 
collecting material on the German passion play 
which later formed the substance of The Chances 
of Death (1897), completing a book called The 
Common Sense of the Exact Sciences (see 1885), 
which had been begun by W. K. Clifford, and tak- 
ing an active part in a small club whose avowed 


purpose was to break down the convent 
riers which prevented free discussion of 
tions between men and women. 

In 1890 Pearson was invited to lect 
ometry at Gresham College, with freedom 
the subject matter on which he would lec 
March 1891 he delivered his first course 
lectures on “The Scope and Concepts of 
Science.” In 1892 he published the first 
The Grammar of Science, and in 1893 he 
article on asymmetrical frequency curves 
It is apparent that a very important 
taken place in his concepts of scientific 
that he had reached a new conviction 
statistical aspects of the foundations of kno 
that problems of heredity and evolution 
quired a new urgency, and in short, that a) 
change had taken place in his profess 

Influences on Pearson's thinking. 
W. F. R. Weldon was appointed to the 
biology at University College. Weldon wi 
acquainted with Francis Galton and en 
statistical research. In 1890 Weldon pul 
paper on variations in shrimp, in 189: 
correlated variations, and in 1893 a thi 
which contains the sentence “It cann 
strongly urged that the problem of a 
tion is essentially a statistical problem 
that point of view was heresy. The import 
science of the intense personal friend: 
soon sprang up between Pearson and Weldo 
both in their early thirties, can scarcely 
gerated, Weldon asked the questions 
Pearson to some of his most significant | 
‘tions. Weldon’s sudden death from pneu! 
the age of 46 was a heavy blow to sciem 
great personal tragedy to Pearson. y 

In 1889 Galton, then 67 years old, P 
Natural Inheritance, summarizing his res 
between 1877 and 1885 on the subject 
sion. This work moved Weldon to undert 
studies of regression in biological popula 
moved Pearson to arithmetical researches 
minated in 1897 with the famous producta 
correlation coefficient r. The elaboration 
relations led, in other hands than Pearson's, 
diverse statistical inventions as factor an 
the analysis of variance. The stimulation 
received from Galton and the devotion he ! 
ward the older man show on every page 
four volumes of The Life, Letters and L 
Francis Galton (1914-1930), one of the y 
great biographies. On this work of over 
quarto pages and about 170 full-page pla! 


son lavished some twenty years of work and much 
of his personal fortune. 

Development of a science of statistics. The 
year 1890 represented not only a turning point in 
Pearson’s career; it marked the beginning of the 
science of statistics, Antedating this development 
and preparing the way for it had been a long period 
of slowly increasing interest in the statistical way 
of thinking. In 1890 this interest was still sporadic, 
restricted in scope, and shared by very few people. 
It exhibited itself primarily in the collection of 
such public statistics as population data, vital sta- 
tistics, and economic data. It was also evident in 
actuarial work and in the adjustment of observa- 
tions in astronomy and meteorology, particularly 
least squares adjustment. Outside these areas, this 
development was hampered not only by lack of 
interest but also by paucity of data and the ab- 
sence of adequate theory. Statistical theory was 
almost entirely that which had been developed by 
the great astronomers and mathematicians con- 
cerned with mathematical probability related to 
errors of observation. It related chiefly to the bi- 
nomial distribution or the normal distribution of a 
single variable. 

The gathering of public statistics by govern- 
ments and semipublic agencies was well estab- 
lished. After about 1800 most of the industrialized 
countries had instituted the official national census. 
Several nongovernmental societies had been set up, 
chiefly for the purpose of improving the quality of 
public statistics, for example, the Statistical Society 
of London (now the Royal Statistical Society) in 
1834 and the American Statistical Association in 
1839, Actuarial work had become a fairly well- 
developed and respected profession. The 25-year 
period from 1853 to 1878 was the era of the great 
international statistical congresses. Economic sta- 
tistics moved ahead greatly in this period, with 
notable improvements in methods of gathering 
data, Governments were beginning to take physical 
Measurements of their soldiers and were making 
these data available to anthropometrists. 

Among Pearson’s predecessors were men who 
made significant contributions to the mathematical 
theory of probability in relation to gambling prob- 
lems, but they never tested that theory on data 
and never proposed its application in any other 
area, Early theorists of this kind were Pierre de 
Fermat, Blaise Pascal, Christian Huygens, and 
Abraham de Moivre. Other mathematicians wrote 
ial the possible application of probability theory 
0 social phenomena, but they had no data: Jakob 
(Jacques) Bernoulli wrote on such possible appli- 
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cation to economics; Daniel Bernoulli on inocula- 
tion as a preventive of small pox; and Niklaus 
(Nicholas) Bernoulli, Condorcet, and Poisson, 
among others, on the credibility of testimony and 
related legal matters. Before 1800 William Playfair 
had invented the statistical graph and published 
many beautiful statistical charts from quite du- 
bious data. 

None of these men, then, either cared to test 
his theories on data or had appropriate data to 
work with, and contrariwise, many other men 
worked in statistical agencies tabulating data with 
very little idea of how to analyze them. 

Two groups of persons, the actuaries and the 
mathematical astronomers, possessed both mathe- 
matical acumen and relevant data for testing the- 
ory, but neither group proposed a general statis- 
tical approach outside of its own field. The great 
mathematical astronomers of the first half of the 
nineteenth century, notably Laplace and Gauss, 
did lay the foundations for modern statistical the- 
ory by developing the concept of errors of obser- 
vation and an impressive accompanying mathe- 
matical theory, and the ferment of ideas which 
they stimulated spread over Europe. Important 
contributions were made by Friedrich Wilhelm 
Bessel and Johann Franz Encke in Germany; Gio- 
vanni Plana in Italy; Adrien Marie Legendre, 
Poisson, Jean Baptiste Fourier, Auguste Bravais, 
and “Citizen” Kramp in France; Quetelet in Bel- 
gium; George Biddell Airy and Augustus De Morgan 
in England; and Thorwald Nicolai Thiele in Den- 
mark, Only a very few persons before Pearson had 
thought of the statistical analysis of concrete data 
as a general method applicable to a wide range of 
problems; one such was Cournot, whose extensive 
writing, both on the theory of chance and on such 
matters as wealth and supply and demand, laid 
the foundations for mathematical economics. And 
more than any other person in the nineteenth 
century, Quetelet brought together mathematical 
theory, the collection of official statistics, and a 
concern for practical problems and fused the three 
into a single tool for studying the problems of life. 
Finally, of course, there was Galton, whose work 
had a great impact on Pearson and whose close 
friendship with him had an incalculable influence. 

Although these men had put a high value on 
concrete data, the amount of data to which they 
had access was paltry beside what soon began to 
be collected by Pearson and his associates. Pearson 
always insisted on publishing the original data as 
well as the statistics derived from them. His pri- 
mary aim was to develop a methodology for the 
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exploration of life, not the refinement of mathe- 
matical theory. Whenever he developed a new piece 
of statistical theory, he immediately used it on data, 
and if his mathematics was cumbersome, this did 
not concern him. 


Major contributions 


Frequency curves. One of the problems on 
which Pearson spent a great deal of time and en- 
ergy was that of deriving a system of generalized 
frequency curves based on a single differential 
equation, with parameters obtained by the method 
of moments. Quetelet seems to have believed that 
almost all social phenomena would show approxi- 
mately normal distributions if the number of cases 
could be made large enough. Before 1890 J. P. 
Gram and Thiele in Denmark had developed a 
theory of skew frequency curves. After Pearson 
published his elaborate and extremely interesting 
system (1894; 1895), many papers were written 
on such related topics as the fitting of curves to 
truncated or imperfectly known distributions and 
tables of the probability distribution of selected 
curves. 

Chi-square. Having fitted a curve to a set of 
observations, Pearson needed a criterion to indi- 
cate how good the fit was, and so he invented 
“chi-square” (1900). Quetelet and others who 
wanted to demonstrate the closeness of agreement 
between the frequencies in a distribution of ob- 
served data and frequencies calculated on the as- 
sumption of normal probability merely printed 
the two series side by side and said, in essence, 
“Behold!” They had no measure of discrepancy and 
were apparently not made uncomfortable by the 
lack of such a measure. Pearson not only devised 
the measure but he worked out its distribution and 
had it calculated. He himself never seems to have 
understood the concept of degrees of freedom, 
either in relation to chi-square or to his probable- 
error formulas. Yet chi-square is an enormously 
useful device with a range of applications far 
greater than the specific problem for which it was 
created, and it occupies an important position in 
modern statistical theory. 

Correlation. The idea of correlation is due to 
Galton, who published a paper entitled “Co-rela- 
tions and Their Measurement Chiefly From An- 
thropometric Data” in 1880 and another entitled 
“Regression Towards Mediocrity in Hereditary 
Stature” in 1885, and who gave a more widely read 
statement in Natural Inheritance (1889). The 
mathematics of the normal correlation surface had 
been derived earlier in connection with errors made 
in estimating the position of a point in space. In 


1808 Adrain gave the first known derivation of 
the probability that two such errors will occur to- 
gether but dealt with uncorrelated errors only, The 
density function for two related errors was given 
by Laplace in 1810 and for n related errors by 
Gauss in 1823 or perhaps earlier. Plana in 1819, 
studying the probability of errors in surveying, and 
Bravais in 1856, that of errors in artillery fire, each 
obtained an equation in which there is a term 
analogous to r. Being concerned about the prob- 
ability of the occurrence of error and not with the 
strength of relationship between errors, these men 
all studied the density function and paid no atten- 
tion to the product term in the exponent, which is 
a function of r. They applied their findings only 
to errors of observation, and the relation of their 
work to the correlation surface was noted only long 
after the important works on correlation had been 
written. In a study made in 1877 of the height, 
weight, and age of 24,500 Boston school children, 
Henry Pickering Bowditch published curves show- 
ing the relation of height tc weight but missed 
discovering the correlation between two variables. 

Galton had been seriously hampered in his study 
of correlation by both lack of data and lack of an 
efficient routine of computation. He did have data 
on sweet peas and on the stature of parents and 
adult offspring for two hundred families. While 
Pearson began to lecture on correlation, Weldon 
began to make measurements on shrimp for cor- 
relation studies. 

In Pearson’s first fundamental paper on correla- 
tion, entitled “Regression, Heredity and Panmixia 
(1896), he generalized Galton’s conclusions and 
methods; derived the formula which we now call 
the “Pearson’s product moment” and two other 
equivalent formulas; gave a simple routine es 
computation which could be followed by a person 
without much mathematical training; stated ae 
general theory of correlation for three variables: 
and gave the coefficients of the multiple regression 
equation in terms of the zero-order correlation co- 
efficients. p 

There followed a series of great memoi me 
various aspects of correlation, some by ee 
associates of Pearson, such as G. Udny Yule oie 
W. F. Sheppard, but most of them from his © a- 
hand. These dealt with such matters aS one T- 
tion in nonnormal distributions, tetrachoric 7 oa 
relation between ranks, correlation when ma 
both variables are not scaled or regression is të 
linear. There were many papers presenting jety 
results of correlation analysis in a great v e 
of fields. A large amount of labor went of these 
derivation of the probable error of each 0 


rs on 


various coefficients and the tabulation of various 
probabilities related to correlation. It is fitting 
that the product-moment correlation coefficient is 
named the “Pearson r.” 

Individual variability. Variation among errors 
made in observations on the position of a heavenly 
body had been studied extensively by the great 
mathematical astronomers. The list of those who 
before 1850 had written on the “law of facility of 
error,” derived the formula for the normal curve, 
and compiled probability tables would be a long 
one. The term “probable error” had come into wide- 
spread use within a few years after Bessel em- 
ployed the term der wahrscheinliche Fehler in 1815 
in a paper on the position of the polar star. 

The concept of true variability among individ- 
uals is very different from the concept of chance 
variation among errors in the estimation of a single 
value. The idea of individual variability is promi- 
nent in the writings of Quetelet, Fechner, Ebbing- 
haus, Lexis, Edgeworth, Galton, and Weldon, but 
it was not commonly appreciated by other scien- 
tists of the nineteenth century. Pearson’s emphasis 
upon this idea is one of his real contributions to 
the understanding of life. In that first great paper 
on asymmetrical frequency curves (1894) he in- 
troduced the term “standard deviation” and the 
symbol ø, and he consistently used this term and 
this symbol when discussing variation among indi- 
viduals. However, when writing about sampling 
variability, he always used the term “probable 
error,” thus clearly distinguishing variability due 
to individual differences from variability due to 
chance errors. 

Probable errors of statistics. Pearson himself 
probably considered that one of his greatest con- 
tributions was the derivation of the probable errors 
of what he called “frequency constants” and vari- 
ous tables to facilitate the computation of such. 
His method, already well known in other connec- 
tions, was to write the equation for a statistic, take 
the differential of both sides of that equation, 
Square, sum, and reduce the result by any algebraic 
devices he could think of. The process of reduction 
was often formidable. Even though he was not 
much concerned with the distinction between a 
Statistic and its parameter and he frequently used 
the former in place of the latter, these probable 
errors marked a great advance over the previ- 
a lack of any measure of the sampling varia- 
We most statistics. In this era new statistics 
is re being proposed on every side, and the amount 

energy which went into the derivation of these 
ie errors was tremendous. With the success- 

search for exact sampling distributions that has 
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been under way ever since the publication of 
Student’s work in 1908 and R. A. Fishers 1915 
paper on the sampling distribution of the correla- 
tion coefficient, better methods than Pearson’s prob- 
able error formulas could provide are now in many 
cases available. 

Publication of tables. An editorial in the first 
number of Biometrika (unsigned, but always at- 
tributed to Pearson) referred to the urgent need 
for tables to facilitate the work of the statistician 
and biometrician and promised that such tables 
would be produced as rapidly as possible. Such 
tables as were then available were in widely scat- 
tered sources, some of them almost impossible to 
obtain. By 1900, tables of the binominal coeffi- 
cients, the trigonometric functions, logs, and anti- 
logs were readily available. A large table of the 
logs of factorials computed in 1824 by F. C. Degen 
was almost unknown. There were tables of squares, 
cubes, square roots, and reciprocals, of which 
Barlow’s is the best known, and there were multi- 
plication tables by Crelle and by Coatsworth. 
Legendre had published a table of logarithms of 
the gamma-function, but copies were very scarce. 
The normal probability function had been ex- 
tensively tabulated but always with either the prob- 
able error (.67450) or the modulus (a/ V2) as 
argument, never the standard error. Poisson had 
not tabulated the distribution which bears his 
name, but Bortkiewicz had done so in 1898 (in his 
Gesetz der kleinen Zahlen). 

A list of the tables which have been issued in 
Biometrika from its second issue in 1902 until the 
present or which have appeared in the separate 
volumes of Tables for Statisticians and Biometri- 
cians (1914) or in the Drapers’ Company Series 
of Tracts for Computers would be a very long one. 
Some of these tables are no longer used, others 
appear to be timeless in value, even after the ad- 
vent of the electronic computer. The Tables of the 
Incomplete Beta-function (1934) was among Pear- 
son’s last contributions to science, published when 
he was 78 years old. 


Controversies 

The frequent controversies in which Pearson 
was embroiled cannot be disregarded. In his youth 
he did battle for such unpopular radical ideas as 
socialism, the emancipation of women, and the 
ethics of free thought. A few years later he was 
involved in a long struggle for the unpopular idea 
that mathematics should be applied to the study 
of biology. Much bitterness arose over this ques- 
tion, and the Royal Society, while ready to accept 
papers dealing with either mathematics or biology, 
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refused to accept papers dealing with both. That 
refusal was one of the circumstances that led to 
the founding of Biometrika in 1900, and this in 
turn gave great impetus to the young sciences of 
biometry and mathematical statistics: now there 
was a journal in which mathematical papers on 
the biological sciences could be published. 

In 1904 Galton established the Eugenics Record 
Office to further the scientific study of eugenics. 
It became known as the Eugenics Laboratory two 
years later, when Galton turned it over to Pearson 
so that he might operate it in connection with his 
Biometric Laboratory. The Biometric Laboratory, 
whose existence Pearson dated back to 1895, was 
a center for training postgraduate workers in this 
new branch of exact science. In 1911 these two labo- 
ratories were united to form the department of ap- 
plied statistics in University College, with Pearson 
as its first professor. 

Beginning in 1907 the Eugenics Laboratory pub- 
lished numerous very substantial statistical papers 
on three of the most controversial issues of the 
day: pulmonary tuberculosis, alcoholism, and men- 
tal deficiency and insanity. These appeared in two 
series entitled “Studies in National Deterioration” 
and “Questions of the Day and of the Fray.” 

In contrast to the idea then current that tuber- 
culosis could be eradicated by improving the en- 
vironment, Pearson’s statistical studies indicated 
that the predisposition to tuberculosis was more 
hereditary than environmental and that there was 
no clear evidence that patients treated in sanatoria 
had a higher recovery rate than those treated else- 
where. 

Another common assumption at that time was 
that alcoholic parents produce children with mental 
and physical deficiencies. The first studies on this 
subject coming from the Eugenics Laboratory 
found no marked relation between parental alco- 
holism and the intelligence, physique, or disease 
of offspring (1910). Later papers concluded that 
alcoholism is more likely to be a consequence 
than a cause of mental defect. Pearson commented 
that “the time is approaching when real knowledge 
must take the place of energetic but untrained 
philanthropy in dictating the lines of feasible social 
reform” (quoted in Egon S. Pearson 1938, p. 61). 

After the American Eugenics Record Office an- 
nounced in 1912 that mental defect was almost 
certainly a recessive Mendelian character and ad- 
vised that “weakness in any trait should marry 
strength in that trait and strength may marry 
weakness,” Pearson or his associates marshaled 
statistical evidence to refute this pronouncement. 


Each time Pearson took up such an issue, the 
reaction of medical authorities and public officials 
was angry and violent, and their personal attacks 
on Pearson were prolonged and vituperative; open 
conflict also developed between the more tradi- 
tional Eugenics Education Society, of which Galton 
was honorary president, and the Eugenics Labora- 
tory, of which he had been the founder, 

The young sciences of biometry and statistics 
may well have profited from these major struggles 
with organized groups that allowed them to break 
the restraining bonds of apathy, of ignorance, of 
entrenched authority. Pearson was something of a 
crusader, and among the qualities a crusader needs 
are self-confidence, the courage to fight for his con- 
victions, and a touch of intellectual intolerance. 
He was a perfectionist and had scant patience with 
ideas or work which he considered incorrect, More- 
over, he was trained for a legal career and from 
childhood had in his father the example of a suc- 
cessful trial lawyer. However, his first thought 
was to get at the truth, and, if intellectually con- 
vinced of an error, Pearson was ready to admit it 
He once published in Biometrika a paper called 
“Peccavimus” (“We Have Erred”). 


Although Pearson made contributions to statis- 
tical technique that now appear to be of enduring 
importance, these techniques are of less impot- 
tance than what he did in rousing the scientific 
world from a state of sheer uninterest in statistical 
studies to one of eager effort by a large number 
of well-trained persons, who developed new theory, 
gathered and analyzed statistical data from every 
field, computed new tables, and re-examined the 
foundations of statistical philosophy. This is an 
achievement of fantastic proportions. His labora- 
tory was a world center in which men from p 
countries studied. Few men in all the history © 
science have stimulated so many other people to 
cultivate and to enlarge the fields they themselves 
had planted. He provided scientists with the con: 
cept of a general methodology underlying s 
science, one of the great contributions to mode. 


thought. 
* Heren M. WALKER 


[For the historical context of Pearson's elie: 
STATISTICS, article on THE HISTORY OF SrA sr 
METHOD; and the biographies of the Bene 
FAMILY; CONDORCET; COURNOT; FISHER, R. us 
Ton; Gauss; LAPLACE; MOIVRE; POISSON; 
for discussion of the subsequent developme ANAL- 
ideas, see GOODNESS OF FIT; MULTIVARIATE STA- 
YSIS, articles on CORRELATION; NONPARAMETRIC 


tistics, article on RANKING METHODS; STATISTICS, 
DESCRIPTIVE, article on ASSOCIATION; and the biog- 


raphy of YULE.] 
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PEASANTRY 


Studies of the peasantry in different places and 
in different eras have been made by historians, so- 
ciologists, economists, anthropologists, and other 


504 PEASANTRY 


scholars, all of whom use a wide variety of defini- 
tions and concepts. Some writers have employed 
the term “peasant” to characterize entire societies; 
others have dealt with the peasantry as a part- 
society within a larger whole. Prevailing practice 
includes analysis of peasant behavior at the levels 
of whole social systems, nations, sectors, villages, 
households, and individual cultivators. 

Peasants are usually seen as forming part of a 
structured society, within which they fall between 
the aristocracy or great landholders, on the one 
hand, and the landless, on the other. However, 
this definition has been stretched at both ends. 
Some writers include among peasants groups of 
cultivators with no class of landlords above them; 
others refer to landless peasantry. 

Again, peasants are normally conceived of as 
settled agriculturalists. Yet in tropical Africa, 
Central and South America, and parts of southeast 
Asia, there are groups of peasants who rely on 
shifting cultivation of the slash-and-burn type. 

With regard to the land which they till, the legal 
status of peasants may be that of proprietors, 
tenants, or crop sharers. They may or may not be 
free to leave the land. 

In a broad sense, the peasantry has constituted 
the most numerous social group in all organized 
states, from ancient to modern times, that have 
rested on traditional forms of agriculture. Even 
in western Europe during the throes of nineteenth- 
century industrialization the peasant population 
constituted the largest segment of society. The 
progressive emancipation of the peasantry from 
serfdom and other forms of bondage has furnished 
favored themes for discourses by politicians and 
treatises by historians. Even after World War 1 the 
agrarian problem, land reform, and peasant move- 
ments (“green risings”) continued to evoke public 
and scholarly interest. Most recently, however, the 
characteristic differences between urban and rural 
economic activity and social life have faded, and 
the peasantry as such has been in the process of 
disappearing. Meanwhile, in other areas—for ex- 
ample, sub-Saharan Africa—there are some indi- 
cations of a movement from tribal to peasant so- 
cieties, By contrast, the United States, Australia, 
and New Zealand have never had a peasantry. 

The ending of European rule over many parts 
of Africa and Asia has led to an enormous increase 
in the number of studies devoted to the peasantry 
of the so-called “underdeveloped” areas, le tiers 
monde. This literature is primarily of an applied 
nature and is concerned with efforts to modernize 
peasant agriculture. Among the chief subjects are 
reforms in systems of land tenure and other pre- 


requisites for rapid economic development. There 
are also a number of examples of fundamental 
research aimed at a comprehensive understanding 
of the structure and functioning of peasant society 
in these countries. 

Rather than trying to define peasants or peas- 
antry in any restrictive sense we shall indicate the 
main lines along which significant work on peas- 
ants has been carried forward since the 1920s, We 
shall quickly survey the issues which have been 
raised in relation to the peasantry in the following 
geographic and historical order: feudal Europe, 
modern Europe in the industrial age, tsarist Russia 
and the Soviet Union, Japan, Indonesia, and India. 
Turning to more general topics, we shall take up 
the economics of the peasantry with particular ref- 
erence to (a) recent “microanalysis,” (b) the posi- 
tion of the Russian school associated with the name 
of Chayanov, and (c) “subsistence” economics; 
then the political role of the peasantry, including 
its part in risings and revolutions; and, finally, the 
question of folk, or peasant, cultures. 

Feudalism. One of the richest sources of lit- 
erature about peasant societies continues to be the 
study of the agrarian aspects of feudalism in Eu- 
rope from the eleventh century to the fifteenth cen- 
tury. There has been a remarkable flow of studies 
in recent decades on the kinds and conditions of 
the peasant classes; their ties and obligations to the 
seigneurs or feudal lords above them; the interplay 
between the peasants’ work on their own fields and 
their work for the lord on his demesne or home 
farm; systems of field cultivation; why and to what 
extent village lands were periodically consolidated 
and redistributed among the cultivators; phases of 
emancipation of the peasants from bondage and 
periods of “re-enserfment”; peasant revolts; and zez 
lations of moneyed people from the towns with the 
peasantry, whether by way of moneylending, buy- 
ing of crops, or taking over of village land throug 
purchase or foreclosure. [See FEUDALISM; MANO 
RIAL ECONOMY, |] 

Age of industrialism. Another 
of research in recent years is tha 
position of the peasantry in Europe since the mi 
ing of the industrial revolution. Interest has © m 
tered on such subjects as the differences fror 
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many, by contrast, continued to be lands of small- 
scale peasant agriculture up through World War n. 

The application of modern biology and chemistry 
to farming led to what has been called an agricul- 
tural revolution, with unprecedented increases in 
productivity. The education, scientific training, 
technical knowledge, capital equipment, and finan- 
cial resources required for carrying on the new 
agriculture were obviously more accessible to the 
large farmers than to the petty peasantry. A ques- 
tion which has been much discussed is how so 
large a number of small peasants managed none- 
theless to remain in existence. The evidence sug- 
gests that they owed their survival to long hours 
of back-breaking toil and minimal levels of living. 

The small-scale peasant agriculture of the Low 
Countries was distinguished for its specialization 
and high technical proficiency. Denmark became 
famous for its rural education and cooperative 
movements. Rack rents, poverty, crop failures, de- 
pressed prices, crowding on the land, and subdivi- 
sion of holdings have been cited as factors in the 
great peasant migrations from Ireland, Germany, 
Scandinavia, and Italy. 

Russia and eastern Europe. In eastern Europe 
the progress of industry was slower and its impact 
on peasant society less. The aristocracy and large 
proprietors held on to enormous estates until the 
Russian Revolution in 1917, while the peasantry 
remained illiterate and backward in its agricultural 
techniques. 

In its efforts to assure a low-cost food supply for 
the urban working classes, the new Soviet regime 
came into conflict with the middle ranks of the 
peasantry as well as the more prosperous “kulaks.” 
The eventual solution in the form of a thorough- 
going, rapid collectivization of agriculture, carried 
out from 1928 to 1934, constituted the most trau- 
matic blow to the structure of peasant society in 
any country in modern history. The historical back- 
ground to collectivization, the campaign itself, and 
its consequences for Soviet society and economy 
may be studied in a vast literature of absorbing 
interest, under such heads as: class composition of 
the Russian peasantry in the 1920s; the character 
of collectivization and its advantages and disad- 
vantages in the light of subsequent agricultural 
Problems; role in a socialist economy of private 
garden plots, peasant family livestock, and free 
peasant markets. 

_After World War n the advent of socialist re- 
Bimes in the countries of eastern Europe initiated 
a series of campaigns designed to collectivize peas- 
pl ae These attempts were less drastic 

those in the Soviet Union but have nonethe- 
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less been pushed very far in Bulgaria, Hungary, 
Czechoslovakia, and Rumania. By contrast, they 
have been abandoned in Yugoslavia and Poland. 

Japan. Recent studies of Asia have largely em- 
phasized the present and the immediate past. The 
most reliable data and the widest range of studies 
of any peasant society in Asia pertain to Japan. A 
striking feature of Japan’s rural experience has 
been the impact, since 1945, of the great expan- 
sion of urban factory employment, with rising 
wages and improved amenities. The attraction of 
the towns for the country youth, especially the 
young men, has become very powerful. As they 
leave, the average age of the farm population moves 
up steeply. To indicate who does the work in rural 
areas today, the Japanese have coined the expres- 
sion “grandfather-grandmother—daughter-in-law 
agriculture.” The demand for hired labor has be- 
come so great that agricultural wages have set new 
records, To hold his laborers at the busy season the 
employer has to feed them well and speak to them 
politely. 

Even for small farms of barely one hectare peas- 
ant families in central Japan have been buying 
many small machines—multipurpose two-wheel 
tractors, chaff cutters, polishers, tiny pickup vans. 
Also, these families are each likely to have a tran- 
sistor radio, a television set, a washing machine, 
and a refrigerator. These items are acquired not 
solely for economic reasons but as part of a deter- 
mined effort to lighten labor and make farming 
more attractive to young people. What is happening 
is the industrialization of agriculture and the mod- 
ernization of rural life. Peasant agriculture and 
peasant society, in central Japan at least, are dis- 
appearing. 

Southeast Asia. In southeast Asia, Indonesia is 
the area which has been studied the longest and 
the most systematically. It was with reference to 
Indonesia that the theory of “dual economy” was 
developed at the beginning of the twentieth cen- 
tury. J. H. Boeke, in a series of writings from 1911 
on, argued that the peasant sector, or what in co- 
lonial times was called the “native sector,” lived a 
life apart, largely unaffected by the market-oriented 
modern urban sector. He held that the peasants 
zealously guarded their own values, institutions, 
and way of life. He believed that this situation was 
desirable on grounds of imperial policy and urged 
that it be encouraged as much as possible. [See 
ECONOMY, DUAL.] 

This theory of dual economy was sharply criti- 
cized by other Dutch authors for seriously under- 
estimating the impact of Dutch rule, foreign trade, 
and market-oriented production upon the peasant 
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sector. Nonetheless, Boeke’s point of view has been 
given wide popularity in the English-speaking world 
by the works of Furnivall (e.g., 1939). In his 
studies of Burma and Indonesia, Furnivall expand- 
ed Boeke’s theory of dual economy to a concept of 
“plural society,” which explicitly allowed for three 
or more main elements. In this way Furnivall pro- 
posed a framework for the relations between the 
Burmese peasants, the British rulers and business- 
men, and the Indian laborers and moneylenders in 
Burma; or, for Indonesia, between the Dutch, the 
Chinese, and the Indonesian peasantry. 

Although the term “plural society” is widely used, 
it does not, in the opinion of the present author, 
throw much light on the peasants covered by it. 
The compartments into which the main elements 
of society are divided turn out, in practice, to be 
much less watertight than Boeke or Furnivall would 
have us believe. What is more, almost all of the 
great societies that have existed in history have 
been composed of a number of principal elements 
among which economic or social relations may 
have been severely restricted. In this sense, “plural 
society” becomes almost a universal phenomenon 
of history. What is gained in our understanding of 
Norman England if we call it a “plural society”? 

India. Of all the countries in the nonsocialist 
world, India has the largest peasantry and the most 
deep-rooted social obstacles to agricultural develop- 
ment. The mixture of ancient Indian, Muslim, and 
British notions and practices about landholding has 
given the country a peculiarly complex structure 
of land tenure. There are no significant reserves of 
good land to be brought under cultivation, and a 
number of regions are already very densely popu- 
lated. Agriculture is largely dependent on the mon- 
soons, which are fickle. Many areas are chronically 
short of water. 

All economic and social institutions in the vil- 
lages are deeply affected by the divisions and sense 
of hierarchy connected with caste. This gives a low 
value to manual labor. In many regions of India 
those who do the bulk of the agricultural work are 
the most disadvantaged and the most looked down 
upon—the “Untouchables.” 

Since the attainment of national independence 
in 1947, there has been remarkably free debate as 
to what should be done for, with, or about the 
Indian peasantry, and how to implement the large 
number of governmental measures relating to the 
rural population. In addition to the more narrowly 
technical projects involving, for example, irriga- 
tion, use of artificial fertilizers, or improvement of 
seeds, there has been much land reform legislation, 
an impressive extension of cooperatives and local 
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self-government schemes, and a vast program of 
“community development,” designed to achieve the 
goal of over-all betterment of living conditions in 
the villages. Whatever the success of these various 
endeavors, there can be no doubt of the interest in 
the discussions generated at each phase of develop- 
ment. Since India can boast a number of first-class 
economists, there are a great many high-level anal- 
yses of peasant problems and suggested solutions, 

Meanwhile, the way of life of India’s peasants 
has been affected not only by the array of govern- 
mental actions undertaken for such purposes, but 
also by the great growth of industry and over-all 
modernization of national life since the launching 
of the first five-year plan, covering 1951-1956. The 
proliferation of factories and workshops and an 
immense building program—new factories, govem- 
ment offices, schools, housing, roads, bridges, dams 
—have given jobs or supplementary income to 
millions of peasants. 

Villagers are more aware of what is going on in 
the outside world, and they enjoy the spectacle of 
the city people coming to solicit their votes at elec- 
tion time. The hegemony of the upper-caste fam- 
ilies within the village has been challenged. In parts 
of south India the Brahman landlords have been 
humiliated and forced out of their holdings. 
Throughout the countryside the spread of educa- 
tion has brought Untouchables’ sons and in some 
areas even their daughters into the schools. The 
foundations of traditional peasant society in India 
are being shaken. 


Peasant economics 


Most of the studies of peasant economic behavior 
have been carried out by persons trained in the 
classical and neoclassical economics developed in 
England, on the Continent, and in the Ce 
States. Quite naturally, the economists broug! 
along with them the tools of their trade, the oa 
gories and concepts which they were used to Me 
ing with. The underlying assumption—made f 
plicit by a considerable number of wee af 
the prevailing economic theories and net a 
the Western world are universally applicable. a 
suitable modifications, the argument goes, they T 
be utilized to explain the behavior of indivi nh 
economic units in any society that has ever ie 
Thus Firth, in his well-known analysis a have 
Tikopians in Polynesia, first explains that they a 
no market, no money, no cash nexus, no Pie 
interest, and no “entrepreneur” class as SUC a i 
theless, in default of any other suitable i, opians 
as though they were entrepreneurs engaged i 


undertakings. If one man gets his neighbor to help 
him build a house, the first is taken as an entre- 
preneur who is employing the second. Similarly, 
Sol Tax writes of the petty traders of the Guate- 
malan highlands as “penny capitalists.” 

When the “farm business” method is applied to 
analysis of peasant agriculture, the peasant’s land 
and livestock, equipment, and other goods are 
equated with those of a small firm. The peasant’s 
behavior is then treated in terms of the theory of 
the firm as developed for business enterprises. It is 
taken for granted that the peasant’s aim is to 
rationalize his operations so as to obtain the max- 
imum profit. Accounts are drawn up for the agri- 
cultural year. The field work of the peasant’s wife, 
his children, his parents, and other relatives is 
evaluated at prevailing wages paid to hired la- 
borers. Receipts from the sale of farm products, 
including an estimate of the value of food kept for 
the family, are totaled. Against these are set the 
costs incurred for agricultural purposes, which 
have been carefully separated out from the ex- 
penses of the family as a consumption unit. 
These costs of production include working ex- 
penses, rent actually paid or calculated from the 
value of the land owned, interest that could other- 
wise have been earned on the capital invested, and 
wages imputed for family labor. If these costs turn 
out to be greater than the receipts, the farm is said 
to be operating at a loss, If this situation goes on 
year after year, it is said to be an uneconomic 
farm. The problem then becomes one of trying to 
explain how peasants in countries like India, for 
example, go on for decade after decade engaging 
in so-called “uneconomic farming.” 

This approach to peasant agriculture was chal- 
lenged in the opening decades of the twentieth 
century by a group of Russian economists. The fol- 
lowers of the “organization and production” school 
—Kablukov, Kosinskii, Chelintsev, Makarov, Stu- 
denskii, and Chayanov—argued that the assump- 
tion that the peasant family should be treated as 
if it were a business enterprise was unsound and 
misleading. The primary aim of the peasant family 
is to feed itself and somehow manage to make 
whatever payments are due to the landlord, the 
moneylender, the merchant, or the state. The mem- 
bers of the peasant family have to eat throughout 
the year, whether or not they work. The actual 
work which they perform cannot be measured in 
money. If the family’s consumption needs and the 
Other usual requirements are not satisfied, the 
members of the household will put in more labor, 
even if the additional product obtained is very small 
indeed. Similarly, the peasant family may rent or 
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buy land at a very high price which would not be 
justified, according to normal business standards, 
by the value of the output they could hope to obtain 
from it. The peasants do this in order to be able 
to use the surplus labor of the family, which is 
freely available without any extra cost. 

Chayanov and his colleagues insisted on treating 
the total returns garnered by family labor as indi- 
visible. They analyzed the decisions of the head of 
the peasant household in terms of a balance be- 
tween his judgment based on experience as to the 
total needs of the family and his subjective eval- 
uation of the drudgery involved in agricultural 
work. After 1930 the work of this school was cut 
short in Russia, and it seems to have had little in- 
fluence elsewhere except in Japan. 

Subsistence. A number of somewhat different 
approaches to the study of peasant economics have 
utilized “subsistence” as the key concept. Whereas 
both the farm business or small entrepreneur meth- 
od and the “organization and production” school 
concentrate on the individual peasant, subsistence 
has been examined on the “macro” as well as the 
“micro” level of analysis. Peasants who produce 
wholly or mainly for their own consumption are 
characterized as subsistence farmers, in sharp con- 
trast to agriculturalists who produce for the mar- 
ket. Groups of villages, regions, or even whole coun- 
tries (particularly with reference to the past) are 
presented as subsistence areas or subsistence econ- 
omies. Sometimes we read of modern and subsist- 
ence sectors (often identified with different ethnic 
elements) within a country or an economy. It is 
also common to find discussions keyed to a three- 
part scheme: subsistence agriculture or economy, 
semisubsistence, and modern. 

Generally speaking, writers who employ the term 
“subsistence” take as their standard of comparison 
the highly organized, mechanized, market-oriented 
agriculture of the great industrial nations. Subsist- 
ence tends to be defined negatively, by the complete 
absence of markets and accordingly of all commer- 
cial relations or incentives for increased produc- 
tion. But examples, of this purely subsistence agri- 
culture are hard to find. 

Semisubsistence fills in the gap as a transitional 
form between pure subsistence and the most ad- 
vanced industrial economies. In practice, the largest 
number of historically known societies fall into this 
essentially teleological middle category. 

Criteria of peasant societies. The present au- 
thor has suggested another possible framework for 
studying peasant economy and society at the macro- 
level. My interest is to identify whole states (either 
current or past) which can usefully be classed as 
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predominantly peasant in nature. The next step, 
and one I believe would be fruitful, is to examine 
the process by which such societies have come into 
being, their life history, and the manner in which 
some have passed out of the ranks of peasant 
societies. 

For the purpose of this analysis I have set forth 
five quantitative criteria that apply to the national 
level. (1) One half or more of the total production 
must be agricultural. (2) More than half of the 
working population must be engaged in agriculture. 
(3) There must be a state of at least a minimum 
size, and it must be organized on a territorial basis 
rather than as a tribal, kinship, or clan order. The 
administrative structure of such a state must com- 
prise a total of at least five thousand officers, minor 
officials, flunkeys, and underlings. (4) A peasant 
society presupposes the existence of towns and a 
break between these towns and the countryside that 
is simultaneously political, economic, social, and 
cultural. The total urban population of the state 
should amount to at least half a million persons; 
alternatively, at least 5 per cent of the entire pop- 
ulation of a peasant society should reside in towns. 
(5) The typical and most representative units of 
production must be family households which grow 
crops on their lands primarily by the physical effort 
of the members of these families, The household 
may include a slave or two, a domestic servant, or 
even a hired hand. But the total contribution of 
these nonfamily members to actual crop production 
must be much less than that of the family mem- 
bers. Half or more of all the crops grown in the 
society must be produced by households relying 
mainly on the labor of their own family members. 

For the period of the past two hundred years at 
least 25 countries can be shown to have societies 
which meet all of these criteria. Many other na- 
tional units, taken at different periods, will corre- 
spond only partially to this definition, but their 
deviations from it may also shed light on the dy- 
namics of the constitution and dissolution of peas- 
ant societies. 


Political role of the peasantry 


The characteristic subjection of the peasantry in 
late medieval Europe gave way to sporadic uprisings 
and even some sustained revolts. This facet of peas- 
ant behavior has also been noted (to mention a few 
instances) in Tokugawa Japan, Manchu China, 
modern Mexico, and tsarist Russia, where major 
movements were led by Stenka Razin and Pugachev. 

Almost always peasant uprisings have been 
marked by fury, desperation, and brutality. Once 


in motion the peasants have usually tried to destroy 
records, burn mansions, and, not uncommonly, put 
to death the landlords and their families. The up- 
per classes have struck back ruthlessly. Because of 
their narrow horizons and limited resources, edu- 
cation, and military experience, peasants have been 
ill-fitted to organize and carry through successful 
revolts. By contrast, princes and landlords have 
been accustomed to the arts of politics and war, 
They have known how to divide a large peasant 
movement or overwhelm and destroy a small one, 
On the whole the peasants have paid dearly for 
their violent efforts to break their shackles, 

After the French Revolution there arose a school 
of “romantic” political thought which idealized the 
peasantry as the center of conservatism in society 
as a whole, the stronghold of religion, and the seat 
of traditional values. Supporters of monarchy who 
wanted to stop the spread of democratic ideas ex- 
alted the peasant way of life as a counterpoise to 
the radical tendencies of the urban populations and 
called for its preservation. 

Marxist views. From the opposite point of view, 
that of wishing to hasten the process of change, 
Marx and Engels concurred in conceiving of the 
peasantry as a bastion of reaction. Marx went $0 
far as to term peasant life “rural idiocy.” 

Marx and Engels bequeathed this prejudice to 
the socialist parties of western Europe. They also 
bequeathed to their followers the doctrine that 
peasant cultivation was inefficient because of the 
small size of the units and that it was foredoomed 
to be replaced by large-scale agriculture. In Marx's 
eyes the growth of capitalism was certain, in the 
course of time, to break up the class of peasant 
proprietors, each with his own land, work animals, 
and implements. Some few of these independent 
proprietors would become capitalist employers 1E 
lying upon hired labor to carry on their agriculture. 
Many more of them would lose their land and tum 
into agricultural proletarians who would have to go 
out and seek work as wage laborers. 

As the socialist movement grew in German 
France, the problem of its relation to the peasantry 
became pressing. To some of the socialist leac 
the process of the breakup of the peasantry whic 
Marx had foreseen seemed to be occurring ve 
slowly, if at all. Nonetheless, in the early 18 : 
after Marx’s death, Engels insisted on the souns 
ness of Marx’s analysis and his prediction. a a 
firmness on this point, Engels cut the groun i 
from under the socialist politicians of the ae 
were hoping to make an alliance with the peas 
against the landlords and industrialists. 
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The logical corollary of Marx and Engels’ posi- 
tion was that it would be wrong for the socialist 
parties to support measures designed to aid peas- 
ant agriculture. Such tactics could serve only to 
prolong artificially the existence of a stratum which 
was economically outmoded, socially backward, and 
politically conservative. Instead, the socialists 
should encourage the class demands of the agricul- 
tural laborers, who were the natural allies of the 
urban industrial workers. 

Debate between Narodniks and Marxists. In 
tsarist Russia during the fifty years before the rev- 
olution of 1917, there took place a historic debate 
between the Narodniks and the Marxists. The 
Narodniks (Populists) believed that the mir, the 
typical Russian village community, was still a vital 
and flourishing institution. A striking feature of 
the mir, they held, was an old tradition of group 
activity in agriculture. It would be possible, the 
Narodniks argued, to build upon this already exist- 
ing foundation and thus make a direct transition 
to a kind of Russian agrarian socialism without 
passing through a stage of capitalism. 

Plekhanov and Lenin retorted that the unity of 
the village was a myth, since the rich and poor 
peasants were sharply divided by their class in- 
terests. In any case, capitalism could not be avoid- 
ed, since it was already the dominant tendency in 
the Russian countryside as well as in the towns. 
The peasantry, according to Lenin, represented a 
form of petty-bourgeois independent production 
within the framework of an increasingly capitalist 
economy, Therefore, the class which was called 
upon by history to lead the way to the new socialist 
society was the militant industrial proletariat of the 
cities and not the peasants in their villages. 

The peasantry as a revolutionary base. Recent 
decades have witnessed the development of a new 
and dramatically different Marxist evaluation of 
the political possibilities of the peasantry, partic- 
ularly in reference to the Far East and Latin 
America. Mao Tse-tung and other communist 
leaders learned from their bitter experience in 
China in the mid-1920s how difficult it was to sus- 
tain a revolutionary movement that was centered 
on the city. In a country so large and so rural as 
China, the working classes of the towns were rela- 
tively feeble and thereby exposed to massive re- 
prisals. The road to power, according to the theory 
which the Chinese communists elaborated, was no 
longer urban but rural. The base areas for the 
Socialist revolution must be located in the country- 
side. There the discontented peasantry were to be 
organized, a politically conscious people’s army 
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created, and guerrilla warfare waged until the time 
was ripe to shift to mobile warfare, encircle the 
big cities, and capture them. This new policy was 
not only enunciated by Mao Tse-tung but it was 
also effectively put into practice by the Chinese 
communists, who succeeded in carrying through 
a revolution relying much less on the industrial 
proletariat of the towns than on the peasant masses. 

Influenced by the ideas of the Chinese and, 
closer to home, by the example of Fidel Castro in 
Cuba, guerrilla movements that are primarily peas- 
ant in composition have been organized by revolu- 
tionary parties in the rural areas of many countries 
in Latin America, notably Colombia, Venezuela, 
Guatemala, and Peru. 


Folk society and folk culture 


An important new series of contributions to the 
study of peasantry comes from the ethnologists and 
social anthropologists. In its early years as a disci- 
pline anthropology tended to emphasize the search 
for social origins, Field investigators accordingly 
directed their efforts toward studying the most 
primitive and most isolated human groups which 
could be discovered. Since World War 11, in partic- 
ular, anthropologists have greatly widened their 
sphere of interest and have produced literally hun- 
dreds of studies of peasant communities on all 
continents. 

The shift from dealing with secluded entities 
about which little or no previous information was 
at hand to peasant villages embedded in countries 
with well-known cultures and long histories posed 
new problems of methodology and theory. One key 
question that the anthropologists found they had 
to grapple with was the place of the small commu- 
nity in the context of the society as a whole. They 
began to explore the extent to which the peasants 
shared in the “high” culture of the areas in which 
they lived and the extent to which they could be 
said to have a folk society, folk culture, or folk 
religion of their own. Redfield, in his work on 
Mexico, was one of the first to call attention to 
this relationship between the little community and 
the larger world, the small and the great traditions. 
Later studies have explored more complicated pat- 
terns of interconnection of village and national 
culture, as found, for example, in India. 


The spread of modern industrial society as a 
world-wide phenomenon is changing the conditions 
of life for the peasantry every day. It may be 
unwise, however, to expect any early disappearance 
of the peasants from the international scene. For 
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more than one hundred years we have been told 
that the peasantry is doomed, and we have been 
presented with exhaustive lists of the hardships 
which plague peasants. Nonetheless, they and their 
problems are still very much with us. The lasting 
power of the peasantry should not be under- 
estimated. 

DANIEL THORNER 


[See also ASIAN SOCIETY, article on SOUTHEAST ASIA; 
Economy, DUAL; LAND; LAND TENURE; MODERNIZA- 
TION; VILLAGE; and the guide under AGRICULTURE.] 
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PEIRCE, CHARLES SANDERS 


Charles Sanders Peirce (1839-1914), the great- 
est of America’s scientific philosophers, was born 
in Cambridge, Massachusetts, the second son of 
the famous Harvard mathematician and astron- 
omer Benjamin Peirce (1809-1880). Peirce was 
coached by his father in mathematics, physics, 
and astronomy, and was later to revise his father’s 
Linear Associative Algebra of 1870. After receiving 
his bachelor’s degree from Harvard in 1859, and 
a master’s degree summa cum laude, he joined the 
United States Coast and Geodetic Survey in 1861. 
During the thirty years he worked there, he became 
internationally famous for his pendular measure- 
ments of gravity and of starlight intensity (1878). 
Between 1879 and 1884, he taught at the Johns 
Hopkins University, his only university teaching 
position. He was elected a member of the National 
Academy of Sciences in 1877 and, even before that, 
to the American Academy of Arts and Sciences. 

Peirce published an improvement of Boole’s logic 
(1867), which was later further developed by H. M. 
Sheffer in his work on the stroke function. He 
also improved on De Morgan’s notation (1870) and 
developed further the logic of propositions, classes, 
and relations (1883). This was Peirce’s chief con- 
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tribution to logic and helped pave the way for 
twentieth-century developments in symbolic logic. 
Just as the work of Giuseppe Peano in Italy on the 
foundations of mathematics influenced such Italian 
philosophers as the “logical pragmatist” G. Vailati, 
so Peirce’s research in logic and critical philosophy 
(especially his studies of Kant, Reid, Bain, Schel- 
ling, and Hegel) led to his own formulation of the 
pragmatic theory that the meaning of a conception 
lies in the sum total of the conceivable conse- 
quences which the object of that conception can 
possibly have on the conduct of “an indefinite com- 
munity of investigators” (1877-1878, vol. 12, 
pp. 286-302). The final opinion of that ideal com- 
munity would constitute the truth, and the object 
of that ultimate opinion would be “reality.” 

This social criterion of meaning and truth mis- 
takenly has been thought by critics to reduce philos- 
ophy to mere public opinion and group prejudice. 
But Peirce clearly limited his ideal community to 
scientific investigators. He insisted, furthermore, 
on the fallibilism of all beliefs, even if common 
sense and science do require that some premises 
be taken tentatively as indubitable until experience 
or experiment shows that a disparity exists between 
the accepted consequences of these premises and 
observation. Applying the test of performing speci- 
fiable procedures for verifying the calculable con- 
sequences of hypotheses has become known as 
operationalism; Percy Bridgman advocated it in 
his study of the logic of physics, and John Dewey 
in his study of the methodology of the social 
sciences. 

Peirce formulated his early statements of prag- 
matism in two essays, “The Fixation of Belief” and 
“How to Make Our Ideas Clear,” the first two of 
six essays in “Illustrations of the Logic of Science” 
(1877-1878). Although in these early publications 
he did not call his philosophy “pragmatism,” he 
did use the term in his discussions about it around 
Harvard. Many of these discussions (mainly on the 
significance of the Darwinian controversy for meth- 
ods of reasoning in the physical and social sci- 
ences) took place in an informal club to which he 
belonged in the early 1870s and whose members 
also included William James, Chauncey Wright, 
John Fiske, Oliver Wendell Holmes, Jr., Nicholas 
St. John Green, and Joseph B. Warner—the last 
three being law students at the time (see Wiener 
1949). 

Peirce’s survey of methodological theories in the 
first of the two essays was a critical analysis of these 
theories based on social psychology and intellec- 
tual history. He listed the four chief methods used 
to settle “doubts” (by which he meant externally 
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caused disturbances of mind rather than the more 
subjective kind of Cartesian doubts): tenacity, 
authority, apriorism, and the scientific method. 
The scientific method is the most satisfactory in 
the long run because it is the only one of the four 
that is not in principle inflexible; it is self-correc- 
tive, whereas proponents of the other three meth- 
ods can claim “infallibility’ only by resting on 
premises other than those encountered in exper- 
ience or experimental situations. Peirce also re- 
jected intuitionism as being a form of a priori 
rationalism, 

He believed in a kind of metaphysical evolution- 
ism, Originally there was chaos, but as man con- 
tinually used reason to order his experience, there 
was a “growth of concrete reasonableness” (1891; 
1893). His three main metaphysical categories 
(that is, modes of representing all that is known) 
are exemplified by chance qualities (firstness), 
brute existence (secondness), and generality or 
order (thirdness) (1877—1878, vol. 12, pp. 604— 
615, 705-718; vol. 13, pp. 203-217). In his later 
works, Peirce insisted that generality is objectively 
real, that it exists independently of our beliefs, and 
that it is given in perception of the particular. 

Although Peirce defended a frequency theory of 
inductive probability, he also proposed other mean- 
ings of probable hypotheses or likelihood (1883). 
He thought all reasoning processes could be divided 
into three types, depending on their consequences: 
purely explicative deduction (for example, mathe- 
matics and formal logic), ampliative induction (for 
example, empirical generalization ), and conjectural 
abduction, or reasoning culminating in a probable 
hypothesis. This third type of reasoning (in his 
logic of hypothesis) is also called retroduction and 
is exemplified by cryptography, medical diagnosis, 
historical inference, and detective work. For all 
scientific theory he advocated (at the same time as 
Ernst Mach, but independently of him) a principle 
of economy, or practical simplicity in the consid- 
eration of hypotheses, and a statistical conception 
of the laws of nature as predictive, fallible, and 
subject to modification in time. 

In social matters, Peirce was intensely opposed 
to the rugged individualism or “philosophy of greed” 
of the political economy of Simon Newcomb and of 
other social Darwinists of his day. He also deplored 
the lack of faith in the gospel of love on the part 
of those theologians who practiced the “higher crit- 
icism.” In general, Peirce was concerned with the 
neglect of humanistic ethics in social or political 
matters. 

During the last thirty years of his life he contin- 
ued his studies in the logic and philosophy of the 


sciences, although he lived almost like a hermit 
at Milford, Pennsylvania, with his second wife, a 
French widow who spoke little English. His contri- 
butions to logic, to the philosophy of science, and 
to the theory of signs were, nevertheless, very influ- 
ential in the development of the views of people 
who participated more actively in academic or 
political affairs, for example, William James, John 
Dewey, George Herbert Mead, C. I. Lewis, Charles 
E. Morris, Morris R. Cohen, F. P. Ramsey, Ernest 
Nagel, Sidney Hook, and others who call them- 
selves pragmatists. 

Recent commentators (Feibleman 1946; Thomp- 
son 1953; Wiener & Young 1952; Goudge 1950; 
Murphey 1961) have examined Peirce’s philosophy 
“as a whole” and have discerned a latent system 
in his diverse writings, a system that, although in- 
complete, is architectonic in its professed aims. 
Kant built an architectonic system of categories 
that was based on what he regarded as a finished 
classical logic, but Peirce went far beyond the class- 
ical syllogistic logic in his “logic of relatives.” 

Murray G. Murphey, in The Development of 
Peirce’s Philosophy (1961, p. 432) has outlined 
four major phases of Peirce’s philosophy: a Kantian 
phase, 1857—1865; the development of the irreduc- 
ibility of the three syllogistic figures corresponding 
to deduction, abduction, and induction, 1866-1869; 
the logic of relations, 1870-1884; and quantifica- 
tion and set theory, 1884-1912. These phases are 
not actually sharply demarcated in Peirce’s work, 
Murphey is correct, nevertheless, in emphasizing 
the fact that Peirce’s philosophical categories of 
firstness, secondness, and thirdness underwent 
changes as his logical theories developed; for like 
Kant, Peirce believed one’s philosophy should fol- 


low one’s logic. 
Purp P. WIENER 


[For the historical context of Peirce’s work, see. Post 
TIVISM; SOCIAL DARWINISM; STATISTICS, article o 
THE HISTORY OF STATISTICAL METHOD. For di 
sion of the subsequent development of his ideas, E 
Errors, article on NONSAMPLING ERRORS; ps 
TICS AND SEMIOTICS; and the biographies of COHEN: 
Dewey; Homes; JAMES; MEAD.] 
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1, THE FIELD 
1. PROBATION AND PAROLE 


1 
THE FIELD 


Penology is the branch of criminology con- 
cerned with government policies and practices in 
dealing with persons convicted of crimes. Its ety- 
mology, from the Latin poena, meaning “pain” or 
“suffering,” reflects the early conception of punish- 
ment as the primary objective of state action 
toward criminals. However, in modern times in- 
terest in punishment has been largely replaced by 
concern with changing the abilities, interests, atti- 
tudes, and emotions of criminals. Accordingly, as 
we move farther into the second half of the twen- 
tieth century, we often find the term “corrections” 
being used instead of “penology.” 

The treatment of criminals has always been mo- 
tivated by (1) the feelings of hostility and desires 
for revenge that criminals arouse in their victims 
and in those who sympathize with the victims; (2) 
abstract philosophies, ideologies, and religious be- 
liefs regarding punishment; (3) prevailing theories 
of crime causation. Usually all three of these mo- 
tives and rationales operate simultaneously, but 
the sequence in which they are enumerated above 
corresponds to the historical shift that has oc- 
curred in the relative power of each to determine 
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penal objectives. These objectives are epitomized by 
four words: revenge, restraint, reformation, and 
reintegration. All four have always been pursued to 
some extent, but there have been marked changes 
in the frequency with which each has been explic- 
itly set forth as the primary penal goal. 


Revenge and restraint 


Evidence that government concern with punish- 
ing persons who commit crimes developed primar- 
ily to replace private vengeance-seeking by the 
victims and their kin is provided not only by his- 
torical accounts of early judicial agencies but also 
by objective statistical scaling of societies on the 
basis of their culture traits. Freeman and Winch 
(1957), in an analysis of 48 societies, found that 
where governments do not assume this private 
punishment function, the society rarely is able to 
maintain full-time priests; without full-time priests, 
they do not have full-time teachers; unless they 
have full-time teachers, they lack full-time bureau- 
crats; and so on up the scale of societal complexity. 
Thus, it seems well established that penal activity 
is a prerequisite to much elaboration of other func- 
tions by government and religious institutions; 
when there is no penal program, the regulation of 
society by church or state is continually subject to 
restriction by the anarchy of private feuding among 
offenders and victims. 

Until the eighteenth century in Europe, and 
much later in many other areas, the structuring of 
penal policy by the prevailing religious and political 
status hierarchies was explicit. The nobility was 
subject to a criminal law different in its specifica- 
tion of penalties from that which prescribed pun- 
ishments for commoners. The latter were dealt 
with most harshly, especially when the victims of 
their offenses were of the nobility. This was justi- 
fied by an explanation of crime as the consequence 
of a base nature, a condition presumed to be most 
frequent in the “low-born.” An alternative theory, 
that crimes result from the possession of one’s 
spirit by the devil, through witchcraft and wizardry, 
justified both the imposition of some punishment 
by clerical courts and a strong church influence on 
the penal policies of secular courts. 

Two major influences on punishment that may 
be latent at all times were blatantly manifest in 
Europe until the end of the eighteenth century. 
These influences were the social distance between 
the punisher and the punished and the economic 
interests of the punisher. When the punisher could 
perceive the offender as both alien and inferior, 
there was little reluctance to impose death, torture, 
physical mutilation, severe corporal punishment, 
or some combination of these. When punisher and 


punished were peers, the most frequent penalties 
were forfeiture of property, forced labor, and ban- 
ishment. Rusche and Kirschheimer (1939) ex. 
haustively document an argument that the supply 
of labor also determined penal policy with respect 
to lower-class offenders: when labor was scarce, 
punishment was most often forced work in galleys 
or in houses of correction, but when there was a 
surplus of labor, punishment by death or by trans- 
portation to the colonies prevailed. [See PUNISH- 
MENT.] 

Beccaria and modern penology. There devel- 
oped in the Enlightenment, partly in reaction to the 
severity of punishments imposed by the nobility, 
a conception of all men as equal. This was equal- 
ity, not in possessions or status, but in rights and 
in potential amenability to guidance by reason. In 
1764 these conceptions were brilliantly used by 
Cesare Beccaria as postulates for a new penal pol- 
icy. His Essay on Crimes and Punishments had an 
immediate and tremendous influence throughout 
Europe and America and is often considered the 
beginning of modern penology. In the half century 
that followed, Beccaria’s arguments were extended 
by Voltaire, Bentham, Romilly, and others, reflect- 
ing the philosophy of utilitarianism. This resulted 
in so-called “classical” criminal law, which still pro- 
vides the framework for our penal codes. This legal 
perspective calls for punishments based on the of- 
fense, rather than on the offender, and calculated 
to inflict no more pain than suffices to offset the 
satisfaction that a crime might yield to its perpe 
trator. 

Imprisonment and forced labor, because their 
ostensible severity can be precisely graduated in 
terms of time, were well suited to the employment 
by judges of what Bentham called a “felicific cal- 
culus.” This is the prescription of punishment in 
precise degrees of severity, so that the pain thé 
court imposes balances the presumed gratifications 
from the crime. The high valuation of liberty Fi 
the respect for all humans which were fostere $ 
the democratic revolutions also promoted 4 He z 
erence for imprisonment or labor as penalties, 
rather than torture or death. Furthermore, ts i 
lic felt protected when felons were confined. on 
result of these several factors, during the half cit 
tury which followed publication of Beccaria vate 
imprisonment replaced capital and corporal p ‘red 
ment in western Europe for over two hun 
felonies. 


Reformation and the reformatory meee. 

“Classical” legal philosophy stressed a hap- 
tives: “individual deterrence” was to ee his 
piness that an offender supposedly gains 


crime, and “general deterrence” was to make non- 
offenders perceive crime as not conducive to hap- 
piness, However, a different justification for im- 
prisonment was promulgated around the time of 
the American Revolution in the Pennsylvania col- 
ony, many of whose Quaker leaders had been im- 
prisoned in Britain for their religious beliefs. 

Like some European Roman Catholic penal pio- 
neers in the same period, the Quakers were inspired 
by the monastery cells in which monks sought 
spiritual reformation through isolation and medita- 
tion. Prevailing penal construction congregated 
groups of inmates in large rooms, but the Pennsyl- 
vanians advocated a separate cell for each offender, 
cutting him off from communication with his fel- 
lows. They claimed that this promoted more peni- 
tent thought, especially when these “penitentiaries” 
limited the reading matter in each cell to a Bible 
and religious tracts, with little or no work and 
with visits only from preachers and pious laymen. 

The “Pennsylvania system” spread rapidly. It 
became the predominant prison pattern in conti- 
nental Europe during the nineteenth and twentieth 
centuries. However, it was opposed as costly, and 
its claims of reformation were challenged by advo- 
cates of a system developed at a prison in Auburn, 
New York, during the first half of the nineteenth 
century. The Auburn system, which soon prevailed 
in the United States, emphasized solitary confine- 
ment at night, but congregate work during the day, 
under a rigid rule of silence. Later in the nineteenth 
century the striped suit and lock step also were 
identified with the Auburn system; these features 
largely disappeared only after World War u. De- 
spite their ostensible contrast, both the Pennsyl- 
vania and the Auburn prison systems were based 
ona conception of crime as learned in communica- 
tion among criminals, rather than as a purely 
individual calculation of alternative utilities. Both 
prison systems also claimed that they trained of- 
fenders in legitimate modes of thought and be- 
havior, instead of merely deterring them. 

Emphasis on training increased in the latter part 
of the nineteenth century with the “reformatory 
movement,” which is usually traced to the Elmira 
Reformatory in New York. This special institution 
for young felons developed distinctive features in 
the 1870s under the leadership of Zebulon R. Brock- 
way. It borrowed from Alexander Maconochie’s 
Norfolk prison colony in Australia the “mark sys- 
tem” of awarding inmates numerical credits for 
each period of good behavior, with loss of these 
marks” for misconduct. A specified total number 
of marks was required for each small reduction of 
Testrictions or increment of comfort in the institu- 
tion, as well as for release. Elmira added to this 
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an emphasis on education and vocational training. 
Most major prison systems established reforma- 
tories during the subsequent half century. However, 
they generally replaced the mark system by less 
mechanically assigned hierarchies of rank and 
small privilege for inmates, known as “honor” sys- 
tems in the United States, and elsewhere as 
“graded” or “step” systems, 

Classification and individual treatment. Con- 
comitant with the reformatory movement were the 
separate developments of probation and parole 
[see PENOLOGY, article on PROBATION AND PAROLE]. 
The simultaneous emphasis on institutional pro- 
grams for reform and on conditional modes of 
release to test how well reform was achieved re- 
flected the development of the biological and social 
sciences during the nineteenth and twentieth cen- 
turies [see CRIMINOLOGY]. These fostered a con- 
ception of crime not as something to be morally 
condemned but as a phenomenon having natural 
causes and, hence, controllable if these causes are 
identified and altered. However, a great diversity of 
causes seemed to be involved. Early criminological 
researchers, notably Cesare Lombroso, ascribed 
crime to biological inheritance; psychologists first 
emphasized imbecility, and later the emotional 
disturbances of early childhood, as causes of crime; 
sociologists called attention to deviant subcultures 
and to the influence of social support in delinquent 
and criminal activity. The most prevalent explana- 
tion for crime, therefore, became “multiple causa- 
tion” [see CRIME, article on CAUSES OF CRIME]. 
This justified the major prison management em- 
phasis in the second quarter of the twentieth cen- 
tury, “classification” and “individual treatment” of 
inmates. 

The classification process in correctional insti- 
tutions begins with medical and psychological 
examination of the newly received prisoner, plus 
a caseworker’s preparation of the inmate's life his- 
tory. These reports are discussed by the institution’s 
classification committee, which usually consists of 
the head of its custodial staff, plus the director of 
its school, its physician, chaplains, and case- 
workers. They first assess risk of escape, of vio- 
lence, or of other misconduct by the inmate. Then, 
within the limits posed by these custodial concerns, 
they recommend a program of training and treat- 
ment by which the institution may help to correct 
the inmate’s deficiencies. As an incentive to his 
compliance with this treatment program, the in- 
mate is advised that such a self-improvement effort 
will enhance his prospects for parole. 

Two major problems have been widely recog- 
nized in these reformation efforts. The first is the 
fact that society has seldom been willing to invest 
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in its treatment programs funds sufficient to cor- 
rect serious educational, vocational training, psy- 
chological, and other deficiencies of most offenders. 
The second is the demonstration by research that, 
for many offenders, such treatment does not pre- 
vent a return to crime, especially when their social 
acceptance and their conception of themselves 
as successful has been and continues to be much 
more immediate and frequent in criminal than in 
noncriminal pursuits. Accordingly, the second half 
of the twentieth century has seen the penal objec- 
tives of deterrence and treatment, inherited from 
prior periods, augmented by concern with changing 
the social relationships of offenders. 


Social reintegration 


The distinctive feature of prison management 
in the second half of the twentieth century has 
been the extent to which the social boundaries 
within the institutions have become permeable 
while the isolation of these institutions from the 
outside world has been reduced. This appears to 
have been stimulated, in part, by a series of socio- 
logical analyses of prisons as communities, as well 
as by increasingly rigorous research on the conse- 
quences of alternative correctional programs. 

Traditionally, prisons have had castelike relation- 
ships between staff and inmates. This is especially 
pronounced in prisons where custody and control 
are the primary managerial concerns. Devices to 
maintain the authority of staff include quasi-mil- 
itary movement of inmates in mass formations and 
the maintenance of social distance between staff 
and inmates by requiring that communication be- 
tween the two be only formal. Clemmer’s pioneer 
sociological study of the prison as a community 
(1940) described the tendency of inmates to be- 
come “prisonized” by learning a new language and 
acquiring a way of life appropriate to the regulation 
of their lives by staff. Sykes (1958), and the various 
authors who contributed to Cressey’s collection of 
sociological essays on the prison (1961), described 
the special roles inmates develop to cope with each 
other, as well as to corrupt the authority of staff. 
Most prevalent, especially in highly repressive 
prisons, is the “right guy” role, which emphasizes 
loyalty to other inmates in opposing staff efforts 
to learn about illicit activities in the institution, and 
hostility to the inmate “stool pigeon” or “rat,” who 
purveys such information to staff. Most influential, 
however, are those inmates who attain the “politi- 
cian” role by acquiring some measure of trust from 
both inmates and staff. This permits them to gain 
from the staff and to dispense to inmates some 
extra “privileges,” comforts, and “inside” informa- 


tion and, reciprocally, to enforce inmate coopera- 
tion and order in the performance of essential 
prison maintenance and production work. Cross- 
cutting these categories, especially in womens 
prisons, according to Ward and Kassebaum (1965) 
and Giallombardo (1966), are social types based 
on role differentials in homosexual relationships. 

In the treatment-oriented penology of the 1930s, 
psychotherapy was recommended for most of- 
fenders, but the psychiatric and clinical psycholog- 
ical staff available in correctional institutions never 
sufficed to provide therapy for more than a minute 
proportion of the inmates. After World War m, 
group therapy was seen as a solution to this need, 
not only because one clinician could deal with 
several subjects simultaneously in group sessions 
but also because a change in the social relation- 
ships among inmates was now seen as essential to 
their reformation. Increasingly, emphasis then 
shifted to changing the “social climate” of the insti- 
tutions by co-opting both nonclinical staff and in- 
mate leaders into the therapeutic process [com- 
pare MENTAL DISORDERS, TREATMENT OF, article 
On THE THERAPEUTIC COMMUNITY]. 

In the post-World War 11 years, concern for min- 
imizing social distance between staff and inmates 
developed, and gained increasing concessions from 
the custodially oriented components of correctional 
management. Techniques for social distance reduc- 
tion described by Glaser (1964) included assign- 
ment of line correctional staff to lead inmate groups 
meeting regularly for counseling; replacement of 
traditional institution classification committees by 
classification “teams” of lower-level staff; appoint 
ment of inmates to advisory committees on insti- 
tution operations; and changing from disciplinary 
practices of solitary confinement for definite n 
riods to periods of confinement with intensive stal 
counseling, the confinement terminating only ye 
there is some gain in staff—inmate consensus. Street, 
Vinter, and Perrow (see Street et al. 1966), com- 
paring inmate leadership attitudes in youth correc: 
tion institutions of diverse social climates, ss 
inmate support for staff objectives highest WA"" 
social distance was minimized. Nevertheless, es 
trolled experiments involving random asia i 
of inmates to heavily counseled and to ae s 
seled penal units failed to demonstrate market n 
consistent recidivism reduction by these eee. 
they appeared to promote adjustment to ins ae 
tional life more than preparation for the P 
release world. i 

Reduction of inmate isolation. Penal e 
in the 1960s is distinguished by a rapid Pa ae a 
tion of programs to reduce the isolation © p 


populations from the outside world. This was 
achieved, first, by inviting outsiders into the insti- 
tutions, both on a voluntary and on a paid basis, 
as part-time instructors, as participants in inmate 
discussion or hobby clubs, and simply as personal 
visitors to inmates out of contact with outsiders. 
Mutual aid organizations, such as Alcoholics Anon- 
ymous, have been especially active in these func- 
tions, and they establish social relationships with 
inmates which continue in the free community. 
Also, several prison systems markedly reduced their 
restriction and censoring of inmate correspondence 
with outside parties. Conjugal visits, allowing a 
married prisoner to have sexual relations with his 
spouse, have long been permitted in Latin America, 
Scandinavia, eastern Europe, and some countries 
of Asia, but only in the state of Mississippi within 
the United States. 

The second mode of reducing the isolation of 
prisoners has been to permit their temporary re- 
lease before final departure from the institution. 
These releases include furloughs for a few days or 
a week when their parole or discharge dates are 
near, to permit them to make home and job arrange- 
ments for the future, and simply to make their ad- 
justments to complete release less drastic. More 
dramatic has been the rapid spread in America 
and Europe of so-called “work release,” whereby in- 
mates of a prison are permitted to depart daily, to 
hold jobs in adjacent communities. 

Work release, as a means for achieving the social 
reintegration of offenders into the community, has 
been limited by the fact that prisons usually are 
in locations remote from the major metropolitan 
areas from which most of their inmates come and 
to which they return. Therefore, new types of penal 
institutions developed in the 1960s, small residences 
in large cities to which inmates are transferred for 
the last few months of their prison terms. The 
inmates in these graduated release centers depart 
daily to seek employment or to work on jobs in the 
community, and as their release date approaches, 
they may leave the centers in their leisure hours 
more often, for recreation and to visit their pros- 
pective postrelease homes. Indeed, they often may 
Move out of the center in their last few weeks, 
although they return several times per week to 
discuss their outside experiences. 

_ The counseling of inmates which occurs in con- 
junction with these releases is much more relevant 
to their real problems of community adjustment 
than counseling generally can be under traditional 
forms of imprisonment. Under the new programs, 
discussion deals with immediate circumstances as 
they are encountered, while counseling during long- 
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term confinement necessarily deals primarily with 
selective recollections of the past, problems of in- 
stitutional life, and speculations regarding a hypo- 
thetical future, Furthermore, staff assessment of 
the risk involved in recommending a prisoner's 
complete and permanent release is presumably 
much more competent when the considerations on 
which the assessments are based include impres- 
sions gained following the prisoner's temporary 
release, rather than being restricted to observations 
in prison. 

Concern with the integration of offenders into 
legitimate roles in the free community has promot- 
ed more reluctance to remove them from the com- 
unity in the first place. Accordingly, there has been 
greater use of fines and of weekend or evening con- 
finement only, presumably as deterrent sentences 
for prisoners with stable employment and family 
relationships, both of which might be jeopardized 
by confinement. Indeed, the increased flexibility of 
release from penal confinement has meant that 
traditional differences between imprisonment and 
alternative correctional practices have diminished. 
It is noteworthy that in the 1960s there is a gen- 
eral decline in the number of people in prison in 
most countries for which statistics are available, 
despite an increase in total population and in prose- 
cutions for major crimes. A major factor in this 
diminution has been the increase in the principal 
alternative to penal confinement—release on pro- 
bation or parole. 

DANIEL GLASER 


[Directly related are the entries CRIME; CRIMINAL LAW; 
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GANIZATIONAL GOALS; UTILITARIANISM; and in the 
biographies of BECCARIA; BENTHAM; LOMBROSO; 
Rusu; VOLTAIRE.] 
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I 
PROBATION AND PAROLE 


Probation and parole are procedures for rel 
of convicted criminals or adjudicated del 
on a conditional basis in order to assist tl 
pursuing a noncriminal life, with the pro 
they may be committed or returned to a 
tional institution if their behavior after relea! 
to meet standards of the releasing authori 
granted by a court as an alternative to in 
tion, this release is generally called “probatic 
granted by an administrative agency to 
who already has served part of a term of 
ment, this release is usually called “parole” 
United States and “license” in Britain. The 
or other equivalent of the term “conditiona 
tion” is used in non-English-speaking CO 
However, in much popular speech and 
some legal statutes within the United States 
bation” and “parole” are used interchangeabl 

Both probation and parole represent a shift 
the classic penal goal of deterrence to the mi 
goal of encouraging the offender to fin 
legitimate alternative to crime, while continu 
restrain him for the protection of society. Pi 
and parole also involve postponing a final 
in dealing with the criminal; the offender's 
treatment is left uncertain until he is 
further. 

History of probation. Probation evolved 
the common-law procedure of suspended S 
by which a judge first imposes a specific Re 
then orders that the penalty not be enforced 
offender behaves as directed. Probation differs 


someone to assist the convicted person in 
ing a law-abiding life, as well as to check wh 
the conditions of release are obeyed. ' P. 
The first probation officer is often said to h 
been John Augustus, a Boston cobbler, who ini 
volunteered to assist offenders if the court | 
release them to his care. By his death in 1 
and his friends were credited with providin 
service for over 2,000 persons with remarkabl. 
failures. These voluntary services first 06 
official government functions in Massachu 
where an 1869 law required the Board of Chari 


“visiting agent” to arrange family placement for 
children charged with offenses and where an 1878 
law authorized court employment of a probation 
officer to serve any offender considered reformable 
without punishment. 

In Britain and elsewhere it is claimed that some 
use of voluntary supervisors in suspended-sentence 
cases occurred well before the work of John Au- 
gustus. However, the first British law on condi- 
tional suspension of sentence was enacted in 1879, 
and the modern form of British probation, provid- 
ing state assistance and supervision, was not estab- 
lished by law until 1907. 

Laws for the conditional suspension of sentences 
were enacted in Norway in 1887, Belgium in 1888, 
and France in 1891. While this concept spread 
rapidly, government employment of probation of- 
ficers was less readily accepted. The latter practice 
is said to have first occurred on the Continent at 
Neuchatel, Switzerland, in 1891, but it did not pre- 
vail in most of western Europe until after World 
War 1, and it spread even more slowly in eastern 
Europe. 

Probation was introduced into most of the British 
colonies in Asia and Africa early in the twentieth 
century and was then adopted in some of the ad- 
jacent countries. Its entrance into Latin America 
came with the establishment of juvenile courts 
there in the 1920s and 1930s; but it was not ac- 
cepted in most Latin American courts for adults 
until much later. 

Probation in juvenile courts, Almost every- 
where, the most extensive use of probation has 
occurred in juvenile courts, Here children are also 
placed on “unofficial probation,” by continuing their 
hearings for perhaps six months or a year, then 
dismissing their cases if they and their parents 
have cooperated with the probation officer and be- 
haved as the court directed during this period. In 
this way the child never is officially adjudicated 
delinquent. Unofficial probation is the most fre- 
quent disposition of cases in many juvenile courts, 
while in others it is either unknown or infrequent. 
This confounds efforts to compare delinquency 
rates from one jurisdiction to the next. An anal- 
ogous practice in some adult courts is known as 

deferred prosecution.” 

These procedures for avoiding official decrees 
have been criticized because the court may thereby 
impose stringent behavior restrictions on an ac- 
cused person without fully investigating the valid- 
ay. of complaints against him. Fear that noncooper- 
ation would result in commitment to a correctional 
institution inspires compliance with unofficial pro- 
bation suggestions as though they were formal 
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court orders, However, such practices have been 
defended as imposing less stigma on the accused 
than would result from full legal proceedings 
and as no more likely to be abused than formal 
procedure. 

History of parole. The term “parole” can be 
traced to parole d'honneur (word of honor), a type 
of oath known in medieval days whereby military 
prisoners or offenders of noble birth were released 
on their promise not to escape or not to resume 
conflict. However, the practice to which the term 
generally is restricted today evolved from programs 
to motivate desired behavior in prisoners. 

In 1840, Alexander Maconochie, head of a 
British penal colony in Australia, instituted a sys- 
tem whereby convicts earned “marks” for industry 
and obedience and lost them through misconduct. 
A specified number of marks had to be accumu- 
lated before a man could enter each of four “stages” 
of decreased restriction in confinement. In the last 
stage the offenders received a “ticket of leave,” 
which granted them freedom for the balance of 
their sentence provided they had no further con- 
flict with the law. 

Sir Walter Crofton, in 1846, introduced a ticket- 
of-leave system in Ireland under which the released 
prisoners had to report regularly to the nearest 
police station. This system was extended to Eng- 
land by the Penal Law of 1857, where it was called 
“conditional release on licence.” The term “parole” 
was applied to the Irish system by its proponents 
in the United States, who finally had it instituted 
in 1877 at the state reformatory in Elmira, New 
York. It had been adopted in 22 states by 1900 and 
in 44 by 1922. In almost all parole systems in the 
United States, government-employed parole officers 
supervise the releasees, 

In Britain, assistance for released prisoners, 
whether on “licence” or fully discharged, is known 
as “after-care” and is performed primarily by pris- 
oners’ aid societies. These originally were purely 
yoluntary organizations, but they have received 
government financial assistance since 1862, The 
British local organizations federated in 1936 as the 
National Association of Discharged Prisoners’ Aid 
Societies. Officers of this organization, by law, are 
also officers of the Central After-Care Association, 
a government agency created by the Criminal Jus- 
tice Act of 1948 and gradually enlarged until, after 
1961, it handled most compulsory after-care cases. 
Rather than engage a large staff, this association 
frequently appoints local probation officers as its 
“associates” to supervise licensees in their area. 
Several plans for decentralization of British after- 
care have been widely debated. 
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Preprobation and parole investigations. Proba- 
tion officers generally are employees of the courts. 
Their work differs from that of parole officers in 
that, in addition to supervising released offenders, 
they serve as advisers to judges. They do so through 
their presentence investigations (called “prehearing 
investigations” in juvenile courts), In these in- 
quiries the probation officer usually interviews the 
accused and his family and any other persons or 
agencies having relevant information, such as em- 
ployers or the armed forces. Psychological and 
psychiatric examinations may also be arranged. 
The report on such investigations summarizes the 
life history of the subject, analyzes his personality 
and his offense, describes his current situation, and 
assesses his probable future behavior under alterna= 
tive possible court dispositions. 

Investigations—pro and con. In some jurisdic- 
tions a presentence or prehearing investigation is 
required by law for every case; elsewhere it is often 
required by court policy. In dealings with con- 
victed adults presentence reports are useful to a 
judge even in cases for which probation is not 
under consideration, for they may guide decision 
as to the most appropriate type of prison sentence. 

There has been some objection to mandatory pre- 
hearing investigations in juvenile courts on the 
grounds that inquiry by a court employee publicizes 
the status of the accused in a way that impairs his 
acceptance in his neighborhood and at his work- 
place or school. Such damage is of particular con- 
cern in cases where the court concludes that the 
subject has been unjustly accused. Mandatory in- 
vestigations also are alleged to damage some juve- 
niles by delaying court action for days or weeks 
even when information adequate for a sound dis- 
position of the case should be provided by persons 
available at the initial hearings. 

There has been continual controversy regarding 
the right of legal counsel to examine the presen- 
tence or prehearing report, to question the proba- 
tion officer on it, and to present witnesses or evi- 
dence in opposition to the findings of this report. 
In support of such rights, it is charged that the 
probation officer frequently presents hearsay evi- 
dence as fact and obtains testimony from preju- 
diced sources without systematically determining 
what others might say in rebuttal. Denial of these 
rights is defended on the ground that exposure of 
sources would limit the range of relevant character 
information submitted to the court and that prop- 
erly trained probation officers are competent to 
make a thorough and impartial investigation. A 
compromise on this issue frequently prevails, with 
the judge making the factual conclusions of the 


probation officer’s report available to counsel, 
and hearing testimony in rebuttal of these con. 
clusions, without revealing the officer's sources of 
information. 

Purpose of investigations. Parole officers in- 
vestigate an inmate's proposed after-prison home 
and employment to ascertain whether they are 
actually available or may involve situations con- 
ducive to crime. Usually this inguiry is undertaken 
only after the inmate’s parole is approved; the 
parole board stipulates that parole will be deferred 
if postrelease arrangements are not satisfactory, 
Officers may also endeavor to arrange housing and 
employment for a prospective parolee. 

Men from prison, however, are so often deficient 
in work skills and experience, and there is such 
prejudice against hiring them, that many promises 
of postrelease employment actually are only favors 
granted the prisoner’s family in order to hasten his 
release. Such jobs usually prove only temporary or 
nonexistent. Consequently, many parole authorities 
advocate less insistence on prelease job arrange- 
ments and more assistance to newly released 
prisoners while they seek their own jobs. 

Factors in release decisions. Probation frequent- 
ly is prohibited by law to persons found guilty of 
so-called heinous crimes, such as murder, rape, or 
narcotics offenses. Advocates of rehabilitation as 
the primary objective in dealing with criminals ob- 
ject to such statutory restrictions on the grounds 
that they hamper the state’s adjustment of its actions 
to meet individual differences among offenders. 

Both concern with rehabilitation and interest in 
protecting the public require that probation and 
parole decisions be based on prediction of the 
offender's future conduct, This prediction nS 
not just assessment of the risk of further crime Í 
the offender is released immediately but also judg- 
ment whether these prospects will be better Or 
worse after a period of confinement. Since the p% 
tentialities of humans and the situations they e 
encounter can never be completely known in al 
vance, considerable error is inevitable in all ne 
and correctional prediction. Furthermore, the val 
tice of making each decision on a purely no ie 
basis and the lack of systematic analysis © of 
accuracy of past predictions impair the ne. 
judges or parole boards to learn from rain ae 

Prediction research and release procedure. nae 
its inception in the 1920s, probation and P 2 
prediction research has repeatedly been A 
as a basis for scientific release decisions. i 
figures as Ernest W. Burgess and Shela het 
Eleanor Glueck in the United States, eee a 
and his students in Germany, Leslie T. wil 


Britain, and many others developed statistical 
tables relating prerelease information on offenders 
to their postrelease behavior. There have been 
numerous demonstrations that these tables gen- 
erally lead to more accurate predictions than is 
achieved through prognosis from case impressions 
by correctional officials, psychiatrists, psychologists, 
or sociologists. Nevertheless, for about thirty years 
the Illinois parole system was the only agency sys- 
tematically making statistical prediction data avail- 
able to makers of release decisions. Resistance to 
this practice reflected exaggerated claims for the 
tables by some of their proponents, which aroused 
an opposition fearing that use of the tables would 
dehumanize casework or would make relative vio- 
lation risk the sole factor in release decisions. This 
situation changed in the early 1960s when Britain 
and several jurisdictions in the United States, nota- 
bly California and Wisconsin, sponsored research 
leading to improved tables. They treated the tables 
as an adjunct to other data facilitating release de- 
cisions, rather than as the sole basis for decision 
[see PREDICTION]. 

Supervision of probationers and parolees. The 
conditions under which probation or parole is 
granted generally are expressed as a set of rules. 
These usually specify a county or other area in 
which the releasee must remain, unless granted 
special permission to go elsewhere. They require 
regular employment or school attendance, if pos- 
sible, and avoidance of known criminals. There are 
rules against carrying weapons and other more or 
less specific injunctions. Usually the probationer or 
parolee is required to report to his supervision 
officer at regular intervals and to cooperate with 
the officer in other ways. 

Restitution and compensation. Frequently in 
probation, but rarely in parole, the offender is re- 
quired to make restitution to the victim of his of- 
fense. Occasionally the judge may require some 
other compensatory action, such as unpaid labor 
at public agencies, Such acts have been interpreted 
Psychologically as essential to rehabilitation, for 
good works give the offender a conception of him- 
self as a good person and facilitate his identifi- 
cation with anticriminal persons. However, for 
optimum rehabilitation from such experience the 
offender must be inspired to undertake these acts 
on his own initiative, rather than just perform 
them involuntarily. 
> Role dilemma of the probation officer. Proba- 
tion and parole officers often experience a conflict 
in their work between the requirements of assist- 
ance to a releasee and the requirements of rule 
enforcement. There is increasing emphasis on such 
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positive aid as counseling, obtaining employment, 
and mediating family conflicts for the releasee. 
Such assistance may require a relationship of 
mutual respect and trust, while effective rule en- 
forcement requires surveillance, inquiry, and other 
expressions of distrust. Solutions for this role 
dilemma vary from leaving surveillance exclusively 
to the police to counseling primarily by threat, 
but most practice is intermediate between these 
extremes. 

Rehabilitation and the public interest. The 
granting of probation or parole implies that the 
state has no interest in punishing the offender but 
is concerned only with assuring his achievement 
of a legitimate way of life. Most jurisdictions, how- 
ever, prescribe a minimum term of confinement 
before parole may begin, and some judges fre- 
quently require that a jail term precede probation. 
Argument for this practice asserts that prior ex- 
perience of the punishment to be imposed if re- 
lease conditions are violated deters the releasee 
from violation. An opposing argument is that con- 
finement, particularly in the idleness characteristic 
of jails, is a criminalizing experience. It is also con- 
tended that a major advantage of probation for 
many cases is noninterruption of the subject’s em- 
ployment and family relationships. Of course, the 
latter arguments are not relevant for the large pro- 
portion of offenders who are unemployed at the 
time of their trial, nor for the many who lack 
satisfactory family relationships. 

Revocation of parole or probation generally oc- 
curs if the releasee is convicted of a felony, or if he 
commits serious or repeated rule violations. Usually 
a court hearing is required to revoke probation, and 
a parole board hearing to declare a parolee a vio- 
lator. However, these hearings seldom can be ar- 
ranged before some weeks after the releasee has 
been in custody. 


Convergence of penal practices 


In concluding the article on the field of penology, 
I noted that under certain new types of prison 
management, which are rapidly spreading, many 
prisons are allowing inmates who are soon due to 
be paroled to leave the institution during the day 
for private employment and to return to prison at 
night, as well as to depart for several days at a 
time on leaves or furloughs. Thus, the contrast 
between conditions of imprisonment and those of 
parole are diminishing, so that the adjustments 
required in making a transition from one to the 
other are encountered less abruptly. 

A contrasting but convergent development con- 
sists in the establishment of counseling centers or 


522 PENOLOGY: Probation and Parole 


residential hostels to which offenders placed on 
probation or parole must report when not at work 
or at school. The residential hostel, or “halfway 
house,” has most often been operated by private 
charitable groups, usually church-affiliated, but 
sometimes they are also established by govern- 
ment agencies. Halfway houses provide temporary 
shelter and counseling services for parolees or pro- 
bationers for whom a satisfactory family home is 
not available. Those operated by private groups 
often also aid discharged prisoners. 

A series of highly publicized experiments in 
special counseling programs at such shelters during 
the 1950s and 1960s set patterns that subsequently 
were widely copied. Highfields, an estate given to 
the state of New Jersey by the flier Charles Lind- 
bergh, was made a residence to which delinquents 
could be sent for up to four months, as a condi- 
tion of probation. There they participated in “guided 
group interaction” for several hours daily, a type of 
discussion oriented toward their helping each other 
reach an understanding of why they behaved in a 
delinquent manner and of how they might change 
this behavior. H. Ashley Weeks, Lloyd W. McCorkle, 
and numerous other social scientists have published 
analyses of this innovation (see bibliography for 
the article on the field of penology), Other centers 
followed. Some—notably those at Provo, Utah, and 
Essexfields, New Jersey—served only as counseling 
establishments, rather than as residences, so that 
probationers assigned to them returned daily to 
their homes. Under this arrangement the daily 
conduct of the participants in community life could 
become the subject of the group's discussion. The 
discussion at these centers often includes the topic 
of whether a particular participant's behavior war- 
rants either revocation of his probation or altera- 
tion of its conditions, and the group’s advice fre- 
quently is followed by authorities. 

Paralleling these developments have been nu- 
merous experiments with intensive parole serv- 
ices in California. Random assignment of parolees 
to smaller than usual parole caseloads—even as 
few as 30 cases per officer—was found associated 
with little or no reduction in parole-violation rates. 
However, a more dramatic series of experiments 
involved establishment of nonresidential “commu- 
nity treatment centers” for daily counseling and 
tutoring of randomly selected delinquents paroled 
within a month after their commitment to institu- 
tions. These parolees were assigned to caseloads of 
only eight to twelve per officer, with different styles 
of supervision specified according to a classification 
of the offenders on a number of psychological and 
sociological variables. Thus, those called “encul- 


turated” and “manipulative” received “firm but fair” 
supervision, while those called “immature” received 
highly “supportive” counseling. For most types of 
offenders, these programs have been dramatically 
more successful, as well as less costly, than those 
of institutionalization, and they have spread rapidly, 
An additional feature of many community counsel- 
ing centers, especially those for adults, has been the 
employment of clearly rehabilitated ex-offenders as 
counselors. These persons have shown unusual 
communication skills and influence as role models 
for offenders whose status they once shared. 

The major thrusts in penal development have 
been the expansion of all services in the community 
and the recommendation of distinctive correctional 
Strategies for particular types of offender. In pe- 
nology, as in medicine at a much earlier period, the 
use of controlled experiments and the objective 
analysis of experience is leading to more specific 
prescriptions for remedial action. The function of 
the state in coping with delinquents and criminals 
has become less a matter of physical containment 
and more a matter of continuous corrective effort 
within the settings where these offenders must 
achieve a legitimate way of life if they are no 
longer to be a problem for the state. 

DANIEL GLASER 


[See also CrimiNoLocy; DELINQUENCY, article on THE 
STUDY OF DELINQUENCY.] 
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PENSION FUNDS 


Pension funds are collections of financial assets 
—government bonds, corporate bonds, corporate 
Stock, etc.—held against the obligations for re- 
tirement payments incurred by pension plans. If 
the entity running a pension plan could look for- 
ward to perpetual life and continual growth or if 
it were armed with power to levy taxes or print 
money, it could, if it wished, conduct the pension 
Plan purely as a transfer operation. Payments to 
beneficiaries would be wholly financed by contri- 
butions and earnings, and no fund would be 
necessary. Business firms, labor unions, and other 
Private sponsors of pension plans do not, of 
course, meet these conditions. Hence, as a general 
rule, their pension schemes include a supporting 
fund, available to meet the plan’s promises, al- 
though in the normal course of events the fund 
would not be drawn down. Thus, the assets in the 
fund typically enter into the fiscal operations of 
Pension plans, through the flow of interest and 
dividend income they generate. 

As a rule, funds of the kind just described are 
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also accumulated under plans run by governments 
for their own employees. However, there is great 
variation between countries in the funding prac- 
tices of governmental social security programs. 
The latter, not of direct concern for this article, 
range from transfer operations that have no fund 
to programs in which social security and private 
pensions have been integrated and which accumu- 
late a fund, like private plans. Characteristically, 
however, the funds of social security programs fall 
far short of the expected value of the program's 
obligations. 

A rapid growth of pension plans and their funds 
has occurred in many countries in the last quarter 
of a century. Among the factors responsible for 
this increased demand for pensions have been the 
growth in the number of the aged, the increase 
in théir relative importance, and the sharp rise in 
the number of years a person over 65 will spend 
in retirement. The last is due in part to an increase 
in life expectancy but more importantly to a 
shorter working life. Also encouraging the growth 
of pension plans has been the favorable tax treat- 
ment generally accorded them. This in effect per- 
mits averaging of income over the taxpayer's life, 
with a consequent diminution of tax liability—a 
privilege made more valuable by the sharp increase 
in tax rates that occurred in most countries in the 
late 1930s and has persisted since. Specifically, 
in the United States the number of plans increased 
rapidly when pensions came to be used to provide 
the equivalent of an increase in compensation in 
the face of wage stabilization during World War 11 
and after the decision by the National Labor Rela- 
tions Board in 1948 that pensions are an appro- 
priate item for collective bargaining. 

The dynamics of the “typical” pension fund are 
something like this: In the early stages there is 
rapid accumulation, as contributions (both on 
account of past and present service) attuned to the 
level of future benefit payments are made, while 
current benefit payments are low. As the fund 
accumulates, earnings grow and benefits close the 
gap on contributions, At some point benefits ex- 
ceed contributions, but for a while earnings take 
up the slack, so pension fund assets still rise but 
by smaller amounts. Finally, with a population 
whose age, employment, benefits, and wage char- 
acteristics are fixed, the time comes when benefits 
just balance contributions plus earnings. The plan 
has matured; the fund no longer grows. By defi- 
nition, of course, the plan of a growing firm will 
never mature. More workers will enter employ- 
ment than will retire; wage scales will rise; and 
benefit formulas will be liberalized. Pension re- 
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Table 1 — Growth of private industrial pension and deferred profit-sharing 
plans and funds in the United States, 1940-1964 


Coverage (millions of persons) 
Contributions (billions of dollars) 
Beneficiaries (millions of persons) 
Benefit payments (billions of dollars) 
Earnings* (billions of dollars) 

Assets (billions of dollars) 

Increase in assets” (billions of dollars) 


1940 1945 1950 1955 1960 1964 
41 6.4 9.8 15.4 21.2 24.6 
0.3 1.0 2.1 3.8 5.5 6.9 
0.2 0.3 0.5 1.0 1.8 2.5 
0.1 0.2 0.4 0.8 1.8 2.8 
0.2 0.3 0.5 0.7 1.7 3.2 
2.4 54 121 27.5 52.0 77.2 
0.4 1.1 2.2 37 5.4 7.3 


a. Calculated as a residual from rounded numbers, hence subject to relatively substantial error. 


b. In the given year. 1940-1950 are rough estimates. 


serves of the plan of a growing firm will increase 
each year, at a “rapid” rate when the plan is young 
and rather “slowly” as it ages. 

Should pension plans be growing in number or, 
more to the point, should membership in all pen- 
sion plans be growing, there would be an added 
reason for the private industrial pension plan 
structure to show rapid growth over a number of 
years. And since the plans composing this struc- 
ture have come in serially over time, the period of 
rapid fund growth should be rather extended. For 
the United States, whose data are used in this 
article for specific illustration, the stage of rapid 
accumulation began about 1940 and will probably 
persist until at least 1980. The data of Table 1 
illustrate past growth. Prognosis rests on estimates 
(Holland 1966) that put the value of private in- 
dustrial pension and deferred profit-sharing funds 
at $125,000 million by the end of 1971 and 
$200,000 million by the end of 1981. 

Supplementing the data of Table 1 and round- 
ing out the picture on private pension funds in 
the United States are (1) the funds of plans for 
employees of state and local governments, whose 
reserves rose from $1,500 million to $32,000 
million between 1939 and 1965 and whose annual 
accumulation increased from $200 million to 
$3,200 million over the same period, and (2) the 
funds of plans for federal civilian employees, 
which increased from $600 million in 1940 to 
about $15,600 million in 1965 and are growing 
each year by an amount in excess of $1,000 
million. 

Effect on saving. By almost any measure the 
annual asset accumulation of private pension 
plans is substantial. In the United States in 1964 
it came to about $11,000 million a year, $7,000 
million accounted for by private industrial and 
nonprofit-organization plans, the rest by plans for 
government employees. More striking, this annual 
addition to holdings appears to account for over 
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one-third of total personal saving (national in- 
come accounts definition) or net financial savings 
by individuals (Securities and Exchange Commis- 
sion definition). But this by itself is just a numeri- 
cal statement. Whether in fact pension fund 
“saving” has real economic significance depends 
on whether these annual asset accumulations re- 
sult substantially in net new savings or are offset, 
in large part, by compensating adjustments in the 
savings that individuals would have made under 
other arrangements or in other forms. 

Arguments have been put forth on both sides 
of this question. On the side of displacement are 
both theoretical arguments and the evidence of 
past experience. If the price of one particular com- 
modity in a set of close substitutes is reduced (the 
favorable tax treatment accorded pension saving 
is like a cut in price), its sales will increase, a 
will the sales of the “industry.” But in general 4 
large part of this increase will be in sales of 
the now cheaper commodity (pensions) at the 
expense of the rest of the commodities in the 
“industry” (other forms of saving). Pointing to 
the same conclusion is the Modigliani-Brumbelt 
“life-cycle” hypothesis (1954). Thus, most of 3 
rapid growth of pension funds might have ee 
financed by reductions in other forms of paai 
Consistent with this possibility is Goldsmith's ple 
dence (1955-1956) that savings (including ae 
sumer durables) ratios have remained stable ov! 
more than half a century and the Ando W 
Modigliani (1963) finding of a stable ratio se i 
sumption to labor income and wealth from a 
on. Rough constancy in the proportion of A 
income saved and in the distribution of eee 
ing among the personal, corporate, and aoa 
ment sectors in the face of strong Anse ae 
changes suggests that this pattern will p ais 
into the future—with pension fund savii O 
placing other forms of saving. [See ConsU: 
FUNCTION. ] 


To support the view that pension fund accumu- 
lation augments the total of saving, one can note, 
first, that many of those participating in pension 
plans did not save in other forms; for them there 
is nothing to offset pension saving against. Fur- 
ther, pension rights or expectations are at best 
imperfect substitutes for other forms of saving; 
and since benefits are highly contingent, it may 
be that potential beneficiaries heavily discount the 
benefits promised, and accordingly do not reduce 
other forms of savings on becoming members of 
a pension plan. Moreover, it is possible that the 
new alternatives made available to people—in this 
case a modest pension (from both a private plan 
and social security )—could induce a change in 
consumers’ tastes that would encourage potential 
pensioners to seek to add to income in old age by 
keeping up or even expanding their other savings. 

It is clear that none of these arguments on 
either side seems very convincing a priori. More 
weighty is some evidence developed by Cagan 
(1965) which suggests very strongly that the 
annual accumulations of pension funds should be 
considered, in large part, to represent a net in- 
crease in personal saving. His findings are based 
ona large sample (11,000) of replies to a ques- 
tionnaire which covered pension status, savings 
in various forms, income, etc. After standardizing 
for income and age, Cagan discovered that those 
covered by private pensions saved at least as much 
in all other forms as those not participating in 
Such plans. It would appear that employer and 
employee contributions to pension plans are not, 
in general, substitutes for other kinds of personal 
Saving, And because the government's recoupment 
of the tax revenue deferred by the exemption of 
employer contributions and fund earnings is not 
likely to cut corporate or personal savings by 
tee Cagan concluded that on balance the in- 
mee in aggregate saving is almost as large as 
tts a. growth of pension funds. The fact that 
diareo ple was drawn from the subscription list 
Ree aS magazine might seem to leave the 
pe eee annii of his findings open to ques- 
E Hie rom another survey, based on represent- 
(1965 mples of all American consumers, Katona 

a ) reached substantially the same conclusion. 
Rete in the United States, private plans 
fie oun Teserves and enhance the flow of sav- 

Ke ap = the social security program, which 
tena 7 a of redistributing income from high- 
saving B ow-income recipients, tends to deter 
fie p D in all likelihood the composite struc- 
tot da we for income support in retirement is 

in its effect on total savings. The ac- 
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celeration of saving by private pension plans 
appears to be considerably greater than the dis- 
couragement of saving because of the social secu- 
rity program (Cagan 1965; Carroll 1960, p. 149). 

Effect on investment. The annual excess of con- 
tributions and earnings over benefits paid out by 
pension plans is invested. This process could be 
viewed in terms of its effect on the demand either 
for capital goods or for the financial assets which 
are their counterpart. The discussion that follows 
concentrates on the latter. 

Pension funds are only one class of participants 
among many in the capital markets, and any 
analysis of their investment activity would ideally 
be undertaken in that broad context. However, 
without attempting formal completeness, it is 
possible to discuss some of the more pressing 
problems posed by the portfolio policies of pension 
funds, including the rapid rate of stock acquisition 
by noninsured funds. (By way of background, this 
set of funds held $47,000 million of assets as of 
the end of 1963, while reserves of insured plans 
were $23,000 million and those of federal and 
state-local government employee plans were about 
$41,000 million. But more important, of all pen- 
sion funds only the noninsured funds purchase 
sizable quantities of stock. Indeed, the net pur- 
chases of stock by noninsured funds from 1960 to 
1963 equaled over 70 per cent of net additions to 
total corporate stock outstanding; so, directly or 
indirectly, they provided most of the finance for 
new stock issues. ) 

For pension funds the particular merit of stock 
lies in its tendency to move with the general price 
level, Thus, if the general price level moves up, 
the capital gains experienced on stock could serve 
either to lower the employer's cost of operating a 
stipulated pension plan or permit him to provide 
more generous benefits at no additional cost. But 
the prospect that pension funds will substantially 
increase their holdings of stocks year after year 
raises questions of corporate control and effects on 
stock prices—particularly since pension funds, to- 
gether with other institutional investors, have 
tended to concentrate their purchases on a rela- 
tively small group of stocks of the highest quality. 

Under the trenchant heading “A Drift Toward 
a ‘Paraproprietal’ Society,” Harbrecht (1959) has 
examined the potential problem of control exer- 
cised by pension funds over corporations whose 
stock they own when the relationship between 
beneficial interests in the fund and control of the 
fund itself remains in some respects ill defined. 
Tilove (1959), recognizing a basis for concern, 
argued that ownership of stock by financial insti- 
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tutions is not a new problem; that in this connec- 
tion pension funds do not constitute a “clear and 
present danger”; and, finally, that mechanisms to 
obviate the longer-run danger now exist and can 
be further developed. With industrial pension fund 
assets projected at about $200,000 million by 
1981, a reasonable estimate of the assets of non- 
insured plans alone would be $140,000 million. 
Stocks are now 41 per cent of the assets of non- 
insured funds (on a book-value basis), and it 
hardly seems likely that the funds would choose 
to have more than 60 per cent of their assets in- 
vested in stock by 1981. Now, make the extreme 
assumption that the value of all stock outstanding 


in 1981 will be no greater than it is today— , 


$650,000 million, (It is most likely, of course, that 
because of retained earnings the value of stocks 
outstanding in 1981 will be considerably higher. ) 
Even under these assumptions, which tend to 
overstate their relative share, by 1981 noninsured 
pension funds would still own less than 15 per 
cent of all stock outstanding. 

But directing attention to the aggregate amount 
of stock misses an important element of the prob- 
lem, for it fails to cope with concentration in pen- 
sion funds’ stock purchases, i.e., their penchant 
for choosing their holdings from a small number 
of “blue chips.” Here relevant data are hard to 
come by, However, it is known that, as of 1954, 
noninsured pension funds under trusteeship of 
New York banks (about half, by value, of all non- 
insured pension funds) held more than 3 per cent 
of stock outstanding in only 17 companies, and in 
only three of these instances did their ownership 
exceed 5 per cent, Also, of the ten stocks in which 
their total absolute holdings were greatest, they 
held, on the average, only 1 per cent of shares out- 
standing, Moreover, pension fund trustees could 
be expected to search for alternatives to these 
“favorites” should their yield relative to other se- 
curities be bid down (Lintner 1959, p. 199). And 
in addition, the companies whose stock is heavily 
in demand are likely to increase their supply, thus 
tending to reduce the proportionate importance of 
the pension funds’ holdings. This does not mean 
that danger of control of particular companies 
does not exist. But it does mean that extrapolat- 
ing the growth of one entity—noninsured funds— 
and leaving everything else unchanged is likely 
to overstate the danger, for other things could very 
well change in a way that would moderate this 
trend. 

The argument that the increased demand for 
stock represented by pension funds, plus the 
general tendency of the funds to buy and hold, 


should constitute a force pushing stock prices up 
is subject to the qualification that a trustee would 
search vigorously for alternatives rather than pay 
an “unreasonably” high price for an asset. On bal- 
ance, however, the educated guess is that stock 
prices are somewhat higher because of pension 
fund demand than they otherwise would have 
been. For one thing, to a long-term investor the 
variability of stock prices is not as much of a de- 
terrent and the yield on stock is more of an attrac- 
tion than to the general investor; therefore, as 
Murray (quoted in Seligman & Wise 1964, p, 200) 
holds, “stock is worth more to pension funds than 
it is to most investors.” For another, the aggre- 
gate demand for stock has probably increased. If 
Cagan’s findings are correct, this gain in stock 
market funds has been at the expense of consump- 
tion. But even were the effect of pension funds’ 
accumulation to be a decline in other savings, 
Andrews (1964, p. 464) noted that, since the two 
most likely suppliers of the candidates for dis- 
placement—life insurance companies and sav- 
ings banks—characteristically invest in corporate 
bonds and stocks, the mortgage market would 
lose funds and the stock market would gain them. 
Finally, this increased demand concentrated on 
“blue chips” could be expected to maintain or per- 
haps strengthen the “relatively more favorable 
access which large well-established firms have had 
to the equity capital markets” (Lintner 1959, 
pp. 196-197). 


The economic effects of pension fund accumula- 
tion are significant and can be expected to grow 
stronger as the scope of private pension plan fiscal 
operations widens. But it is hard to say anything 
more precise than this. While pension schemes 
will continue to be liberalized—bigger benefits, 
lower retirement age, earlier vesting, ett S 
consequent changes in contributions, benefits, ani 
net fund accumulation, the pace of liberalizen i 
could vary widely. In the United States (and 
number of other countries) private pension pion 
are part of a larger structure of arrangements A 
income support in retirement, the other main a af 
ponent being social security. While designe a 
supplement or complement one another, the i 
strands of the composite structure are also Ma 
stitutes, to some degree. If one is made mi x 
“adequate,” the other will not be relied ee 
heavily as it would have been failing this. To ae 
extent, then, the role of private pension pea 
funds will be determined by decisions made al 


social security. D 
DANIEL M. HoLLAN! 


[See also AGING and WacEs, article on FRINGE BENE- 
Fits.] 
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I 
INTRODUCTION 


The study of perception is the attempt to under- 
stand those aspects of observations of the world 
of things and people that depend on the nature of 
the observer, Such understanding is obviously im- 
portant to the physician, to the physiologist, and, 
it was once thought, to the philosopher concerned 
with the question of how we can be sure about the 
truth of our ideas. Despite these different interests, 
perceptual study remains predominantly psycho- 
logical. 

Research in perception requires the most so- 
phisticated controls of the motivational, judgmen- 
tal, and learning processes, i.e., the use of the 
techniques of experimental psychology. More im- 
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portant, the touchstone of perceptual research is 
perceptual experience: the fact that the prism 
bends light energy of different wave lengths by 
different amounts is, of course, a physical dis- 
covery; the fact that white-appearing light thus 
spreads out into the myriad colors of the spectrum 
is a matter of perception, Similarly, the fact that 
a sensory end organ, or receptor, has been found 
that is electrically responsive to a particular wave 
length remains of unknown import until its effects 
on experience (or on suitably selected discrimi- 
natory behaviors, which is an equivalent statement 
for most purposes ) are demonstrated [see HEARING; 
LEARNING; MOTIVATION; SENSES; SKIN SENSES AND 
KINESTHESIS; TASTE AND SMELL; VISION]. 

The structuralist theory. The oldest and most 
complete theory of perception, now known as 
structuralism, held that simple elementary expe- 
riences, or sensations, recur in various combina- 
tions to compose the world we perceive; sensations 
presumably result from the excitation of individual 
sensory cells, or receptor neurons, each contrib- 
uting a characteristic signal, called a specific nerve 
energy, to the central nervous system. Whether 
acting alone or with a host of others, any receptor 
would produce the same sensation, although the 
memories of past experiences evoked by each dif- 
ferent context in which the sensation is embedded 
would usually conceal from the untrained observer 
(but not from the observer trained to ignore those 
memories, to practice analytic introspection) the 
fact that the same elementary sensation has oc- 
curred. 

Not only were the physical qualities of ob- 
jects (their sizes, distances, shapes, etc.) to be 
explained in terms of these basic simple sensations 
but also our perceptions of people, of their expres- 
sions and intentions, of their social relationships. 

This was much more than a theory of percep- 
tion, of course; it was central to a comprehensive 
program for understanding man’s mind, since all 
ideas about the world must (it was held) be 
composed of just these sensations, plus memories 
of previous sensations, in various combinations. 
Structuralism has had to be abandoned as far as 
any such unitary program is concerned, and a 
variety of competitive schools have attempted to 
replace it (behaviorism, gestalt theory, and func- 
tionalism ). Nevertheless, structuralism remains im- 
portant. Most knowledge about perception has been 
gathered either within or against the structuralist 
program, which has provided the framework of 
problems around which present research continues 
and within which we have to evaluate more recent 
alternative approaches [see TITCHENER; Wonpt]. 


In this article we shall first consider the elemen- 
tary analytic sensations and then those properties 
of the perceived world that structuralism hoped to 
explain as compounds of sensations and to which 
the opposing schools hope to address themselves 
more directly. This survey will be restricted to 
visual perception, which has been most fully 
studied and which has produced most of the criti- 
cal problem areas that characterize the disciplined 
study of perception. 

Structural analysis. As we examine our ob- 
servations of the world, we note separate sensory 
channels, or modalities; closing our eyes, for ex- 
ample, leaves our hearing relatively unaffected, 
and closing our ears leaves our sight relatively 
unaffected. Although not all of the senses are so 
immediately separable, this division gives us a first 
step in an analytic system. 

Sound energy (pressure waves) at the ear stim- 
ulates specialized receptor cells in that organ, and 
their specific nerve energies (their effects on the 
brain) result in the sensations of hearing. Light 
energy stimulates the receptor cells in the retina 
of the eye, and sensations of sight result. This was 
the model of perceptual analysis self-consciously 
followed by the sensory physiologists and sensory 
psychologists at the turn of the century, and most 
of our perceptual problems derive from the failure 
to obtain elements of analysis within modalities 
even approximately as well defined as the differ- 
ences between modalities. 

We can do much more than observe whether 
it is dark or light, silent or noisy. We also see, 
hear, feel, etc., an infinitely variegated world of 
objects and happenings. What analytic methods 
can we apply to this problem? How does a seen 
apple differ from an orange, a hostile face from 
a friendly one? 

Within each modality there are evident differ- 
ences in quality and quantity that also seem H ; 
independent of each other in the same Way t i 
the modalities are independent of each other. Ligh 
is not only off or on; it is red, blue, helion g 
maize, and these qualities seem to recur ae 
again, in different combinations, in the worl ni 
see. It is extremely tempting, therefore, to eer 
that—as between modalities, so within mod see 
—there are elementary sensations within ge 
sense, each of which can be aroused by the Ta 
of a particular stimulating energy on recep E 
specialized to receive it, and that these eleme ose 
sensations combine in various ways to peas 
the world we perceive. How are we to dis 
them and their physiological bases? eo 

In theory, we might vary any conceivable 


of stimulation to discover these elementary com- 
ponents. In historical practice, most inquiry has 
centered on a few physical dimensions, partially 
because they can be precisely manipulated with 
available apparatus but mainly because they seem 
to be sufficient to account for (or add up to) the 
world we perceive. Within vision, for example, any 
static scene at all, no matter how complex and 
exotic, can be duplicated by an array of spots (or 
points) of color, varying only in their wave-length 
composition and intensity, to which variables the 
simple experiences of hue and brightness seemed 
to correspond and to which simple receptor cells 
seem to be sensitive. 

Color sensation and visual perception. Consider 
an object before our eyes. This is a distal stimulus; 
it affects our senses only by the energy, called 
the proximal stimulation, which is actually trans- 
mitted to the sense organ. This energy presents 
the eye with an optic array, and the eye’s optical 
system focuses some selected region of that array 
to form an image on the retina which is a mosaic 
of light-sensitive cells, connected by a network of 
lateral cells, whose output goes eventually to the 
optic cortex of the brain. (The selection depends 
on the eye’s direction, which in turn depends on 
the state of tension of a vast number of muscles. ) 
Usually, the retinal image is considered to be the 
proximal stimulation, but, almost always, the optic 
array is analyzed in its place. The optic array can 
vary in only certain measurable ways that are 
known to affect vision. At each point (if we take 
points that are small enough—say, less than about 
one minute of visual arc), the array can effectively 
vary only in its energy at each wave length of the 
Spectrum. That is, given any distal object at all, 
we can duplicate its effect on the nervous system 
of the observer by producing the same intensity 
Pattern of wave lengths in the optic array. Such 
fidelity is unnecessary, however. For the normal 
eye, we also can duplicate any (or all) colors in 
ihe spectrum by any set of only three wave lengths 
we conditions. Here, then, is our 

8 point: we need only find elementary ex- 
be rata corresponding to each of three colors at 
Point in a two-dimensional array. 
ied of color vision. The traditional Young- 
te 0 theory of color vision is based on the 
arte a a of three kinds of retinal receptors, OF 
eee oe kind being maximally sensitive to one 
which ar e spectrum (in addition, there are rods, 
one ie rap brightness-sensitive). When only 
tals of cone is stimulated, the appropriate 
T Sensation (red, green, or blue) is experi- 
enced. These h : 
ues were chosen because they are 
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the purest-appearing colors that correspond to a 
set of three wave lengths, 650, 530, and 460 milli- 
microns (my), sufficient to mix all of the other 
colors of the spectrum. When more than one cone 
is stimulated, we perceive a mixture of the two; 
for example, 480 my stimulates the blue cone and 
the green cone about equally, so that we see blue- 
green; 460 my stimulates the blue cone and 
530 mp, the green cone by about the same 
amount, so that with the combination 460 my and 
530 mp we see the same blue-green [see HELM- 
HOLTZ; VISION, article on COLOR VISION AND COLOR 
BLINDNESS]. 

One way in which this traditional theory might 
be wrong is particularly instructive. It may have 
chosen the wrong elements. For example, at 580 mp, 
the hypothesized red and green cones are stimu- 
lated, so that we should see a reddish green (!); 
we do not, of course: at 580 we see yellow. Thus, 
even though yellow seems to be just as pure and 
unitary as red or blue, it is supposedly not a sensa- 
tion at all, but a mixture of two other sensations, 
so welded together by the many experiences we 
have had with that pair that we can no longer 
distinguish the underlying red and green sensa- 
tions. This is but one example of what is encoun- 
tered repeatedly in the structuralist system, namely, 
that the elements combine to produce something 
unexpected in terms of the parts and the system 
is saved only by declaring that some of the sensa- 
tions (which are supposedly elementary observa- 
tions) are unobservable. 

In this case, however, an alternative set of ele- 
ments may extricate us from the apparent para- 
dox, Although three colors are indeed sufficient 
to explain the psychophysical facts of color mixing 
(ie., what mix of wave lengths will match the 
appearance of any other wave length[s]), a much 
more complete perceptual account is given by the 
opponent-processes pairs proposed by Ewald Her- 
ing and recently developed in impressive detail by 
Hurvich and Jameson (1957), The elements here 
are three paired opponent processes, one yielding 
red or green, one yielding blue or yellow, one 
yielding black or white. With this analytic system, 
one can predict the appearance of any mixture 
simply from the appearances of the components, 
whereas discrepancies occur when we attempt to 
add up the elements in the traditional system, [See 
the biography of HERING.] 

Contrast and constancy. With the first attempt 
to step beyond single small patches of color, far 
greater discrepancies appear; for example, if you 
keep two identical patches of light constant but 
change the illumination of the region surrounding 
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one of them, the two patches no longer match. 
Exactly the same energy falls on each patch, and 
presumably the specific nerve energies activated 
in the retina remain unchanged; but the appear- 
ances now differ. This is the familiar illusion of 
induced color (contrast); several recent attempts 
to find quantitative laws to predict the effects of 
one colored region on another have met with rea- 
sonable success; thus, it may eventually be possible 
to predict the apparent color of each region of any 
optic array, but we certainly cannot do so yet. 

There is another, not unrelated class of dis- 
crepancy called color constancy. Consider a piece 
of paper in the shade and a piece of coal in sun- 
light; both may present exactly the same amount 
of light energy to the eye, yet they do not appear 
equal. Now this seems to be an achievement, not 
an illusion (we see the world as it is, even though 
the proximal stimulus variable on which our sight 
is presumably based would mislead us), but it is 
nonetheless a discrepancy to be explained. Some 
of the preferred explanations are clearly false 
(e.g., the explanation that we see the colors as 
we know them to be must be false, since good 
color constancy obtains with unfamiliar objects); 
other explanations, such as those which subsume 
color constancy and contrast under the same for- 
mula (Helson 1964, pp. 260-297; Wallach 1948), 
are only partially complete. 

This much is clear, however: the colors that we 
perceive by stimulating one small homogeneous 
region at a time are not observed unchanged when 
those regions are viewed in other contexts. 

Spatial direction and motion. Assume that we 
had accounted for color. Next we must ask, How 
does the eye analyze space? Here is one simple 
model: If one rod or cone is stimulated, we would 
see one spot; if two or more are stimulated, a larger 
spot; if the two are separated by an unstimulated 
receptor, two separate spots. The smallest such 
separation that we can see—our visual acuity— 
would set the limits of the spatial differences we 
can see, that is, no two perceived objects could 
differ in size, shape, or position by less than this 
difference. Although this simple model of acuity 
still lies behind most casual thinking about per- 
ception, the actual discriminations that we can 
make are, optimally, even finer than can be ac- 
counted for in terms of the retinal image and the 
known sizes of the receptors. 

We must also, of course, be able to tell where 
the points are in relation to one another—left, 
right, up, down. Does each stimulated retinal point 
produce not only an elementary sensation or im- 
pression of color but a sensation of place as well 


—a “local sign”? The structuralist answer was that 
the qualities by which each receptor in the mosaic 
contributes to our perceptions of space and form 
are not simple sensations, but are themselves com- 
posed of the memories of those eye movements 
that were involved in the movement of a particular 
distal stimulus from one retinal point to another, 
Thus, a straight line would differ in appearance 
from a curved one only because the straight line 
falls on a row of retinal receptors that have in the 
past been stimulated by a set of eye movements 
that carry the same retinal point along the retinal 
image of some object's edge that is straight to the 
sense of touch. Since a single spot of light in an 
otherwise dark room has a very variable apparent 
locus, this theory cannot be tested by studying the 
local sign of each retinal point. In 1896, Wundt 
showed that after prolonged wearing of prismatic 
spectacles that produce curvature distortions in 
the images of straight lines, the distortions dis- 
appear and straight lines appear straight—until 
the spectacles are removed, after which straight 
lines appear curved in the opposite direction. This 
showed, Wundt argued, that the eye-movement 
components of the local signs had been relearned. 
After much confusion caused by the discovery that 
similar aftereffects could be produced purely intra- 
visually (i.e., by the prolonged inspection of curved 
lines, even without eye movements), curvature 
aftereffects have recently been demonstrated that 
are in fact contingent upon movements made 
while wearing prismatic spectacles (indeed, that 
are contingent only on active movement by the 
observer, which is somewhat different from what 
Wundt had in mind). Many different kinds of in- 
verting, distorting, and reversing spectacles have 
since been worn, for different periods of time, but 
no clear summary statement can yet be abstracted, 
In some kinds of distortion rapid perceptual change 
occurs, while in others it is not clear whether it 
is the visual world that changes or the motor 
habits (or the “body image”) of reaching and w 
ing that accommodate themselves to a chang 
relationship between vision and action. It is cer 
tain, however, that visual direction depends as 
much on muscle systems as on retinal stimulation Š 
We scan the world continually, and the eye move 
ments that have been made must be taken into 3 
count in deciding the directions and motions K 
perceive. As the eye moves in its socket (or a 
head and body move in the world), a ee 5 
distal object must produce a moving retinal be 
yet we usually see the object's position A 
stant, even though very different local signs t 
be involved. This direction constancy means 


a visual local sign, if it is to be retained as a 
component of observation at all, would have to re- 
main unconscious (an “unobserved observation” ) 
while it is somehow coupled with information 
about movements of the eye (and position of the 
body). This kind of reliance on unconscious sensa- 
tions, necessary if this analytic system is to be 
saved, will become unbearably cumbersome when 
we turn to more ambitious—but still rather primi- 
tive—syntheses of our elements of analysis into 
the objects of the perceived world. 

Perception of shape. How does a circle differ 
from a square? Are any additional specific nerve 
energies needed to account for either one? 

In structuralist syntheses the perception of some 
particular square consists of its immediate visual 
sensations of color, plus all of the tactual mem- 
ories of direction—of straightness, rectangularity, 
etc.—that were initially discerned by the exploring 
movements of the hand and eye, plus memories of 
the whole host of different sensations of color and 
direction that would be produced as the square is 
displaced slightly laterally, or rotated from the 
vertical by the tilting of the head, or as its retinal 
image increases or decreases in size as we move 
toward or away from the distal object, (All of these 
must be part of the same perceptual structure 
since we do not even recognize their changes in 
stimulation when they occur.) A “shape” is, thus, 
a submerging of a vast number of now unnotice- 
able sensations, by the effects of a great deal of 
Past experience, into a single named perception. 

Given this principle, there is no need to elabo- 
Tate it by considering more complex shapes, such 
as letters, people, or even the perception of mo- 
tion. All of these would merely be further examples 
z which the elementary observations of color and 

rection or position are smoothed together and 
oan out into an apparently unitary structure, so 
pau cemented by the memories of previous 
2 gps that even the most practiced observer 
ad seg the ; sensations that presumably 
KAR ose memories. (If we try to expand this 
in Siar include any of the geometrical illusions, 
as oes to the illusions of color—the 
Han at is Seen is discrepant from the shapes 
tion oi hee in the retinal image, the explana- 
mains ee becomes much more involved but re- 
ae atisfactory; the illusions are still without 

quate explanation. ) 
me eee approach. Gestalt criticism was 
appear mo in questioning whether shapes really 

(Seah 4 way structuralists claimed. 
ails aa the figure-ground phenomenon, in the 

‘ase-and-faces Rubin figure in Figure 1. 
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Figure 1 — Vase or faces 


At any region of the contour, one can see the vase 
or one can see a face, but not both simultaneously. 
One can easily construct other puzzle-pictures in 
which even a long search may not reveal the 
“hidden” shape. Since the sensations must be the 
same on both sides of the contour—in figure and 
in ground—the structuralist explanation would 
have to be that one sees whatever shape is most 
familiar, Against this argument, gestalt psychol- 
ogists demonstrated that thoroughly unfamiliar 
shapes can absolutely conceal such long familiar 
shapes as letters and numbers. For example, con- 
sider the perfectly good 4’s concealed in the scrib- 
ble in Figure 2. Not familiarity but the over-all 


Figure 2 — Concealed 4's 


configuration determines which way a contour will 
face, what function a part will play. The whole is 
not the sum of its parts, and we cannot say what 
shape we will perceive by adding up the experi- 
ences we have with each individual point. 

“The whole is not the sum of its parts.” A catchy 
paradox; if taken seriously, it would mean an end 
of analysis and of science. The gestalt psychol- 
ogists, of course, offered to search for other parts 
—other units of analysis. Implicitly, the “laws of 
organization” are such units: rules by which we 
can predict what will be figure and what will be 
ground in a given stimulus pattern. For such a 
major challenge, these “laws” have, until recently, 
received little direct experimental examination; 
hence it is still too early to evaluate the more 
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positive gestalt proposals. The effects of gestalt 
criticisms, however, were considerable. The cen- 
tral support of the structuralist enterprise, which, 
as we have seen, was badly weakened anyway, was 
destroyed. We simply do not see the “underlying” 
sensations as we should be able to do. However, 
in rebuttal, we should note that the properties of 
such figures (hard, shaped, with an edge that is 
in front of, and not shared by, the ground) are 
suspiciously like those of real objects, from which 
they might have been learned. And the laws of 
organization may themselves have been learned. 
When the familiar shape, the 4, is concealed in 
the presumably unfamiliar scribble, it may simply 
be because the observer's expectations about con- 
tours (that a contour that proceeds without inter- 
ruption is generated by a single object or surface, 
i.e. the “law of good continuation”) are even 
more deeply ingrained than are his expectations 
about the figure 4 [see GESTALT THEORY]. 

We cannot now evaluate this possibility directly, 
because testing it would depend on a better under- 
standing of the process of perceptual learning than 
we have and on making the kind of ecological 
surveys, as Egon Brunswik (1947) has proposed, 
that would tell us what the environment is likely 
to have taught us perceptually. However, the very 
possibility that the laws of organization are the 
result of learning opens new vistas. Laws do not 
have to be innate in order to be widespread; they 
can be the result of regularities in the environ- 
ment in which we have developed and be nonethe- 
less usable for that reason [see BRUNSWIK]. 

This possibility offers a different line along 
which we might take up again the structuralist 
program, if not its set of elements. 

Cell assemblies and phase sequences. D. O. 
Hebb proposed (1949) that there are in some 
fashion “specific nerve energies” that correspond 
to the gestalt phenomena; he suggested that cor- 
ners, contours, and edges have their neural coun- 
terparts in organized networks of cellular connec- 
tions in the brain, called cell assemblies, which 
have come to act as single functional units as a 
result of perceptual learning. From these cell as- 
semblies (which would consist of counterparts of 
frequently encountered fragments of shapes, etc.) 
still larger temporally extended units, called phase 
sequences, might be built up—for example, the 
succession of cell assemblies that would be pro- 
duced by repeated sweeps of the eye over the same 
pattern (say, in picking out the corners of a 
square). This theory makes explicit provision for 
obtaining different responses to the same retinal 
stimulation. Which cell assemblies would fire and 


what we should see are functions not only of which 
receptors are stimulated but also of what other 
cell assemblies are simultaneously firing (i.e., the 
effect of the whole on the part) and of what other 
cell assemblies have just been firing. This last 
point provides for the effects stemming from the 
observer's attention and expectations, an immensely 
important component of the perceptual process, 
which has been missing from our previous discus- 
sions. As a new structural theory, Hebb’s proposal 
is still in process of testing and revision, and al- 
though it permits us to expect that elementary 
units to analyze shape and form might be found, 
it does not tell us how we should start to find them. 

Classifying shapes. One obvious starting point 
is to classify shapes and forms in accordance with 
their apparent similarities and differences, that is, 
to obtain the dimensions of variation of shapes. 
This task is more difficult with shape than with 
colors simply because shapes can be sorted or 
classified in so many different ways. Several sys- 
tems for classification have been proposed, includ- 
ing procedures for the random generation of “non- 
sense shapes.” While these procedures may help 
us standardize our discussions, it is not yet clear 
what they do as a choice of subject matter (that 
is, to what extent they are representative of the 
kinds of shapes about which it is important to be 
able to generalize ). 

Alternatively, we can start from ways in which 
form and shape are already known to be important 
and try to develop functional elements with these 
important ways in mind. What are the elementary 
distinctive shapes involved in reading (i.e. what 
features do children, learning to read, single out 
when discriminating letterlike shapes from each 
other)? Complexity is an important formal vari 
able in any attempt to systematize the gestalt iene 
of organization.” What features determine subjects 
judgments of the complexities of shapes? More 
directly, the gestalt thesis is that whenever alter- 
native organizations can be perceived, We bes 
whichever is simpler, and where patterns are less 
complex in two dimensions than in three, We see 
them as flat. 

It is too soon for a general evaluation © z 
tempts to find elements and combining iav m 
the study of form perception, but it is quite 1 
that attempts are worthwhile and that they g 
had some limited success. 

Perception of distance. 
there is no real difference between shap! 


f at- 


estalt psychology 
Tae e percep- 


l å same 
tion and space perception: both express w 
si neither nec 
organizational processes, and distance 


depends on learning. To structuralism, 


perception must be learned, since each individual 
retinal receptor cell can only indicate by its spe- 
cific nerve energy whether or not it is stimulated, 
not whether the energy stimulating it comes from 
near or far. There are, it is true, patterns of 
stimulation that differ with near and far objects, 
and many of these had already been discovered 
by artists, but if points are the only “basic” sensa- 
tions, these patterns would themselves have to be 
the results of learning and could evoke depth per- 
ceptions only by their previous associations with 
walking, reaching, and touching. 

This argument becomes far less critical if we 
abandon point sensations as the only elementary 
experiences. Once we believe that the individual’s 
early personal history can build up neural net- 
works that respond to patterns (Hebb’s cell as- 
semblies), we can also believe that millions of 
years of evolution may have selected individuals 
born with these (or equivalent) connections “pre- 
wired,” 

What about this issue as a question of fact? 
On the one hand, some animal species will, with 
no prior visual experience at all, avoid the edge 
of a “visual cliff’—i.e., that side of a sheet of 
plate glass under which there is an abrupt drop, 
to which the only cues are visual ones. On the 
other hand, cats, at least, seem unable to respond 
to such visual depth cues if they have received 
visual stimulation but have been deprived since 
birth of active visuomotor behavior. Recent hu- 
man research, necessarily much less direct, has 
centered mainly on the visual distance cues and 
has been less conclusive, Clear-cut consequences 
of either alternative are not easy to come by. Any 
given distance cue might be learned early and 
remain irreversible, or it might be innate and yet 
subject to relearning (which is why the wearing 
of distorting spectacles—discussed in the section 
on direction—cannot tell us anything at all about 
the ontogenetic bases of space perception). The 
Poner is no longer central to all of psychology, 
i pres was, but it does remain a technical ques- 

on, vital to the study of growth and development. 
ee aside for the moment the age-old pre- 
ey ‘ion with whether distance perception is 
es o what stimuli can we find that might ac- 
ante our perceptions of distance? Some of the 
thee cues have been known for centuries, but 

In ie ey many more still to be discovered. 
uaa ast decade, Gibson (1950; 1959) has 
EER at therc are static and dynamic pattern 
Me ia (gradients, to be precise) that are at 

ieee informative not only about the 

of objects and surfaces but about their 


PERCEPTION: Introduction 533 


sizes and proportions as well. Surfaces at some 
slant in relation to, the observer produce a constant 
rate of change of texture density in the retinal 
image, and if the eye is sensitive to such gradients 
(innately or by learning), these would indicate 
the relative distances of two objects by the site of 
their intersections with the ground plane’s gradient 
and would indicate their relative sizes by the num- 
ber of texture-gradient units their images subtend, 
etc. If the observer locomotes in his environment 
as well, the dynamic pattern of textural gradients 
is even more potentially informative. 

Although we do not yet know the extent to which 
this particular set of “higher order” variables can be 
used by the perceptual system, they reveal so much 
information in the proximal stimulation about 
distal space that the question of what in space 
perception is learned becomes a very different kind 
of inquiry. Moreover, these variables suggest a new 
set of units with which to analyze the world of 
perceived space—units of surfaces and their slants, 
of their distances and angles toward each other 
and toward the observer. 

Whether or not this promise is fulfilled, how- 
ever, distance perception is no longer the culmi- 
nating problem for the study of perception (though 
there remains a residual aura about this class of 
research that it is in some sense “basic”), and we 
must not expect that our perceptions of social 
events and personal qualities, for example, are 
built up out of perceived surfaces any more than 
perceived surfaces can be built up out of sensed 
spots of color. 

Perception of people and social events. Much 
research on what is called social perception has 
studied the effects of the observer's needs and 
expectations on his judgments of physical vari- 
ables, for example, the effects of economic class 
on children’s estimates of coin size, or the effects 
of emotional connotations of words on the amount 
of light needed to recognize them. One goal of 
such research has been to demonstrate that un- 
conscious perception, or subception, can occur; 
even if this goal could be achieved—and it is 
doubtful that it can be in any conclusive fashion 
__we have seen in our previous discussion that the 
concept is not a novel one in the history of per- 
ceptual theorizing. The asserted findings of such 
effects are few and still contested after a decade 
of controversy; even were they large and reliable, 
however, there is no clear path by which we could 
apply such findings to the study of the perception 
of people and of social events. This latter inquiry, 
which is still in its infancy, will almost certainly 
require new analytic units and experimental vari- 
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ables different from those of physical size or light 
intensity. 

A primary problem here is that of defining the 
areas of research. We do not yet know how to 
classify the problems most fruitfully or what vari- 
ables are important to those problems. A start at 
classifying problems might be in terms of the per- 
ception of other persons’ permanent qualities (e.g., 
character traits), their temporary states (e.g., emo- 
tions, desires), and their interpersonal relations. 
A beginning at an introspective classification sys- 
tem has been undertaken by Heider (1958), and a 
few studies have been made in which the physical 
proportions of faces were varied (or in which the 
physical qualities of voices were manipulated) to 
discover the effects of such variations on perceived 
social traits. Schlosberg (1952) has reduced the fa- 
cial expression of emotions, as portrayed in photo- 
graphs, to the different combinations that can 
be achieved with two underlying dimensions 
(pleasantness—unpleasantness, attention—rejection). 
However, the manipulation and measurement of 
facial or bodily configurations are almost impossibly 
difficult if the research is to have any relevance to 
nontrivial samples of social events. 

An ingenious and promising solution to this 
general problem is illustrated in several studies by 
Secord (1958) and his colleagues. If two different 
groups of subjects agree in their judgments about 
a set of people (the social objects), there must 
be certain stimulus features of the social objects 
to which both groups of judges are responding in 
the same way, and it is the task of the student of 
social perception to discover what those stimulus 
features may be. (This is, of course, precisely the 
same problem encountered in starting research in 
any area of perception.) Still another group of 
subjects can help us in this task by serving as a 
set of human measuring instruments. All we want 
from this last group is their judgment of such 
features as grooming, mouth curvature, complex- 
ion, and so forth, for each social object. With 
these measures (for which we need not know the 
physical bases), we can then try to discover which 
features or combinations of features are respon- 
sible for the social properties that were reported 
by the first set of subjects—for example, glasses 
impart apparent dependability, industriousness, 
intelligence; relaxed lips enhance a woman’s ap- 
parent sexuality, etc, Eventually, it is to be hoped, 
a set of stable stimulus elements will be found 
with which to analyze more complex social prop- 
erties, 

Why should we try to find such analytic units? 
Units of analysis are needed in social perception 


for the same reasons that they are needed in other 
fields of perceptual inquiry: this knowledge may 
permit us to predict more economically and more 
heuristically what our subjects will do—including 
that particularly important class of behaviors, what 
they say they observe. It is important that we keep 
our purposes clear in undertaking social-perceptual 
inquiry, because in this problem area we do not 
even have the heritage of a previous goal and pur- 
pose (i.e., the structuralist framework) as land- 
marks by which to set our course and gauge our 
progress. 

JULIAN E. Hocuperc 


[See also Senses. Other relevant material may be 
found in GESTALT THEORY; LEARNING, article on 
DISCRIMINATION LEARNING. ] 


BIBLIOGRAPHY 

For an introduction to the attempt to analyze perception 
into sensory elements, Titchener 1896 is probably unsur- 
passed; a more recent general introductory survey of the 
status of this attempt and of its successors appears in 
Hochberg 1964. The problem of spatial direction is enjoy- 
ing a resurgence of interest; three anchors in the flood of re- 
search are Kohler 1962, a popular article; Mikaelian & Held 
1964, a research paper; and Harris 1965, a critical, inte- 
grated literature review. The study of shape perception 
has produced a vast and expanding literature, much ol 
which is devoted to attempts to formulate the problem s0 
that the results of various exercises with digital computers 
will be of evident relevance. The best gestalt treatments 
are those in Koffka 1935 and Metzger 1935-1942. Empir 
cist rebuttals are to be found in Hebb 1949 and Brunswil 
1947. Both classifications and generation procedures, e 
well as analyses of form based on measures of other tuno 
tions to which form is important, are briefly reviewed in Ne 
last chapter of Attneave 1959 and still more briefly in ule 
berg 1964. The perception of distance is best reviewel i 
Gibson 1950; an analysis of the entanglement of this roi 
lem with the more general controversy between the He 
istic and the empiricist views of man, is given in ein 
berg 1962, and a review of the literature is in rae a 
1964, Modern research in the area of social percep! oy 
the restricted sense in which the term is used Li vel 
with a seminal paper, MacLeod 1947. Heider 19! pee 
Secord 1958 form the background for most of the 
discussion. 


ion The- 
ATTNEAVE, FreD 1959 Applications of Information Te 
ory to Psychology: A Summary of Basic 
Methods, and Results. New York: Holt. 
Brunswik, Econ (1947) 1956 eae 
resentative Design of Psychologica j 56. 
2d ed., rev. & enl. Berkeley: Univ. of can a 
EpsTEIN, WILLIAM 1964 Experimental Inves' Z rological 
the Genesis of Visual Space Perception. Psyc! 
Bulletin 61:115-128. 3 
Gmeson, James J. 1950 The Perception oj 
World. Boston: Houghton Mifflin. 
GīBsoN, James J. 
Stimulation. Volume 1, pages 456- S017 ka New 
Koch (editor), Psychology: A Study of @ S¢ 
York: McGraw-Hill. to In- 
Harris, CHARLES S. 1965 Perceptual Adaptation ogical 
verted, Reversed, and Displaced Vision. Psy 
Review 72:419-444. 


f the Visual 


PERCEPTION: Perceptual Development 


Hess, DONALD O. 1949 The Organization of Behavior: 
A Neuropsychological Theory. New York: Wiley. 

HEDER, FRITZ 1958 The Psychology of Interpersonal 
Relations. New York: Wiley. 

HeELSON, HARRY 1964 Adaptation-level Theory: An Ex- 
perimental and Systematic Approach to Behavior. New 
York: Harper. 

Hocuserc, JULIAN E. 1962 Nativism and Empiricism 
in Perception, Pages 255-330 in Leo J. Postman (edi- 
tor), Psychology in the Making: Histories of Selected 
Research Problems. New York: Knopf. 

Hocugerc, Junin E. 1964 Perception. Englewood 
Cliffs, N.J.: Prentice-Hall. 

Hurvicu, Leo M.; and JAMESON, DoroTHEA 1957 An 
Opponent-process Theory of Color Vision. Psychologi- 
cal Review 64:384-404. 

Korrxa, Kurt 1935 Principles of Gestalt Psychology. 
New York: Harcourt. 

Kouuer, Ivo 1962 Experiments With Goggles. Scientific 
American 206, no. 5:62-72. 

MacLeop, RopertT B. 1947 The Phenomenological Ap- 
proach to Social Psychology. Psychological Review 54: 
193-210. 

Merzcer, Wotrcanc (1935-1942) 1953 Gesetze des 
Sehens. 2d ed., enl. Frankfurt am Main (Germany): 
Kramer, 

Marian, Hanurune; and HELD, RicHarp 1964 Two 
Types of Adaptation to an Optically-rotated Visual 
Field. American Journal of Psychology 77:257-263. 

ScuuosBeRc, Harorp 1952 The Description of Facial 
Expressions in Terms of Two Dimensions. Journal of 
Experimental Psychology 44:229-237. 

Secor, PauL F. 1958 Facial Features and Inference 
Processes in Interpersonal Perception. Pages 300-315 
in Renato Tagiuri and Luigi Petrullo (editors), Person 
Perception and Interpersonal Behavior. Stanford 
(Calif.) Univ. Press. 

TircHener, Epwarp B, (1896) 1902 An Outline of Psy- 
chology. New ed., enl. New York: Macmillan. 

Wattacu, Hans 1948 Brightness Constancy and the 
Nature of Achromatic Colors. Journal of Experimental 
Psychology 38:310-324. 

Wonpr, Wirnetm (1896) 1907 Outlines of Psychology. 
Leipzig: Engelmann. > Translated from the seventh, 
revised German edition. 


n 
PERCEPTUAL DEVELOPMENT 


The problem of how man develops his knowledge 
of the world around him in the course of his life- 
span is one of the oldest in the history of psy- 
chology and at the same time one of the newest for 
modern experimental psychology. Historically, the 
een has received attention from philosophers 
hee with the Greeks and became the focus of 
E fete question of nativism—empiricism 
British seventeenth and eighteenth centuries. The 
David eee beginning with John Locke and 
his’ ken lume, espoused the notion that man gets 
Gane of objects and space in the world 
nativisti im through his senses, in contrast to the 
ENEA i assumption of innate, God-given, proto- 
slate” ideas. The concept of the mind as a “blank 

that is written upon by experience was set 
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forth so persuasively by these empiricists that it 
dominated Western thought for centuries. George 
Berkeley carried the arguments into a more psycho- 
logical realm by considering some specific prob- 
lems, especially the perception of visual space and 
the third dimension, and proposed that visual per- 
ceptions of distance are mediated by and dependent 
on habitual connections with touch—an idea that 
persists yet. The principle of association, account- 
ing for the linking together of sensory elements of 
experience, was invoked by David Hartley, and fre- 
quency as a primary principle of such associations 
was proposed by James Mill. Thus the concept of 
associative frequency, assumed by many psycholo- 
gists to be the fundamental principle of learning, 
was born in an attempt to account for the develop- 
ment of man’s complex perceptions of objects, their 
sizes, their shapes, and their distances [see BERKE- 
LEY; HARTLEY; HUME; LOCKE]. 

With the nineteenth century came the first ex- 
perimental work on the physiology of the senses, 
work which led to further elaboration of the theory 
of perceptual learning. Hermann von Helmholtz, 
the genius of this period, carried on the empiricist 
tradition, supplementing it with the notion of “un- 
conscious inference.” He thought that certain 
assumptions were built through repeated sensory 
experience and were drawn on, albeit unconscious- 
ly, to interpret the momentary sensory input [see 
HELMHOLTZ]. 

There were proponents of nativism as well dur- 
ing these centuries: Kant its principal philosophical 
advocate and Ewald Hering its principal physiolog- 
ical advocate. Nativism had little impact on scien- 
tific psychology in the United States and England 
compared with the effects of the empiricists’ 
theories. But it did foreshadow the gestalt psy- 
chology of the early twentieth century. The gestalt 
psychologists’ emphasis on characteristics of the 
“whole”—patterns and relations—that cannot be 
explained as summations of discrete sensory ex- 
periences (for example, a melody retaining its iden- 
tity despite transposition to another key) was 
clearly at odds with the notion that complex per- 
ceptions are constructed by association of elemen- 
tary sensations and ideas. So was the gestalt notion 
of Prägnanz, of forces tending toward a “best” 
structure (for example, the soap bubble), both in 
the physical world and in man’s neural processes. 
Furthermore, isomorphism between neural process 
and perception was assumed. When self-regulating 
processes of organization resulting in one “best” 
structure are postulated, associative learning can 
have no role in perception. In 1921, Koffka pub- 
lished Growth of the Mind, a gestalt psychologist’s 


536 


view of how cognition develops. He conceived of the 
developmental process in general as one of articu- 
lation and differentiation, rather than as a concre- 
tion of experience, and of perceptual development 
as the process by which a view of the world grad- 
ually emerges from early and inarticulate experi- 
ences (1921, p. 280 in 1924 edition). For instance, 
with respect to color, he suggested that first there 
is a distinction made only between “colour and 
non-colour”; after this stage, colors become distin- 
guished as “warm” and “cold”; then, within the 
warm and cold groups, differentiations are made 
of the four principal colors, red, yellow, green, and 
blue (1921, p. 291 in 1924 edition). Similar views 
of perceptual development characterize the work of 
Werner (1926). “Creative learning” and matura- 
tion, as opposed to associative frequency, are the 
key principles [see GESTALT THEORY; HERING; 
KANT; Korrka]. 

During the same period, interest in perception 
in the United States was almost eclipsed by the 
victory of behaviorism, However, the functionalists, 
especially Carr (1935) and his students at the Uni- 
versity of Chicago, kept some interest in space per- 
ception alive. As theorists, they stayed within the 
empiricist tradition, but, influenced by the work of 
George M. Stratton, they introduced a strong em- 
phasis on “localizing” movements of the eyes, head, 
hands, body. These movements become directly 
associated with specific local signs in each sense 
department in temporal contiguity with them 
(Carr 1935, p. 28 ff.). Stratton’s early experiment 
of wearing an inverting lens system (a lenticular 
pseudoscope producing a 180° rotation of the visual 
field) and, in the manner of a learning experiment, 
studying the ensuing changes was an important 
impetus to experimental research (1897). 

During the 1930s and 1940s the collection of 
developmental data, with an emphasis on age 
norms, was also a strong trend (Gesell et al. 1949). 
The study of “sensorimotor development” (for ex- 
ample, the time at which an infant first follows a 
moving light, turns his head to a sound, coordi- 
nates his eye movements) replaced “perceptual de- 
velopment” as an area of concern at this time, as a 
result of the influence of the stimulus—response 
(SR) concept in psychology. 

By the end of World War 11 a number of trends 
combined to produce the re-emergence of percep- 
tual development as a full-fledged experimental 
problem, gradually shaking off its roots in philoso- 
phy. During the war psychologists found that per- 
ceptual skills could be improved by training; and 
comparative psychology and ethology discovered 
once more the importance of the analysis of the 
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sensory aspects of “innate” behavior [see ETHOL- 
ocy]. Lashley, in a 1938 address called “The 
Experimental Analysis of Instinctive Behavior,” had 
already strongly urged an emphatic and systematic 
analysis of real stimulus attributes in studies of in- 
stinct, reflexes, and learning, claiming that psycho- 
logical theories would remain sheer nonsense if the 
stimulus continues to be defined only as what “the 
experimenter puts in front of the animal” (Lashley 
[1938] 1960, p. 380). He spoke of the “innate com- 
ponents of sensory organization” and suggested 
experimental methods for the analysis of the ade- 
quate stimulus of innate behavior. Two of these 
methods, “sense privation” (now generally termed 
sensory deprivation) and successive elimination of 
properties of the stimulus object, are responsible 
for the design of much modern research. 


Methods of studying perceptual development 

In present-day psychology, the nativism-empiri- 
cism, or nature-nurture, controversy has been re- 
phrased. The questions asked —how does perception 
develop? to what extent is it trainable? and to what 
extent is it dependent on innate factors? —are ones 
that can be answered scientifically. Evidence from 
ethology of “species-specific innate releasers” has 
convinced the most die-hard empiricists that cer- 
tain stimulus-to-perception correlations are built in. 
And laboratory evidence of the improvement of 
perceptual skills through training has made dog: 
matic acceptance of a nativistic bias equally impos- 
sible, A number of methods are being pursued in 
an intensive effort to uncover the laws of perceptual 
development. 

Sequential studies. Longitudinal observational 
study, with a comparison of different age ka 
and a search for a definable sequence (employe 
by Gesell for sensorimotor development) has Pa 
utilized in a long-term program of research on c08 
nitive development by Piaget and his cowok 
Piaget’s observations on perceptual developmen 
are summarized in Les mécanismes pacan 
(1961). His theory of perceptual devel oa 
emphasizes constructive activity on the part of y 
child. There are perceptual “mechanisms ani a 
a later stage intellectual processes as well, he 
lieves, operating on the sensory input andin aid 
constructing the perceptual world. The bea 
Piaget's work is enormous, but one examp E; 
development of form perception by toue jects 
suffice (Piaget & Inhelder 1948). Familiar 0?! 
or cardboard cutouts of geometrical 
hidden from the child’s visual field b 
behind a screen to the child’s touch. ad was 
hand movements were observed, and the chi 
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asked to identify or describe the object. Develop- 
mental progress appeared to occur in stages. At the 
begining of Stage 1 (up to four years), the child 
identified only familiar objects, and tactile explora- 
tion was gross and relatively passive. Later in 
Stage 1, geometrical shapes that differed topologi- 
cally (properties such as closed versus open) were 
distinguished. In Stage m (four to six years), the 
child progressed to crude differentiation of linear 
from curvilinear shapes and then to progressive 
differentiation according to angles and dimensions. 
In Stage mm (six years and over) there was ability 
to distinguish between complex forms, with me- 
thodical exploration. Shifts from one stage to the 
next were accompanied by a more active, elaborate, 
and planned tactile search that permitted “con- 
struction” of a whole, complex contour. This motor 
activity Piaget believes to be essential in the final 
abstraction of shape itself. 

Piaget's laboratory has produced much experi- 
mental work on illusions and perceptual constancy, 
usually employing samples of subjects at different 
age levels. Perceptual constancy has often been ex- 
plained as the result of a learning process. To take 
size constancy as an example, the child presumably 
learns (through moving around his environment) 
that objects at a distance are not smaller than near 
ones, and he comes to infer their real size from 
cues to their distance and from familiarity. Experi- 
ments comparing size constancy in children and 
adults performed in Piaget's laboratory (and others) 
generally reveal developmental differences, but their 
interpretation is ambiguous. Adults are often less 
accurate than children; adult errors tend toward 
overcompensation and suggest the operation of 
attitudinal factors rather than growth in perceptual 
Processes as such [see DEVELOPMENTAL PSYCHOL- 
OGY, article on A THEORY OF DEVELOPMENT]. 

Comparative studies. Ontogenetic studies such 
as Piaget's have demonstrated sequential changes 
th Perceptual development, but the relative con- 
tributions of maturation and experience are diffi- 
Bae to assess, Two other comparative methods have 

een used as well: interspecies and cross-cultural 
Comparisons, 
eres comparisons. Comparisons of spe- 
Waele the very young animal have revealed 
tee aaa differences, due either to the 
i eras stage of the neonate or to differences 
Shr ure—in either case the emphasis being 
ae “atts rather than experiential factors. Walk 
Airis (1961) made comparisons of young 
to a aes toa simulated cliff (constructed 
oF the ae only visual information about the depth 
p-off) as soon as the young of each spe- 
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cies were capable of locomotion. All the species 
tested exhibited a preference for the shallow over 
the deep side of the cliff as soon as tests were 
possible. But some animals (for example, chicks 
and ungulates) could be tested at birth, while 
others (such as human infants) were too imma- 
ture for testing until considerably later. Fantz 
(1961) has studied young animals of several spe- 
cies with a selective-attention method and revealed 
the presence of pattern discrimination in young 
primates far earlier than had been supposed by 
most empiricists. 

Cross-cultural comparisons. The other com- 
parative method is the cross-cultural study. Com- 
parisons of responses of different ethnic groups to 
illusions (for example, those done by Allport & 
Pettigrew 1957) have shown some differences, in- 
ferred by the authors of the studies to be the result 
of varying experiences, ecologies, and education. 
Whorf’s hypothesis that man’s perceptual system 
is structured in part by his language is well known 
to social scientists. But the limits of cultural influ- 
ence must not be glossed over. Lenneberg (1961) 
found that young Zuni subjects did show differ- 
ences in naming colors that reflected the linguistic 
peculiarities of their color nomenclature, But when 
psychophysical measures of differential limens 
(thresholds) were taken from the same population, 
there was no corresponding shift; indeed, the li- 
mens were the same as for a native English-speak- 
ing group. 

Laboratory studies. Interesting as the compara- 
tive studies may be, it is clear that manipulative 
laboratory study is necessary for the ultimate 
understanding of how perception develops. 

Deprivation studies. Lashley's method of “sense 
privation” is one way of testing experimentally the 
hypothesis that normal perceptual development is 
a function of accumulated sensory experience. 
Elimination of visual stimulation from birth on- 
ward should, from an empiricist’s standpoint, pre- 
vent the development of adequate perception of 
space, shapes, and objects. However, the findings 
are equivocal in cases of human adults whose con- 
genital cataracts have been surgically removed. 
Visual perception is not normal in these cases 
(obviously, visually perceived objects cannot be 
named correctly), but factors other than absence 
of visual stimulation could be responsible, for ex- 
ample, inferior accommodation, an emotional up- 
set, or interference from previous habits. Lorenz 
(1965) has commented on the limitations of the 
deprivation experiment. 

It is possible, though, that visual apparatus must 
mature in an environment of patterned light for 
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normal pattern vision to develop. Chimpanzees 
reared in darkness or with only unpatterned light 
are incapable of normal visual discrimination of 
pattern when tested after seven or more months 
of such treatment (Riesen 1958). Cats reared in 
darkness likewise exhibit disturbed visual develop- 
ment. On the other hand, rats reared in darkness 
behave like their light-reared controls after being 
taken into the light. Lashley and Russell (1934) 
found that in dark-reared rats the force of jump from 
a platform to a target was related to the length of 
the distance to be covered; and Walk and Gibson 
(1961) found that they avoided a visual cliff as 
well as the normal animal. It would thus appear 
that species differences exist even among the mam- 
mals, since the more visually dominant primates 
and cats require some patterned light (and pos- 
sibly active movement within the lighted environ- 
ment as well) for normal visual development, 
whereas the rat does not. But the mere statement 
that the rat’s spatial vision is “inborn” is insuffi- 
cient. What are the adequate stimuli to which it 
responds so immediately? Walk and Gibson used 
Lashley’s method of successive elimination of prop- 
erties of the stimulus and found that the dark- 
reared rat discriminated depth differences with 
certain isolated stimulus properties (for example, 
motion parallax) but not with all. Isolated texture 
difference in the cliff situation yielded a 100 per 
cent preferential response in the light-reared rat, 
but in the dark-reared animal a chance response 
was found to texture differences. After living in a 
lighted environment for a week the deprived ani- 
mals responded like the normal animals. This ex- 
periment makes it evident that innate and experi- 
ential factors combine in the process of perceptual 
development. Location of the factors and deter- 
mination of their interaction has only begun. 
Enhancement studies. An alternative to the 
method of deprivation is that of rearing the experi- 
mental animal with enhanced opportunities for 
stimulation. The hypothesis tested is that the ani- 
mal will later exhibit superior perceptual perform- 
ance. Gibson and her associates (1959) used this 
method in studying form discrimination in rats. 
Shapes (circles and triangles) were cut out of 
metal and hung on the walls of the experimental 
animal’s living cage. After three months of such 
daily exposure, the animals were presented with 
a triangle-circle discrimination task. Discrimination 
learning in these experimental animals appeared 
at first to be facilitated in comparison with control 
groups, but further experiments suggested that 
selective attention rather than literal “visual learn- 


ing” of form was responsible. Transfer of selective 
attention from a simple situation to a more com- 
plex one may be one of the most powerful mech- 
anisms of perceptual development. 

Adaptation to distorted environments. Strat- 
ton’s early experiments with inverting optical sys- 
tems have been carried further, especially at the 
University of Innsbruck (Kohler 1951). The hy- 
pothesis underlying these experiments is that pro- 
gressive adaptation to the distorted world thereby 
produced is analogous to the original development 
of space perception in the infant. This reasoning 
is questionable, but the experiments are interesting 
because they demonstrate great plasticity in the 
human subject in compensating for a transformed 
stimulus array. The transformation can be an ex- 
tremely complex one. Not only have spatial trans- 
formations (up-down or right-left reversals, pris- 
matic shifts) been employed but spectacles with 
different colors of glass in different halves (upper 
versus lower, right versus left) have also been 
used, Even in the latter case, the subjects eventu- 
ally adapt, and most astonishingly show, by the 
aftereffects when the spectacles are removed, that 
adaptation has been specifically conditioned to eye 
position. It is essential to remember that all these 
transformations, however complex, are still in 
systematic correspondence with the original spatial 
order. Thus, one cannot infer that because adapta 
tion to the new order is possible the infant in the 
beginning creates spatial order out of a random 
or chaotic fusillade of stimulation. 

The plasticity of the human species in these 
experiments is in remarkable contrast to that of 
the chicken. Hess (1956) placed prisms that dis- 
placed the visual field 7° to the right or left on 
chicks and observed pecking accuracy. Despite four 
days of practice, not a single chick learned to col 
rect for the displacement, This species aie 
again argues for some innate basis of spatial Her 
development, with greater plasticity in (at leas 
the human species. 

Trainability of perceptual skills. 
of the human observer is convincingly pie 
strated also by his trainability in numerous ei he 
tual tasks. Experiments have demonstrate ai 
effects of practice on judgments of pitch, D 
hue, distance, and weight (see Gibson 1902 aE 
ably the first learning curve in experiment a 
chology was obtained by W. F. Volkmar 
found that the two-point limen on the skin Bee af 
progressively lower with practice. An ona ex: 
educated pitch discrimination is found in cient 
periment of Heinz Werner's with “microme’ 


The plasticity 


PERCEPTION: Perceptual Development 


Observers were trained in judging fine differences 
of pitch until they could eventually hear (and 
transpose) melodies embodying pitch differences 
so tiny as to have been indistinguishable previ- 
ously. Both these experiments demonstrate differ- 
entiation of perception, the development of greater 
precision of judgment within a stimulus dimension. 
A rather different case is learning to respond differ- 
entially to stimulus differences never attended to 
before. It has been shown that some animals (for 
example, bats) and many blind persons get infor- 
mation about obstacles in the space around them, 
and accordingly steer themselves, from echoes re- 
flected from objects at different locations. The bat 
emits a high-pitched noise, the blind man taps a 
cane, thus enhancing the opportunities for possible 
differential echoes. That this is a learned skill in 
man has been demonstrated in experiments with 
blindfolded subjects, who progressed from random 
performance to virtuosity after training. 


Modern theories of perceptual development 


It should be clear that for modern psychological 
theory the problem is no longer whether perception 
is innate or learned, but rather how it develops. 
“Enrichment” theories and “differentiation” theories 
are two general approaches to perceptual develop- 
ment (Gibson & Gibson 1955), 

Enrichment theories. Enrichment theories as- 
sume that perception begins with bare sensory in- 
put, and that this is supplemented somehow by 
past experience. The supplementation may be by 
way of an associated context of images and ideas, 
or it may be by inference from “hypotheses” built 
up through experience. The latter position, deriving 
from Helmholtz, has a number of modern propo- 
nents. 

Egon Brunswik’s probabilistic theory held that 
experiences during development reflect the envi- 
ronmental ecology and result in a network of hy- 
Potheses used by the observer to make a best guess 
about the probable distal reality of a present proxi- 
mal stimulus. A number of demonstrations reflect- 
ing this general orientation were designed by Adel- 
bert Ames. They involve stimulus arrays presented 
under nonoptimal viewing conditions and show, 
Presumably, the interpretative effect of the observ- 
ers accumulated assumptions. 

5 Another version of the enrichment hypothesis is 
oR theory, which likens the perceptual proc- 
Ga. categorizing. Schemata are cumulatively 
aie up through experience; new sensory exper 
a s are matched with them, thus explaining 

Ognition and identification of objects by way of 
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past experiences. The matching process has been 
thought of as a successive trying-out of hypotheses. 
The schema presumably influences the perceptual 
process in addition to serving in identification. 

A behavior theory (as opposed to a cognitive 
theory) of enrichment of the sensory process also 
has its adherents. Improvement in discrimination 
is explained as the learning of differential re- 
sponses; these in turn, by way of response-produced 
stimulation, add distinctiveness to the sensations 
produced by the original stimuli. The hypothesis 
is generally referred to as “acquired distinctiveness 
of cues.” The opposite process, acquired equiva- 
lence, should occur when common responses are 
learned. 

Differentiation theories. Differentiation theo- 
ries of perceptual development contrast with 
enrichment theories by assuming that the environ- 
ment, by way of the stimulus array, provides a 
wealth of potential information, and that develop- 
ment is a process of “getting” more and more of 
this information. It is not a matter of adding some- 
thing to a sensory process but, rather, of respond- 
ing more selectively to the variables and invariants 
of stimulation. It is a matter of learning to attend 
to distinctive features of the stimulation, as op- 
posed to adding distinctive responses. Development 
is characterized by reduction of primary generali- 
zation (unselective response to a wide range of 
similar stimuli) and increased refinement and pre- 
cision of discrimination. Emphasis is thus on learn- 
ing discriminations, rather than equivalences (as 
in a schema theory). The differentiation process 
may be one of progressive splitting of large dimen- 
sions, or it may be one of training the attention to 
critical dimensions, or it may be both. Any increase 
in “equivalence” responses, such as those produced 
by perceptual constancies, is explained as learning 
to respond to higher-order invariants in stimulation, 
rather than as the learning of common mediating 
responses, or interpretations. 

Models of perceptual development. Neurologi- 
cal models of perceptual learning (see, for exam- 
ple, Hebb 1949) are becoming fairly numerous, 
and so are computer models (Uhr 1963). It is to 
be expected that these models will influence psy- 
chological theories (and vice versa) and that 
achievement of a satisfactory merger of all three 
will lead to faster progress in our knowledge of how 


perception develops. 
ELEANOR J. GIBSON 


[See also SENSORY AND MOTOR DEVELOPMENT. Other 
relevant material may be found in DEVELOPMENTAL 
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PSYCHOLOGY; HEARING; SKIN SENSES AND KINES- 
THESIS; VISION.] 
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m 
DEPTH PERCEPTION 


The term “depth perception” may mean both the 
perception of the distance of an object and the 
perception of its solidity (the slant or curvature of 
its surfaces). For reasons to be explained, these 
properties taken together are known as the third 
dimension. The term may also mean “space per- 
ception,” on the theory that perception of an array 
of objects at various distances from the observer 
and from one another implies the apprehension of 
abstract space. It is thus related to still other kinds 
of perception, such as direction from the observer 
and orientation relative to gravity. The term is 
obviously connected with object perception and 
with constancy of perception. 

The phenomena of depth perception are highly 
various and have been described in different ways 
for centuries, cutting across the disciplines of 
geometry, physics, philosophy, and psychology 4S 
well as the arts of painting, sculpture, and archi- 
tecture. Discussion has been frequently plagued by 
confusion of the concrete with the abstract and 0 
phenomenal properties with physical variables. The 
relation of the visual to the tactual perception of 
depth has never been clear. 4 

That many kinds of visual depth perception do 
exist is indicated by the behavior of animals, no- 
tably their avoidance of obstacles or precarious 
places and their choice of the nearest goal or De 
shortest path to a goal. The precision flying 3 
birds and insects certainly suggests that they per 
ceive space in some meaning of the word. This type 
of behavior is studied more and more as por 
mental psychology becomes less anthropomorP A 
But the whole topic of depth perception 12 di- 
centered on the classical problem of how an mr 
vidual can perceive in the visual third ae 
when the sensations on which the perception ™ 


be based are in only two dimensions. It should be 
noted that the problem has been thought to be 
visual, not auditory or tactual, and that it arises 
from human introspection. 


The problem of visual depth perception 


The puzzle of three-dimensional perception arises 
from the following very old assumptions about 
vision, depth, and perception: 

(1) The eye works like a camera, and the retinal 
image is comparable to a flat picture. The informa- 
tion delivered by the visual sense is therefore 
bidimensional. 

(2) When depth is perceived, a plane form is 
converted into a solid form in the same way that 
plane geometry is related to solid geometry. When 
distance is perceived, a third dimension is added 
to the two dimensions of the frontal plane just as 
the third axis is added to the two axes of Cartesian 
coordinates. 

(3) Perception is a mental construction imposed 
upon the data of the senses; the nature of this con- 
struction is to be discovered. 

These assumptions seemed unquestionable as 
early as the eighteenth century. In the first case, 
the gross anatomy of the eye was known by then; 
the projection of an image by a lens had been 
studied; and the inverted image on the retina of an 
excised eye had been observed, along with the 
image on the rear wall of a camera obscura. New- 
ton could speak with confidence of “the picture 
painted on the fund of the eye.” Second, the power 
of analytic geometry in supplementing plane ge- 
ometry had been demonstrated, and the geometer- 
painters were daily using the laws of perspective 
Projection on a picture plane to create astonishing 
illusions of reality. Since a space of three dimen- 
sions could be converted into two, the natural ques- 
tion was how a retinal picture might be recon- 
verted into three dimensions by the mind. Third, 
men had been made conscious of their sensations 
and their dependence on the corresponding sense 
organs by the discovery of curious phenomena like 
the blind spot of the eye. The doctrine of sensation 
as a necessary precondition for perception, or even 
knowledge, had become established. The question 
ne how perceptions could arise from these bare 

pressions, or, for vision, how the experience of 
Pajects in space could arise from the data found 
f a picture, e.g., colored spots, lines, Or plane 
orms, 

r empra solutions of the problem. For two 
a some of the greatest minds in Western 

tion struggled with the problem of depth 
Perception. It was the chief battleground for com- 
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peting theories of human nature, such as nativism 
versus empiricism and rationalism versus sensa- 
tionalism. Many solutions have been offered, but 
efforts to solve the problem continue into the pres- 
ent. The history of experimental psychology is 
permeated by it, as Boring (1942) has shown. The 
interest has been so great that diluted versions of 
certain explanations have become popular knowl- 
edge. Two of the most common beliefs are that 
depth perception depends on binocular vision and 
that it is learned (i.e., it is an acquired skill). The 
actual explanation, in contrast to popular belief, 
is not at all simple. The main efforts over the cen- 
turies are outlined below. 

Binocular vision. According to the binocular- 
vision hypothesis, tridimensional vision, now often 
called stereovision, comes from the ability to use 
two eyes, either one alone being insufficient. The 
idea goes back at least to Berkeley (1709). The 
first form of the hypothesis asserted that the mind 
(or the brain) computes the depth of a fixated 
object by “feeling” the directions of the two eyes 
and then triangulating and also by “feeling” the 
amount of focusing necessary for each lens. There 
is still argument about whether the eye muscles can 
register and report such convergence and accom- 
modation, A later, more sophisticated addition to 
the binocular hypothesis declared that the disparity 
of the two retinal pictures was transmitted to the 
brain and converted into perceived depth, The 
validity of this lawful geometrical incongruence, 
the difference in perspective due to the differing 
parallax of the eyes, was impressively shown by 
the discovery of the stereoscope (see Boring 1942, 
p. 282) and was popularized in the late nineteenth 
century by the manufacture of parlor stereoscopes. 

Binocular disparity does provide information 
about depth if the ocular equipment can register 
it. But not all animals have eyes that can do so. 
Moreover, questions of how two separate pictures 
could be transmitted to the brain and how they 
could be compared by anything short of a man 
in the brain have never been answered. In gen- 
eral, the hypothesis is not sufficient to explain 
depth perception, for it does not take into account 
the animals that have lateral eyes instead of the 
forward-pointing eyes with compulsory conjuga- 
tion of eye movement of humans, or the men 
who have only one eye or who, having two eyes, 
are unable to see any special sort of depth in a 
stereoscope. [See VISION, article on EYE MOVE- 
MENTS.] 

Past experiences. A second general hypothesis 
states that depth perception consists of learning 
to perceive the flat retinal images as objects. The 
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original sensations are converted into perceptions 
by an accumulation of experiences with the outer 
world. This has been by far the most appealing 
explanation over the centuries. One formula sug- 
gests that a given sensation calls up from memory 
all the other experiences—such as approaching, 
touching, and handling the object—with which it 
has been regularly associated. Vision gets its mean- 
ing from touch. Experiments with artificial inver- 
sion or displacement of the visual field are tests 
of this hypothesis. With upside-down vision do we 
eventually learn to see things where we feel them 
to be? The answer is no, despite a popular under- 
standing to the contrary. A more defensible for- 
mula suggests that certain properties of retinal 
images are signs, indicators, or criteria of depth 
in the corresponding objects and that the child 
learns to interpret these signs. This notion was 
elaborated by Helmholtz in 1867 in the theory of 
unconscious inference, which asserts that the origi- 
nal act of interpretation drops out of consciousness 
in the adult, who therefore believes he sees the 
depth of the world immediately. [See the biogra- 
phies of HELMHOLTZ and LOTZE.] 

The usual list of the clues for depth (or “cues,” 
to play down the rational implication) consists of 
linear perspective, the known size of familiar ob- 
jects, the heights of the form in the visual field, 
the covering of one object by another, aerial per- 
spective, and motion parallax. All of these except 
the last can be represented by a painter and are 
supposed to occur in the retinal picture. The cues 
are difficult to isolate experimentally or to specify 
exactly, and no listing can be considered exhaus- 
tive. And the so-called cue of motion parallax is 
not so much a motion of an object as a complete 
transformation of the entire field of view of an 
observer during locomotion (Gibson 1950). 

Intuition. Another hypothesis holds that the 
sensations are converted into spatial perceptions 
by the intrinsic capacities of the mind. The simple 
doctrine of innate ideas—of the soul that enters 
the body at birth—lost ground during the Enlight- 
enment, but to some thinkers the abstract ideas 
of space, motion, and form could not be derived 
from sensations. A capacity for other-than-sensory 
intuition was demanded, and a highly sophisticated 
form of nativism evolved in the writings of Kant. 
He asserted that depth is perceived because the 
very nature of perception requires a preconception 
of tridimensional space. 

In general, scientific psychologists have been un- 
sympathetic to any appeal to innate capacities and 
continue to search for an explanation in terms of 


learning whenever possible. Experimental evidence 
keeps suggesting, however, that animals and chil- 
dren do not experience sheer sensation even when 
they have had no opportunity to learn the cues for 
depth, to associate vision with other experience, or 
to acquire visual conditioned responses. This is 
perforce interpreted as evidence for nativism. The 
issue persists therefore as a dilemma: Empiricism 
will not explain all the facts, yet nativism is un- 
welcome, 

Gestalt theory. A fourth general hypothesis, 
proposed in the early 1930s, purports to provide 
an alternative to either nativism or empiricism. It 
suggests that depth perception depends on a spon- 
taneous process in the brain called “sensory or- 
ganization.” Bits of color are transmitted to the 
brain and converted into phenomenal objects, but 
not by associative connections and not by intuition. 
Instead, it was suggested that there exist field 
forces that yield some electrophysiological entity 
like the object. Since the brain is tridimensional, 
the neural substrate of experience is also tridimen- 
sional, and the experience is tridimensional. 

It was hoped that the laws of visual sensory or- 
ganization could be discovered by presenting spots, 
lines, angles, and curves to an eye and simply 
noting that these pictorial elements, depending on 
their relations, tended to yield perceptions of ob- 
jects in space. They did in fact do so, and the 
study of this seeming process of unit formation 
was a considerable advance over the classical list- 
ing of the cues for depth. These investigations 
(Koffka 1935) have not yet developed into a scien- 
tific enterprise, but they did establish the fact that 
the components of a picture need not be repre- 
sentational in order to arouse spatial experiences: 
They suggested, or now suggest, that the physical 
structure of light entering an eye may carry 4 
much richer load of information about the environ- 
ment than had previously been supposed. But aa 
did not serve to display the hoped-for laws = 
neural organization. Moreover, the independen 
neurological evidence for field forces is dubious. 

The expectation that a field theory of brain ac- 
tivity would resolve the dilemma of nativism- 
empiricism has thus not been realized. Most psy- 
chologists consider it a new form of nativism, a 
the grounds that if we must assume a brain T 
automatically produces fields that are like objec a 
we might as well assume an intuition of oe i 
the first place. There have been attempts to $ oe 
how a brain might automatically produce for 
by the residues of eye movements 
peatedly traced the contours (€.8-, Heb 


but these attempts are motivated by empiricism 
and, in any case, do not account for depth percep- 
tion. [See GESTALT THEORY; NERVOUS SYSTEM, arti- 
cle on STRUCTURE AND FUNCTION OF THE BRAIN; 
and the biography of KOFFKA.] 

Summary. None of these four efforts at solu- 
tion of the problem of depth perception has been 
successful. To summarize, binocular vision is only 
a partial explanation; past experience and intui- 
tion pose dilemmas; and gestalt theory does not 
really provide an alternative. 

Contemporary trends of thought. Since the 
1930s there has been a shift of interest away from 
the classical problem of tridimensional vision and 
toward another problem that is directly related to 
it but is a little closer to life: the puzzle of the per- 
ceived constancy of the size and shape of an ob- 
ject despite the variation in the size and shape of 
the retinal image. If sensations of retinal size and 
shape are prerequisite to perceptions of objective 
size and shape, then a perception of distance must 
enter into any perception of size and shape. The 
classical problem is not avoided by this formula; 
one puzzle is only substituted for another. How- 
ever, the emphasis on objects instead of abstract 
space points to the possibility that the essence of 
the problem may lie in the perception of environ- 
mental surfaces and their layout, This is the writer's 
belief. 

Another recent trend is the avoidance of cate- 
gorizing perceptions as either innate or learned 
and the study of the development of environment 
perception as it actually occurs in children or ani- 
mals. The contribution of learning to development 
can be assessed by observations of the development 
of spatial perception when there is no opportunity 
to learn. The results vary according to the spatial 
situation and the species of animal, but recent re- 
sults give little encouragement to theories advocat- 
ing that organisms learn cues for depth or the 
association of vision with touch. 

Still another tendency is a growing recognition 
that the perception of space cannot be considered 
apart from the perception of time. In fact, it is 
Possible that the first and simplest perceptions do 
not involve either abstract space or time but are 
instead direct detections of motion, change, and 
transformation in the world. If it were true that 
transformations were themselves stimuli, the for- 
ER of an entirely new theory of stimulation 
ie ope to take place, as well as a re-examina- 
rena e senses, and a rejection of the theory that 
Beare on is based on the classical data of sense. 

ps the difficulty of the old theories lies in 
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their conceptions of the nature of sensations and 
their assumption that sense organs could do no 
more than provide them. [See Time, article on 
PSYCHOLOGICAL ASPECTS.] 


Is the problem one of depth perception? 

The writer believes that the problem of depth 
perception has been insoluble because the assump- 
tions that gave rise to it can be challenged. First, 
it is possible that the information delivered by 
vision is not bidimensional, since the retinal image 
is not really analogous to a picture and since the 
ocular equipment does not actually work like a 
camera; the information obtained by vision is 
multidimensional. Second, it is likely that what 
animals and men perceive is not depth (the third 
dimension) but is the layout of environmental 
surfaces and their own movements relative to this 
arrangement. The information for this perception 
is mathematically and optically complex, but the 
resulting experience is simple. Third, it is possible 
that perception is not a mental construction im- 
posed upon the data of the senses but a direct 
registration of the information available in am- 
bient light, in the direction of gravity, and in the 
pressure of the surface of support. The visual sen- 
sations are symptoms of perception, but they are 
not entailed in perception, If so, the need to explain 
how they might be converted into perceptions dis- 
appears. The problem of depth perception vanishes. 
What emerges instead is a new problem, the prob- 
lem of perception as such. 

For this new problem, to be sure, one has to 
assume that there is information in reflected light 
about the layout of surfaces in the world, not just 
energy differences of intensity and frequency. One 
needs a new kind of optics concerned with gradi- 
ents and transitions instead of the old optics con- 
cerned only with points. Similarly, one has to 
assume that the pull of gravity on the weights 
inside the inner ear provides information about 
the direction of gravity, not sensations from the 
receptors, and that the pressures of the earth on 
the skin specify where the earth is relative to the 
body, not just where the pressures are relative to 
the skin. In all these cases the information is in- 
variant with a change of position of the observer, 
whereas, of course, the sensations vary with every 
movement of the eye or the head or the body. 

When the stimulus information is reduced or 
impoverished, as it often is in experiments on sen- 
sation, the remaining perceptual process can be 
fairly described as one of mental construction, in- 
ferring, or guessing. But when multiple redundant 
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information is available, the perceptual process can 
be conceived as one of detecting the invariant in- 
formation in the stimulus flux. If the observer's 
perceptual system is not already attuned to it, he 
can usually learn to extract what is invariant by 
exploration, Or his attention may be educated to 
the regularities that specify the subtle features of 
the world by training or teaching. Exploratory look- 
ing, listening, feeling, smelling, and tasting are 
characteristic of the child, and the development of 
these modes of attention continues in the adult. 
But this theory of the development of perceptual 
systems contrasts with the doctrine that modes of 
sensation are the starting point of perception in 
the newborn child and that learning consists of 
mental operations on these raw data, 

Theories of the conversion of sensations into 
perceptions seem unable to resolve the issue of 
whether this conversion is innate or learned. Hence 
the continuing controversy over evidence showing 
that infants can see depth and evidence showing 
they cannot. A theory of the development of in- 
formation pickup would escape this dilemma, for 
flat pictorial visual sensations do not enter into it. 
The assumption would be that infants see the main 
features of the surrounding environment (differ- 
ing somewhat from one species of animal to an- 
other) and that by maturation and learning the 
finer features of the world come to be discrimi- 
nated. The change is not from flat vision to spatial 
vision but from vague to specific perception. 

So radical a revision of the concept of percep- 
tion implies an unwelcome reconstruction of other 
psychological theories. Few psychologists would 
be so skeptical, although most would agree that 
the problem of depth perception has been misfor- 
mulated in one way or another. The concept of a 
retinal picture is so simple and convincing, how- 
ever, and is so firmly asserted by all the physicians 
who treat our eyes and all the designers of instru- 
ments to make optical pictures that it will die hard. 
Even physiologists who understand that the ocular 
system found in many animals—bees, for example 
—does not provide a retinal image, although it 
registers some of the information in ambient light, 
continue to assume that a picture must be formed 
if an animal is to perceive visually. 


A solution to the problem of the perception of 
the environment by the use of the eyes will prob- 
ably depend on new theories in optics concerning 
the structure of ambient light instead of images, 
new studies of the ocular equipment of insects, 
animals, and men concerned with the system as a 
whole instead of the retina merely, and a new 


theory of the perceptual process starting from a 
position of epistemological realism instead of sub- 
jectivism. 

JAMES J. Gipson 


[Other relevant material may be found in GESTALT 
THEORY; SENSES; SENSORY AND MOTOR DEVELOP- 
MENT; SKIN SENSES AND KINETHESIS; VISION.) 
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IV 
PERCEPTUAL CONSTANCY 


It is a well-known fact of everyday exp 
that such characteristics of objects as their color, 
form, and size tend to remain invariant iia 
changing conditions of stimulation. Thus 4 whi d 
sheet of paper looks white in bright sunlight ‘ak 
in deep shadow, a coin still appears round kh ig 
it is turned out of the frontal-parallel plane an T 
image on the retina is elliptical, and a man appe: fe 
to be about the same height at twenty feet a 


perience 


does at ten feet although the retinal image is re- 
duced to one-half in linear dimensions in the for- 
mer case. The constancy of object properties is 
not limited to visual perceptions but is found also 
in other sense modalities; for example, an orches- 
tra sounds equally loud in all parts of an audi- 
torium that has good acoustic properties, and 
different temperatures feel “right” or comfortable 
within wide limits. If the brightness of objects were 
determined solely by the amount of light they send 
to the eyes, then on an overcast day a piece of 
chalk would appear as dark as a lump of coal on 
a sunny day; and in the course of one day the 
same objects would take on all lightnesses between 
white and black. It is evident that objects as per- 
ceived tend to remain fairly stable in contrast to 
the changing conditions of stimulation. The funda- 
mental nature of visual constancy was noted by 
Helmholtz (1867) and Hering (1872-1875), who 
saw that it posed important problems for physio- 
logical optics. We now know that perceptual con- 
stancy has even greater importance in that it is 
part of the broader problem of biological adjust- 
ment and survival. 

Invariance of object properties. That percep- 
tual constancy has its origins in basic mechanisms 
and is not wholly a product of memory or intellec- 
tual inferences about the way things ought to look 
is proved by experiments with very young children 
and adults. The results with children of different 
ages, while conflicting, show that they do not per- 
ceive the colors, shapes, and sizes of objects merely 
in accordance with the images projected on the 
retina but rather in accordance with the invariant 
properties of the objects (Woodworth [1938] 1960, 
chapters 14-16). Experiments with animals below 
man on the phylogenetic scale are decisive. Fish 
taught to get food in troughs painted a certain color 
continued to choose the same troughs even when 
the intensity and color of the illumination were 
changed radically (Burkamp 1923). Hens taught 
to discriminate white rice grains from yellow in 
daylight illumination continued to pick the white 
grains in strongly colored yellow light (Katz & 
Révész 1921). Finally, chimpanzees that chose 
food from containers having high or low reflec- 
tance (white or black) under normal conditions of 
illumination also chose the same containers when 
the amount of light reflected from the containers 
ee high reflectance was actually less than that 
ees containers with low reflectance ( Kohler 

Psychological principles. Much mystery has sur- 
Tounded the ability of organisms to respond to in- 
variant properties of objects, but a consideration 
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of certain facts: enables us to bring phenomena of 
perceptual constancy within the framework of 
well-known physiological and psychological proc- 
esses, thus relating them to the wider realm of 
biological adaptations. These facts may be sum- 
marized in a number of principles having general 
applicability (Helson 1964). 

Organisms adjust to level of stimulation. The 
first of these principles is that organisms adjust 
their level of response to the level of stimulation. 
If the average level of stimulation is high, as in 
bright sunlight, the eyes rapidly adapt to the high 
energies reaching the retina and thereby reduce the 
net effectiveness of the bright light. Conversely, if 
the level of illumination is low, adaptation quickly 
makes the eyes more sensitive to incoming stimu- 
lation through regeneration of photopigments and 
also by the much faster action of less understood 
neural amplification mechanisms, thereby increas- 
ing visual effectiveness. The role of adaptation in 
constancy, while recognized, was not sufficiently 
appreciated by nineteenth-century workers in visual 
science because they thought only in terms of what 
Walls (1960) has called the “cold molasses” ki- 
netics of photopigment concentration, which could 
only account for the slow, intensive sorts of adapta- 
tion. Modern work (e.g., Schouten & Ornstein 
1939) has demonstrated that there is a rapid, 
almost instantaneous, adjustment of visual proc- 
esses to take care of sudden changes in stimulation 
and that there are mechanisms in other sense 
modalities for rapid adjustments to changed stimu- 
lation to bring the organism into equilibrium with 
the environment. 

Organisms respond to ratios of stimulation. 
The second principle at work to preserve constant 
properties of objects is that organisms respond to 
ratios of stimulation as well as to absolute amounts 
of energy. When the general illumination is raised 
or lowered, the relative amounts of light coming 
from different objects to the eyes remain the same: 
white objects still reflect about thirty times as much 
light as black objects, and hence the former look 
white and the latter look black in both bright and 
dim illumination. Contrast effects, whether simul- 
taneous or successive, which depend upon ratios 
rather than absolute amounts of stimulation, are 
largely responsible for the stability of the visual 
world. 

Constancy rarely perfect. A third, often over- 
looked, principle operative in the perception of 
object properties is paradoxical from the point of 
view of the concept of constancy. Constancy is 
hardly ever perfect; it is almost always approxi- 
mate and partial. It is more proper to speak in 
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terms of “approximation” to constancy than in 
terms that imply unchangeable perceptions in the 
face of all changes in physical input to sense 
organs, Let us suppose for a moment that con- 
stancy were perfect. Then we would not recognize 
differences between objects under bright and dim 
illumination; objects at a distance would appear as 
large as those close by; and we would feel no dif- 
ferent on a cold day from the way we feel on a 
warm day. There is biological utility in the percep- 
tion of changes in general illumination and in size 
and form of objects and in the discrimination be- 
tween far and near sources of sound. Actually 
there is never perfect constancy but only what 
Thouless (1931) called perceptual “regression to 
the real” object. To be sure, a white sheet of paper 
looks white in low illumination, but the white is 
dimmer than in bright light and hence gives an 
indication of the lower amount of light it sends to 
the eyes; similarly, a man at twenty feet is per- 
ceived to be about as tall as he is perceived to be 
at ten feet, but he is seen in a perspective that 
places him at a greater distance and so is a some- 
what different perceptual object. Thus we have 
constancy with change, the one giving information 
about invariant object properties, the other giving 
information about changes in the relations of ob- 
jects to the organism. Technically the facts reduce 
to this: while some dimensions of perception re- 
main constant with changing stimulation, others 
do not, with the result that we are able to recognize 
objects as the same in altered environments. 
Constancy and homeostasis. Perceptual con- 
stancy has a parallel in the concept of homeostasis 
(Cannon 1932), according to which physiological 
mechanisms act to preserve “normal” values of cer- 
tain critical constants such as 98.6° F. body tem- 
perature, pH of 7.40 acid-base equilibrium of the 
blood, normal blood-sugar levels, etc. The range 
over which objects are perceived as normal, or the 
same, is, however, much greater than the range 
over which physiological constants may change 
while maintaining a normal state of health. How- 
ever, it should be remembered that much of the 
constancy found in behavior depends upon the 
action of the physiological homeostatic mechan- 
isms, for example, the ability to withstand fairly 
large changes in external temperature and the re- 
duction in oxygen supply at high elevations. 
Factors affecting constancy. While the discus- 
sion of constancy to this point has made it seem 
like a fairly simple, univocal phenomenon, it is by 
no means so when all the facts are considered. 
Constancy may be reduced and even made to dis- 
appear both by altering the field conditions under 


which objects are seen and by instructions to ob- 
servers to adopt various attitudes in judging at- 
tributes of objects. Thus if one looks at objects 
through a long, narrow, black tube, their color, 
size, and shape are seen in accordance with the 
properties of the retinal image and there is little 
or no constancy. If observers are asked to judge 
actual physical size as against apparent size, then 
the perceived size increases with the distance of 
the object from the observer (superconstancy or 
overcompensation ). If, on the other hand, observ- 
ers are asked to judge analytically, or in accord- 
ance with “apparent size in perspective,” then 
perceived size decreases with the distance of the 
object from the observer (Carlson 1960). For some 
purposes one attitude is better than another: the 
painter must view a scene analytically and depict 
objects neither as they are physically, nor as they 
are perceived naturally, nor entirely as they are 
seen analytically, but somewhere between these 
appearances in order that his picture may have 
some likeness to the object as perceived. 
Personality and social factors. Studies purport- 
ing to show correlations between degree of percep- 
tual constancy and intelligence or self-esteem 
(Coopersmith 1964) and psychotic states (Weck- 
owicz 1964) may only be measuring the degree to 
which observers can adopt an objective or analyti- 
cal attitude as against the natural way of looking 
at things, and hence should not be interpreted as 
direct correlations between the mode of perception 
and complex personality traits or states. Similarly, 
cross-comparisons, with respect to the degree of 
constancy, between children and adults, subhuman 
and human subjects, and subcultures and our cul- 
ture should be interpreted with caution in view o! 
the large part played by field conditions and atti- 
tudes on perceptual constancy. The complex nature 
of objects and the many factors determining how 
they will be perceived, that is, whether constancy, 
superconstancy, or no constancy is found, a 
to the many resources organisms have at th e 
command for responding adequately to one 
under the changing conditions of the externa 
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ILLUSIONS AND AFTEREFFECTS 


Tlusions 


P According to a dictionary, illusion is defined as 
4 perception of a thing which misrepresents it or 
Ea it qualities not present in reality.” This is 

common-sense definition, not reflecting the in- 
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sights of scientists who study perception and in- 
sufficient for the purposes of this discussion. 

The main implication of the term “illusion” 
centers on what is real and what is mistaken per- 
ception. The question of reality has engaged the 
best minds among philosophers and scientists for 
centuries, and for them reality turns out to be 
something different from the common-sense belief; 
but the ordinary man is well satisfied with his 
own conception of what is real. The traditional 
common-sense belief is that objects and their prop- 
erties exist independently of perception and thus 
lie outside of perception and that these objects 
constitute absolute reality. It is believed that per- 
ceiving consists in becoming aware of these ob- 
jects and their properties. Present-day science, on 
the other hand, holds that there are two “worlds,” 
the world of the physicist and chemist, which is 
the energistic universe of molecules, atoms, and 
subatomic particles, and the perceptive and cogni- 
tive world that the human organism constructs, as 
a result of various forms of impingement, from 
the physicochemical world (Bartley 1950; 1958). 
The two worlds do not consist of the same kinds 
of items at all. Perception is the experience of the 
world of objects and things. Perception is induced 
by energy impinging upon sense organs but does 
not tell us anything directly about the physico- 
chemical world. Perception must enable the or- 
ganism to operate in this world. This operation 
is not on an absolute basis but on a probabilistic 
basis. Various sense modalities work together to 
this end. The probability of the perceiver’s effective 
action is greatly increased by the combined actions 
of several senses. 

Illusions, then, are instances in which direct 
perception does not promote effective motor rela- 
tions with the physical world or in which two 
sense modalities do not provide consistent in- 
formation. The trouble is not one of perception 
misrepresenting something that it should mirror, 
represent, or copy, as is ordinarily supposed. In 
illusion there is no failure or breakdown in the 
lawfulness of nature, as is implied in the common- 
sense definition. 

The classical example of two sense modalities 
conveying inconsistent information is provided by 
the wearing of glasses that invert the images pro- 
jected on the retina (Snyder & Pronko 1952). The 
immediate result is that what is up visually is in 
effect down motorically. Obviously in this case 
vision is no guide to motor action, and great con- 
fusion and distress arise. However, with practice 
the motor and visual functions become harmo- 
nized, with the result that the direction in which 
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the arm reaches for an object tallies with the visual 
appearance of the object's position. Both the initial 
disharmony and the final harmony occur accord- 
ing to natural law. 

The perceived location of objects need not bear 
any relation to where these objects may be reached 
by a motor act, such as extending an arm. The 
laws of optics determine the direction of rays of 
photic radiation (“light”) reaching the eye from 
objects. The only directional determinant given 
the viewer is the direction in which the rays enter 
the eye. Lenses, mirrors, and prisms can be inter- 
posed in various ways between photic source and 
eye so as to change the direction of the rays enter- 
ing the eye, and as a consequence objects are seen 
in a new position (Bartley 1950). Every apparent 
position is as lawful as every other, although some 
positions will be unexpected and called illusory. 

Kinds of visual illusions. It was shown in 1885 
that intermittent lines are seen as longer than 
continuous ones. A short time later it was reported 
that vertical lines tend to be seen as longer than 
horizontal ones. 

One of the earliest illusions involving angles was 
the Poggendorf illusion, reported in 1860. This 
was formed by a vertical rectangle or column with 
a diagonal line lying behind it. The two visible por- 
tions of the line do not seem to be segments of 
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Figure 1 — The Zöllner illusion 
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the same line. The Zöllner illusion was reported 
at the same time. It is depicted in Figure 1. The 
long diagonal lines are metrically parallel but ap- 
pear to be nonparallel. The short cross-hatch lines 
appear to be either horizontal or vertical lines in 
planes that represent the treads and risers of a 
stairway. The whole figure seems to represent a 
three-dimensional situation, but it is not consist- 
ently drawn for that purpose. Therefore, the figure 
disintegrates into several portions, each of which 
does represent internally consistent elements of a 
three-dimensional situation. Consequently, the long 
diagonal lines do not seem to be parallel. In an 
over-all three-dimensional situation they do not 
represent parallel elements. Some pairs converge 
to a far vanishing point, and some to a near van- 
ishing point. 

Figure 2 represents another illusion, one of 
columns. At the left the three rectangles are parts 
of a frankly three-dimensional depiction. At the 
right they are items in a homogeneous frontal 
plane ground. In the three-dimensional situation, 
the illusion of the right-hand column being longer 
than those to its left emerges. In the unstructured 
situation, the columns look equal in size, which 
they should be if lying in the same frontal plane. 
However, for those observers who may see the 
three columns as if in a three-dimensional situa- 
tion, some illusory effect results. The number of 
specific geometrical patterns in which unexpected 
(illusory) results are perceived is virtually endless. 

Theories of illusions. A number of theories 
have been advanced to explain (account for) il- 
lusory effects. These have been dealt with by 
Boring (1942) and by Woodworth ({1938] 1960). 
None of the theories appears to be adequate in 
respects, and many of them are even far-fetched. 
One of the more recent attempts to explain illu- 
sions involves the concepts of size and shape 
constancy (see Teuber 1960). It does not seem 
to be particularly convincing. 

In addition to the constancy theory the! 
five major theories attempting to accoun 
illusions: (1) the eye-movement theory; (2) the 
empathy theory; (3) the good-figure (Prignan2) 
theory; (4) the confusion theory; and (5) the Per 
spective theory. 

The eye-movement theory 
the observer does overtly, i.e., moving hi: 
the lines of a figure. Movement in one 
assumed to be more strenuous than in t 
therefore, lines lying in such a direction ie 
overestimated. The empathy theory relies on A 
elicitation of an aesthetic or emotional bias 1n 
observer. The good-figure theory assumes 
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Figure 2 — A typical visual illusion 


figures may either suggest or fully express some 
characteristic and that the observer tends to see 
the figure in a way that makes for the full expres- 
sion of this characteristic. The confusion theory 
suggests that the observer is unable to abstract 
fully the item in question from the presentation 
as a whole, hence some of the properties of the 
whole cling to the item itself. The perspective 
theory assumes that line drawings are not simply 
seen as items in a single plane (the frontal plane) 
but suggest three-dimensional scenes, with the 
law of perspective operating. [See GESTALT THEORY: 
SYMPATHY AND EMPATHY; VISION, article on EYE 
MOVEMENTS.] 

In all cases, the theories assume something 
about the way the observer operates and, if the 
assumptions are correct, do provide at least partial 
explanations of why visual presentations are seen 
in the ways called illusory. 

It must be seen that for the result to be called 
an illusion, a comparison of some sort has to be 
implied. The present appearance of the specific 
item is compared with that on other occasions. Some 
sets of circumstances are taken to be veridical 
(truth telling), and others are taken to be incorrect 
or deceptive. It would require considerable space 
to discuss this matter fully, but what seems to be 
needed to account for the way items appear is a 
Principle applicable as nearly universally as pos- 
sible. One needs to move as far as possible away 
from explanations that are as numerous and 


Variable as the examples (illusions) to be ac- 
counted for, 


It would seem that to come to grips with the 
essential features of the unexpected, the illusory, 
in vision, one might resort to the use of Gibson’s 
texture-gradient concept of the visual field ( 1950). 
This concept has already provided an over-all, uni- 
tary understanding of the macroscopic visual field 
and would therefore seem to hold possibilities for 
the smaller portions of the field that are usually 
dealt with in illusion. This concept might preclude 
making explanations as various as the illusions to 
be explained. 

The texture-gradient theory of space perception 
is, in a way, an elaboration of the perspective 
theory already mentioned, but it has gone so far 
beyond the earlier, abortive form that it merits 
special attention. Implicit in the theory is the idea 
that the perceiver tends to see things as three- 
dimensional presentations. The manner and the 
degree to which this occurs depend on the par- 
ticular presentation. Some presentations are geo- 
metrical figures with very few details and, of 
course, can be seen either as plane figures or as 
components of three-dimensional scenes. Some of 
these presentations are not consistent with actual 
three-dimensional scenes and cannot be so seen 
as totals. The figures tend to break into compo- 
nents, each of which can be seen as a component 
of a different three-dimensional depiction. Vari- 
ous inconsistencies result, and the illusory effects 
emerge from this. 

Actually, the term “illusion” loses much of its 
traditional meaning once the operational factors 
underlying perception are understood. 
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Color illusions and nonvisual illusions. The 
principles involved in geometrical illusions in 
vision also pertain to such visual qualities as color. 
Color is not inherent in the physicochemical world. 
It is what the individual sees. It is not absolute 
and is not inherent in the wave length of the photic 
impingement (stimulus). Factors other than wave 
length, such as intermittence of the stimulus, also 
affect the color seen by the observer. Shifting rates 
of intermittence, while wave length remains con- 
stant, result in the sensation of different hues, 
saturations, and brightnesses. The entire range of 
connections between stimulus input and resulting 
color perceptions is so varied as to preclude any 
meaningful discussion about true colors and illu- 
sory ones. The only way one can proceed at all is 
to specify the spectrum source (such as color), the 
temporal features of stimulus presentation, parts 
of the retina involved, intensity of photic input, 
residuals of past experience, and the adaptive state 
of the visual system. Thus, expectations about 
what color will be seen are totally operationally de- 
fined, and although some results are thought of as 
unusual and possibly illusory, they are all the law- 
ful products of the factors involved. [See VISION, 
article on COLOR VISION AND COLOR BLINDNESS.] 

Not all illusions are visual. Some are auditory. 
Some are haptic, involving the sense of touch, In 
fact, haptic parallels with most of the visual illu- 
sions are already known. They are not always 
recognized for what they are but are often dis- 
missed as transfers from visual experience rather 
than inherent operations of the sense modalities 
involved. 


Figural aftereffects 


Figural aftereffects are but one class of mani- 
festation of the fact that sensory inputs change 
the state of the responding system. Some figural 
aftereffects are quite ably demonstrated by repeat- 
ing a given presentation of a stimulus and com- 
paring the results. In vision, this is done by apply- 
ing an image to the retina for a brief time and 
then applying it again, while concurrently apply- 
ing the same pattern to a previously unstimulated 
portion of the retina for comparison, The two iden- 
tical images will not produce identical perceptions. 

Physiological residue from previous response is 
not at all unusual. In fact it is the rule. It so 
happens, however, that some kinds of sensory re- 
sults stemming from residual physiological effects 
have proved of more interest than others, Those 
classed as figural aftereffects provide one of the 
best examples. Visual afterimages provide another. 

Not all figural aftereffects are visual. They exist 
in other modalities, such as the tactual-kinesthetic 


complex. One of the early instances of visual fig- 
ural aftereffects was reported by Gibson (1933); 
in this case, geometrically straight lines were per- 
ceived as curved ones. He also reported a counter- 
part to this in tactual—kinesthetic perception, one 
produced by the subject’s rubbing his finger along 
a curved edge. As he did this repeatedly, the ap- 
parent curvature diminished. Subsequently, when 
he rubbed his finger along a geometrically straight 
edge, the edge appeared curved, with the curvature 
opposite in direction from that of the original. 

In Gibson’s example we are dealing with two 
phenomenal aspects of the situation—the effects 
of repetition, or previous exposure and response, 
and the fact that response is illusory. Whereas 
illusions and figural aftereffects are, in general, 
independent, we can see that they may be inti- 
mately interrelated in given examples. 

Theories of figural aftereffects. The explana- 
tion of figural aftereffects has taken a somewhat 
curious turn. Köhler (1940), who (with his col- 
leagues) undertook the most extensive study of 
figural aftereffects, chose to resort to simple 
physics to account for them. He utilized what he 
calls the principle of satiation. (Since he was, by 
intention, dealing with a physical situation, he was 
dealing with what physicists and chemists call 
“saturation,” not “satiation.” Satiation has to do 
with phenomena at the personalistic or psychologi- 
cal level, matters of appetite, boredom, etc.) _ 

Köhler and Wallach (1944) assumed that “satia- 
tion” occurs in the area of the sensory-projection 
fields of the brain as a function of the geometrical 
pattern of their involvement during inspection of 
a visual form. Thus, gradients of satiation, €x- 
pressed as electrical potentials in a volume con- 
ductor (the brain), form the isomorphic bases for 
the perceptions of distance and shape involved. 
The satiation theory with its oversimplified con- 
ception of isomorphism between visual target and 
visual perception rides roughshod over the mass of 
detailed information and understandings of 
day neurophysiology. It also ignores the inadequacy 
of isomorphism, which has been pointed out on a 
number of occasions (De Laguna 1930). 

The phenomena of figural aftereffects have Te; 
ceived a very different explanation by Osgood an 
Heyer (1952), who utilized conventional neuro- 
physiological concepts. Neither of these, nOr any 
other current theory for explaining figural ae 
effects, accounts for very much; hence, it is no 
profitable to describe them in detail. 

S. Howarp Bart 
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vI 
PERCEPTUAL DEPRIVATION 

A relatively recent development in experimental 
psychology has been the study of the effects upon 
human behavior of a severe reduction in the level 
and variability of sensory and perceptual stimula- 
tion. The attempts to achieve such a reduction in 
environmental stimulation are often referred to by 
such terms as sensory isolation, stimulus depriva- 
tion, sensory deprivation, and perceptual depriva- 
tion, Whatever the terminology, this condition can 
produce marked behavioral changes, for example, 
disturbances in perception, thinking, emotions, 
motivation, and, occasionally, the appearance of 
hallucinatory-like phenomena. Furthermore, these 
behavioral alterations are often accompanied by 
long-lasting disturbances of the electrical activity 
of the brain. 

Although experimental research on this topic is 
of recent origin, similar phenomena have long been 
reported. Observations by mystics, prisoners in soli- 
tary confinement, miners trapped underground, 
polar explorers, and solitary sailors have drawn 
attention to pronounced changes in behavior of 
people exposed to isolation (Brownfield 1965). 
Common to many of these are reports of disorienta- 
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tion, delusions, hallucinations, and a variety of 
emotional and cognitive disturbances. More recently 
it has been found that prisoners subjected to politi- 
cal indoctrination or “brainwashing” (a term first 
used in the Korean war) may experience similar 
reactions (Brownfield 1965). Finally, the medical 
literature provides numerous examples of behav- 
ioral disturbances associated with reduced sensory 
stimulation—patients with deafness, cataracts, Or 
detached retinas, and orthopedic cases. 

Although the existence of these phenomena has 
long been known, little or no scientific attention 
was paid to them until shortly after World War 1, 
when a convergence of influences from four major 
sources resulted in an unusually great interest in 
the effects of isolation. The first source of interest 
came from the highly publicized “confessions” ex- 
tracted by communist interrogators. What little 
information was available suggested that the re- 
sults were obtained by techniques which often em- 
ployed solitary confinement and the deliberate 
impoverishment of the prisoner’s perceptual envi- 
ronment. Drugs and physical torture were appar- 
ently not used. A second important source of 
interest came from the arrival of the space age. 
The crew of a space vehicle will not only have to 
live in very restricted quarters under relatively mo- 
notonous conditions but also, more importantly, 
they will be subjected to prolonged separation from 
society. Other technological advances, as reflected 
in increased use of submarines, isolated radar and 
meteorological stations, and of automated equip- 
ment in general, also provided considerable impetus 
to the initiation and development of research pro- 
grams dealing with reactions to restricted sensory 
and social environments. 

A third major source of interest originated in 
certain advances in neurophysiology, particularly 
the discovery of the brain-stem reticular formation, 
which is important in producing a general state of 
“arousal” or alertness in the organism. Since this 
alerting action seemed to be dependent upon con- 
stant exposure to sensory stimulation, experiments 
appraising the behavioral consequences of sensory 
deprivation were required, Finally, certain develop- 
ments within academic psychology were instru- 
mental in directing attention to this topic. One of 
these was in the area of motivation, in which at- 
tempts were made to show that animals have an 
active need for experience, i.e., they possess an 
exploratory drive or curiosity (Fiske & Maddi 1961, 
pp. 175-226). A study of human reactions to an 
impoverished environment might clarify the mech- 
anisms underlying this need for experience, change, 
or novelty. Another development within psychology 
came from studies of early sensory deprivation in 
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animals. The experiments of Austin Riesen, D. O. 
Hebb, and their associates had shown that labora- 
tory animals reared under various forms of isolation 
subsequently exhibited strikingly abnormal behav- 
ior (Beach & Jaynes 1954; Fiske & Maddi 1961, 
pp. 57-105). These findings suggested the possibili- 
ity of extending this work to humans. But, because 
of ethical considerations, only adults and short pe- 
riods of deprivation could safely be employed. 

The first experimental work on this topic began 
in 1951 at McGill University under the direction 
of Hebb (Bexton et al. 1954). Its purpose was to 
further our understanding of the mechanisms un- 
derlying brainwashing and of the lapses of atten- 
tion noted under monotonous environmental con- 
ditions, such as watching a radar screen, The 
results of this research were quite startling. The 
subjects, who were paid to do nothing for several 
days except lie in a cubicle and wear translucent 
goggles, soon reported vivid hallucinations, delu- 
sions, impaired intellectual efficiency, and an in- 
creased susceptibility to propaganda. This pioneer 
work, which was terminated in 1954, soon precipi- 
tated similar research at a large number of institu- 
tions in the United States, Canada, and England 
(Schultz 1965). 

Experimental procedures. A variety of proce- 
dures have been used to reduce environmental 
stimulation. Generally speaking, they fall into two 
main categories, namely, sensory deprivation and 
perceptual deprivation, In the first category, efforts 
are made to reduce sensory stimulation to as low a 
level as possible. This is usually accomplished by 
the use of a dark, soundproofed room in which the 
subject, wearing gauntlet-like gloves, is instructed 
to lie quietly on a cot or a mattress. Earplugs or 
earmuffs may be used to reduce further the level 
of sensory stimulation. Communication between 
subject and experimenter is kept to a minimum, 
thus reducing social stimulation. Under this cate- 
gory one can also place the water-immersion tech- 
nique, in which the subject, wearing an opaque 
mask, is suspended in a large tank of water and 
instructed to inhibit all body movements (Lilly 
1956; Shurley 1960). Because of its severity and 
numerous methodological problems, this method 
has received limited attention. Maximum endur- 
ance is approximately six hours. In the second 
category, perceptual deprivation, an attempt is 
made to reduce the patterning and organization of 
sensory stimulation while maintaining its level 
near normal. This is the procedure used in the 
work at McGill. The subject typically lies on a cot 
in a cubicle, wearing gloves and translucent gog- 
gles which permit diffuse light to enter the eyes 
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but eliminate all patterned vision. A masking sound, 
usually white noise (a hissing sound like escaping 
steam), is directed into both ears. The intensity of 
the light and noise is maintained at a constant 
level. A less commonly used variation, employed 
for durations of less than a day, involves placing 
the subject in a tank respirator used for poliomye- 
litis patients and exposing him to the blank walls 
of a screen and the repetitive drone of a motor. 

The experimental literature on sensory and per- 
ceptual deprivation has been characterized by wide 
differences in the quality of the studies, ranging 
from carefully designed experiments employing 
precise psychophysical measures to vaguely formu- 
lated studies using a handful of subjects, and no 
controls, and relying entirely on the subject's oral 
reports, which are often unchecked for their re- 
liability. The presence of numerous contradictory 
findings is, therefore, not surprising. Large differ- 
ences also exist in the duration of the reduced 
sensory stimuation, ranging from several minutes 
to two weeks. In this article, little attention will be 
paid to studies employing durations of less than a 
day, since doubts exist as to whether many of the 
effects are due to reduced sensory stimulation. 
These short-term studies, however, have been sum- 
marized by Kubzansky (1961) and Fiske & Maddi 
(1961, pp. 106-144). 

Affective changes. There seems little doubt that 
exposure to either sensory or perceptual depriva- 
tion is a stressful experience (Zuckerman 1964). 
The subjects often report severe boredom, restless- 
ness, irritability, anger, unrealistic fears and anx- 
ieties, depression, disorientation in time, and 
vague physical symptoms that are only rarely re- 
ported by control subjects. Also, their dreams, 
which are often exceptionally vivid, are largely of 
an anxiety nature whose main theme often con- 
cerns death or restricted spaces. Finally, several 
instances of euphoria of the type sometimes ex: 
perienced by deep-sea divers and high-altitude 
flyers have been reported, but usually only after 
prolonged periods of deprivation. 

Hallucinatory-like experiences. Undoubtedly the 
most dramatic finding of the original McGill study 
was the report that a variety of vivid hallucinatory- 
like phenomena, similar to those described for 
mescal intoxication, could be produced in the m4 
jority of normal subjects by exposure to sever 
days of perceptual deprivation. These phenomen? 
were largely visual and ranged in complexity ae 
dots of light, lines, or simple geometrical Pace 
to meaningful integrated scenes of a picture: 3 
nature. There were also reports of hallucinate 
involving other senses, as well as occasional delu 


sions and disturbances of body image. The pres- 
ence of these psychotic-like reactions excited the 
interest of numerous investigators, particularly 
clinically oriented groups, since it appeared that a 
new experimental approach to the study of various 
abnormal or pathological states which had long 
resisted scientific analysis had been discovered. 
Unfortunately, the subsequent research on these 
phenomena has generated more problems than it 
has solved. 

In general, the early research, largely using 
short-term isolation, supported the McGill findings. 
Lilly and Shurley reported similar phenomena after 
water immersion, except that they occurred in a 
much shorter period of time. Other investigators 
also commented on the prevalence of complex vis- 
ual and auditory imagery, delusions, and other 
unusual reactions. More recent experiments, how- 
ever, lasting from 2 to 14 days, have indicated that, 
in general, these experiences are much less preva- 
lent and complex than was first believed, particu- 
larly under conditions of perceptual deprivation. It 
also appears that numerous variables may influ- 
ence the complexity and frequency of these phe- 
nomena. Among these are the attitudes, suggestion, 
or “set” resulting from the instructions given a 
subject prior to isolation, the degree of motor ac- 
tivity permitted during isolation, and the time at 
which the self-reported experiences are obtained 
from the subject. Since these variables can influ- 
ence the results to varying degrees, they may be 
responsible for some of the contradictory findings. 
It is also becoming increasingly evident that vari- 
ous unusual experiences, similar to those already 
described, are quite common even under relatively 
normal environmental conditions, for example, 
while lying quietly for several days in an ordi- 
nary room. This suggests the necessity of control 
groups to provide a base line of the incidence of 
normally” occurring hallucinatory-like reactions 
against which the deprivation phenomena can be 
evaluated. Unfortunately, this has rarely been done 
in the past, presumably because it was felt that 
control subjects would not report “seeing” things. 

In conclusion, it would appear that a variety of 
hallucinatory-like experiences can occur during iso- 
lation. However, the extent to which they can be 
attributed solely to reduced sensory stimulation re- 
mains to be established. Furthermore, it is doubt- 
ful whether these phenomena can be looked upon 
as mental aberrations resembling those occurring 
in various pathological and psychotic states. They 
seem to possess few of the characteristics of the 
teactions of the mentally ill; rather, they show a 
Greater resemblance to certain types of normal 
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imagery often seen during certain periods of re- 
duced awareness, just prior to sleep, for example. 
In view of this, it is conceivable that future re- 
search may show that these seemingly unusual dep- 
rivation phenomena largely represent an accumu- 
lation, over time, of a variety of essentially normal 
experiences. 

Perceptual and motor abilities. Another dra- 
matic aspect of the McGill studies was the pres- 
ence of gross disturbances of the perceptual 
environment. Subjects, upon emerging from 
several days of isolation, reported pronounced 
movement of the visual field; changes in size, 
shape, vividness, and brightness of objects; after- 
images; and distortions of human faces. Although 
these postisolation phenomena usually disappeared 
within half an hour, some subjects still experienced 
them a day later. Subsequent research, however, 
also of 2 to 14 days’ duration, has not verified the 
existence of these perceptual distortions. Although 
various objects are seen as much brighter and more 
vivid in color, they seem to undergo no gross 
changes in size, shape, or movement. What distor- 
tions do occur are both minimal and transitory in 
nature and could be due to temporary disturbances 
of eye movements and equilibrium. The reasons for 
this discrepancy in results are difficult to find. Per- 
haps the McGill results were produced by some 
unique interaction of several variables of a pro- 
cedural, personal, or motivational nature. 

In contrast with the subjective reports, the re- 
sults of objective tests of perceptual and motor 
functions show greater agreement. There seems 
little doubt that various aspects of visual—motor 
coordination are impaired, e.g., rail walking, hand- 
writing, rotary pursuit, mirror tracing, and various 
measures of simple eye-hand coordination. Color 
perception is also uniformly impaired, On the other 
hand, certain basic perceptual processes, such as 
depth perception and size constancy, are immune 
to even prolonged periods of deprivation. 

A few measures, surprisingly, are facilitated. 
Auditory vigilance, as measured by speed of reac- 
tion to infrequently presented tones, is improved. 
An increase in tactual acuity and in pain sensi- 
tivity also occurs. Since this cutaneous supersensi- 
tivity can result from visual deprivation alone, the 
possibility exists that a severe reduction in sensory 
input from several modalities may not be essential 
for the appearance of certain deprivation phe- 
nomena. Some of these may be specific to a par- 
ticular sense modality or, alternatively, may be 
produced by interference with any one modality. 
Finally, there are suggestions that a wider variety 
of behavioral measures are impaired by perceptual 
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than by sensory deprivation. This particularly 
applies to prolonged periods of deprivation. For ex- 
ample, both visual and auditory vigilance are dis- 
turbed by perceptual deprivation, while only visual 
vigilance is impaired by sensory deprivation. Again, 
rate of reversal of ambiguous figures is significantly 
affected by perceptual but not sensory deprivation. 

Cognitive abilities. The extent of cognitive 
changes also seems to be related, to some degree, 
to the type of deprivation condition employed. 
Again, there are indications that disturbances of 
intellectual functioning are greater after percep- 
tual deprivation than after sensory deprivation. In 
the original McGill research on perceptual depriva- 
tion, nearly all of the subjects reported an inability 
to concentrate, lack of clarity in thinking, and dif- 
ficulty in organizing their thoughts. These subjec- 
tive reports were confirmed by results derived from 
a battery of objective tests, Significant decrements 
were observed on tests of word making, anagrams, 
and various numerical abilities. Subsequent re- 
search has not only confirmed these results but 
has also indicated that other abilities, such as ab- 
stract reasoning and space visualization, can be 
impaired. On the other hand, the cognitive effects 
of prolonged sensory deprivation are not particu- 
larly severe, Few abilities are impaired and these 
only moderately, Furthermore, certain mental abili- 
ties appear to be facilitated. Memory for a prose 
passage and immediate memory span for digits 
tend to be superior. Certain types of verbal learn- 
ing also seem to be facilitated. Thus it would ap- 
pear that the performance of isolated subjects is 
not always impaired. Certain perceptual measures 
as well as some forms of learning and memory can 
improve in an impoverished sensory environment. 
The reasons for this differential effect, however, 
are not known as yet. 

Susceptibility to propaganda. The evidence in- 
dicates that the beliefs and attitudes of isolated 
subjects can be altered by appropriate propaganda 
material. For example, the experimental subjects 
at McGill showed a greater belief in a variety of 
psychic phenomena, after listening to recorded lec- 
tures advocating the existence of these phenomena, 
than did nonisolated controls listening to the same 
material. They also made more requests for the 
lectures, Furthermore, these propaganda effects 
were still present to some degree two weeks after 
termination of isolation. Recently, other labora- 
tories, appraising attitudes toward certain national 
groups, have also reported a greater frequency of 
requests for propaganda recordings by the isolated 
subjects, Attitudinal changes were also observed; 
but their presence and extent were dependent upon 
a number of variables, for example, the subjects’ 


initial attitudes. If these are neutral, the propa- 
ganda material is quite effective; but if their initial 
attitudes are either positive or negative, little or no 
effect may occur. Intellectual level is another vari- 
able, with susceptibility to propaganda being some- 
what greater in subjects with lower intelligence, 
Although the results in this area are as yet meager, 
they are important in furthering our understanding 
of the so-called brainwashing phenomena and their 
dependence on a limited sensory environment. 
[See ATTITUDES, article on ATTITUDE CHANGE; 
BRAINWASHING; HYPNOSIS; PERSUASION; PROPA- 
GANDA; SUGGESTION. ] 

Therapeutic effects. Various researchers have 
claimed that therapeutic benefits can be achieved 
by merely exposing psychiatric patients to brief 
periods of sensory or perceptual deprivation. Among 
these beneficial effects are a reduction in the inten- 
sity of hallucinations, an increase in ego strength, 
less rigid utilization of defenses, a greater recog- 
nition that their difficulties originated in them- 
selves, and an increased desire for social contacts 
and therapeutic relationships. Unfortunately, these 
results are difficult to evaluate, since control groups 
are rarely used and agreement on results has not 
always occurred, Despite these shortcomings, thera- 
peutic benefits probably do occur, particularly since 
certain perceptual and intellectual processes are 
known to be facilitated during isolation. It is also 
possible that even greater benefits might result from 
the insertion of therapeutic material into the dep- 
rivation situation.’This method might prove to be 
extremely effective with psychiatric patients, since 
it would capitalize on the isolated subjects’ in- 
creased need for and receptivity to environmental 
stimulation. 2 

Physiological changes. Until recently, physio 
logical changes associated with deprivation have 
received little attention. Furthermore, what studies 
are available are largely concerned with changes 
in the electrical activity of the brain occurring dur 
ing a week of isolation (Zubek & Welch 1963). 
These have revealed a progressive decrease in a 
cipital lobe frequencies with increasing Jengi 
deprivation. They have also shown that this de- 
crease is more pronounced after exposure to a 
ceptual deprivation than sensory deprivation, & ag 
that may be related to the greater behavioral pee 
pairments which are known to occur after percep 
tual deprivation. The disturbances of brain Mae 
activity are even more striking during two W d 
of isolation. Not only is there a progressive a 
crease in frequencies in the alpha range but E 
decrease also appears to be approximately twice 
great during the second week as during the her 
Cumulative effects seem to be indicated. Furt 


more, follow-up records reveal that some degree of 
brain wave abnormality may still be present ten 
days after termination of isolation. Accompanying 
these physiological changes, long-lasting motiva- 
tional losses of up to eight days’ duration may 
occur. In the light of these results, one can only 
speculate about the possible physiological and 
psychological state of prisoners of war and others 
who, in the past, have been isolated for months or 
even years. 

Large individual differences, however, are pres- 
ent, Some subjects show considerable disturbance 
of brain wave activity and behavior, while others 
reveal an almost normal record and few behavioral 
changes even after 14 days of deprivation. Similar 
results have also been observed in prisoner isola- 
tion practiced by Russian and eastern European 
state police. Most prisoners develop symptoms of 
disorganization within three to six weeks; but some 
have been known to endure this for many months, 
while others have succumbed within days. Large 
differences also exist in the capacity to withstand 
the effects of sleep deprivation. Some individuals 
can endure 100 hours with their functions largely 
intact, whereas others become disorganized and 
ineffective after only 48 hours of sleep deprivation. 
The reasons for these large individual differences 
are not fully understood, but it is believed that such 
factors as the subjects’ genetic make-up, person- 
ality, attitudes, and perception of the immediate 
situation may all play some role. 

Data on physiological responses other than the 
electrical activity of the brain are sparse. Several 
workers have measured skin resistance, circulatory 
and respiratory changes, and the output of the 
adrenal glands, but little of a positive nature has 
emerged (Schultz 1965; Zubek 1964). 

Tolerance of prolonged isolation. Not all volun- 
teers can endure prolonged periods of deprivation. 
Some can endure the condition for many days, 
While others terminate it within a few hours. De- 
Spite these large individual differences, no satisfac- 
tory predictive measures of isolation tolerance have 
oe developed. Various paper-and-pencil 
aoe of personality, together with certain perceptual 
ihe have been tried but with little or no pre- 
Rae success ¢ Hull & Zubek 1962), Perhaps other 
E possibly of motivation, attitudes, and 
edi kan prove to be more rewarding. Although 
‘tie ang success in advance of isolation has been 
Aii ctiye; certain behavioral responses occurring 
Naa isolation are excellent predictors of later 
R S, example, volunteers who exhibit con- 
tation z ; oredom, restlessness, and time disorien- 
the sod in isolation almost invariably terminate 

tion prematurely. This would suggest that 
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preisolation tests utilizing such measures could 
prove to be excellent predictors of isolation 
tolerance. 

Various factors can make isolation more tolera- 
ble and also minimize the impairments, for exam- 
ple, prior exposure to isolation, specification of 
duration in advance of the experiment, and the 
introduction of physical exercise while in isolation. 
Of these factors, exercise seems to be the most 
effective. Under this condition, virtually all impair- 
ments of behavior and of the electrical activity of 
the brain may be eliminated. This finding, inci- 
dentally, substantiates the reports of some ex- 
plorers and prisoners of war who have claimed that 
performance of calisthenics is an effective method 
of combating the effects of isolation. 

Theoretical explanations. A variety of theories 
exist as to how reduced environmental stimulation 
exerts its effects (Schultz 1965). Briefly, three 
general types of theories have been formulated : 
(a) The psychoanalytically oriented interpretations 
postulate changes in the relationship between the 
functioning of ego and id or a weakening of the 
ego for reality testing. (b) The theories of a psycho- 
logical nature attribute the effects to the organism's 
continuous search for order and meaning in an 
unstructured perceptual environment, or they pos- 
tulate a disruption of the process of evaluation by 
which the models and strategies used in dealing 
with the environment are monitored and corrected. 
(c) Finally, neurophysiological theories emphasize 
the function of the reticular system because of its 
importance in attention, perception, and motiva- 
tion. According to them, a decrease in the level and 
variability of sensory stimulation coming into the 
reticular activating system, via collateral fibers 
from the sensory systems, disturbs its normal rela- 
tion with the rest of the brain, thus producing both 
behavioral and physiological effects. The neuro- 
physiological theories seem to be the most prom- 
ising but, as yet, we are still in search of an ade- 
quate theory of stimulus deprivation. 

J, P. ZUBEK 


[Other relevant material may be found in ATTENTION; 
NERVOUS SYSTEM; SLEEP; STIMULATION DRIVES. ] 
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VIL 
SPEECH PERCEPTION 


Speech perception is the identification of pho- 
nemes, that is, the vowels and consonants of lan- 
guage, largely from acoustic clues, and the recog- 
nition of the phonemes in combination as a word. 
Other clues that bear upon the perception of speech 
relate to rules of syntax, probabilities within the 
language, precision of utterance, interfering or 
competing noise or messages, and the listener's 
familiarity with the language. 

The effects of perceiving speech are both imme- 
diate (for example, an ongoing conversation) and 
long range, even affecting the development of lan- 
guage. That language is constantly changing is a 
truism, but that this relates to the perception of 
speech is reason to give the social scientist pause. 
The listener who fails to detect a phoneme, who 
erroneously hears a phoneme from afar as being 
of his own pronunciation dialect, or who seems to 
hear a sound where none was intended is on the 
verge of precipitating a change of language when 
he speaks. 


Physical clues to speech perception 


English utilizes approximately a dozen vowels 
and twice as many consonants, no more than one- 
third of the sounds that are represented in the 
International Phonetic Alphabet. Each phoneme, in 


turn, is a singular distribution of acoustic ener, 
among selected frequencies below 7,000 cycles per 
second (cps). In the instance of vowels this energy 
is concentrated in two or three bands of frequen- 
cies, termed “formants” (Fant 1960). 

Vowels have been successfully synthesized on 
the basis of two formants (Dunn 1950), Synthetic 
vowels have been made more intelligible by the 
addition of a third formant, approximately 600 cps 
above, and of the same intensity as, the second 
formant. This is particularly true of the front 
vowels, the ones of seat, sit, met, mate, and sat 
(R. L. Miller 1953; Delattre 1951). The formant 
concept is applicable to some consonants as well 
as to vowels, for example, the four consonants of 
the word unrolling. In general, “locus” refers to a 
more useful concept than does formant in describ- 
ing the crucial concentrations of energy in a con- 
sonant. Locus is a “fixed frequency position” and 
an important clue for the perception of individual 
consonants in the context of vowels. Another useful 
concept is “transition,” that is, a gradual shift from 
the locus to the steady state of the vowel in a 
syllabic environment (Delattre et al. 1955). These 
characteristics provide important clues for the per- 
ception of the consonant, sufficient in the repro- 
duction of synthetic vowels to suggest which con- 
sonant might have preceded a vowel. 

There is further evidence that a consonant and 
a subsequent vowel interact. Individual phonemes 
have been extracted from recorded speech and set 
aside as “building blocks.” When reassembled to 
form new words, some consonants retained their 
intelligibility; others, such as those in pig» fwe, 
make, and thrash, were frequently misundersto 
in the new phonetic environment (Harris 1953). 

Distortion and attenuation. A common po 
dure for studying the determinants of recognizable 
speech is the introduction of distortion and a 
ation of physical properties of the signal. The arti 
ulation index was developed in this manner, ae 
it, in turn, provides spectrum and acoustic ee 
further clues for the perception of speech (Frenc 
Steinberg 1947). 

The study of the duration of a vowel ae 
for recognition led Gray (1942) to present ee at 
of 11 vowels, as spoken by a variety of pee 
different pitches, to a panel of listeners. TTE 
fered in their ability to identify the short $% an 
and found some vowels more perceptible on 
others; some observers identified vowels a 
hearing no more than one-quarter of a single van- 
wave. At the other end of this continuum no & fag 

f ech perception attend the prolong! 
tages for speech pi rian 1953): 
of a vowel beyond a normal length ( 


Discrete segments of running speech have been 
eliminated by as much as one-half without serious 
effects (G. A. Miller & Licklider 1950; Garvey & 
Henneman 1950). The same outcome is accom- 
plished through compressing speech by a procedure 
that avoids the usual shifts in pitch (Fairbanks 
1956). 

The intelligibility of speech is resistant to the 
effects of peak clipping, a process that alters the 
wave form and reduces the irregular profile of 
sound waves to a series of square waves (Lick- 
lider & Pollack 1948). 

Multidistorted speech presents an opportunity to 
test whether or not the perceptual effects of dis- 
tortion are predictable, for example, as the joint 
probability or joint intelligibility of the single dis- 
tortions (Black & Agnello 1964). 

In summary, acoustic clues for the perception of 
speech sounds and words relate to audible frequen- 
cies below 7,000 cps and to the relative amount 
of acoustic energy momentarily associated with 
selected frequencies within this range. As speech 
is attenuated or is distorted (e.g., through filtering) 
the perception of speech is impaired. However, it 
is more resistant to distortion than would be ex- 
pected, an outcome that suggests the importance 
of clues other than acoustic ones in the perception 
of speech. 


Nonacoustic clues to speech perception 


l Articulatory clues have been held to be crucial 
in the perception of speech sounds, resulting ina 
motor theory of speech perception. Perception of 
consonants is categorical and not continuous. This 
categorizing is in agreement with the distinctive 
features of articulation, especially the place at 
which the consonant is formed (Delattre et al. 
1955). However, such mediation of articulatory 
correlates has been questioned because of lack of 
sufficient and integrated acoustic clues in synthetic 
speech (Fant 1964). 
i Probability and familiarity. Another nonacous- 
ic feature of speech relates to probability: the 
more likely an event of oral language, the more 
readily it is perceived. For example, under circum- 
stances where the first one thousand most fre- 
oT used words were 66.9 per cent intelligible, 
aa one thousand most frequently used words 
As 9.5 per cent intelligible, and the intelligibility 
ail : intervening categories decreased systemati- 
ae i the most common to the least common 
eee lack 1952; Pollack 1964). The familiar 
Tate: ary itself may differ from person to person. 
thei ample, listeners were grouped according to 
it values as shown by the Allport-Vernon scale, 
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Figure 1 — Intelligibility of vocabularies of various sizes 
(2-256 words) as a function of signal-to- 


noise ratio 
Source: From Miller, Heise, & Lichten 1951. 


and commonly used words were categorized simi- 
larly (Carlton 1953). Each group of listeners iden- 
tified a higher proportion of words that related to 
its interest than words that did not. This can 
readily be interpreted as representing a listener's 
greater familiarity with the vocabulary of his 
interest. 

Vocabulary size. Closely akin to the probability 
that arises from disproportionate usage is one that 
derives from vocabularies of different sizes. The 
more limited the number of words, the higher is 
their intelligibility (G. A. Miller et al. 1951). Fig- 
ure 1 shows the relative intelligibility of words in 
vocabularies of 2, 4, 8, 16, 32, and 256 words 
heard in different signal-to-noise (S/N) ratios. The 
S/N ratio states in decibel units (db) the amount 
by which the speech signal exceeds the noise. 
Should the two have the same power the ratio 
would be zero. Words in a 2-word vocabulary were 
80 per cent intelligible in a 12 db S/N ratio, while 
words of a 32-word vocabulary were only 40 per 
cent intelligible. The typical noise used for control- 
ling conditions of listening is thermal, or white. 
All frequencies are present. The sound resembles 
that of a jet aircraft. 

Context. A third aspect of probability that pro- 
vides clues for speech perception derives from con- 
text and includes both meaning and the rules of 
syntax. Figure 2 depicts the intelligibility of words 
heard both in isolation and in sentences. The ad- 
vantage provided by context may be compared 
directly to differences in intelligibility that accom- 
pany two S/N ratios (G. A. Miller et al. 1951; 
O'Neill 1954). The beneficial effects of context is 
even apparent in groups of words representing 
second-order word approximation; for example, in 
“eat mushrooms please speak to drive there” speak 
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Figure 2 — The effect of sentence context upon the 
intelligibility of words in various signal- 
to-noise ratios 

Source: From Miller, Heise, & Lichten 1951; O'Neill 1954. 


is more intelligible than when it is spoken alone 
(Traul & Black 1965). 

Phonemes and words. The study of phonemes 
and words reveals further clues to speech percep- 
tion. First, the different speech sounds have unique 
values of intelligibility. Some sounds enhance the 
discrimination of words that contain them; other 
sounds reduce the likelihood of correct perception. 
The first phonemes of ought, oat, pat, fat, think, 
cake, and love are in the latter category. Such a list 
is dependent on the system in which the tests are 
made. For example, the phoneme /s/ has been re- 
ported as both a deterrent and an aid to intelligi- 
bility. The difference lies in the two systems over 
which words were heard (Black 1952). 

The phonetic complexity of a word relates to its 
perception. In the system cited above that yielded 
the intelligibility values of more and less familiar 
words, one-syllable words of two sounds were 
55.9 per cent intelligible, and words with 3, 4, 5, 
and 6 sounds were 56.2, 58.3, 63.0 and 67.7 per 
cent intelligible, respectively. Similarly, 2-syllable 
words were more intelligible than one-syllable words 
of the same number of sounds; 4-sound words of 
one and two syllables were 58.3 and 66.5 per cent 
intelligible, and 5-sound words, 63.0 and 68.9 per 
cent, respectively. Thus, a second syllable was more 
than the equivalent of an additional sound (Black 
1952). 

The relation of the intelligible phoneme to the 
perception of the word is complex, affected by the 


frequency of usage of the word and the phonetic 
complexity of the word. Nonsense syllables provide 
a different model: the intelligibility of the syllable 
is the joint intelligibility of the constituent pho- 
nemes (Fletcher & Steinberg 1929). Applied to 
words, this gives a highly spurious result: the 
longer the word, the lower the predicted intelligi- 
bility—an outcome not in keeping with the facts, 
Attempts to predict the intelligibility of a word 
from the known values of the constituent phonemes 
have led to the tentative formula: “The intelligi- 
bility of the word is approximately equal to the 
product or joint probability of the intelligibility of 
the first two phonemes of the word.” This does not 
imply that all of the phonetic value that contributes 
to intelligibility lies in two phonemes or that the 
total potential of the two is utilized (Black 19652), 


Negative and positive reinforcement 


Attempts to alter speech perception led to the 
development of infinite peak clipping, on the one 
hand, and jamming, on the other (Licklider & 
Pollack 1948). Some impediments to perception 
are built into speech and relate to human frailty. 
For example, in one study listeners were asked to 
identify the final 3 words of oral sentences from 
3 to 17 words in length. Even in quiet, the listeners 
were progressively less successful in processing the 
longer sentences, and the detrimental effect of sen- 
tence length increased systematically as more and 
more noise was introduced into the system. Correct 
scores associated with 5- and 15-word sentences in 
four conditions of noise (10, 8, 6, and 4 db sA 
ratio) were, respectively: 94 and 84 per cent, r 
and 76 per cent, 73 and 47 per cent, and 50 ani 
27 per cent (Black 1961). 

Sound similarity, Similarity in sound (rhyme) 
may affect speech perception. The letters soun 
similar to others in the same set but different from 
those in the other set: (b, c, t, d, v) and (f, ™ H 
s, x). As the letters were heard and nemed ai 
errors of identification were preponderantly wi ie 
a set, not between sets. This also characterized i 
confusions in recall of the oral materials (Cont! 
1964). Such interference was also evident T 
listeners who were asked to identify, from @ bee 
multiple-choice format, words that they had ents, 
moments earlier. Recall was poorer when per p 
ulus word was similar in sound to the mu tip» 
choice foils (Kresheck & Black 1964). has 

Unfamiliarity. An unfamiliar langna a fe 
built-in interferences to speech perception. Eng: 
students are consistently less able to per 3 
lish words than are American listeners, ee 
marking responses on a simple multiple- 


form. Interestingly, the relative skill of these stu- 
dents in perceiving oral English words relates to 
other aspects of their orientation to English, for 
example, the amount of time they devote to pauses 
in their English speech (Black et al. 1965). In turn, 
American students have difficulty identifying Eng- 
lish words spoken by foreigners; however, with 
sixty minutes of practice in hearing simple English 
prose read in the pertinent foreign dialect the 
listeners improve markedly in their perception of 
the speech of foreign students (Black & Tolhurst 
1955). With respect to American pronunciation 
dialects, when listeners in a military training pro- 
gram were categorized according to the regional 
units in which they were inducted into service, the 
groups varied in their ability to identify words as 
spoken by their peers from the various regions 
(Mason 1946). 

Interference and delay. An intriguing charac- 
teristic of speech communication is that a listener 
can single out one talker from a group and converse 
with him in the face of an overridingly unfavorable 
S/N ratio. This might be termed the “cocktail party 
effect.” Despite this fortunate circumstance, a 
competing message may interfere disastrously with 
speech perception. A control center for air traffic 
may be a babel of competing messages. An experi- 
mental approach led to several recommendations 
for such a center: separate the sources of speech 
(loud-speakers) by at least 10 degrees on a hori- 
zontal plane; introduce a singular filtering in the 
most-used circuits; identify message sources with 
visual lights (Spieth & Webster 1955). 

Another investigation treated a primary message 
and an interfering message that were presented 
either simultaneously or separately with slight tem- 
poral disparities. Phonetic similarity between the 
two messages was one deterrent; particular tem- 
poral differences, another; the primary message 
Was least well received when it was initiated ap- 
proximately one-third of a second subsequent to the 
initiation of the interfering message (Peters 1954). 
Many gimmicks have been tried for reinforcing 
a For example, speech was presented doubly 
oe ae line and separate lines to a listener's 
ae ne rendition was delayed in varying amounts 
wo: 330 msec, Compared to normal reception, 
nee s pere heard as well, but no better, with the 
viento separated by 150 and 300 msec; 
TA values of delay yielded detrimental effects. 
TE heed vein, the lower frequencies of the 
higher cosa have been put to one ear and the 
rane sta amplified, to the other, delayed 
iltea. o improvement in speech perception re- 

ted (Camp 1958). 


PERCEPTION: Speech Perception 55 9 


Repetition and instruction. Messages them- 
selves have been tailored for reinforcement. Repe- 
titions gave slightly improved intelligibility but were 
costly in terms of time (G. A. Miller et al. 1951). 
Listeners have been warned against highly prob- 
able errors, that is, they have heard, “Write faith, 
not face.” This procedure produced a marked gain 
in intelligibility; however, face continued to be one 
of the written responses, although an infrequent 
one (Traul & Black 1965). The transmission of 
digits has been enhanced by the repetition of the 
initial sound of the digit, as in t-two, th-three, 
f-four; this is termed “voluntary stuttering” (Moser 
et al. 1956). 

Listeners’ agreements in errors. This article 
focuses on correct perception of speech. An inter- 
esting pattern emerges from the incorrect percep- 
tions: listeners tend to agree in their errors. Thus, 
where words were intelligible to 50 per cent of the 
listeners, another 25 per cent nonetheless agreed 
on one wrong response, and about 12 per cent 
agreed on a second wrong response. As a rule the 
second-error response occurred with one-half of 
the frequency of the first, and the third-error re- 
sponse, 0.6 of the frequency of the second. More- 
over, the same proportions were present, whether 
in a closed message set with a specified and limited 
response form or in an open message set in which 
a listener could write any word he thought he heard 
(Black 1965b). That the ratios among the errors 
were the same in open and closed message sets is 
taken as an instance of the principle of the con- 
stant ratio (Clarke 1957). 


This summary of representative work on speech 
perception has treated particularly (a) the deter- 
mination of responses by acoustic clues and (b) by 
nonacoustic clues, (c) interferences and reinforce- 
ments that may affect perception, and (d) the tend- 
ency of listeners to make the same responses, even 
in error. Listening behavior generally directs speak- 
ing behavior and thus relates to changes in lan- 
guage, which are of interest and importance to the 


social scientist. 
Joun W. BLACK 


[See also HEARING; LANGUAGE; LEARNING, article on 


VERBAL LEARNING.] 
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PERSON PERCEPTION 


‘ocesses 
Person perception concerns the study of pr 


by which we come to know and think about wa 
persons, their characteristics, qualities, an ocial 
states. This area has been variously oe ee 
perception, person cognition, interperson@ E some 
tion, connaissance d'autrui, to mention OMY 


of the phrases used. It overlaps, but is not cotermi- 
nous with, psychodiagnosis by the expert. In the 
phrase “person perception” used here, the term 
“perception” is used in a very loose way, most often 
meaning apperception and cognition. 

As a physical object a person is, of course, not 
different from other stimuli, and the processes of 
perception and cognition of social and nonsocial 
objects are probably basically the same, as Egon 
Brunswik repeatedly maintained ({1947] 1956). In 
the sense that we perceive or infer primarily inten- 
tions, attitudes, emotions, ideas, abilities, purposes, 
traits, thoughts, perceptions, memories, conscious- 
ness, and self-determination—events that are, so to 
speak, inside the person and strictly psychological 
—persons are doubtless special objects. Person per- 
ception is unique in that the similarity between the 
perceiver and the object perceived is greater than 
in any other case. This fact has far-reaching con- 
sequences in that the perceiver is probably maxi- 
mally inclined and able to use his own experience 
in perceiving, judging, or inferring another's state 
or intentions. 

The process by which we know people did not 
receive formal and separate attention until the lat- 
ter part of the nineteenth century. It was the work 
of Darwin (1872) on emotional expressions and 
their recognition that gave scientific impetus to this 
problem area. At the beginning of this century the 
approach was extended to the process by which we 
know any characteristics of another and to deter- 
mining what are the characteristics of the “good 
judge” of other persons. How these processes are 
related to action was seen as basic to an under- 
standing of interpersonal behavior. 

The general structure of this problem area be- 
comes apparent when the various elements involved 
are set out for consideration. In the basic situation 
of person A’s perception or cognition of person Bs 
characteristics or states, the major components 
involved in the process are: (1) person B’s charac- 
teristics or state (say, fear, courage, intelligence, 
happiness, attractiveness to others, or intention to 
help); (2) the concomitants of B’s characteristics; 
on Toa cues or manifestations of B’s charac- 
in oa at are, so to speak, available to A (these 
a See include cues from the environment and 
aes ernal to the person); (4) the proximal cues 
eer ee of B’s characteristics that are 
lize a A; (5) the cognitive processes that uti- 
ie: ee Cues; and (6) the percept or 

These a A of B’s characteristics. A 
arena e a and their relationships corre- 
ETEA e various lines of investigation that 

eloped around this problem area: whether 
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emotional or mental states and personal character- 
istics have consistent expressions, whether these 
manifestations yield usable cues, the question of 
which cues the observer uses, and by what process 
he forms his judgments, and, finally, whether there 
is a valid connection between the state or charac- 
teristic of B and the judgment of it by A. 

Common to all these approaches is the problem 
of finding good ways of systematically and objec- 
tively specifying the “stimulus,” the object of the 
perception or cognition, which is often a covert and 
distal variable, such as an emotion, trait, or inten- 
tion. This presents difficulties not usually encoun- 
tered in the more traditional regions of psychology. 
For example, the judgment that a person is “kindly” 
is not related to one specific stimulus constellation 
but is based on greatly diverse observations, such 
as appearance, gestures, and actions that radically 
differ from one another, and on contexts, simul- 
taneous and sequential, that are equally diverse. 
The events that produce the judgment of “kindli- 
ness” in any single situation are most often too 
complex to be described in terms of physical dimen- 
sions, although the physical dimensions of another 
person and of his behavior, such as his facial 
features or movement in space, are regularly used 
by the beholder as cues of the psychological char- 
acteristics and state of the person. One line of at- 
tack is to use a consensus of judges as the criterion. 
In some cases these might be “experts.” Still an- 
other way of defining the criterion, especially when 
studying the perception of certain internal states 
or attitudes and feelings, is to use self-reports by 
the stimulus person. One may also resort to psycho- 
logical test results as the criterion (e.g., when cer- 
tain personality traits are judged). One other pos- 
sibility is to start with the perceived stimuli and 
coordinate these with the further cognitive judg- 
ments they generate. By taking this approach, 
Heider (1958) and Jones (Jones & Davis 1965), 
among others, have been able to make remarkable 
progress in explicating how we understand others. 
Still other approaches have been used. 

These diverse definitions of the criterion need 
not be equivalent, and the comparability of studies 
performed on the basis of such different operations 
cannot be assumed. 

There seem to be two major directions of effort 
in modern studies of person perception. In one, 
questions are asked about the process of perceiving 
or knowing another person, about stimulus and 
perceiver characteristics and their interactions. In 
the other, the processes leading to the percept or 
judgment are not of primary interest. The focus is, 
instead, on the veridicality of the judgment and its 
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correlates, on what is sometimes called empathic 
ability, or “accuracy.” In specific instances these 
two streams of work blend, and, of course, there 
are contributions that fit poorly into this simple 
dichotomy. It is, however, a useful one for survey- 
ing the work done. 


The “ability” to judge others 


The interest in empathic ability and its generality 
stems principally from an amalgam of two trends 
of thought: on the one hand, the work of such 
writers as Ludwig Klages, Theodor Lipps, Wolfgang 
Kohler, and Max Scheler, who were trying to ex- 
plain how one person understands another in terms 
of such concepts as intuition, inference, and em- 
pathy; and on the other hand, the work of Charles 
H. Cooley, James Angell, George H. Mead, Sigmund 
Freud, William McDougall, Harry Stack Sullivan, 
and Leonard Cottrell, who, among others, were 
stressing the importance of recognizing others’ feel- 
ings for the process of socialization and evolution 
of the self. The quantitative approach, added to the 
fusion of the above two lines of thought, led to a 
search for simple operational indices of empathy 
and accuracy. Thus, the ability to understand others 
was defined as the discrepancy between a judg- 
ment and a criterion, for example, that between 
the self-ratings by a subject and the ratings attrib- 
uted to him by a judge. Many variations of this 
operational definition of empathic ability or accu- 
racy have been used in a substantial number of 
investigations [see SYMPATHY AND EMPATHY; see 
also Bruner & Tagiuri 1954; Taft 1955; Sarbin et 
al. 1960; Allport 1961; Kaminski 1959; 1963; 
Shrauger & Altrocchi 1964; Cline 1964; Secord & 
Backman 1964, for reviews of this literature]. 

Characteristic of these approaches is the accu- 
racy or empathy score or rating that results from 
the comparison of the judge’s performance with the 
criterion. It soon became apparent, however, that 
these scores were far from being pure measures of 
the ability under investigation. In addition to dif- 
ferences in accuracy dependent upon the particular 
judgmental task, which suggested that several 
types of abilities might be involved, several com- 
plications made the interpretation of these scores 
difficult, For example, the tendency toward assumed 
similarity (i.e., the general inclination to attribute 
to others responses one would give oneself) gen- 
erates high accuracy scores for judges who happen 
to be similar to the persons judged and low scores 
for judges who are not; thus, accuracy as an ability 
is confounded with a fortuitous event. Other re- 
sponse set phenomena and sampling characteristics 
of judges and judged were observed to influence the 


results. In 1955 Cronbach analyzed these “dyadic” 
global scores into a number of components, He 
showed that in considering a judge’s response in 
social perception, the theory needs to take into 
account “the traits being perceived, the constant 
tendencies in this perceiver with respect to those 
traits and finally the effect of the particular other 
as a social stimulus to this perceiver” (Cronbach 
1958, pp. 375-376). He argued that instead of a 
global score covering all traits or themes, the pro- 
cedure ought to allow for differential results de- 
pending on the characteristics involved. In the light 
of his analysis prior work on accuracy as an ability 
and on its correlates proved very difficult to inter- 
pret, and new approaches had to be sought. 
While it has been impossible to design a study 
that takes into account all the elements, effects, and 
artifacts that have been identified, several investi- 
gators with more appropriate methodologies have 
since made new and vigorous approaches to the 
problem. Their reports support the view that the 
ability to judge others is complex; that, for instance, 
“stereotype accuracy” (ability to judge the general- 
ized other) and “differential accuracy” (the ability 
to judge individual differences) are two of the 
components; and that they are to some extent inde- 
pendent (e.g., see Bronfenbrenner et al, 1958). 


The process of knowing others 


The difficulties with the problems of accuracy 
and of the “good judge,” and the hope that better 
understanding of these matters might be reached 
indirectly, strengthened interest in the process by 
which we perceive and think about others, irrespe™ 
tive of its veridicality. How can we know the per- 
sonality or inner states of others when we do not 
have access to their experience? In a review of the 
major approaches to this broad issue Allport (1961, 
chapter 21) considers several explanations. One 


rr rare ts 

theory, founded in British empiricism, suaa 
that we know about others by a swift process 

e are seldom 


inference, or analogy, of which we ar< a 
aware. We infer the state or characteristics yeh 
other person because the circumstances, the e A 
ior, the sequence of events, are similar so ji 
with which we ourselves have had experience 
previous situations. 

The inference theory of interpersonal knowlede” 
accounts well for the findings that simile Be 
tween judge and other, and breadth of li a 
perience, aid in the accuracy of jndemen i fo 
according to some writers, inference theory aa 
account for several other aspects of the ae ce 
particularly for the fact that we seem t0 e others 
hend transitory moods or states of mind ©} 


without necessarily having had relevant personal 
experiences—a phenomenon often seen in chil- 
dren, This suggests that additional processes need 
to be postulated to account for such observations. 
One of these is Einfiihlung, or “empathy,” intro- 
duced by Theodor Lipps in 1907 and defined as 
“objective motor mimicry.” By slight movements we 
partially imitate the other, thus creating for our- 
selves cues that give us an understanding of his 
feelings or characteristics. It may be argued that 
this process can be explained, at least in part, in 
terms of inference theory. We draw inferences from 
cues derived from our own mimicry. Yet the mim- 
icry itself cannot be accounted for in this manner, 
and it is worth considering it as a possible com- 
ponent of the process in its own right. 

Finally, many writers, especially those in the 
phenomenological and gestalt traditions, insist that 
inference, even modified and supplemented by em- 
pathy as defined by Lipps, does not account for the 
observation that certain external objective patterns 
or configurations (gestalten) have an immediate 
meaning, which is mediated by physiological and 
psychological processes that do not necessarily in- 
clude inferences but are of a more direct nature. 
According to this view, the state of the other per- 
son's mind is related to a patterned expression on 
his part that directly produces in the beholder a 
patterned sensory and central excitation that repre- 
sents or conveys the person’s state of mind. 

A combination of the explanations reviewed 
above may be necessary to account for the phe- 
nomenon of how we know and understand others. 
We do it through inference and analogy, through 
sensory cues derived from empathic responses, and 
through the immediate response to external con- 
figurations and patterns that are expressions of 
qualities of a person “out there.” And all of these 
elements are intensified by the unusually demand- 
ing effect that a person has upon our attention, 
perception, and thought. 

Yet explanations in terms of inference processes 
have been gaining ground, partly as a result of the 
a a clarification of cognitive processes, as, 
ae example, in the approaches of Brunswik, Tol- 
ol and Bruner. The most extensive and thor- 
eras application of this genre of cognitive 
A A oA how we know others has been presented 
én hee Taft, and Bailey (1960) in their work 
EE inference. These writers argue forcefully 
eia intuition and empathy as necessary ele- 
ieee in explanations of how we know others; they 
a fae that the process of clinical inference and 
byta hae cognition can be adequately represented 

roadened syllogistic inference model with 
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both major and minor premises stated in prob- 
abilistic terms. 

On certain properties of the process. Regardless 
of the general explanation or combination of ex- 
planations they favor, students of person cognition 
seem to be in some agreement about certain char- 
acteristic features or properties of the process and 
about certain factors that influence it. 

As mentioned, studies of accuracy of social per- 
ception indicate that the process involves several 
different types of judgment that had not been dis- 
tinguished in most of the earlier empirical studies 
of empathic ability. The data used by the perceiver 
may correspondingly differ, and the question arises 
as to what are the major cues, information, or con- 
cepts that people use in perceiving or thinking 
about others. We should really know a great deal 
about this, since literature, the theater, movies, as 
well as our daily thoughts and conversations, are 
replete with such content. Some writers have begun 
to treat this aspect of the problem systematically, 
but little as yet is known about it on the basis of 
empirical work. The task is a difficult one, since 
it seems that the categories of social perception 
and cognition and the inferential sets mobilized 
vary not only with the particular nature of the 
judgment but also with the purpose of the social 
interaction under which the judgment takes place 
and with the role of the judge. It is also unques- 
tionable that people can arrive at some evaluation 
from almost any data about the other person and 
that they do so with a high degree of consensus. 
This occurs in spite of the great variety of cues that 
have been used, such as actual persons, photo- 
graphs, schematic representations of persons, voice, 
trait information, and paths followed by a person, 
to mention just a few. 

As Brunswik ([1947] 1956) has made clear, cues 
are highly interchangeable, and a great variety of 
them can lead to the attribution of a trait or dis- 
position. It is the attributed distal, covert disposi- 
tion of the other that serves to guide behavior in 
relation to him, and it is with this level of event 
that the perceiver is concerned, Yet some disposi- 
tions and states are more important for interaction 
than others, and cues to these may be objects of 
special attention. (See Jones & Davis 1965 for a 
discussion of attributing dispositions from acts.) 

Among the aspects of the other to which a person 
particularly attends are his intentions, especially 
when the action is directed toward the beholder. 
Indeed, there is a general tendency to see others as 
origins and responsible agents of actions. Two 
other aspects that seem to have strong demand for 
the beholder’s attention are the other person’s good- 
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bad qualities and his relative power. These distal 
characteristics of the other—intention and respon- 
sibility, goodness—badness, and relative power— 
can be seen as crucial dimensions along which a 
person orients himself vis-à-vis the other. From a 
functional point of view, special sensitivity to these 
aspects of the other may be assumed. 

To continue with considerations of the process, 
there are two sources of information external to the 
beholder about the states, feelings, attributes, and 
intentions of the other: the object person and the 
situation or context of the object. The other person 
himself, insofar as we can divorce him from the 
context, seems to allow the formation of impres- 
sions on which there is a measure of agreement 
among judges. Discriminations of emotions from 
facial expressions alone are possible, for example, if 
the categories used are broad enough, such as those 
proposed by Robert S. Woodworth: love, happiness, 
and mirth; surprise; fear and suffering; anger 
and determination; disgust; contempt; and a re- 
sidual category (see Bruner & Tagiuri 1954 for a 
summary). 

While there is a clear determinant of the impres- 
sion in the configuration of the stimulus person 
himself, the interpretations of the other’s state may 
not be unique. In fact, judges will frequently offer 
two or three possible interpretations, although there 
are other possibilities they will not advance. The 
situation or context by itself is often sufficient to 
permit indeterminate estimates of the state of an- 
other person. We do not need to have information 
on the behavior and appearance of a person who 
has lost a loved one to make a good guess at how 
he feels. 

Either the person or the situation, then, taken 
separately, allows nonrandom but indeterminate 
judgments. We “know” both that certain situations 
tend to evoke feelings A, B, or C and that a certain 
expression or behavior reflects feelings B, D, or E. 
So we judge that in this particular situation, given 
the person’s expression, the feeling is probably B. 
By combining cues derived from the person with 
those derived from the situation, the observer typi- 
cally arrives at highly consensual and functional 
judgments in the great majority of ordinary situa- 
tions. These two sources of cues, of course, are not 
often independent and are frequently redundant, 
thus increasing the likelihood of a veridical re- 
sponse. It may be that the transaction between the 
other person and the situation operates as a higher- 
order informational variable and is closely and 
parsimoniously attended to. 

The interpersonal judgment depends not only 
upon the way in which person and situation cues 


are jointly utilized but also upon certain other oper- 
ating characteristics of the typical judge himself, 
For example, the tendency toward “assumed simi- 
larity” is characteristic of a great deal of social 
perception, as is the tendency to attribute to the 
other a high degree of invariance by thinking of 
him in terms of enduring dispositions. 

As in all complex cognitive processes, in the 
case of person objects there is a tendency to max- 
imize cognitive balance and to avoid dissonance of 
elements. Thus, the characteristics of other persons 
are viewed as more homogeneously good or bad 
than they can be shown to be when independently 
measured. The other person is thought of as a 
configuration of highly integrated characteristics 
(traits, emotions, etc.). Knowledge of one or two 
traits, for example, leads to a strong expectation 
that other particular traits are present. The “halo 
effect” and the “logical error” are well-known in- 
stances of this more general phenomena. [See Coc- 
NITIVE THEORY and THINKING, article on COGNITIVE 
ORGANIZATION AND PROCESSES. | 

The tendency to “make sense” of the other per- 
son’s behavior is seen again in the inclination to 
attribute intentionality to others even where such 
attribution is objectively unwarranted, to see others 
as origins of actions, thus forcibly integrating the 
person and the situation in which the events are 
taking place. Indeed, a very strong determinant of 
how the behavior of a person is understood is 
whether the cause of his actions is believed to be 
internal or external to him and, consequently, 
whether he is or is not responsible for it (Heider 
1958; Pepitone 1958; Jones & Davis 1965). Thus, 
the same behavior may be interpreted as malicious 
or clumsy, depending upon the perceived locus 0 
causality. d 

Also well known is the general inclination towar 
quickly placing a person into categories according 
to some easily identifiable characteristics—Se* oid 
ethnic membership, nationality, occupations 
then to attribute to him, correctly or incorrec l 
qualities believed to be typical of members of T 
category. The term “stereotyping” is often use F 
refer to this form of categorizing and should i 
be regarded as necessarily misleading, for it T 
been shown that it sometimes leads to mote ed 
rate inferences about others than does en 
information about each individual person a 
PREJUDICE and STEREOTYPES; see also Sarbin €! 
1960, chapter 9]. 

Neglected in empirical studie 
served, is the tendency to adopt a b: 
teristic of the other person, against 
ments are made; to deal, that is, WI 


s, but easily ob- 
ase-line charac- 
t which judg: 
th variations 


and gradients rather than with absolute values. A 
mean man is judged to feel kindly toward someone 
if he does not hurt him. Another form of utilization 
of a base line is carefully described by Jones and 
Davis (1965). who point out that the certainty 
with which a disposition is inferred from observed 
actions and their consequences increases with their 
atypicality and unexpectedness, 

These general tendencies can be viewed as prop- 
erties of the typical implicit personality theory: the 
assumptions we make about the nature of other 
persons. These assumptions presumably affect the 
way we perceive and understand others, much in 
the way conceptions of any type of event influence 
what we perceive and the manner in which we per- 
ceive and understand it. [See PERSONALITY, article 
on THE FIELD.| 

How the other is perceived also depends with 
some regularity upon the interpersonal relationship 
between him and the judge. The qualities he is 
seen to have depend upon whether the perceiver 
likes him or not; when the perceiver does like him, 
the tendency toward assumed similarity is strength- 
ened (see Secord & Backman 1964, p. 81). Differ- 
ences in the relative status of the perceiver and 
the other person influence the attributed qualities, 
usually adversely (Pepitone 1958). There is also a 
connection between how we feel toward others and 
our perceptions of their feelings for us. We seldom 
feel liked by people we do not like ourselves ( Tagi- 
uri & Petrullo 1958, pp. 316-336; Moreno et al. 
1960, part 11 C). The role of the beholder vis-a-vis 
the object person (e.g., colleague versus superior) 
affects the dimensions in terms of which the other 
is apprehended and evaluated (see Secord & Back- 
man 1964, pp. 85-87). 

The tendencies to attribute intentionality and 
causality, to categorize and stereotype, to integrate 
discordant information, to attribute particular dis- 
positions on the basis of diverse and variant cues— 
all ae be seen as efforts at simplification and sta- 
bilization of a highly variable human environment. 
These efforts are comparable in function to such 
eo as the constancies in visual perception, 
a © invariance is achieved in spite of drastic 

anges in the stimulus characteristics. 
ee studies have been carried out on devel- 
nie NPS person perception. A baby seems 
eure sae facial expressions made by its 
Aine though only extreme differences are rec- 
iia early in life. As the child matures, the 
ing ne z describe persons in terms of outstand- 
EE A 7 d eatures—big, old, thin—decreases, and 
while a city to discriminate increases rapidly, 

ysiognomic perception, the immediate re- 
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sponse to stimulus configurations, tends to recede 
[see Bruner & Tagiuri 1954, pp. 638-639, for re- 
view of main references; see also DEVELOPMENTAL 
PSYCHOLOGY, article on THE FIELD; INFANCY]. 

Sex differences in person perception have been 
well established. Among children, females describe 
adult figures in a less differentiated and more favor- 
able manner than do males, Boys focus more on 
aggression, nonconformity, and physical recreation 
in describing others, while girls refer more to nur- 
turant behavior, physical appearance, and social 
skills. Women seem to use more stereotyping than 
men, to be less analytical and more intuitive, and 
to use more psychological (as opposed to physical) 
terms than men (see Shrauger & Altrocchi 1964). 
Women seem to focus visually on those with whom 
they interact more than men do, perhaps because 
they rely on visual cues more than men do. 

By and large, investigators have reported sex 
differences irrespective of research approach or 
judgmental task, and it is clear that the sex of 
judge and judged should always be taken into ac- 
count in the design of research and analysis of 
data in this area. [See INDIVIDUAL DIFFERENCES, 
article on SEX DIFFERENCES.] 

Individual differences. The processes men- 
tioned so far are typical of how people in general 
perceive, think about, and form impressions of 
others. Even within the same cultural environment, 
however, there are demonstrable individual differ- 
ences (see Shrauger & Altrocchi 1964). 

There is some evidence that individual cognitive 
styles apply to both person and nonperson concepts, 
such as the degree to which people integrate cog- 
nitive elements, the cognitive complexity with 
which they conceive of human and nonhuman ob- 
jects, and the extent to which they prefer analytic 
approaches to global ones. [See SYSTEMS ANALYSIS, 
article on PSYCHOLOGICAL SYSTEMS.] 

In addition to formal differences in cognitive 
approach, variations have been shown in the 
“theories” people have about human nature and 
personality. More often than not implicit, such 
notions affect what is attended to in others, as well 
as the inferential process itself. There is, for in- 
stance, considerable evidence that individuals give 
importance to, and tend to use, different traits in 
their perceptions and thoughts about others. Some 
persons, for example, tend to describe others in 
terms of external and physical traits; some do so 
in terms of internal and psychological traits. 

The patterns of trait relationships assumed by 
the individual perceiver have been shown to have 
consistency and individuality, although there is also 
high consensus on trait interdependencies. Further- 
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more, people seem to have available a range of 
“personality theories” upon which they draw, de- 
pending upon variations in the stimulus person and 
the situation. Methods appropriate to the analysis 
of such multidimensional processes and interac- 
tions have only recently been developed (Jackson & 
Messick 1963). 

The matter of individual differences in ability in 
understanding others has already been treated 
above in the course of discussing accuracy of per- 
son perception (see also Shrauger & Altrocchi 
1964), The issues involved, such as the effects of 
response sets, the likelihood that interpersonal ac- 
curacy has several independent components, and 
the failure to show that accuracy is stable over a 
variety of object persons, make it difficult to draw 
general conclusions on the basis of past empirical 
studies, Some earlier interpretations of this liter- 
ature (Bruner & Tagiuri 1954; Taft 1955) are 
based on evidence that now seems equivocal. All- 
port (1961, pp. 497-522), writing with awareness 
of the post-1955 literature on the complexity of the 
accuracy concept, still concludes that there is evi- 
dence that “good judges” have particular character- 
istics, such as breadth of personal experience, 
intelligence, cognitive complexity, self-insight, social 
skill and adjustment, detachment, aesthetic atti- 
tude, and intraceptiveness. The relationship of per- 
sonal characteristics to accuracy, however, is prob- 
ably complex; different types of persons may be 
accurate, depending upon the form of interpersonal 
judgment and the type of object persons involved. 
This does not necessarily deny that some individu- 
als may be better judges than others. It does show, 
however, that one probably cannot describe a per- 
son’s accuracy in terms of a single dimension. 
Instead, accuracy may have to be expressed in 
terms of variations of a profile of components, 
where scatter and shape must be taken into ac- 
count. Those who seem to be extraordinarily good 
judges may well be individuals who happen to be 
accurate on most of the major components. Such 
individuals may be quite rare, and most studies so 
far have not been designed to identify large num- 
bers of them so that they can be compared with 
average persons. 


Overview 


In 1872 Charles Darwin published The Expres- 
sion of the Emotions in Man and Animals, in which 
he discussed not only the manifestations of emo- 
tions but also the recognition of emotional expres- 
sions. Although the matter of recognition was 
secondary in Darwin’s work, his was the best doc- 
umented naturalistic treatment of this subject up 


to that time. It placed prior contributions i 
category of speculative work. The problem ha 
expanded to encompass the recognition o 
tically all psychological states and huma 
ties. For about a decade the issue of acct 
person perception and its correlates p 
many investigators. It became clear in 
1950s, however, that a much better under: 
was needed of the processes involved in thii 
of judgment, an approach that had been advi 
and followed by some investigators from the 
ning. Much progress has been made in this 
tion. Serious attention has been given to 
ments, factors, and artifacts that enter in 
process, to fresh analyses and reformulati 
the phenomenon, and to the development 
ods more suitable for its study. No one he 
seriously underestimated the subtlety and d 
of the process whereby we come to know 
minds and personalities. However, empiri 
ralistic, and theoretical work has indicated th 
even more complex, and that more elemi 
variables enter into it, than had been si 
What is needed next? 

Generally speaking, three directions of 
ment would seem particularly beneficial 
types of theoretical formulation, greater si 
certain forms of inquiry, and intensification 
change and contact with other relevant 
knowledge. i 

Many theories, both in psychology and p 
phy, are more or less directly relevant to the 
of knowing others. Darwin’s genetic-learni 
of recognition of emotions, Scheler’s theory 
rect understanding, Brunswik’s lens theory 
variations, Cronbach's theory of dyadic and 
urational components, the theory of clinical 
ence of Sarbin, Taft, and Bailey, and K 
taxonomic formulations and integrative pre 
should serve as examples. There is, ho 
dearth of theories spanning the available da 
retaining contact with neighboring or N 
erally relevant fields. The contributions of 
(1959; 1963), Rommetveit (1960), ane | 
Taft, and Bailey (1960) illustrate, in very € 
ways, the kinds of theoretical effort needed | 
clarify and integrate the large quantitie 
observations and to point to new questo 
research, while keeping in contact with 
lines of inquiry. New methodologies, more 1n 
tion, and better analytic tools are need 2 
work has to be done tc analyze what people 
do when forming impressions of, and und 
ing, others. $ 

The treatment of person perception has | 


suffered from separation from other areas of psy- 
chology with which it overlaps. This relative isola- 
tionism is diminishing, however, and more atten- 
tion is being given to the relationship between 
knowing other persons and more general processes, 
such as cognition, perception, and theories of per- 
sonality, and to more specialized subjects, such as 
clinical diagnosis of personality, as well as to ap- 
propriate quantitative tools. 

RENATO TAGIURI 


[Other relevant material may be found in EXPRESSIVE 
BEHAVIOR; INTERVIEWING, article on PERSONALITY 
APPRAISAL; PERSONALITY MEASUREMENT; SYM- 
PATHY AND EMPATHY; and in the biographies of 
Brunswik and WoopwortTH.] 
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IX 
SOCIAL PERCEPTION 


Social perception is a field of study notoriously 
difficult to define. It is generally concerned with 
the effects of social and cultural factors on man’s 
cognitive structuring of his physical and social 
environment. This very broad statement is needed 
as a first approximation in order to encompass the 
variety of interests represented in the study of the 
social context of human perception. 

To the psychologist concerned with perceptual 
theory, social perception is of interest for two rea- 
sons. First, cultural and social differences in per- 
ceptual functioning can serve as demonstrations 
of the role of experiential factors in perception. 
Second, the importance of motivational processes 
has been stressed in many recent theories of per- 
ception. Perceptual selectivity emerges as a com- 
mon denominator of much of this work, especially 
when the distinction is made between what can be 
perceived by an organism and what is perceived in 
a variety of conditions. A selection exercised at any 
given moment depends upon a great number of an- 
tecedent factors, including interests, needs, values, 
and goals. Many of these are of social origin; and 
thus, “social” has often come to mean, explicitly 
or implicitly, motivational factors in perception. 

In much of social psychological writing, social 
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perception has acquired yet a different meaning: 
it is the study of “person perception” or “percep- 
tion of people.” Heider wrote: “We shall speak of 
‘thing perception’ as ‘nonsocial perception’ when 
we mean the perception of inanimate objects, and 
of ‘person perception’ or ‘social perception’ when 
we mean the perception of another person” 
(1958, p. 21). 

The field so defined extends well beyond the con- 
ventional use of the term “perception”; it includes 
highly complex inferences about characteristics of 
persons. It is true, however, that such a blanket 
use of the term is based on more than a mere 
transplantation of its current everyday meaning. 
The point of transition from perception to other 
cognitive activities, such as inferring, recognizing, 
categorizing, and judging, is difficult to specify. In 
addition, the role of inference and categorization 
in perceptual activities has been a central preoccu- 
pation in several theories, and from this an inter- 
est has developed in showing that basic similarities 
of process exist at various cognitive levels. 

Some phenomena of person perception have 
been used as contributing evidence in these at- 
tempts at unification. Brunswik was interested in 
the inferences about personal attributes made on 
the basis of physiognomic cues and in the validity 
of such inferences under various conditions (1947). 
In the application of his theoretical premises to the 
perception of people, he was able to provide some 
evidence for the unity of process in phenomena 
that differed in their degree of complexity. Thus, in 
a view such as Brunswik’s, social perception ap- 
pears as perception of people, and the use of the 
term perception finds its justification on systematic 
grounds rather than as a metaphor [see BRUNSWIK]. 

In social psychology, efforts were also made to 
demonstrate that perceptual phenomena embedded 
in a social context do not require for their explana- 
tion a set of principles distinct from those used in 
general perceptual theory. The building of bridges 
between general and social psychology became 
particularly important as increasing stress was 
laid on the determination of social behavior by the 
manner in which an individual perceives and 
interprets his social environment. 

This trend of thought found one of its most 
prominent expressions in the textbook published by 
Krech and Crutchfield in 1948. The frequent equa- 
tion of the study of motivational factors in percep- 
tion with the study of social perception can quite 
possibly be traced to the influence of the move- 
ment of which this book was an expression. Value 
judgments and social norms are powerfully present 
in nonveridical perception, and thus social, in the 


area of perception, has tended to become identi- 
fied with various forms of perceptual biases and 
distortions. 

And yet, efforts toward a theoretical integration 
of social with nonsocial in perception need not rely 
exclusively on this particular link. If it is true that 
many perceptual activities “depend upon catego- 
rizing activities and upon the construction of an 
adequate system of categories against which stimu: 
lus inputs can be matched” (Bruner 1957, p. 127), 
then we may expect that such a category system 
will strongly influence the perception of social ob- 
jects and events that, in their complexity, offer to 
the perceiver a large number of alternative possi- 
bilities of assignment. This has important implica- 
tions for the “perception of the social.” It means, 
for example, that social stereotyping and related 
phenomena can be analyzed on the same lines as 
the categorization of complex sensory inputs. 

So far, the term social perception has been seen 
to subsume in the psychological literature three 
principal varieties of empirical content: it has been 
used to designate some aspects of nonveridical per- 
ception; it has referred to the perception of people; 
and it has served as a link in the theoretical uni 
fication of a wide range of cognitive phenomena. 
Any ex-cathedra pronouncement on the proper 
definition of this field encounters further dificul- 
ties when one considers the interests of social 
anthropologists in cultural differences in cognition. 
For example, in his excellent review of the rela- 
tions between culture and behavior, Kluckhohn 
(1954) devoted most of his section on perception 
to a discussion of culturally distinct modes of 
thought concerning time and space. In one of his 
earlier papers, Hallowell wrote: “If a culture does 
not provide the terms and concepts, spatial atui 
butes cannot even be talked about with precision: 
Individuals are left to fend for themselves, as it 
were, on the level of elementary discriminatay 
reactions. This limits the possibilities for the na: 
manipulation of more refined and developed a 
cepts that require symbolic representation in Sie 
form” (1942, p. 77). It is just possible that i 
quotation defines our problem. There is no do y 
about the cultural determination of conceptu? a 
tems concerning the view of the world, but is E 
also true of what Hallowell calls the “elemen 
discriminatory reactions”? 

A discussion of the problems 
tion must rely on the uses of the 
pirical literature. But not all such usage one erm 
cluded, since the consequent stretching of e soci 
would finally lead to a consideration © i < about 
and cultural determinants of all knowledg 


of social percep 
term in the em 
an be in 


the world. There is not much difficulty in the speci- 
fication of the antecedent term: the influence of 
society and culture. The dependent variable must 
be a category of human behavior that can be shown 
to be affected by these cultural and social determi- 
nants and that at the same time falls within a 
roughly acceptable range of perceptual phenomena. 

For these reasons, social perception will be un- 
derstood here to include those aspects of discrimi- 
nation, identification, recognition, and judgment 
that satisfy two conditions: (a) they refer to the 
sensory information received at the time of re- 
sponding, and (b) the response does not consist of 
complex and abstract inferences based on con- 
scious choice and deliberation. The second char- 
acteristic of this range of phenomena—their sub- 
jective immediacy—points to one of the main 
reasons for their importance and interest when 
they are considered in a cultural and social con- 
text. We know that systems of values and beliefs 
are to a large extent a product of an individual's 
culture and society. We know much less about the 
limits of these influences, about their capacity to 
reach those regions of human experience that ap- 
pear as immediately given, as the incontrovertible 
evidence of our senses. The intervention of culture 
and society in our experience of aspects of the 
world that appear unquestionably as “out there” is 
not clearly established; but a good deal of relevant 
research is available, and some tentative con- 
clusions can be drawn. 


Social context of motivational factors 


The frequent use of the term “social perception” 
to indicate the effects of motivational factors on 
perception has already been mentioned. But in 
many cases, the effects of needs on perception can 
be considered independently of their social source 
or context, Similarly, in the studies on individual 
Consistencies in perceptual and cognitive styles, 
personality becomes a determinant of a set of per- 
ceptual responses, not in virtue of conditions com- 
Mon to a social or a cultural group but rather in 
terms of individual differences superimposed on a 
aes social background. And yet, many of these 
ee are, or could be, directly relevant to the in- 
Nes ee of social or cultural contexts of percep- 
eine peeing group membership that deter- 
a ifferences in the subjective importance of 
o aah ae and needs could conceivably lead 
T or social regularities in recognition phe- 
erie which are usually discussed under the 
ate of perceptual deferise and perceptual sen- 
ae n. But research in this field tended to focus 

tracultural phenomena, although cultural dif- 
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ferences in recognition related to a variety of 
nonmotivational factors have been extensively 
studied. 

Emotional and motivational factors determine in 
a great many ways our inferences about person- 
ality attributes and traits of other people. They also 
sometimes influence our perception of their phys- 
ical characteristics. With the appropriate adapta- 
tion of experimental procedures employed in some 
studies of perceptual illusions, it can be shown that 
the perception of people who are familiar and/or 
important to the perceiving individual tends to be 
relatively resistant to various distortions. The use 
of visual characteristics of persons as stimuli pre- 
sents obvious advantages in the attempts to dem- 
onstrate the effects on perception of emotional in- 
vestment in the stimulus (Ittelson & Slack 1958). 
It is true, however, that although the effect of fa- 
miliarity in resisting distortion finds a confirmation 
in these experiments, the role of emotional factors 
is not so clearly demonstrated. The attempts to 
investigate the effects of emotion as distinct from 
familiarity (Ittelson & Slack 1958) yield results 
that are suggestive and promising, but an unam- 
biguous separation of the two effects is not always 
easily achieved. 

In addition to the above, there exists another 
group of studies that is concerned with the per- 
ception of visual characteristics of people and that 
can also be related to the general trend of findings 
on motivational factors in perception. Under some 
conditions, it seems that the association of needs 
or values with a stimulus leads to a perceptual 
overestimation of its magnitude. An alternative in- 
terpretation of these findings is possible: when 
differences in magnitude between stimuli are as- 
sociated with differences in their value or in their 
emotional significance to the subject, the stimuli 
tend to be perceived as physically more different 
from each other than they really are. This interpre- 
tation relates the phenomenon directly to one of 
the basic aspects of social stereotyping—the accen- 
tuation of differences between members of different 
social groups in those characteristics that are sub- 
jectively related to the: criterion for classification 
(Hochberg 1957; Tajfel 1959). There is some evi- 
dence that this accentuation of relevant differences 
is not confined to judgments of personality char- 
acteristics but also extends to physical attributes 
of human groups (e.g., Secord et al. 1956). Similar 
polarization has also been found to operate in the 
identification of racial membership of human faces 
presented in pairs under conditions of binocular 
rivalry (Pettigrew et al. 1958). 

The sensitivity to selected visual cues and their 
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use in the identification of a human being as be- 
longing to an ethnic group was well known before 
World War 1 in some European countries with a 
strong tradition of anti-Semitism and a large Jew- 
ish minority. One of the difficulties experienced by 
Jews who attempted to pass for non-Jews under 
Nazi occupation was the ease with which they were 
identified. An experimental equivalent of this can 
be found in a few studies concerned with identi- 
fication of Jews from photographs. Despite incon- 
sistencies in the findings, the trend of evidence 
provides some justification for Allport’s conclusion: 
“It is a striking fact that prejudiced people are bet- 
ter able to identify members of the disliked out- 
groups than are nonprejudiced people” (1954, 
p. 133). Here also it is difficult to decide to what 
extent these identifications are based on familiarity 
with relevant cues rather than on selection of cues 
because of emotional involvement. At any rate, 
it is likely that emotional involvement breeds fa- 
miliarity and thus leads to increased efficiency in 
the detection of identifying cues. 

The implications for social perception of studies 
on the role of motivational factors of perception 
are threefold. The first, and most general, can be 
stated as follows: Much of human motivation is of 
social origin; it can be shown that motivation 
affects some perceptual responses; therefore it can 
be expected that social factors are part of the 
causal chain. Second, if it is true that appropriate 
motivational antecedents vary predictably from one 
social group to another, then some corresponding 
social regularities of perceptual responses would be 
expected to occur. Finally, there is some evidence 
that motivational factors intervene in the percep- 
tion of physical features of the social environment. 


Cultural differences in perception 


Cultural determination of cognitive systems and 
of behavior relating to them ranges all the way 
from views about man, nature, and the universe to 
the organization of sensory data. The limits of cul- 
tural relativism are bound to vary according to 
whether one considers a cultural system or the 
range of variability in the behavior of individual 
members of a culture. From the psychological point 
of view, the main problem in this field is to dis- 
cover techniques that would allow the transition 
from the analysis of a cultural system to that of 
behavior relating to this system. When this is done, 
it is often found, as Asch (1952) pointed out, that 
it is easy to underestimate the extent of free play 
that a cultural system of values and beliefs gives 
to individual variability. 


But regularities do undoubtedly exist, and it may 
be expected that cultural influences become more 
marked as the individual has fewer opportunities 
to engage in his own independent checking of what 
comes to him from social sources. Thus, it is fairly 
obvious that supernatural beliefs or aesthetic pref- 
erences will show more evidence of uniform cul- 
tural impact than, for example, perceptual recog- 
nition or identification. However, there are many 
intermediate cases, and because of them the em- 
piricist-nativist controversy loses much of its 
meaning in the context of social perception, The 
capacity to receive sensory information is not, in 
general, culturally determined; what tends to be 
selected and how the selection is interpreted will 
be so determined to varying extents. It would bea 
bold investigator who would undertake to decide 
for each case whether it should properly be called 
perceptual or not. . 

Within the rough boundaries previously speci- 
fied, it is possible to distinguish three classes of 
variables determining some of the known cultural 
differences in perceptual responses. They are the 
functional salience of selected aspects of physical 
environment, the familiarity with the material 
products of a culture, and the communication 
systems employed in a culture. 

Functional salience. The general relationship 
between functional salience and various communi- 
cation systems used in a society turns out to be, 
more often than not, a case of the hen-andegg 
situation. For example, functional importance o! 
some aspects of the environment presumably ra 
to the growth of an appropriate and specialize 
terminology. The existence of a terminology may 
become in turn a facilitating factor in the efficiency 
of discrimination or recognition; and it is oe 
very difficult to decide whether a system of labels 
has an independent causal status in the production 
of a set of perceptual responses. The problem 0 
the direction of causal relationship becomes es 
more complicated when one considers that a 
times the existence of a terminology is a. 
reason for the need to learn a set of perceptual 3 A 
criminations. Then the communication system 
comes itself a property of the environment, W 
determines the relative salience of a set of w 

It is true, however, that in some cases j of 
tinction is possible between the direct at 
functional salience and the effects of such S$ a 
mediated through a set of linguistic Le te 
problem is that of conditions for learning: dis- 
case of the Eskimos and Laplanders von A 
criminate” between more kinds of snow tha 


members of other cultures, it may well be difficult 
to decide whether this is due to the learning of 
labels or to the direct consequences of mispercep- 
tion, although it is likely that the consequences of 
a perceptual mistake concerning snow are fairly 
independent of linguistic labeling. The phenome- 
non is in no way different from those usually 
discussed under the headings of acquired distinc- 
tiveness of cues and perceptual differentiation. 
Professional competence based on learning to make 
discriminations that are very difficult for the lay- 
man is a familiar phenomenon; the reading of 
X-ray plates or wine tasting are two examples 
among many. 

It is in this area of functional salience that there 
exists the only experimentally established case of a 
culturally determined regularity in discrimination 
between simultaneously presented stimuli. Liber- 
man and his colleagues (1961) found clear evi- 
dence of the effects of learning on the distinctive- 
ness of speech sounds. They were able to show 
that, with objective acoustic differences kept equal 
and the stimuli (synthetically produced speech 
sounds) presented in pairs, the subjects perceived 
as the same those phonemes that belonged to the 
same phoneme category in English, and as differ- 
ent those that were on different sides of a boundary 
between two phoneme categories. 

This is an important phenomenon because it 
probably provides the most striking example of 
cultural differences in discrimination that we are 
ever likely to discover. The range and selectivity 
of sounds in various languages impose probably 
more opportunities for alternative modes of experi- 
ence than is the case for any other aspects of the 
sensory information that we receive. As a result, 
cultural regularities occur in the perception of 
simultaneously presented stimuli as “same” or “dif- 
ferent’—a phenomenon not easy to replicate for 
any other culturally divergent aspects of experience. 

Familiarity. Cultural variation that might in- 
fluence perceptual responses can also be found in 
the general type of man-made products peculiar to 
a culture. One example is the use of straight lines 
and right angles in urbanized societies. Is it pos- 
sible that the accumulation of corresponding habits 
of visual inference would lead to differences in per- 
ceptual functioning? In order to explore this pos- 
sibility, Allport and Pettigrew (1957) used one of 
the illusions devised by Adelbert Ames, the rotating 
trapezoid window, in which the inferred object- 
quality of the stimulus is in conflict with its objec- 
tive features. If the visual inference of a window- 
like object is made by the observer, the rotation is 


PERCEPTION: Social Perception 571 


perceived as a swaying to-and-fro movement. 
Therefore the illusion should be less marked in 
cultures where there is less familiarity with the 
type of object represented by the trapezoid. 

Allport and Pettigrew tested this prediction, 
using as their subjects groups of rural and urban 
Zulus and a group of South Africans of European 
origin. They found that under conditions optimal 
for the experience of the illusion “virtually as many 
primitive Zulus report the trapezoidal illusion as 
do urban Zulus or Europeans” (p. 111). But under 
marginal conditions the subjects familiar with the 
products of Western culture tended to experience 
the illusion more frequently. We may perhaps note 
that in this case evidence concerning the existence 
of cultural differences in perception seems to be 
confined to marginal conditions within an already 
marginal and fluid situation. 

In the study by Allport and Pettigrew the nature 
of past experience emerges clearly as a determinant 
of cultural differences. In many cases it is more 
difficult to distinguish between the possible causal 
factors. This is so in the investigation by Segall, 
Campbell, and Herskovits (1966), who under- 
took the study of the extent of geometrical illu- 
sions experienced by subjects from nearly twenty 
different societies. For the Miiller—Lyer and the 
Sander parallelogram illusions, the authors pre- 
dicted more susceptibility in the urbanized than in 
the nonurbanized societies. The opposite prediction 
was made for two versions of the horizontal-—vertical 
illusion. 

Three hypotheses underlie the study. The first of 
these, referred to by the authors as the “carpentered- 
world hypothesis,” is similar to that put forward 
by Allport and Pettigrew. The second relates the 
extent of horizontal—vertical illusion to “aspects of 
the physical environment of peoples, specifically 
the presence or absence of broad, horizontal vistas.” 
The third hypothesis concerns cultural differences 
in the amount of experience with two-dimensional 
schematic representations of reality. This can be 
related to familiarity, but it can also be assigned 
to the category of cultural differences in communi- 
cation systems. It seems, however, preferable to 
distinguish research designs that use directly a 
system of notation employed in a culture (such as 
language or conventional visual representations ) 
from those in which cultural differences in percep- 
tion are attributed to such a system but not directly 
shown to be related to it. 

Segall et al. were able, on the whole, to provide 
support for their hypotheses despite some incon- 
sistencies in the pattern of findings. It seems that 
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on the basis of assumptions about the kind of visual 
experience prevailing in a society, predictions can 
be made about susceptibility to different types of 
illusions. It should be noted that at the turn of the 
century similar findings were reported by Rivers 
(1905). 

Three unresolved problems in the results of 
Segall et al. should be mentioned: for example, 
some discrepancies between the ranking of socie- 
ties in terms of susceptibility to the illusions and 
the ranking that should be expected on the basis 
of the hypotheses; the difficult issue of perceptual 
development emerging from the ambiguous data 
on children; and discrepancies with the findings of 
other investigators who used slightly different pro- 
cedures. These are discussed in detail in the 
original monograph. 

Another trend of evidence concerning the role 
of past experience in cultural differences in per- 
ception is provided with techniques of binocular 
rivalry. Bagby (1957) presented stereoscopically 
pairs of photographic slides to groups of American 
and Mexican subjects. In each pair one photograph 
was of a typical Mexican scene and the other of a 
typical American scene. The subjects reported what 
they saw. The differences between the two groups 
were highly significant, each group identifying 
more easily the culturally familiar scene. Bagby 
makes a fairly convincing case that his results are 
due to genuine perceptual dominance of the scenes 
characteristic of one’s own culture and not just to 
selective reporting. 

The range of evidence concerning cultural dif- 
ferences in perception determined by familiarity is 
fairly limited. Indeed, it would be surprising if 
large and important differences were discovered; 
human beings in different cultures share the most 
fundamental aspects of experience of their physical 
environment. The marginal differences that have 
been discovered simply provide a supplementary, 
and perhaps minor, source of evidence for the per- 
ceptual adaptability of man. There is a simple but 
instructive example of this in an experiment con- 
ducted by Tresselt (1948), who found discrepan- 
cies between groups of weightlifters and watch- 
makers in their initial judgments of heaviness of 
a series of weights. These discrepancies gave way 
very rapidly, however, to a convergence of judg- 
ments because of common experience of both 
groups with the experimental stimuli. It can safely 
be assumed that the cultural differences discussed 
in this section would share the same fate under 
conditions of cultural change and intercultural con- 
tact. This has already been shown in some of the 
previously described experiments in which urban- 


ized members of non-Western cultures serv 
subjects. These studies do not really show | 
what we “see” is determined by our cultur: 
vironment; their importance lies in stressin; 
capacity to make, under conditions of perce 
choice, an appropriate selection of those aspect 
the environment that matter. s 
Communication systems. Languages are 
most important cultural notation systems, A 
deal of controversy has been created by the 
put forward by Whorf (1956), but also exp 
previously by several eminent anthropologist 
linguists, that the structure of a language d 
influences the view-of-the-world characteristic 
culture. The main psychological problem is to de 
termine whether nonlinguistic cognitive beh 
can be related to language. Within the approxi 
domain of perception some knowledge about | 
simpler relationships must be gathered indeps 
dently of conclusions that may be drawn 
organizations of higher order. The possible ef 
of language on the conceptual organization 
world must be distinguished from its effects oi 
organization of the available sensory data, 
Most of the relevant experimental work 
done with the view of determining the effe 
linguistic labels on perceptual recognition. Thi 
search is best represented by the well-known 
of Brown and Lenneberg (1954). Different 
guages tend to impose different groupings on 
physical continua as the range of colors. 
groupings may be described in terms of three 
cipal characteristics: (a) the number of label 
signed by a language to a continuum, (b 
regions on the continuum where the assignm 
one label tends to shade over into the assi 
of another, and (c) the consistency with wh 
particular label is assigned to a particular stimi 
on the continuum. Brown and Lenneber} 
termined the relative linguistic “codability 
number of colors and found that the de 
codability related to the ease of recognition 0 
color, but only when the conditions of perfo a 
the task were rendered a little difficult. This Ang 
raises a number of problems. Is it really th 
more codable colors are better remember 
recognized, or is it possible, as Lenneberg 
p. 379) suggests in a later paper, that “con: 
with the color context, S would now tend 
member the word ‘green’ but not its exact 3 
and would therefore ask himself ‘what is | 
green here?” In addition, it is found that 
that show “singularly low communality" in lal 
also tend to be recognized with relative ae 
In the Netherlands, Frijda and van e 


(1961) were able to confirm Brown and Lenne- 
bergs results and to take them one step further. 
They showed that what is true of recognition of 
colors is also true of recognition of facial expres- 
sions of emotion. Those that are more consistently 
codified in a language are also more accurately 
recognized. 

Some work has been done on communication sys- 
tems other than language. Conventional representa- 
tions of the visual world vary not only from culture 
to culture but also from one historical period to 
another. They present two problems, both of them 
common to the art historian and to the psycholo- 
gist: (a) the development of stylistic conventions 
and (b) the problem of communication, or the 
matching of the reproductive or creative intentions 
and their interpretation by the viewer. The first of 
these is not really perceptual. Albrecht Diirer’s 
woodcut of a rhinoceros showing the beast as 
armor plated does not allow us to draw the con- 
clusion that Diirer would not have been capable of 
perceiving a real rhinoceros without these dramatic 
embellishments. As Gombrich (1960) points out, 
there are many examples of drawing or painting 
from nature in which distortions occur. These dis- 
tortions are not of a random nature, otherwise we 
could not talk about styles in art. Presentations of 
the visual world are guided by the stereotypes con- 
cerning the object represented and by a choice in 
which some sets of relationships are conveyed 
among many possible ones. 

Many experiments on reproduction point in the 
same direction. But we cannot conclude that there 
is a parallel between perceiving and building a no- 
tation system out of what is being perceived. It can 
hardly be assumed that the ritualistic Egyptian 
manner of drawing human figures corresponded to 
the Egyptians’ perception of human shape. There 
remains, however, the second problem: How are 
these cultural artifacts perceived? 

Conventional representation of visual aspects of 
the world becomes in many cultures an important 
part of an individual's visual environment. There 
is little doubt that within a culture these visual 
Messages communicate the information that they 
are supposed to convey. Therefore, perceptual in- 
terpretation of visual notation systems acquires 
some importance. It provides perhaps the only case 
of sharp discrepancies in the modes of visual ex- 
perience acquired by individuals who are members 
of different cultures. 
tg et of the evidence gathered by art historians 

mcerned with the makers of symbols rather 
be with the receivers of messages; and, indeed, 
would be difficult to infer the modes of percep- 
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tion of paintings in the past from the way these 
paintings appear to us at present. There exists 
here an opportunity for psychological research that 
has not been exploited: namely, investigating how 
the unsophisticated observers of today perceive the 
spatial features of visual representations created 
in the past.» 

Several anthropologists reported difficulties ex- 
perienced by those unfamiliar with photographs in 
recognizing in them thoroughly familiar objects, 
but there are few experimental studies. The clear- 
est evidence of cultural divergences in the percep- 
tion of two-dimensional representations comes per- 
haps from the work recently reported by Hudson 
(1960). Using simple drawings of scenes whose 
content was familiar to his African subjects, he 
found that failure to perceive the scenes three- 
dimensionally was related to the amount of formal 
schooling and intelligence, but even more clearly 
to the general opportunities that the subjects had 
in the past of familiarizing themselves with pic- 
torial material of all kinds. 

It seems fair to conclude that some marginal 
aspects of perception can be shown to differ pre- 
dictably in various cultural contexts, In some cases, 
it seems that the ecology of the environment and 
conditions of survival related to it are responsible 
for some differences in the degree of close inspec- 
tion that various human groups undertake of one 
or another aspects of their surroundings. In other 
cases, the visual environment acquires distinctive 
properties in large measure through man’s own 
activities. Finally—and perhaps in its implications 
this is the most important of these phenomena— 
perceptual interpretations of notation systems are 
not given; they are embedded in appropriate past 
experience. This points to difficulties that may be 
encountered in the introduction of educational 
techniques using unfamiliar notation systems in 
new cultural contexts. It seems that an urgent re- 
search priority in this field is not only the ascer- 
taining of the existence of cultural differences; for 
theoretical as well as for practical reasons, it would 
be essential to concentrate on problems of per- 
ceptual development in children, which until now 
have received very little attention in cross-cultural 


research. 


Social pressures 

In addition to the information we receive through 
sensory channels, we have at our disposal informa- 
tion that we receive from social sources. One of the 
most important aspects of social learning is in the 
reliance of the individual on the consensual in- 
formation imparted to him by other people. 
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The problem is to discover whether our percep- 
tion of the physical world is influenced by social 
consensus. This is undoubtedly so in the case of 
moral and aesthetic judgments or of the interpre- 
tation of highly ambiguous information. Visual 
illusions such as the autokinetic phenomenon, 
which consists of an impression of moyement given 
by a stationary point of light presented in total 
darkness, have been used abundantly to demon- 
strate the effects on perception of socially derived 
norms. In a classical series of experiments, Sherif 
(1935) has shown that individual judgments of 
the extent of subjective movement tend to converge 
when the individual is in a group and also that this 
convergence persists subsequently to the group 
situation. Jacobs and Campbell (1961) used the 
same phenomenon for an elegant demonstration of 
the cultural transmission of social norms. They 
created a laboratory “microculture,” in which they 
first established the convergence of subjective 
movement. Subsequently they eliminated from the 
experiment, one by one, the subjects who had 
served originally and replaced them with new sub- 
jects. The effect of the social norm tended to per- 
sist several “generations” after the last of the 
original subjects had gone. 

In one sense perceptual illusions. such as the 
autokinetic phenomenon lend themselves admirably 
to the demonstration of the effects of social norms 
on perception. The responses are perceptual, and 
there is no difficulty in the quantitative assessment 
of the extent of social influence. But this is an ex- 
treme condition, in which the subject has no possi- 
bility to assess the accuracy of his response. 
Therefore, it is only too easy to draw from these 
cases unwarranted conclusions about the impact 
of social influence in less extreme conditions, The 
need for caution is well shown in a careful series 
of experiments by Luchins (e.g., Luchins & Luchins 
1955). Using sets of ambiguous figures as stimuli, 
he was able to demonstrate the converging effect 
of social influence and also to demonstrate that the 
magnitude of this effect depended on the degree 
of fit between the information received from the 
group and the actual properties of the stimuli. 

This is one of the more obvious variables deter- 
mining the extent of influence of social norms. 
There are others, such as the size of the group, its 
unanimity, its emotional significance to the indi- 
vidual, etc. (for a detailed review, see Graham 
1962). From the point of view of perception, the 
main problems concern the nature of the response 
and the mode of operation of social norms. Are 
converging responses only verbal or do they repre- 
sent a perceptual change? Is the subject aware that 


his responses are socially influenced, or is he con- 
vinced that he is still reporting aspects of his own 
experience in undiluted form? 

These questions are not easy to answer. They 
become even more difficult in the case of drastic 
effects of group pressures on judgment, such as 
those shown in the famous experiments by Asch 
(1952) and many that followed. In these situa- 
tions, a group of the experimenter’s confederates 
announce comparative judgments of length that 
are obviously wrong. Despite this, a sizable propor- 
tion of subjects tends to yield to the influence of 
the group. 

There is little doubt that in many cases this 
yielding is only verbal. Many subjects feel uneasy 
and uncertain about the unexpected differences 
between what they see and what they think the 
others see, but this happens without a concomitant 
perceptual or judgmental change. The conflict in- 
troduced in such cases has important social impli- 
cations, but it is not a perceptual phenomenon. 

In some of the postexperimental interviews there 
appears to be no awareness that the subject's judg- 
ment has been determined by the judgment of the 
group, but this is not sufficient evidence of a change 
in judgment. There is, however, some evidence 
that suggests that this could be the case. It has 
been shown, for example, that judgments in ac- 
cordance with the norms created in the group situ- 
ation tend to persist subsequently. This is not easily 
explained on the basis of verbal yielding alone. 
study by Flament (1958) is illustrative of those 
attempting to provide evidence that some judg- 
mental change is taking place. He argued that 
when group influence of a verbal nature is followed 
by verbal reports of the individual who is subject 
to the influence, a conscious link is established 
between the two. In order to minimize this, he used 
a procedure in which subjects, after being told we 
mean response of a fictitious group in a task con 
sisting of detecting differences in length betwee? 
two lines, were asked to adjust manually the len 
of a variable line to a constant one. In this aey 
tion the effects of a group norm were at least 
marked as in the case of verbal responses: } 

The literature on the effects of group baal 
on judgment is very large, but most of these stu! 
are mainly concerned with aspects of confo nite 
behavior rather than with perceptual chank a any 
of the results do not allow one to infer w? a 
degree of certainty that such change has occu Eo 
On the other hand, some suggestive eviden oe t 
exist. This is not fully convincing, and muc a 
work will have to be done to provide the pom Ait 
and nonperceptual alternatives with @ cle 
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pirical meaning, For the present it seems that the 
occurrence of judgmental or perceptual change as 
a function of group influences, particularly in con- 
ditions of some uncertainty and ambiguity, is at 
least likely. 

HENRI TAJFEL 


[Other relevant material may be found in COMMUNI- 
CATION; LANGUAGE; GROUPS, articles on GROUP BE- 
HAVIOR and GROUP PERFORMANCE; Norms.] 
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x 
UNCONSCIOUS PERCEPTION 


The term “unconscious perception” has two dis- 
tinct and separate meanings with quite different 
implications for a science of behavior. One mean- 
ing assumes that the nervous system is capable 
of differentially responding to stimulus energies 
whose intensity is below that at which a discrimi- 
nated verbal report could be obtained. This mean- 
ing is frequently referred to as subliminal percep- 
tion. The second meaning makes no assumption 
concerning a lower sensitivity for unconscious 
discrimination but is instead concerned with situa- 
tions in which an observer's behavior is governed 
by perceptual stimuli whose presence he is unable 
to report when subsequently interrogated. In this 
situation the cues that guided the behavior would 
have stimulus energies sufficiently great that, had 
they been examined in a standard threshold situa- 
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tion, they would have been found to be above a 
threshold for discriminated verbal report. This 
meaning of unconscious perception is designated 
as incidental perception. 

These two separate meanings of unconscious 
perception have seldom been made explicit. The 
Freudian concept of the unconscious has had a 
major influence in current psychological usage of 
this term, and as with most Freudian theory, the 
concepts are so loosely defined and of such a 
general nature that they provide only a starting 
point for scientific inquiry. It is not surprising that 
both meanings of unconscious perception seem to 
be involved in the Freudian conception of the un- 
conscious with no attempt to separate or distin- 
guish these two distinct possibilities. 

The neurophysiology of our sensory systems pro- 
vides a certain plausibility for the existence of 
subliminal perception. Stimulus energies that ini- 
tiate activity at the receptor have to pass through 
several relay stations in the nervous system before 
reaching the cortex and other higher brain centers 
assumed to mediate consciousness, Thus, at a theo- 
retical level it is plausible that an energy too weak 
to transmit all the way to the cortex might none- 
theless be able to produce nervous activity up to 
several of the lower relay stations. Then if effector 
mechanisms are capable of responding to input at 
these lower way stations, discrimination without 
awareness would be possible [see MENTAL DIS- 
ORDERS, article on BIOLOGICAL ASPECTS; NERVOUS 
SYSTEM, article on STRUCTURE AND FUNCTION OF 
THE BRAIN; SENSES]. 

Some who have advocated the existence of sub- 
liminal perception have characterized it as only 
a gross discriminating system. Thus it is assumed 
that subliminal discriminations are only of rather 
gross emotional or evaluative types, such as danger 
versus no danger and pleasant versus unpleasant. 

Attributing only gross discriminative capacity to 
subliminal discrimination provides the concept 
with somewhat more plausibility. It avoids the 
concept of the “Judas Eye” type of unconscious 
that screens and evaluates incoming information 
before it is allowed to pass into the conscious 
realm. 

However, the assumption of only gross discrim- 
inative capacities for subliminal discrimination has 
certain logical difficulties. The meaning of a stim- 
ulus is conveyed not by one or two gross physical 
characteristics of the stimulus but rather by the 
total aggregate of characteristics, which at times 
can be quite subtle. In the case of words, which 
are frequently used as stimuli in experiments on 
subliminal perception, the correct perception of 
the emotional tone of the word depends upon ac- 


curate discrimination of very slight differences in 
letters and in the sequence of letters. For sublim- 
inal discrimination to detect the affective tone of 
the stimulus would require a very precise, highly 
developed discrimination capacity. 

Before consideration of the evidence on sub- 
liminal and incidental perception, it is necessary 
to define such terms as “awareness,” “threshold,” 
and “subliminal” in order to evaluate the experi- 
mental evidence properly. 

Awareness. Most writers on the subject of 
awareness have distinguished between conscious 
and unconscious processes in terms of verbaliza- 
tion. Conscious procgsses are subject to verbal 
report; unconscious processes are not. Freud, in 
his extensive writings, on several occasions equated 
consciousness with verbalization, and Dollard and 
Miller (1950), among contemporary writers on the 
subject, are most explicit in defining consciousness 
in terms of verbal report or verbalization. 

The scientific value of a definition of awareness, 
or consciousness, in terms of verbalization and 
verbal report depends critically upon the precau- 
tion taken in experimental investigations with 
respect to assuring adequate motivation on the 
part of the subject to report with precision. Care 
has to be taken to make sure that the subject 
understands what is being asked of him, and con- 
sideration must also be given to the effects of the 
questioning upon awareness itself. But most im- 
portant, an adequate schema for classifying the 
subject’s verbal reports along relevant dimensions 
is necessary if progress is to be made in clarifying 
the concept of consciousness. Some promising 
steps toward this objective have been reported by 
Dulany (1962). mdi- 

Although one can use verbalization as an in 
cator of awareness, one must not make the error 
of equating the two or of considering them synony- 
mous. Words are highly abstract symbols and beat 
little or no physical resemblance to the pae 
events, and relationships that they denote. It WER 
be placing an excessive burden upon language F 
expect it to convey fully the richness of ee 
and perceptual experience. Instead, console 2 
or awareness, at our present stage of know 
must be treated as a concept whose properties E 
inferred from the observable verbal reports, ie 
a concept it remains distinct from the respo? a 
from which it is inferred. This use of ese. 
as a concept is similar to the treatment of Panes 
tion as a concept that has been advanced by 
et al. (1956). F A 

ries are many limitations and disadvanitt 
to the definition, or criterion, of awareness © 
above, but it must be recognized that this 18 


PERCEPTION: Unconscious Perception 


area of little knowledge and the above definition 
serves as a beginning point for experimentation 
that will enrich the knowledge of the concept of 
awareness as well as provide a criterion for evalu- 
ating work that has been done in the general field. 

Thresholds, or limens. Before one can talk 
meaningfully about subliminal discrimination or 
discrimination below threshold, it is necessary to 
examine what is meant by the concept of a thresh- 
old. In order to do so, it will be helpful to consider 
in a skeletonized form the psychophysical pro- 
cedures for determining a sensory threshold. 

Let us consider an auditory study in which the 
intensity or energy necessary for the subject to 
detect a thousand-cycle tone is being investigated. 
The subject is in a controlled environment where 
relevant environmental stimuli are under the ex- 
perimenter’s control, The subject would typically 
be presented with a “ready” signal, perhaps in the 
form of a flash of light, to inform him that the 
auditory signal is about to occur. He would further 
be instructed to report “yes” if he heard the tone 
and “no” if he did not. The subject would then be 
presented with a thousand-cycle tone at various 
intensity levels in a random order. Traditionally, 
the threshold has been defined as the stimulus 
intensity that leads to the subject’s report of a 
stimulus 50 or 75 per cent, or more, of the time. 

Two characteristics of such a definition of 
threshold are immediately apparent. First, the sta- 
bility or sensitivity of such a threshold will depend 
critically upon the number of observations made; 
and second, the threshold represents an arbitrary 
Statistical decision. It does not represent anything 
absolute in a neurophysiological sense. 

Statistical nature of thresholds. The statistical 
arbitrariness of such a definition of threshold has 
certain consequences for experimentation on sub- 
liminal perception. The subject often makes correct 
verbal discriminations even below the 50 per cent 
point. How much better his discrimination is than 
chance can be determined if the experimental 
procedure is modified to include control intervals 
in which, unknown to the subject, no signal is 
Presented. The frequency with which the subject 
gives “yes” responses to the zero-intensity signal 
is then a measure of his “false alarm” rate, or his 
tendency to say “yes” in the absence of stimulation. 
Any increase in “yes” responses over this “false 
alarm” rate indicates some capacity for verbal dis- 
crimination. Consequently, one should not be im- 
pressed by an experiment on subliminal perception 
demonstrating that a nonverbal indicator, such as 
a galvanic skin response, will show better than 
chance discrimination at a level below the 50 per 
cent threshold. What must be asked of such an 
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experiment is how often was the verbal response 
a response to the signal at this intensity level. 
Only if it is found that the nonverbal indicator is 
appreciably or significantly more accurate than the 
verbal response is there evidence on the side of 
subliminal discrimination [see PsYcHoPHYSICS]. 

Effects of “set” upon thresholds. Another, but 
less apparent, difficulty with the threshold concept 
is that the subject’s tendency to say “yes” depends 
upon the instructions given to him. If a subject is 
instructed to make certain that he does not miss a 
signal when it occurs, then there is a tendency for 
the over-all number of “yes” responses to increase. 
On the other hand, if the subject is instructed to 
make certain that he does not report a signal when 
none is present, then there is a tendency for the 
over-all number of “yes” responses to decrease. This 
shifting of the threshold depending upon instruc- 
tions reflects the fact that the threshold obtained 
under a given experimental procedure depends 
quite pronouncedly upon the subjective criterion 
the subject adopts for what he will consider a 
signal. In a poorly defined psychophysical situation 
the subjective criterion adopted by different sub- 
jects will vary rather markedly. Some subjects will 
adopt a cautious criterion; others will be more 
radical in terms of the internal standard that they 
adopt for a “yes” response [see RESPONSE SETS]. 

Procedures for eliminating difficulties. Two pro- 
cedures are available that circumvent most of the 
difficulties of these individual differences in sub- 
jective criterion. One procedure requires that the 
subject give a confidence rating with a “yes” re- 
sponse. Thus, if the subject was highly certain that 
he had heard the signal he would assign a rating 
of “1,” whereas if he felt it was just a pure guess he 
would assign it a rating of “5.” The intermediate 
numbers would be used to reflect varying degrees 
of his subjective confidence. 

The other procedure is the “forced choice tech- 
nique” that has been intensively investigated by 
Blackwell (1953). Under this procedure the sub- 
ject is told that the stimulus can occur in one of 
several intervals, marked off by the flashing of a 
light, and he is asked to indicate in which of the 
intervals the stimulus occurred. The stimulus, of 
course, occurs in only one interval. With four inter- 
vals the subject’s chance jevel of accuracy would 
be 25 per cent. Blackwell has shown that this 
method leads to lower thresholds than are ob- 
tained by the more traditional procedure and also 
that thresholds so obtained are more stable and 
less subject to fluctuation on different sessions. 

Subliminality. The considerations just dis- 
cussed abundantly make clear that the term “sub- 
liminal” is devoid of scientific value unless the 
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specific operational context in which it is used is 
specified. It is not sufficient for an experimenter 
to say that a stimulus was subliminal. One needs 
to know the procedure, criterion, and instructions. 

Subjective confidence. Before considering the 
experimental procedures that have been employed 
in studying subliminal perception, one should ex- 
amine a phenomenon that has occasionally been 
construed as evidence of unconscious perception. 
In sensory investigations in which the subject is 
required to give his confidence rating along with 
his report of the stimulus, it is not uncommon to 
find that certain subjects show above chance ac- 
curacy on the responses they rate as pure guesses. 

There are two serious objections to accepting 
this phenomenon as unconscious discrimination. 
First, it violates our definition of unconscious since 
the subject is making a discriminated verbal re- 
port. If consciousness is equated with verbalization, 
then these reports, irrespective of the subject’s sub- 
jective confidence in them, must be considered as 
conscious. A more serious objection is that accept- 
ance of this phenomenon places the criterion of 
consciousness at the mercy of what any subject 
subjectively wants to term a guess. This latter ob- 
jection would not be as serious were it not for the 
marked individual differences that exist in the sub- 
jective standard of guessing. 


Subliminal perception 


Having clarified our concepts and our terminol- 
ogy, we are now in a position to evaluate the ex- 
perimental evidence on subliminal perception. We 
can begin by noting that in terms of our criterion 
for consciousness, the question of subliminal dis- 
crimination resolves into the question of whether 
or not a nonverbal response can be found that will 
be a more accurate indicator of the presence of 
stimulation than a verbal response. Experimenters 
have been quite ingenious in the variety of methods 
they have devised in their attempts to demonstrate 
subliminal discrimination. For the purpose of eval- 
uation, these various experimental approaches have 
been classified into the four basic methodologies 
considered below. 

Subliminal conditioning. In subliminal condi- 
tioning, the experimenter attempts to condition a 
nonverbal response to a stimulus whose intensity 
is lower than that capable of eliciting a discrim- 
inated verbal report. The conditioned response may 
be either a voluntary or an autonomic response, 
although the latter is more typical (for example, 
galvanic skin response, heart rate, pupillary con- 
traction). 

From an over-all evaluation the numerous studies 
that have attempted to demonstrate subliminal con- 
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ditioning have been unsuccessful. In the few in- 
stances in which positive results have been ob- 
tained, other investigators have been unable to 
replicate the findings or threshold criterion, for 
verbal report was inadequate. 

Conditioned nonverbal responses. The method 
for assessing thresholds by generalizing conditioned 
nonverbal responses involves the conditioning of 
some nonverbal response to a suprathreshold in- 
tensity of a stimulus, and then observing whether 
this response generalizes to values of the stimulus 
at threshold intensities. One could condition a gal- 
vanic skin response to a suprathreshold tone and 
then observe the frequency of conditioned re- 
sponses as the tone varies in intensity around the 
threshold level. The frequency of conditioned re- 
sponses as a function of the intensity of the stim- 
ulus could be used to compute a threshold that 
would then be compared with the threshold ob- 
tained by verbal report. 

Experiments in which a conditioned nonverbal, 
but voluntary, response has been compared with 
the threshold obtained by verbal report have gen- 
erally given virtually identical thresholds or detec- 
tion functions. Experiments that have employed 
autonomic responses, such as the galvanic skin 
response, typically have found that the autonomic 
response is a less sensitive indicator of perception 
or discrimination. The instability and error that 
exist in autonomic responses as currently measured 
almost ensure that such responses would be less 
sensitive indicators than a verbal report (see Du- 
lany & Eriksen 1959). 

Subception. The methodology used in subcep 
tion is virtually identical with the preceding one; 
the difference lies primarily in how the results are 
analyzed. To each stimulus presentation the sub- 
ject gives both the conditioned nonverbal response 
and a verbal response concurrently. The subception 
effect consists in the demonstration that the oe 
ditioned nonverbal response shows above Ba 
accuracy, on the average, over those trials in whi 
the verbal response was wrong. fer 

Although the subception effect has been Po 
preted as demonstrating subliminal percep! a 
(Lazarus & McCleary 1951), it can be more ee 
moniously explained in terms of noncorre! a 
sources of error in the two concurrent Be re- 
or response systems. For example, ifara A 
sponse is being studied concurrently with we 
vanic skin response, when the stimulus is pres ri 
there will be some occasions in which ae sponse 
response is in error but the galvanic skin Bee, 
gives the correct discrimination, and one = 
there will be other trials in which the ver 
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gives a wrong response. The less than perfect cor- 
relation between these two concurrent responses can 
be attributed to different sources of error that effect 
one response and not the other at a given moment 
in time for a given stimulus presentation. That 
certain variables will differentially effect one type 
of response and not another has been shown for 
several different response systems (Eriksen 1960). 
Further evidence that the subception effect does 
not represent subliminal discrimination is the fact 
that in these experiments, if the performance in 
the experiment as a whole is examined, it is typi- 
cally found that the verbal response conveys more 
discrimination or more information than does the 
nonverbal response. 

Subliminal stimulation and behavior. Rather 
heterogeneous methodologies are united by the fact 
that subliminal stimulation has indirect effects on 
subsequent behavior. ‘The assumption uniting 
these methodologies is that subliminal stimulation 
does not lead to precise discrimination but rather, 
through the distortion of unconscious processes and 
defense mechanisms, reveals itself in subsequent 
behavior in indirect forms. The subject is presented 
with a subliminal stimulus, and the effects of this 
stimulation are sought in subsequent dreams, fan- 
tasy productions, or mental associations. Repre- 
sentative of such experiments are those of Fisher 
(1954), Hilgard (1962), and Shevrin and Lubor- 
sky (1958). 

Since the effect of the subliminal stimulation is 
expected to be distorted or transformed, or to ap- 
pear in a symbolic rather than the identical form 
of the stimulation, the detection of this subliminal 
influence poses severe methodological problems. 
Numerous sources of artifact are present in such 
experimental designs, as have been pointed out by 
Johnson and Eriksen (1961). Interpretation of 
these experiments is further hindered by the failure 
of experimenters in most cases to ensure that the 
stimulation was indeed incidental or to specify the 
Criterion by which they determined that the stimu- 
lation was subliminal. In many of the experiments 
it is quite obvious that the stimuli were not sub- 
liminal, and the subject’s lack of awareness of the 
stimulation was due to the nature of the experi- 
ment, which directed his attention to other stimuli. 
In this case the experiments are more concerned 
with incidental stimulation than with subliminal 
stimulation and bear a strong resemblance both 
methodologically and conceptually to the traditional 
work on incidental memory. 


Incidental perception 


i Although the evidence for subliminal perception 
s largely negative, the possibility that behavior can 
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be directed by above-threshold cues of which the 
subject is unaware is not only more plausible but 
has somewhat better experimental substantiation. 
We know by common sense that we are constantly 
utilizing cues of which we are unaware in our 
perception of depth and of shape and size con- 
stancy, as well as a number of cues that guide such 
complex motor habits as playing the piano and 
driving an automobile. 

The methodology employed in experiments on 
incidental perception makes use of one or another 
of a few basic techniques. In most cases the per- 
ceptual cue is presented without the subject’s being 
forewarned that it will be present. Typically, it is 
presented at an intensity so low that it does not 
command attention. In other experimental arrange- 
ments the cue may be of a more obvious nature, 
but the subject is prevented from attending it be- 
cause of misdirection about the purpose of the 
experiment or the nature of the experimental task. 
In fact, misdirection of attention seems to be a 
crucial consideration in experiments on incidental 
perception; this suggests that perhaps these exper- 
iments are dealing more with the concept of atten- 
tion than with one of awareness. 

Two separate lines of investigation using inci- 
dental perceptual cues can be distinguished. The 
first is concerned with the effect of incidental cues 
upon the behavior that was learned outside the 
experimental situation. For example, the effect of 
the incidental stimulation of the word “angry” upon 
the interpretation of ambiguous profiles might be 
studied. The other approach is concerned with the 
problem of perceptual learning without awareness. 
The first approach we will term “performance with- 
out awareness” and distinguish it from the problem 
of “learning without awareness.” 

Performance without awareness. There have 
been several experiments demonstrating that cues 
of which the subject is unaware are capable of 
producing perceptual illusions ( Dunlap 1900; Bress- 
ler 1931). Bressler, using the Mueller—Lyer illu- 
sion, found evidence of illusory effects in the 
judgment of the length of the two lines when the 
directional arrows were so faint that the subjects 
reported that they were unaware of their presence. 
Perky (1910) and Miller (1939) projected geo- 
metric forms on the back of a ground-glass screen 
at low intensities and found that the patterns pre- 
sented were influential in determining the subject's 
choice of a geometric pattern. The effect was found 
even when the subjects reported that they had not 
been aware that a pattern was actually being pro- 
jected on the screen. 

In more recent work, Smith et al. (1959) pre- 
sented their subjects with a line drawing of an 
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expressionless human face. The face was continu- 
ously presented to the subjects except for short 
interruptions during which the word “happy” or 
“angry” was briefly flashed on the screen. Although 
nearly all of the subjects reported that they had 
not noticed the flashing of the words, some sug- 
gestive evidence was found that the subjects’ inter- 
pretations of the face were more pleasant following 
exposures of the word “happy” than following ex- 
posures of the word “angry.” Exposures of the word 
“angry,” however, had little or no effect upon the 
subjects’ interpretations of the ambiguous face. 

These experimenters, as well as others, have 
interpreted the results as demonstrating the effects 
of subliminal cues upon perception. In terms of our 
criterion of subliminal, however, none of these 
experiments permit such interpretation. In Bress- 
ler’s experiment even the faintest lines he em- 
ployed, when studied independently of the illusion, 
were found to yield better than chance discrimina- 
tion by his subjects. Vinacke’s data show clearly 
that Miller’s and Perky’s results are not obtainable 
when the geometric forms are shown at an 
intensity level below even a rather crudely deter- 
mined psychophysical threshold (Vinacke 1942). 
And in the study by Smith et al. (1959), observa- 
tions upon a control group of subjects, who had 
been alerted to the fact that the words were being 
flashed, indicated that most of the word exposures 
for the experimental group were above the level 
for recognizability of the word. 

Even when the above studies are considered as 
demonstrations of incidental rather than sublim- 
inal perception, the conclusions must be accepted 
with caution. In most cases the effects are slight, 
and careful replication of these studies is needed 
before confidence can be placed in the effect. 

Although there is an extensive literature on in- 
cidental learning and learning without awareness, 
most of this effort has been concerned with verbal 
learning. Little work has been done on perceptual 
learning without awareness. Brunswik and Herma 
(1951) found that an incidental color cue associ- 
ated with the lighter or heavier weight in a weight- 
lifting experiment resulted in illusory judgments 
when a subject was confronted with equal weights 
that differed in color. Since the subjects were un- 
able to report an awareness of the correlation be- 
tween color and weight, it would seem that they 
had learned the association of the cue uncon- 
sciously. However, a more carefully controlled rep- 
lication of this experiment by Levine (1953) failed 
to confirm the finding. 

Eriksen and Doroz (1963) have reported a series 
of experiments using perceptual tasks containing 
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incidental cues that the subject could use as a 
guide for his perceptual judgments. Although a 
certain percentage of the subjects learned to use 
the cues, all of those who did were readily able to 
verbalize the nature and the presence of the cue, 
The results of these experiments on incidental per- 
ceptual learning seem to be consistent with the 
increasing evidence that human adults do not learn 
unless awareness can be verbalized. 


CHARLES W. ERIKSEN 


[Other relevant material may be found in ATTENTION; 
LEARNING, articles on CLASSICAL CONDITIONING, DIS- 
CRIMINATION LEARNING, and INSTRUMENTAL LEARN: 
ING; PsYCHOPHYSICS.] 
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PERIODIZATION 


Periodization attempts to impart significance to 
the passage of time in history by identifying and 
ordering chronological sequences (periods). As 
practiced by historians, it has a long and varied 
history; as a subject of study, it commands neither 
a formal body of knowledge nor systematic instruc- 
tion. To the historian, although not to the archeol- 
ogist or anthropologist, periodization serves no 
accepted theoretical function. For unlike the con- 
cept of period in the earth sciences or of periodicity 
in the physical sciences, the concept of historical 
period depends more on stipulation than on infer- 
ence from commonly accepted evidence. As to mod- 
ern philosophers of history, both the nominalis- 
tically and the neoidealistically inclined have denied 
that historical periods are “real”: the former be- 
cause a period cannot be said to exist in the sense 
in which a historical event or person exists; the 
latter because they see all ordering of historical 
Materials as a function of the individual historian’s 
ee ee 1927; Croce [1917] 1960, chap- 
Pet eel lends itself to broad typology. In 

e following account of some of the principal 
Periodization schemes in Western history, two 
ees types, among others, are distinguished. They 

ay be conveniently labeled lawlike (historical 
peat are significant as manifesting the operation 

a cosmic, divine, biological, or social force) and 
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pedagogic (historical periods are significant as 
didactic or heuristic devices, the concept of under- 
lying forces being minimized or ignored). 


Early history 


Mythopeic and religious thought stand at the 
origin of lawlike periodization, specifically at the 
origin of the idea that terrestrial events follow time 
cycles. The biological birth-death cycle and the 
seasonal cycles of sun and moon were related to 
cosmic cycles of redemptive significance. Cyclical 
paradigms are found in virtually every ancient 
Middle Eastern and Far Eastern world view (Cairns 
1962). Thus, alongside calendrical and astronom- 
ical periods, a variety of cosmic or divine periods 
were devised: the great cosmic year or divine year 
was a cycle of varying duration from 3,000 to 
36,000 years. Classical antiquity and early Chris- 
tianity adopted and transformed some of these and 
other mythical time divisions, thereby laying the 
foundation for Western thought. 

In classical antiquity, the ancient myth of four 
metallic ages (gold, silver, bronze, and iron) was 
reinterpreted for the Greeks by Hesiod (c. eighth 
century B.C.) and popularized for the Romans in 
the poetry of Ovid and Vergil. The cycle itself 
(pertodos, Gr.; periodus, L.) figured more in phi- 
losophy and cosmology than in history, But at least 
one historian, through whom cyclical notions came 
to be transmitted to Machiavelli and other classically 
influenced writers, utilized the idea of the cycle: 
Polybius (c. 203-c. 120 B.c.). Other influential 
classical conceptions attempted to connect mythical 
ages to calculable chronologies. The Roman Varro 
(116-27 B.C.) created a tripartite scheme: the ob- 
scure, the fabulous, and the historical periods—the 
last-named commencing with the first Olympiad 
(776 B.C.). 

The two principal Christian periodizations, des- 
ignating terrestrial events as successive stages of a 
divinely ordained rhythm, were as follows: (1) The 
interpretation of Daniel's dreams of four kingdoms 
(Daniel 2.31 ff., 7.17 ff.), whose contents resem- 
bled the Hesiodic myth, as four successive empires 
or monarchies. The idea of four monarchies— 
Babylonian, Medo-Persian, Macedonian, and Roman 
__dominated historiography until at least the six- 
teenth century. The Roman Empire, having been 
designated as lasting to the end of the world, was 
necessarily seen as continued by the Byzantine and 
Frankish emperors. Hence the emphasis on dating 
periods within the fourth and final empire by dynas- 
ties and individual rulers, a chain of dating which 
is still routine classroom periodization for much of 
European history. (2) Saint Augustine's addition 
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of three periods to the three 14-generation periods 
from Abraham to Christ which are laid down in the 
Bible (Matthew 1.17). Augustine made it six ages 
in all, corresponding to the six days of creation— 
five ages from Adam to Christ and the sixth from 
Christ to the end of time. The seventh to come was 
the Sabbath day or millennium. This scheme not 
only influenced Christian chronographers and 
chroniclers and, since each age came to be reck- 
oned at 1,000 years, enabled calculation of the end 
of the world; it also produced the modern conven- 
tions of dating. The oldest and longest-lived was 
by years A.M. (anni mundi), since the creation; 
years A.D, (anni [ab incarnatione] Domini) ap- 
peared in the sixth century and years B.C. (anni 
ante Christum [natum]) in at least two works pub- 
lished in the fifteenth century. The currently wide- 
spread belief that years B.c. first appeared in the 
eighteenth century is demonstrably wrong, but it 
was probably not much before 1700 that the idea 
of pure chronological reckoning, or dating by years, 
emancipated itself from the periodization schemes 
which it subserved in the Christian West. 


Modern history 


The renascence and development of secular 
learning from the fifteenth to the eighteenth cen- 
turies, in particular the emergence of history as a 
discipline very nearly independent of moral philos- 
ophy and rhetoric, produced new concepts of peri- 
odization. First, contemporary scholarship in law, 
language, and letters created an awareness of the 
discontinuities in the eternal Roman Empire; post- 
classical Latin, for example, was obviously unlike 
classical Latin. A second period, a medium aevum, 
originally a theological notion, was posited. Chris- 
tophorus Cellarius, 1634-1707, though not the first 
to conceive of it (Pot 1951, pp. 113-122; Bernheim 
1889, vol. 1, p. 77 in the 1908 edition), enunci- 
ated the division of history into ancient history (to 
A.D. 311), medieval history (to 1453), and modern 
history. Despite continuing differences about bound- 
ary dates (A.D. 476 for 311; 1485, 1492, or 1517 
for 1453), Cellarius’ tripartite division became, and 
has remained, the principal accepted periodization 
for Western history. This is undoubtedly so because 
historians have come to regard its significance as 
pedagogic, as offering easily conveyed, broad char- 
acterizations of cultural differentiation, void of 
metaphysical and pseudoscientific speculation. A 
second product of the revival of secular learning 
was the occasional substitution of classical ideas 
of the mutability of fortune for the Christian ethic. 
Aristotelian and Polybian in origin, this notion in- 
volved concepts of cyclical revolutions and succes- 


sions of states; being almost exclusively confined to 
political philosophy, its impact on historiography 
was slight. More persistent was numerical periodi- 
zation, the search for cardinal numbers whose 
recurrence explained the number of years interven- 
ing between significant historical events or their 
duration, Partly of Platonic and Pythagorean origin, 
it was revived by the political philosopher Jean 
Bodin, among others. In his Method for the Easy 
Comprehension of History (1566), he showed the 
duration of lives of famous men to be multiples of 
7 and 9, and elsewhere he used numerology to 
establish the likely duration of states (496 years). 
Numerical periodization also had Biblical sanc- 
tion: counting the numbers of generations to 
obtain chronological divisors goes back to the Old 
Testament. 

By the eighteenth century, the new scholarship 
had prepared the ground for periodizations as law- 
like as those of their Christian and classical pred- 
ecessors but explicitly secular and socially oriented. 
Under the influence of scientific and geographic 
discovery, of the quarrel about the superiority of 
the moderns over the ancients, and of the spread 
of antiabsolutist ideas in politics and in philosophy, 
a variety of forward-looking doctrines had come 
into being. These are conveniently summed up as 
the idea of progress. History—past, present, and 
future—was to be a mirror of the working out of 
the successive stages of this idea. Conversely, the 
large accretions of historical materials coming to 
hand were held to be intelligible only as manifesta- 
tions of the periodic and progressive development 
of one or more of the new, enlightened doctrines. 
Throughout the eighteenth and nineteenth cen- 
turies, histories which rose above the purely nat 
rative or factual level tended to be written on that 
principle. Its earliest significant representative was 
the Italian Giambattista Vico, 1668-1744. He com- 
bined the older, cyclical model with the nma 
progressive-stage model by positing 4 higher le 
cyclical recurrence (ricorso) of his three- : 
cycle (corso). That cycle consisted of the aE 
gods, heroes, and men, each age being charact 
ized by the literary, legal, linguistic, religious, EA 
behavioral developments appropriate to it; m 
ample, the development of thought, which unde 4 
the whole scheme, took the Hepes forms 
sensation, imagination, and rationality. ; 

Progressive French thinkers elaborated nal 
periodization schemes by extending the intel ee 

x rfectibility, 
vocabulary of progress to the idea of pe enlight- 
by showing a Voltairean disdain for the un ane 
ened Middle Ages, and sometimes by predic ‘Alone 
socioeconomic utopia as the final period. 


these lines Turgot, 1727-1781, and Saint-Simon, 
1760-1825, produced three-stage periodizations 
of considerable subsequent influence. Condorcet’s 
Sketch . . . of the Progress of the Human Mind 
(1795) divided history in terms of the progress of 
knowledge: significant cultural inventions deter- 
mined successive stages—for example, the fourth 
stage of Condorcet’s ten-stage periodization be- 
gins with the invention of the alphabet and the 
eighth with printing. In Germany, progressive 
themes were adumbrated by Kant, Herder, and 
Fichte. The climax was reached in the four-stage 
periodization scheme of Hegel's Philosophy of His- 
tory (1837). In its specific details—how the em- 
bodiment of the idea of reason, the so-called world 
spirit, successively realized itself in the Oriental, 
the Greek, the Roman, and the Germanic worlds— 
the scheme has not survived its authors fame. 
(That was also the fate of its great French counter- 
part, Auguste Comte’s Positive Philosophy [1830- 
1842], which divided history into a theological, a 
metaphysical, and a scientific period; this was done 
according to positivist beliefs totally unlike those 
of the idealist Hegel, of whose work Comte de- 
clared deliberate ignorance.) However, Hegel's basic 
thesis, that differing historical periods represent 
successive realizations of an identical principle, so 
that meaningless chronological succession has been 
subjected to meaningful logical (dialectical) order, 
is still with us. Especially in its materialist version, 
with economic forces in place of the world spirit 
(as in Marxism), it is the only species of lawlike 
periodization continuing to make an impact on 
social science. 


Recent attempts 


In the twentieth century, a number of schemes, 
most of them modifications of earlier ones, exist 
side by side. Classical Marxism, claiming to have 
substituted scientific, lawlike periodization for 
Hegel’s rhythm of the spirit, divides history into 
five periods: primitive communalism, classical 
slavery, Western and Asian feudalism, capitalism, 
and socialism (communism). These correspond to 
identifiable stages in the development of productive 
forces and the social relations they create. Contem- 
Porary Marxist scholars allow for more variety 
within this framework and hold that it parallels, 
and explains, the traditional ancient-medieval— 
modern periodization. Various levels of generality 
of periodization are admitted, including so-called 
Private periodization within the accepted larger 
sans (Zhukov 1960). This rapprochement be- 
SET pedagogic and lawlike periodization does not 

scure the fundamentally differing starting points 
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of each in respect to determinism, inevitability, 
prediction, and other consequences of a belief in 
historical laws. 

The other two comprehensive lawlike periodiza- 
tions of our time are the works of Oswald Spengler 
(1918-1922) and Arnold Toynbee (1934-1961). 
Thematically, they hark back to the birth-death 
cycles common to Eastern and ancient Western 
cosmology and revived intermittently since. There 
is a finite number of historical units: 8 cultures in 
Spengler, 21 civilizations in Toynbee. Each under- 
goes—inevitably in Spengler, with qualifications 
and alternatives in Toynbee—four periods of devel- 
opment: birth, growth, aging, and death. This 
periodicity is morphological or physiognomic, de- 
scriptive of states of a cycle and not expressing the 
development of some substantive concept like eco- 
nomic or intellectual advance. Spengler explains 
differentiations within and between cultures in styl- 
istic terms (Newton’s physics was “baroque” ). 
Indeed, cultural historians generally, at any rate 
since the pioneering Swiss historian Jacob Burck- 
hardt, 1818-1897, have had recourse to stylistic 
periodization. Identification of periods in terms of 
style has been used to advantage by literary and 
art historians. To the general historian, this has 
not been persistently useful. To extend such terms 
as “neoclassical” or “romantic” from the aesthetic 
phenomena of a period to social, economic, dip- 
lomatic, and political phenomena may invite verbal 
confusion. It may also give a period a unity it does 
not possess—for example, Germany in the Napole- 
onic period reached cultural, especially literary, 
peaks but was politically impotent. Not even most 
of the art forms in a given period are likely to be 
correctly characterized by applying to them stylistic 
terms applicable to one or two of them, [See STYLE.] 

Numerical periodization, mentioned above, has 
also survived, in form but not in repute, and the 
life-span of patriarchs is no longer its subject. In 
the nineteenth century and in the first half of the 
twentieth century, newer versions of counting bio- 
logical lifetimes to which a numerical value was 
assignable appeared. The once best-known was the 
‘Austrian O. Lorenz “law of three generations” 
(1886). Three generations make up 100 years; 
hence centuries are the spiritual units of history; 
large-scale events tend to occur every 3 x 3 or every 
6 x 3 generations, that is, at 300- and 600-year 
intervals. While rival systems based detailed nu- 
merical-period calculations on the prestigious natu- 
yal sciences of the day—as Henry Adams did 
with physics in his essay “The Rule of Phase Ap- 
plied to History” (1909)—the concept of genera- 
tion as the numerical key to history remained the 
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most diligently expounded theme in the half cen- 
tury after Lorenz (Teesing 1949, chapter 3). 
Whether stated as the father-son problem, as sim- 
ple psychological antitheses and coherences (as 
with Mannheim), or as the discovery of the num- 
ber of years of duration of an intellectual genera- 
tion (as with Ortega y Gasset, whose number was 
15), the concept of generations as the key to bio- 
ideational periods in history has been a topic 
attracting much scholarly effort. [See GENERA- 
TIONS.] 


Contemporary pedagogic periodization 


As already indicated, pedagogic periodization is 
the only scheme generally accepted by modern his- 
torians, however shallow and void of commitment 
to ultimates it may appear to Marxist and theo- 
logical critics. Textbooks and college courses treat 
chronological divisions like national divisions: pri- 
marily as manageable and secondarily as meaning- 
ful slices of a subject that cannot be digested 
whole. Obviously, there are reservations in accept- 
ing the ancient-medieval—modern scheme and its 
many subdivisions—almost as many reservations 
as there are individual historians. Most of those 
fall into two broad classes: (1) Acceptance of con- 
venient periodization tends to imply acceptance of 
established terms denoting periods (like archaic, 
Middle Ages, Enlightenment, Reconstruction) but 
not necessarily of terminal dates. Especially where 
terms—as period-denoting terms—are not of con- 
temporary origin (like Elizabethan, coined two 
centuries later) or denote relatively miscellaneous 
habits of thought (like Renaissance), historians 
using the same terms are likely to assign different 
terminal dates to them. Sometimes, the terms them- 
selves are called into question—but usually to im- 
prove terminology, not periodization (Early Middle 
Ages for Dark Ages, for example). (2) Specialized 
studies tend to substitute a time structure of their 
own, derived from the changes in features of the 
subject studied. The history, say, of price structure 
would not be most advantageously understood in 
terms of the ancient—-medieval—modern or any other 
division designed to make large-scale cultural dif- 
ferentiations more manageable. Hence the fact 
that periodization as a scholarly pursuit is less in 
evidence now than in earlier times, when history 
was studied as nonspecialized universal history 
covering all known mankind and all known ages. 
While all specialism necessarily inhibits this pur- 
suit by devaluing the use of larger periods, some 
specialism strengthens the use of smaller ones. 
What is perhaps most significant about pedagogic 
periodization today is the use of smaller units. 


Concentration on shorter periods of study has 
widened the gulf between pedagogic and lawlike 
periodization. This is not the case because the for- 
mer deals in decades and the latter in millennia— 
a generalization by no means true. It is rather that 
the smaller units employed nowadays make the 
prerequisite of lawlike schemes—namely, the as- 
sertion of a single spiritual, economic, biological, 
numerical, or psychological law or principle—in- 
creasingly unlikely. Recognized period designations 
are drawn typically from ecclesiastical history 
(Reformation). political history (Colonial Period), 
dynastic history (Victorian), chronology (Eight- 
eenth Century), science (Darwinism), and schol- 
arship (Humanism). This variety has strengthened 
the recognition of the purely conventional char- 
acter of periodization. But there seems no widely 
recognized or epistemologically warrantable impli- 
cation that current practice is subjective and law- 
like periodization objective. Variety expressed in 
conventions that are appropriate to it suggests 
greater congruence between study and subject 
studied, Periodization as convention rather than 
ontological proof suggests greater congruence with 
scientific method. Admittedly, there is more loose- 
ness, more dispute, more disagreement on dates 
and other factors affecting precise delimitation of 
the units into which history is broken up. But if 
different historians with different viewpoints ar- 
rive at different configurations of periods, the pre- 
sumption is that they are accurate observers rather 
than accurate believers. At least the presumption 
is greater than if they looked at the complexity 
which is history and turned in identical results. 

GrorcE H. NADEL 


TIME SERIES, 4° 
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PERLMAN, SELIG 


Selig Perlman (1888-1959), American econo- 
mist, was born in Bialystok, Poland. After attend- 
ing the school of commerce in Bialystok from 1900 
to 1906 and the University of Naples for a year, he 
went to the United States in 1908. He gained his 
bachelor’s degree from the University of Wisconsin 
in 1910 and his PH.D. five years later. 

Perlman served as a special investigator for the 
U.S. Commission on Industrial Relations from 
1913 to 1915, and in 1916 he was appointed to a 
Tesearch position at the University of Wisconsin. 
He eventually rose to a professorship in the depart- 
ment of economics in 1927 and remained at Wis- 
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consin until his retirement in 1959. He was the 
author of many books and articles on the labor 
movement. 

Under the influence of John R. Commons, Perl- 
man’s early Marxist views were modified. He con- 
cluded that the ideas of socialism and anarchism 
are the creations of intellectuals (that is, writers, 
teachers, scholars, and even autodidacts) who ap- 
proach social and economic problems from an 
excessively abstract point of view. The attitudes 
and aspirations of the workers may better be under- 
stood by studying the institutions they themselves 
have created: the trade unions. These reflect the 
manual workers’ consciousness of their limited 
opportunities and their need to reinforce their in- 
ferior bargaining position vis-à-vis their employers. 

Labor unions are, in his opinion, essential in an 
industrial society, for some mechanism is needed 
for redressing grievances and equalizing bargain- 
ing positions. Not only do these institutions enlarge 
the freedom of the individual and give him a voice 
in the determination of the conditions of employ- 
ment, but they are an aid to social and political 
stability. Perlman believed that the trade unions 
are conservative institutions; they neither attack 
the basis of private property nor aspire to socialize 
it. He was convinced that the workers would turn 
to more radical means, if they were prevented by 
law from utilizing the trade unions to redress their 
economic grievances. Such legal restriction would 
be an undesirable development, in that it would 
mean a diminution in the influence of essentially 
democratic trade organizations; eventually it would 
lead to a replacement of the trade unions by po- 
litical groups that would reflect less accurately the 
needs of the worker and would not be subject to 
the same kinds of direct pressure as are the trade 
unions. 

From his study of American labor history, Perl- 
man concluded that job-conscious unionism— 
emphasis on immediate demands and day-to-day 
improvements in the place of employment—is the 
result of the thinking and planning of the trade 
union leaders of the 1870s and 1880s, many of 
whom had been schooled in the radical philoso- 
phies of Europe. These leaders, he claimed, trans- 
formed their belief in class consciousness into a 
belief in wage consciousness. He pointed to the 
struggles of these pioneer unionists to support his 
views that the organization and maintenance of 
economic organizations of labor are acts of states- 
manship requiring both a high level of intelligence 
and a willingness to sacrifice. 

Perlman had a wide acquaintance with Euro- 
pean history, especially of the labor movement. He 


586 PERSON PERCEPTION 


was interested primarily in the attempts of Euro- 
pean workers to set up economic organizations 
capable of protecting their interests, but he was 
generally a close student of political trends within 
European labor. With serious and often painful 
concern he watched the destruction of independent 
economic and political organizations on the Euro- 
pean continent in the 1920s and 1930s. He was 
the first to challenge the view propagated by the 
Marxists that fascism was a form of monopoly 
capitalism, and he claimed that in fact many 
fascist doctrines were the staples of lower-middle- 
class opinion. Although this view was presented in 
his classes in the 1920s, Perlman, who was not 
very much concerned with publication, did not 
offer it to a larger audience until much later. 
Perlman was one of the leading classroom 
teachers at the University of Wisconsin. His color- 
ful language, his wit, and his sharp improvisations 
made his classes an important intellectual experi- 
ence. His influence led generations of students to 
examine existing institutions and the attitudes and 
behavior of groups instead of concentrating upon 
general principles, and many of these students 
were able to function more effectively in govern- 
ment and industry as a result. A shy man, Perlman 
was not at home in large gatherings and was rarely 
seen at meetings of learned societies. To some ex- 
tent his caution in publishing, his avoidance of 
debate and argument, and his tenacity in main- 
taining a proposition once proclaimed reduced his 
reputation, at least outside of the University of 
Wisconsin. Nevertheless, Perlman was one of the 
leading students of the labor movement, and his 
teaching and writing made a permanent mark 
upon the field. 
PHILIP TAFT 


[For the historical context of Perlman’s work, see the 
biography of Commons. For discussion of the sub- 
sequent development of his ideas, see LABOR UN- 
IONS, article on THEORIES OF THE LABOR MOVE- 
MENT. ] 
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PERSON PERCEPTION 
See under PERCEPTION. 


PERSONAL PROBABILITY 
See BAYESIAN INFERENCE. 


PERSONALITY 


The articles under this heading describe the field 
of personality and the concept of personality devel- 
opment. For discussions from other than psycholog- 
ical viewpoints see CULTURE AND PERSONALITY: 
NATIONAL CHARACTER; PERSONALITY, POLITICAL. 
Important areas of investigation within the field of 
personality are reviewed in ACHIEVEMENT MOTIVA- 
TION; ANXIETY; ATTITUDES; BODY IMAGE; CONCEPT 
FORMATION; DEFENSE MECHANISMS; DRIVES; EMO- 
TION; IDENTITY, PSYCHOSOCIAL; MENTAL HEALTH; 
MORAL DEVELOPMENT; MOTIVATION; PERCEPTION: 
articles on PERSON PERCEPTION and SOCIAL 
PERCEPTION; ROLE; SELF CONCEPT; STRESS. Theo- 
retical accounts of major personality phenomena 
are found in COGNITIVE THEORY; FIELD THEORY; 
GESTALT THEORY; PERSONALITY: CONTEMPORARY 
VIEWPOINTS; PSYCHOANALYSIS, articles on CLAS 
SICAL THEORY and EGO PSYCHOLOGY; PsyCHOLocY, 
article on EXISTENTIAL PSYCHOLOGY; SYSTEMS 
ANALYSIS, article on PSYCHOLOGICAL SYSTEMS: 
THINKING, article on COGNITIVE ORGANIZATION ae 
PROCESSES. Personality development is discusse! ri 
DEVELOPMENTAL PSYCHOLOGY; INFANCY: INTEN 
TION, article on INTERACTION AND REMON 
Methods of assessing personality are reviewe ue 
FACTOR ANALYSIS, article on PSYCHOLOGICAL ie 
CATIONS; INTERVIEWING, article on PERSONAE N 
APPRAISAL; PERSONALITY MEASUREMENT; pao 
TIVE METHODS; SOCIOMETRY; TRAITS. Ae 
states of personality are described in MENTA 
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1. THE FIELD 
1. PERSONALITY DEVELOPMENT 


I 
THE FIELD 


There are common-sense as well as technical 
uses of the term “personality.” It is common sense 
to observe a person over time, to note that he is 
consistently aggressive or dominant or submissive, 
and to think of personality as the aggregate of 
traits such as these. It is also common sense to 
think of personality as what is referred to when a 
person uses the pronoun “I.” The reference here is 
to the self, an aggregate of feelings, ideas, inten- 
tions, and evaluations that identifies a person to 
himself and distinguishes him from other people. 
The sense of self has continuity over time and per- 
mits one to feel that he is the same person today 
that he was yesterday and—with a little luck— 
will be tomorrow. 

Both of these conceptions have much in common 
with, but neither is the same thing as, personality 
as it is defined by most specialists in the field. Not 
all consistency of behavior can be ascribed to per- 
sonality. The characteristic aggressiveness of a 
person, for example, may be due not so much to a 
persisting disposition of personality as to the fact 
that the person is constantly in a situation so frus- 
trating that it would evoke aggressive behavior in 
anyone. Personality, in its most widely accepted 
technical sense, refers to dispositions in the person 
that help to determine his behavior and that differ 
from one person to another. Again, there may be 
durable dispositions in a person that determine 
some of his behavior but do not enter into his 
awareness, either because he is unwilling to admit 
their existence or because he is unable to formu- 
late them for himself. 

Personality, then, refers not to observable be- 
havior itself but to dispositions that lie behind be- 
havior. Observed consistencies in overt behavior 
and individuals’ reports about themselves are im- 
portant bases for inferences about personality, but, 
as will be shown, the study of personality requires 
that these sources of information be supplemented 
by the use of special techniques. In the view of 
Most students of the subject, the dispositions of 
Personality constitute an organized totality, a more 
or less enduring structure that interacts with an 
environment. 

Personality theory has an important place in 
much psychological and social science work. It is 
highly relevant to practice aimed at improving the 
functioning of people, at curing or correcting mal- 
pectioning, at preventing disorders, or at provid- 
a Conditions favorable to individual development. 

s basic to the diagnosis or psychological assess- 
Ment of problems in individuals, to most forms of 
Psychotherapy, and—since problematic conditions 
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must often be seen in the context of the whole 
person—to research in psychopathology. Social 
actions affecting groups of people are commonly 
based on explicit or implicit assumptions about 
what people in general will do under certain con- 
ditions; it seems highly desirable that the psychol- 
ogy underlying such actions include a conception 
of the person as a whole, for an action meant to 
affect a particular condition can affect various fea- 
tures and functions of the person; and if there 
were no theory to suggest such possible conse- 
quences, the action might do more harm than 
good. Those who undertake social actions must 
also consider not only universal needs and the 
modes of their organization but the differing per- 
sonality dispositions shared by people who have 
been brought up in different social groups. The 
same considerations hold for the efforts of social 
scientists to explain events on the larger social 
scene, e.g., within or among nations or within 
social structures such as industrial, correctional, or 
educational institutions. 

Personality theory can aid substantially in un- 
derstanding the motivations of influential indi- 
viduals and the shared dispositions of collectivities 
of people (e.g., the susceptibilities to different 
propagandistic appeals). Overt behavior, as noted 
above, is heavily dependent upon the situation in 
which people live, and the organized dispositions 
of the person are built up largely through experi- 
ence in the social group, but the matter may also 
be taken the other way around: social processes 
depend to some extent upon personality processes. 
Nationalization of basic industries in Britain, for 
example, does not appear to have radically changed 
attitudes of laborers there toward work. Belief 
systems commonly express personality needs and 
are modified according to principles of personality 
change, e.g., education designed with attention to 
the nature of cognitive structures and of the un- 
conscious determinants of beliefs. Again, people 
learn to use their roles in organizations to serve 
their personality needs; hence, changes in an or- 
ganization’s role structure may involve personality 
changes in individuals. Change in performance 
might require more than the manipulation of eco- 
nomic incentives; individuals might, for example, 
have to be shown different ways of maintaining or 
enhancing their self-esteem. 

Theorists differ about what the elements of per- 
sonality are, how they are organized, what the 
boundaries of personality are, and how personality 
interacts with other phenomena. These problems 
will be discussed in turn; and we may thus identify 
some of the major issues, problems, and lines of 
empirical inquiry in the field of personality today. 
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Finally, discernible trends in thought and research 
are to be considered. 


Elements of personality 


In contemporary psychology, elements of per- 
sonality are hypothetical; they are concepts repre- 
senting something assumed to exist in nature. They 
are inferred from behavior, but there is never a 
one-to-one relation between an observable pattern 
of behavior and an element of personality: All 
observable behavior is a function not only of proc- 
esses in the person but also of the situation in 
which the person exists at the moment. The con- 
ceptualization of elements of personality is neces- 
sary because individual patterns reappear in dif- 
ferent situations and because individuals react 
differently to what seems to be the same situation. 

Since elements of personality are concepts de- 
veloped in an effort to understand a complex struc- 
ture in which there are many levels of organiza- 
tion, and since the discovery or determination of 
any element is difficult, it is not surprising that 
psychologists have offered many and varied pro- 
posals for the analysis of personality. 

The major underlying issues here are (a) 
whether to begin with a conceptual scheme of the 
total personality or to begin with empirical meth- 
ods; (b) whether there is a distinctive psychology 
of personality or a general psychology that includes 
personality; and (c) whether to use a behavior 
theory, which focuses on particular stimulus- 
response relationships, in the hope that the de- 
velopment of empirical laws governing precisely 
measured variables will eventually lead to an un- 
derstanding of complex processes, or to use a 
dynamic-organismic theory, which assumes that 
simple processes such as stimulus—response con- 
nections are always in part determined by larger 
structures within which they occur—structures 
explicable only as wholes. These issues are dis- 
cussed in turn. 

Conceptual approaches. Theorists and investi- 
gators have tried every meaningful way to “divide” 
personality. Often, it is true, they have simply 
made arbitrary cuts, usually in the interest of some 
practical aim, such as the assessment of some per- 
sonality characteristic (e.g., intelligence) deemed 
to be relevant to performance in some social role 
(e.g., that of college student). Where there is con- 
cern with a single individual about whom decisions 
have to be made, it is common practice to “divide” 
by abstraction, breaking down the whole into a 
number of distinguishable properties or features, 
e.g., soundness, stability, or maturity. Although 
making arbitrary cuts tends to fragment the per- 


sonality, and abstracting features of the 
tends to result in vague attributes with lit 
pirical support, these practices can yield 
may later find a place in a theoretical for 
of personality; meanwhile, they help the Į 
ogy of personality to meet some of the p 
demands made upon it. 7 
Many widely studied personality elements 
trate random cuts—e.g., particular interests 
attitudes, or such characteristics of every 
havior as sociability or politeness—and ab: 
features of the whole—e.g., activity level, 
plexity, or sense of well-being. Reflected 
the psychologist’s faith that once suitable 
have been devised, relatively indepen 
ments of personality can be discovered ant 
ured by objective means. Sometimes the 
on the method, on what can be measured, 
than on what is theoretically significant; bi 
method-centered investigator can always find 
port in the argument that until elements 
sonality have been measured, it will not be 
to know how they are related one to anotht 
a personality psychologist, confronted 
task of making full assessments of individ 
sonalities, would not wish to leave out of 
arbitrarily cut or abstracted features of the 
and he would not be seriously troubled 
names given to such elements. J 
Most efforts to divide personality have been 
acterized by some attention to the prin 
structural articulation, according to WI 
theorist starts with a conception of the wht 
then divides it along lines suggested by 
Lewin (1926-1933) and Angyal (1941), f 
ample, approached the whole personality in 
the same way they approached other wholes 
the.use of a model that would hold for any 
system. Although this way of thinking, 
accord with gestalt principles or systems 
has been and is widely influential in 
ogy, it has been so only in a very gener 
it has not been a major source of stimulation 
empirical work. Both Freud (1923) and 4 
(1925-1928) proceeded in accord with 
ciple of structural articulation; they divi 
“psyche” into several major systems, 
actions among which were supposed to expres 


major subsystems—id, superego. 
fined without reference to the oth 
whole personality. Jung's major subsystem” 
persona, the personal unconscious, the col 
unconscious, the ego, the self—seem not to 
caught on with psychologists inte 


ical work, a fact that Jung did not find disappoint- 
ing. (His abstracted features of introversion and 
extroversion have, of course, often been studied 
empirically. ) Freud’s subsystems, on the other 
hand, were defined with considerable reference 
to behavior and have stimulated empirical work 
by research psychologists. No one claims to have 
measured characteristics of the superego with 
precision, but the task of doing this is not differ- 
ent from that of finding behavioral indexes of other 
hypothetical constructs, such as needs. That these 
concepts of Freud’s have persisted and still find a 
place in psychological literature seems due not 
only to their connections to behavior but also to 
the fact that psychologists, particularly those in 
practical work, sometimes have oceasion to think 
of the whole personality and still find Freud's ele- 
gant scheme useful. [See ANALYTICAL PSYCHOLOGY 
and the biographies of FREUD; June.] 

The most common practice in the search for 
elements of personality is to start with a general 
psychological theory and to transpose its basic 
features to personality: habits from stimulus— 
response theories, needs or generalized motives 
from functionalist theories, regions (areas of a 
purely spatial model) from topological psychology, 
and attitudes and traits from trait psychology. 
Although starting with the whole is not the Ameri- 
can way, theorists who use elements from general 
psychology do give some attention to these ele- 
ments’ functional roles within large structures. 
Thus, for example, functionalists who focus on the 
element of striving in human behavior consider 
that a motive is advanced by perceiving or cog 
nizing in a particular way, using a variety of tech- 
niques or devices, and forming attachments to 
particular objects (persons, ideas, things). 

í Personality and general psychology. The prac- 
tice of deriving elements of personality from gen- 
eral theories of behavior, in the belief that there 
is just one psychology, has been severely criticized 
by theorists (e.g., Allport 1937) who believe that 
there is a distinctive personality psychology and 
that units of analysis ought to be derived from a 
consideration of the nature of personality itself. 

In the 1930s it was not difficult to distinguish 
between general psychology and the psychology of 
Personality. At that time general psychology was 
mainly concerned with the generalized adult mind 
a focused largely upon laboratory studies of 
iene and perception. On the other hand, molar 
a vior (i.e., large functional units), motivation, 
ae . wide range of processes of clinical interest 
rs e left to the personality psychologists. Today it 

generally agreed that the most typical phenom- 
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ena of personality—motivation and the relations 
of motives to other phenomena, such as forget- 
ting—are just as general, just as worthy of atten- 
tion from general psychologists, as the phenomena 
of learning and perception. Behavior theorists in- 
sist that personality psychology and the general 
psychology of behavior are coextensive. They have 
been joined by gestalt psychologists, in particular 
those who follow Kurt Lewin, whose A Dynamic 
Theory of Personality (1926-1933) is wholly 
concerned with the effort to establish general laws 
governing processes in a field that contained varia- 
bles involving both the person and the psycho- 
logical environment. 

Some writers, admitting the force of the argu- 
ment for generality but wishing to save something 
for a psychology of personality, have taken the 
view that the field of personality owes its distinc- 
tiveness solely to its concern with individual dif- 
ferences and the unique organization of each indi- 
vidual. Others—the great majority of those fully 
identified with the study of personality—have 
made the claim that an adequate general psychol- 
ogy can be nothing other than a psychology of 
personality. Actually, this is a traditional view of 
the matter. Certainly Freud and Jung considered 
that personality was synonymous with “mind” or 
“psyche” and that their theories were general psy- 
chological theories. The same position was taken 
by Murray (see Explorations in Personality . . . 
1938), Goldstein (1934), and Angyal (1941) 
when, in the 1930s, they espoused the organismic 
point of view. In these older statements, as well as 
jn more contemporary ones by such writers as 
Maslow (1954), Rogers (1959), Cattell (1950), 
and von Bertalanffy (1951), there are two basic 
assumptions: (1) that since all behavior depends 
on varying processes in the person as well as on 
the situation, there can be no general psychological 
laws that do not take into account relatively en- 
during personality processes and (2) that no par- 
ticular process, such as establishing a conditioned 
reflex or perceiving another individual, can be 
fully understood apart from the context of the 
total system of the person. 

Behavior and dynamic-organismic theories. It 
is very generally agreed that there is to be, not a 
general psychology and a personality psychology, 
but one psychology. The question is, what kind of 
psychology is this to be? Here there is a funda- 
mental division between behavior theorists and a 
group of theorists who might be called dynamic- 
organismic. 

The division concerns not only differences in 
general views of what a person is like but also the 
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questions of how the search for psychological 
knowledge ought to be conducted, the nature of 
science, and what kinds of theory are best for 
psychology. 

The behavior theorist puts his faith in methods 
of precise measurement and in theories whose 
concepts are readily referred to observable events. 
He believes that the study of simple processes will 
eventually lead to the development of a set of em- 
pirical laws that will hold for, and permit the 
understanding of, more complex processes. The 
dynamic-organismic theorist doubts this. He as- 
sumes that personality processes are organized on 
different levels and that single processes are always 
in part determined by, or depend for their nature 
upon, the larger organismic patterns and purposes 
within which they have a place. Thus, for example, 
Rogers (1959) argues, and has assembled evidence 
to show, that the way to change a particular pat- 
tern of behavior, such as a refusal to concentrate 
in school, is to change the self concept; that indeed 
a change in self concept might induce various 
kinds of changes in behavior. The behavior the- 
orist (e.g., Farber 1964; Eysenck 1959) would 
argue, with the use of some supporting evidence, 
that the way to change the self concept is to change 
behavior. Rogers would probably not be upset by 
a demonstration that such an effect was achieved, 
for it is his view that the whole and its parts are 
mutually related. He would ask why the behavior 
theorist, who presumably is interested in the ex- 
planation of all behavior, does not accept the idea 
of the determination of a process by the whole in 
which it is embedded. The behavior theorist would 
admit the possibility that a particular behavioral 
event is influenced by various processes in the 
organism, but he would question how to specify 
and measure them. He would argue that until this 
becomes possible, through the slow and deliberate 
advance of empirical science, he would prefer to 
confine his work to studying the effects of external 
stimuli, venturing into the vast complexity of the 
organism's inner processes only when this can be 
done without losing precision of measurement or 
elegance of theory. 

The division is deep, pervasive, and enduring; 
and it has many implications. The concern here 
is with the way it influences the psychologist’s 
search for elements of personality. Consider, for 
example, the matter of the breadth of the analytic 
categories. One might suppose that the choice would 
depend solely upon the kind of problem to which 
the investigator was addressing himself, e.g., small 
categories if he was interested in the effects of a 
visual stimulus, larger categories if he was inter- 


ested in the effects of a college education. One 
might suppose, too, that investigators who used 
small categories would be interested in what they 
might add up to and that those who used large 
categories would be concerned to make sure that 
they lent themselves to analysis into finer units, 
Generally this is not so. Behavior theorists avoid 
large categories, which rarely seem called for in 
experimental work, and postpone the task of build- 
ing their fine and nicely specified units into larger 
structures. Dynamic—organismic theorists, typically 
concerned with long sections of behavior and large 
areas of the person’s functioning, do not hesitate 
to use large categories, such as generalized needs 
or sentiments. While they admit the need to under- 
stand how such categories are compounded of 
smaller units, these theorists seem temperamen- 
tally disposed to neglect this problem and are 
concerned instead with the ways in which a large 
category fits into a still larger one, e.g., the ego 
or the self—and with how these major systems 
relate to the whole personality. 

Again, the differing points of view described 
here are reflected in the degrees to which theorists 
use hypothetical constructs. As indicated, to speak 
of personality at all is to speak in terms of such 
constructs. And all theorists agree that it must be 
possible to connect the construct to observable 
events. The question is how close the connection 
must be. Behavior theorists introduce hypothetical 
constructs but are reluctant to enlarge or compli- 
cate them. They insist upon the most direct ties 
between the constructs and observable behavior. 
This has led some of them to regard personality 
as nothing more than an aggregate of measurable 
performances. Dynamic—organismic theorists, 00 
the other hand, interested in probing the “deepel 
or more “central” aspects of the person, often use 
constructs, such as unconscious motive, that vei 
only highly indirect ties to anything observabl A 
From present indications, it appears that theos 
of this group will continue to make free use A 
hypothetical constructs but, under pressure fro 7 
their more cautious colleagues, will pecon ee 
creasingly concerned about the verification of 3 
concepts and increasingly sophisticated in i d. 
to the methods by which this is to be accomplis a 

Not all preferences and practices in choos: 
units for the analysis of personality depend ea 
the investigator's orientation to theory; they ie 
pend also on the interests of the investiga of 
kinds of problem he investigates, and the kin me 
observations he makes. When, for ann ae 
psychologist addresses himself to practic Yaa 
lems involving people, he is likely to use catego 


that are less abstract than those which appeal to 
the experimentalist or to the devotee of elegant 
theory. Thus the psychotherapist, when faced with 
the necessity for taking action affecting a person, 
often on short notice, has to deal with the relatively 
concrete and particular without stopping to trans- 
late his thoughts into the terms of a general theo- 
retical system. Again, some students of personality 
are interested primarily in persistent differences 
among people, others in changes in the same people 
over time. The former prefer to investigate vari- 
ables, such as abilities or temperamental traits, 
that are presumably either genetically or somati- 
cally linked or else highly generalized and per- 
vasive; the latter prefer elements, such as social 
attitudes, values, and opinions, that, according to 
theory, are related to experience. 

In sum, all agree that elements of personality 
must be conceived of and their attributes or char- 
acteristics measured. However, disagreement about 
the nature of these elements has been marked 
throughout the whole history of personality psy- 
chology; and it shows no signs of ending soon, 
mainly because, being invisible, elements naturally 
are conceptualized differently by many different 
personality theorists and because the measurement 
of elements is methodologically difficult: poorly 
conceived elements cannot be exposed for what 
they are, and psychologists are too often guided 
by what can be measured by existing methods 
rather than by what is essential to theory. Disagree- 
ment has stimulated a large amount of theoretical 
work and empirical research directed to the dis- 
covery and examination of elements of personality. 
Progress will be made as behavior theorists become 
interested in studying more complex structures and 
as dynamic-organismic theorists increase their 
methodological sophistication. 


Structure and organization of personality 


All conceptions of personality embody the idea 
of organization or patterning of its elements. The 
term “structure” is commonly used, as it is here, 
to refer to all relationships within personality, 
while a distinction is made between formal rela- 
tionships and functional ones, and thus between 
formal structure and dynamic structure. 

In this section we shall attempt first to clarify 
the distinction between formal and dynamic struc- 
ture and then proceed to a consideration of some 
a a organizing principles of theorists who seek 
ROE, dynamic structure. Most such theo- 
Manne heavily upon the concept of motives, or 

ing to attain goals; i.e., to reduce the tension 
Senerated by drives, to satisfy needs, or to achieve 
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purposes. In the view of these theorists a single 
drive, need, or purpose organizes various processes 
to serve it, and the various strivings of a person 
are themselves organized, e.g., in hierarchies or in 
syndromes. In classical psychoanalytic and be- 
havioristic theories the goal of striving is reduction 
of tension. Criticisms of this conception are next 
discussed; then other approaches to organization 
are considered; and, finally, some empirical work 
on dynamic structure is briefly discussed. 

Formal structure may be exemplified by a spatial 
model for describing features or parts and their 
arrangement in relation to each other and to the 
whole. Thus Kurt Lewin often began his lectures 
on personality by drawing an ellipse on the black- 
board to represent the person (1936). The area 
outside but adjacent to this figure represented the 
environment. Another ellipse drawn within the 
first separated the perceptual-motor region, next 
to the environment, from the inner-personal region, 
which had no direct contact with the environment. 
By means of crosshatching in the inner ellipse he 
represented regions, €.g., needs or habits, of the 
inner-personal sphere. He was now in a position 
to endow these elements of personality and the 
boundaries separating them with whatever prop- 
erties seemed required by his general theory. It was 
obvious that at a given time the regions of person- 
ality stood in various purely formal relationships 
to one another and to the environment. Some 
regions were connected with many others, some 
with few; some, in the inner-personal sphere, were 
immediately adjacent to the perceptual-motor 
region and were thus peripheral, representing the 
more superficial or transitory features of the per- 
son; other regions, in different degrees central, or 
separated from the environment by intervening 
regions, represented what was deeply important 
to the person, e.g., his self or needs directly related 
to it. Boundaries of regions had varying degrees 
of permeability, represented by different degrees 
of thinness of the boundary lines. In Lewin’s think- 
ing, personality development in children involved 
mainly an increasing differentiation of regions, an 
increasingly definite demarcation between the self 
and the environment, and increasingly imperme- 
able boundaries between regions. Fawl (1963) used 
these conceptions to predict and to find that dis- 
turbances are less frequent in older children (whose 
central regions are better protected from disturbing 
stimuli) but of longer duration (because the 
greater impermeability of boundaries makes it more 
difficult for tension to dissipate). 

Dynamic structure refers to predictable constan- 
cies or regularities of functioning in the person- 
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ality. Dynamic structure concerns, for example, 
which of the regions in Lewin’s scheme act upon 
other regions, and what lawful transformations 
occur in consequence. Personality theorists have, 
in effect, filled in the regions of Lewin’s ellipse 
with various contents—needs, sentiments, object 
cathexes, habits, cognitive dispositions, and so on 
—and then have proceeded to develop theory con- 
cerning the interactions of these contents and the 
mechanisms by which effects are achieved. Such 
theory is used to explain particular formal relation- 
ships among particular contents. Lewin himself 
considered that his formal structures changed 
readily as a result of dynamic forces. For example, 
if a person is angry but exercises self-control, the 
peripheral regions of the inner-personal sphere be- 
come more widely separated from the perceptual— 
motor region, and at the same time the whole 
inner-personal region becomes more unified, i.e., 
less differentiated and more primitive. He allowed, 
however, that some formal structures persist and 
may differ from one individual to another. [See 
FIELD THEORY and the biography of Lew1n.] 

All personality theorists will agree that person- 
ality is more or less differentiated and that differ- 
entiated parts are related, but most of these the- 
orists can be drawn into a debate concerning why 
and how differentiation occurs. 

According to the traditional and widely accepted 
view of functionalism or action theory, the funda- 
mental organizing principle of personality is striv- 
ing. Goals are set by the individual in accordance 
with his motives, and such available resources as 
are necessary to the attainment of these goals are 
mobilized. At any moment of a person’s life, at 
least one motive—need, wish, drive—is active, and 
various other processes are being organized in its 
service. Striving may be induced by a physiological 
drive and may take the direction of reducing the 
tension generated by that drive; it may have the 
purpose of relieving an unpleasant psychological 
State, such as lowered self-esteem or feelings of 
guilt; it may be an effort to reinstate a condition 
previously experienced as pleasant; or it may be 
in the service of some far-flung purpose involving 
a complicated program of action. [See Drives; 
MOTIVATION. ] 

The basic model for stimulus-response (S-R) 
psychology is that of a rat running through a maze 
to get food and thereby reduce his hunger drive; 
for psychoanalysis it is the infant seeking and 
finding the breast and then sucking to get pleasure 
as well as to relieve his hunger. In other dynamic 
theories, such as those of McDougall (1923) and 
Murray (see Explorations in Personality . . . 1938), 


there is emphasis on more general needs, e.g, 
achievement or affiliation, in the service of which 
the person recalls previous successful Strivings, 
selectively perceives relevant objects and instru- 
mentalities, organizes motor activities, and experi- 
ences positive or negative affects, depending on 
the progress and ultimate fate of the striving, 
[See ACHIEVEMENT MOTIVATION; STIMULATION 
DRIVES; the biography of McDovcatt.] 

With experience, more or less durable structures 
develop around motives. Whiting and Child (1953), 
for example, discuss the “behavior system,” a set 
of habits built upon a common drive originating 
in early childhood. For Freud (1916-1917) there 
are “character structures,” e.g., oral, anal; and for 
Murray (see Explorations in Personality . . . 1938) 
there is the “need integrate,” a structure embody- 
ing a need and the images of objects and modes 
of response that have been associated with it. 
Murphy (1947) introduced the concept of “canal- 
ization” to stand for the process by which a need 
becomes focused upon a particular stimulus or 
class of stimuli, e.g., the sex drive is canalized 
toward a specific member of the opposite sex. 

In the view of most writers, personality com- 
prises numerous such motivational systems, and 
theory must deal with their interrelationships. A 
number of principles of organization have been 
introduced. There is the means—end relationship, 
or a series of such relationships, in which imme- 
diate goals are means for attaining more distant 
ones. For Murray (see Explorations in Personality 
. . . 1938) “subsidiation” is the operation of one 
need in the service of another. It is widely agreed 
among psychologists that the classical short section 
of behavior so intensively studied by experimental 
psychologists (e.g., the rat’s successful negotia- 
tion of a maze) may have a role in the promotion 
of long-range objectives. Allport (1961) a 
that a motive which developed because of ike role 
in furthering another motive may become ee 
tionally autonomous,” i.e., functionally nee 
ent of its origins. For example, a young baer A 
goes to medical school in order to please his fa a 
may find enough satisfaction in his ee: 
sustain an interest in medicine. The relations’ p 
of this interest to the original motive may on 
pear; the new interest becomes a strong nee 
its own right. [See PERSONALITY: CONTE ee 
VIEWPOINTS, article on A UNIQUE AND OPEN 
TEM.] j 

From the point of view of psychoanalysis, fi 
behavior of the young man in this example ae 
be overdetermined. If people are going to ae is 
in complex and difficult patterns of activity, 


elementary wisdom to make these behaviors serve 
a multiplicity of needs, including infantile ones. 
Murray (see Explorations in Personality . . . 1938) 
speaks of a “fusion” of needs when multiple needs 
are gratified by a single course of action. 

Various processes for resolving conflicts among 
needs have been conceptualized. Among them is 
scheduling (Murray & Kluckhohn 1948), which 
lessens or resolves conflict by permitting the at- 
tainment of as many goals as possible—one after 
the other. Again, motivational systems may be 
simply coordinated, existing side by side, as it 
were, or one may be a specification or concretiza- 
tion of another. 

A widely used conception of organization is that 
of hierarchy—of which there are different orders. 
In stimulus—response theory, for example, habits 
are said to be arranged in a hierarchy in the sense 
that a given stimulus is most likely to evoke some 
particular response out of several possible ones— 
and so on for a number of stimuli and responses. 
Murphy (1947) has stressed the importance of 
hierarchies of conditioned responses, pointing out 
that attitudes, which he sees as conditioned re- 
sponses, determine what responses may later be 
conditioned. Murray (1951) uses the concept of 
“prepotency” to refer to the attribute of a need that 
leads to its taking precedence over others in a 
hierarchy, Maslow (1954) has developed this idea 
more fully. The most prepotent needs are physio- 
logical deficiencies which must be overcome at 
least to some minimal degree before other needs 
can operate. Then, in order of prepotency, there 
are safety needs, belongingness and love needs, 
esteem needs, and self-actualizing needs. 

Freud has remained the greatest contributor to 
the dynamic psychology of personality. Various 
other conceptions of dynamic structure have been 
developed as alternatives to Freudian concepts or 
are concerned with areas of personality neglected 
by Freud. A discussion of these matters must be 
Ae by a brief examination of Freud’s major 

eas, 

Psychoanalytic principles. Freud's psychoana- 
lytic theory, postulating, among other things, the 
plasticity of motives, is still basic to contemporary 
Psychodynamic views. This theory is probably as 
elegant as any in psychology, in that it uses only 
pra simple principles to explain a wide range of 
poses and seemingly meaningless phenomena: 
hon S, Slips of the tongue, neurotic symptoms. 
ey amassed a great deal of evidence to show 
on importance of sexual (i.e, erotic, libidinal) 
Bk ee drives in the functioning of per- 

ity. These drives, typically, are directed toward 
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other people and therefore become the objectsofy, 
social control. Social control’ is possible because \ 
of the plasticity of the drives. Capable of being | 
satisfied in various ways and with the use of yagi- j 
ous objects, these drives lend themselves to beitg, 
patterned in ways that are more or less _soct 
acceptable. But this social acceptability is 
achieved without a struggle. Drives tend to 
in their original form after the child has attached 
himself to his parents and has learned to speak to 
himself as they would speak. The stage is thus set 
for inner conflict—and for the elaboration of per- 
sonality structures. There are generated highly 
complex arrangements by which the individual 
seeks to avoid anxiety or ward off external punish- 
ment while at the same time obtaining gratification 
of basic needs. 

We deal here with what is at once the most 
characteristic and the most distinctive feature of 
psychoanalytic knowledge: the idea that events 
are not always what they appear to be, that they 
may mean the opposite of what they seem to mean, 
that meaning is to be found through a search for 
the ways in which the underlying sexual and 
aggressive needs have been transformed. 

This transformation is rarely, if ever, complete, 
for infantile needs are often rendered unconscious 
—a state that favors their remaining fixed. Ideas 
or impulses to action are made unconscious as a 
means of avoiding the anxiety occasioned by a 
conflict between some sexual or aggressive drive 
(id) on the one hand and the internalized version 
of society's restraining forces (superego) on the 
other. The key conception here is that content that 
has been made unconscious is unaltered by ex- 
perience, but persists by its own laws, meanwhile 
affecting the conscious. Freud and his followers 
produced many theoretical statements explaining 
how unconscious processes are reflected in con- 
sciousness, or transformed or disguised in such a 
way as to make them compatible with conscious 
aspirations; and how the more conscious, “control- 
ling” systems of personality ward off, defend them- 
selves against, manage, or integrate unconscious 
processes; as well as explaining dreams, jokes, 
slips, and symptoms, and the persistence into adult 
life of childish ways and childish constructions 
that are capable of dominating the individual's be- 
havior in critical situations. 

Whereas id and superego processes are auto- 
matic, inflexible, and repetitive in disregard of 
consequences, other processes are likely to be 
conseious and to have the benefits of the individ- 
ual’s most highly developed abilities. To account 
for these, Freud (1923) introduced the concept of 
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ego. It makes a great deal of difference, in psycho- 
analytic theory, whether a need, with its integrated 
images and affects, is “in the ego” or under the 
sway of the id or superego. In the ego are to be 
found such structures as rational plans for action, 
sustained strivings to carry out promises or to 
uphold principles, and highly differentiated prefer- 
ences, Freud said, “Where id was there ego shall 
be,” thus implying that the direction of personality 
development is toward domination of the whole 
personality by ego processes. This state could be 
represented, according to Lewin’s scheme, as many 
well-differentiated regions with flexible boundaries 
permitting a maximum of intercommunication. [See 
PSYCHOANALYSIS, article on CLASSICAL THEORY.] 

Psychoanalytic ego psychology. Freud, during 
most of his life, considered that the ego was gen- 
erated from the conflict between the drives and 
social forces (or their internal representatives). 
He was never very clear about this formulation, 
nor have many of his psychoanalytic followers been 
altogether happy with it: the theory left psycho- 
analysis open to the criticism that it explained the 
element of fixity in personality far better than it 
explained change or development and that it was 
inadequate to account for man’s highest, most com- 
plicated, and most constructive activities. In 1939 
Hartmann argued that if psychoanalysis was to be 
a general psychology of development, it would be 
necessary to posit ego development outside “the 
sphere of conflict.” He proposed that there were 
“apparatuses of the ego,” the beginnings of abilities 
such as perception, memory, and inhibition, which 
were independent of the drives at birth and devel- 
oped under the impact of external stimuli, inde- 
pendently of determinants in the drive system (id) 
and of conflicts between id and ego (“primary 
autonomy”). More than this, Hartmann argued, 
like Allport (1937), there are ego activities which, 
although they have their beginnings in the id or 
in id—ego conflicts, become functionally independ- 
ent of their origins (“secondary autonomy.”) 

In this effort to develop a “psychoanalytic ego 
psychology” Hartmann was joined by a number of 
psychoanalysts—e.g., Kris and Loewenstein (Hart- 
mann et al. 1946), Pumpian-Mindlin (1959), and 
Gill and Brenman (1959)—and psychologists 
friendly to psychoanalysis—e.g., Rapaport (1958) 
and White (1959). This has tended to bring psy- 
choanalysis closer to academic psychology. For 
example, many phenomena which ego psychology 
now seeks to deal with have been conceptualized 
by psychologists as aspects of the self—something 
about which Freud had little to say. Thus Lecky 
(1945) argued that the individual's conception of 


himself was the nucleus of a system into which 
all the individual’s ideas were organized and with 
which new experiences had to be consistent, or at 
least unthreatening, in order to be assimilated, 
Rogers (1959) and Snygg and Combs (1949) have 
regarded the phenomenal self as the core of the 
personality organization and have empirically stud- 
ied the dimensions of the self, ways in which 
patterns of behavior are related to it, and the con- 
ditions under which it changes. Hilgard (1949) 
holds that the functioning of a wide range of moti- 
vational systems, including the Freudian “mech- 
anisms of defense,” depends upon the activities 
and states of the “inferred self”—a structure under- 
lying the self of awareness. Allport (1953) has 
hailed the development of ego psychology as a 
sign that psychoanalytic writers are at last ready 
to pay some attention to man’s higher achieve- 
ments, [See PSYCHOANALYSIS, article on EGO PSY- 
CHOLOGY; SELF CONCEPT.] 

For most psychologists, it would seem, psycho- 
analytic ego psychology is nothing more than 
psychology; it attends to matters that have always 
fallen within the general domain of psychological 
inquiry. The question, then, is whether the new 
ego psychology really builds any important bridges 
between psychoanalysis and the rest of psychology. 
There has been some agreement to seek agreement, 
and there have been many fresh observations of 
ego functioning, but a true integration of psycho- 
analysis and academic psychology is far from being 
achieved. Loevinger (1965) has cogently argued 
that rather than positing an ego that is autono 
mous from the beginning, it would be better in 
terms of theory construction if a way could be 
found to explain ego development by the same 
principles used to explain other phenomena A 
psychoanalysis. Loevinger proposes that Frew i 
idea of “mastery through reversal of voice” be t 
to supply a dynamic basis for ego evelor a 
Freud had concluded that his pleasure prin w 
was not adequate to explain such phenomena 
the repetitive play of children and the anpa 
need of people to relive traumatic Gaan y 
Underlying this compulsion to repeat must tively 
need to master problems, fundamentally by ag yt 
repeating what one had experienced pe 
as in identification with an actor who has ee è 
against one—and by reliving in the passive A 
experiences in which one had been acar 
inger argues that this principle is at the nde in 
ego's functioning and is the governing p se ulse: 
the initial separation of the ego from 7 Ta 
Thus writers on psychoanalysis continue i: evoke 
the question of whether it is necessary t° 


entirely new, nonpsychoanalytic principles, such 
as a “need for mastery” (Hendrick 1942) or “inde- 
pendent ego energies” (White 1963a), in order to 
account for ego development. For psychoanalysis 
this question is of crucial importance, for it is 
basic to the larger question of whether psycho- 
analytic theory can become a general theory of 
personality development and functioning. 

Critique of tension-reduction theory. In classi- 
cal psychoanalytic theory and in other functionalist 
theories, the general object of striving is to reduce 
tension or to restore equilibrium, As Allport (1960, 
p. 304) has written, “most current theories clearly 
regard personality as a modus operandi for restor- 
ing a steady state.” 

It should be made clear that tension reduction 
is not the same as drive reduction, in Hull's origi- 
nal meaning of this term (1943). In animal psy- 
chology there has been a great deal of discussion 
and experimentation directed to the question of 
whether all striving is in the interest of drive 
reduction. For Hull all drives were aversive. It was 
counterargued that at least some drives are appe- 
titive, ie., the striving was in the direction of 
positive affect. The matter has now been more or 
less settled in favor of the latter position, a major 
blow being struck by Olds and Milners demon- 
stration (1954) that animals exhibited responses 
suggesting that they were very much interested in 
the affects to be experienced through electrical 
stimulation of a certain area of the brain. Most 
personality theorists seem never to have doubted 
that human beings will seek experiences that have 
given them pleasure in the past, that they will go 
to great lengths to find excitement or “have fun,” 
showing signs of tension when gratification is de- 
layed or denied. This successful critique of drive 
reduction, then, does not settle the issue of tension 
reduction; criticisms of the latter, more general 
conception have been directed to the homeostatic 
Principle itself. [See NERVOUS SYSTEM, article on 
BRAIN STIMULATION; STIMULATION DRIVES.] 

One line of criticism has been directed to the 
Older tension-reduction theories that seem to have 
been based on analogies to simple physical sys- 
tems, in which the final state of equilibrium, or 
freedom from tension, was the same as that which 
existed before the disequilibrating circumstances 
arose. Nowadays it seems to be generally agreed 
on final and initial stages of equilibrium are likely 
9 be different. Organisms change in the course of 
their strivings; they grow and develop, and the 
equilibrium that they attain must somehow take 
account of their increased size and complexity. One 
might say that the individual tends to recapture his 
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earlier states of equilibrium, but in attempting to 
do this, he changes; thus, his new stable state is 
on a higher level of integration than before. [See 
HOMEOSTASIS.] 

Another line of criticism is simply that equilib- 
rium, however new or however high its level, is 
not enough; organisms go on being active after 
equilibrium has been reached. In Goldstein’s organ- 
ismic theory (1934), for example, the basic drive 
of the organism is toward “self-actualization”; it 
may use homeostatic mechanisms for this larger 
purpose, but it will maintain a level of tension ade- 
quate to enable it to initiate further activity. The 
same general idea seems to be expressed in Snygg 
and Combs’s accent on enhancement of the phe- 
nomenal field (1949), in the existentialist search 
for meaning (May et al. 1958), and in Maslow’s 
theory of growth motives (1954). Granted that a 
basic tendency of life is to grow and develop, the 
question remains of what makes growth occur. 
What are the mechanisms of developmental 
change? And what prevents development? A study 
of college students suggests that it is just as nat- 
ural for them to remain as they are as it is for 
them to enhance their phenomenal fields or seek 
new levels of order and transaction (Sanford 
1966). It does seem that they need to understand 
that they can develop, that they are not forever 
bound by their childhood conditioning or by a 
closed system of social forces. Teachers may fur- 
ther student development by finding ways to chal- 
lenge them, to upset their equilibrium and generate 
moderate degrees of tension. Favoring the teachers’ 
efforts is the possibility suggested by Goldstein and 
stressed by Murray and Kluckhohn (1948) that it 
is not a tensionless state which is most satisfying 
to the healthy organism but, rather, the process of 
reducing tensions; hence, people will see to it that 
they have tensions to reduce. [See PERSONALITY: 
CONTEMPORARY VIEWPOINTS, article on A UNIQUE 
AND OPEN SYSTEM; the biography of GoLDsTEIN.] 

Finally, it is proposed that some behavior may 
be its own aim (Maslow 1954). Organisms or per- 
sonalities not only do things in order to further 
some objective or other—to reduce tension or to 
raise it—they just do things. Murray and Kluck- 
hohn ([1948] 1953, p. 37) have used the term 
“process activity” to refer to “spontaneous, random, 
ungoverned, but yet expressive cacophonies of 
energy,” which they regard as “more elementary 
and basic than integrated goal directed activity.” 
Self-expression, as well as tension reduction, we 
are told, has to be regarded as a function of per- 
sonality. 

Critics of tension reduction have made their 
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point that there is more to life than reducing 
aversive drives, overcoming deficiencies, coping 
with anxiety, and being blindly conditioned. They 
have shown that there is more inside the person- 
ality than seems to have been dreamed of by 
classical psychoanalysis or contemporary behavior 
theory. A resourceful theorist, however, could still 
conceptualize rational, complex, or self-actualizing 
motivational systems in tension-reduction terms— 
although this might cost him considerable dis- 
equilibrium! 

A third force. To get away from the tension- 
reduction formula is, apparently, to take a more 
optimistic view of man’s nature, And so is the 
postulation of an autonomous ego, which blesses 
man with organizing powers from the beginning. 
Both of these trends in theoretical thinking express 
a new emphasis upon the higher mental processes 
and upon what is distinctively human in man. 
From this point of view man is by nature more 
organized, more organizing, destined for higher 
things than S—R psychology and classical psycho- 
analysis have supposed him to be. This view has 
been gaining ground in recent years, and its advo- 
cates sometimes speak of themselves and their 
work as a “third force” in psychology. Theorists of 
this group center attention on the experiencing 
person, oppose the use of mechanistic models of 
psychological functioning, accent meaningfulness 
rather than methodological elegance in the selec- 
tion of problems, and, in general, are concerned 
with the dignity and worth of man. In general, 
these theorists favor the dynamic-organismic ap- 
proach and oppose behavior theory; they tend to 
lump psychoanalysis and behaviorism together, as 
the two other forces. Sometimes, however, a dis- 
tinction is made between the older classical con- 
ception of psychoanalysis, which is one of these 
other forces, and psychoanalytic ego psychology, 
which, in some of its versions at least, seems to 
qualify as “third force.” 

The third force has drawn attention to areas of 
experience, behavior, and personality that were 
neglected in psychoanalysis and behaviorism, and 
thus has restored some balance to personality 
study. Some advocates show an inclination to reject 
more of behaviorism and psychoanalysis than seems 
necessary or wise. That man is a nobler creature 
than science has heretofore supposed is, of course, 
good news; but endowing man from the beginning 
with a great deal of organizing potential and with 
a tendency to natural growth sometimes seems to 
discourage intensive investigation of just how he 
becomes what he does. 


Other approaches. If the third force in per 
sonality theory accents organizing tendencies in 
the original nature of man while psychoanalysis 
and behaviorism accent what is built up through 
experience, there is another school of thought 
which says that there never is very much organi- 
zation in personality. Situationism (e.g., Cantril 
1947) and certain tendencies in field theory (e.g, 
Lewin 1926-1933), have minimized organization 
in the individual personality and have emphasized 
the social group and the culture as organizers of 
human behavior. In the face of these arguments 
the personality psychologist has usually stood his 
ground, reiterating his preference for studying the 
individual human, who can be as well defined as 
a system—an open one, to be sure—as can social 
and institutional phenomena, and whose organized 
dispositions may bring structure to the situations 
he encounters. Nevertheless, personality psycholo- 
gists have been profoundly influenced by advances 
in sociology and anthropology and are much con- 
cerned about the articulation of personality sys 
tems and social systems, about how the more or 
less stable structures of the personality are Sus: 
tained by surrounding social forces. [See SOCIAL 
INSTITUTIONS. | 

Empirical work. Factual knowledge about the 
dynamic structure of personality has derived mainly 
from clinical work—and from the intensive study 
of individuals by methods originating mainly in 
the clinic, Experimental methods have so far proved 
inadequate for dealing with the complexity of per- 
sonality, with the vast network of meanings In 
which a particular pattern of behavior is likely to 
be entwined. But psychologists, by and large, have 
clung to the view that propositions emerging from 
such “exploratory” clinical work are at best hypoth- 
eses that must be tested by laboratory methods, 
They have devoted an enormous amount of energy 
and ingenuity to this end. A 

First, there is personality research in what mighi 
be called the grand tradition. This is the ee 
started by Murray (see Explorations in Persona ty 
. .. 1938), who, more than any other psychologist, 
devoted himself to the task of devising me 
that were appropriate to an organismic coni m 
of personality. His general approach was poaa 
a variety of tests, interviews, and PoS at 
niques in the study of the same anh eee 
particular findings could be underst hod 
text of a patterned organismic system. The tia 
could reveal patterns of individual functioning 
previously brought to light, as well as gener: 
part-whole relationships. The tradition is $ 


much alive, although the work of those who repre- 
sent it today constitutes only a small part of the 
published investigations in the field of personality. 
The publication of The Study of Lives, edited by 
R. W. White (1963b) and written by 18 psychol- 
ogists who had been influenced by Murray, was 
an important event in the recent history of per- 
sonality psychology. 

Another approach to the study of structure is to 
use a variety of tests designed to measure variables 
operating on different levels of the personality, and 
then to employ correlational techniques in an effort 
to demonstrate dynamic relationships (e.g., Cattell 
1959). Related to this is the effort to delineate 
types or syndromes of personality and on this basis 
to predict behavior in particular situations. Stein 
(1963), for example, used a self-description ques- 
tionnaire to develop a system of typing Peace 
Corps volunteers and then found that the types 
based on this system were predictive of perform- 
ance in the corps; for example, young men who 
ranked high on drives for dominance and achieve- 
ment and low on playfulness and sex interest, 
unlike all other types of subjects, obtained—after 
six months in the field—higher scores than they 
had previously on a scale measuring authoritarian- 
ism. [See Factor ANALYSIS, article on PSYCHOLOGI- 
CAL APPLICATIONS; PERSONALITY MEASUREMENT; 
PRoJECTIVE METHODS; TRAITS.] 

Objective and experimental studies of psycho- 
analytic theories have been appearing in the liter- 
ature since the early 1930s. They have shown 
increasing sophistication both in respect to meth- 
odology and in respect to the derivation of testable 
hypotheses from Freudian theories. Recent exam- 
ples are a study by Hall and Van de Castle (1965) 
which found that male dreamers reported more 

“ms expressive of castration anxiety, while 
female dreamers reported more dreams expressive 
of castration wishes and penis envy, and a study 
by Zamansky (1958), who found that hospitalized 
oo with paranoid delusions spent more time 
sa at pictures of males than did hospitalized 

zophrenic males without paranoid delusions. 

© results of both these investigations are in 
hepa with Freudian theories, from which the 
a hypotheses were derived, but the investi- 
gators admit that their findings might, conceivably, 
explained in some other way. The Zamansky 

Y is representative of a great many in which 
oe is made to “diagnose” the subjects— 
pa a. the use of projective techniques—and 
trolled asis to predict behavior in some con- 
Situation, [See DREAMS; PARANOID REAC- 
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TIONS; PSYCHOANALYSIS, article on EXPERIMENTAL 
STUDIES.) 

The longitudinal study (in which the same in- 
dividuals are measured repeatedly at different 
points in their lives) is the method par excellence 
for studying stability and change in personality 
structures. Such studies have not been widespread 
in recent years, owing chiefly to their expense and 
inherent difficulty; nonetheless, quite a few have 
been reported since 1935. One important longitudi- 
nal study was carried out by Kagan and Moss 
(1962) and focused on a group of young people 
who grew to maturity between 1929 and 1958. 
This study found that many patterns of behavior 
exhibited by children between the ages of three 
and ten were predictors of theoretically related pat- 
terns of behavior in early adulthood, e.g., “involve- 
ment in intellectual mastery,” “sex role identifica- 
tion,” and “spontaneity.” From a review of several 
hundred longitudinal studies, Bloom (1964) con- 
cluded that, although personality may change at 
any age under the impact of a sufficiently powerful 
environment, change in many characteristics be- 
comes increasingly difficult with increasing age 
and that the experiences of early childhood have 
great implications for all that follows in the indi- 
vidual's life. He believes there is considerable prom- 
ise that environmental conditions favorable to the 
child's development will be identified, and made 
operative, in this century, 


Boundaries and interactions 


A complete definition of personality must deal 
not only with elements of personality and their 
organization but also with the problem of distin- 
guishing personality from phenomena that are 
closely related to it. Much theoretical work has 
been concerned with how, if at all, personality is 
to be distinguished from observable behavior, from 
the social environment, and from the physical and 
physiological systems of the organism. 

It has been assumed that elements of personality 
cannot be defined in terms of observable behavior 
because all such behaviors have determinants be- 
sides personality. To consider the boundaries sep- 
arating personality and behavior, and the inter- 
actions of the two, is to confront some important 
issues in contemporary theory, e.g., those concern- 
ing the impact on personality of changes in be- 
havior. These issues have been raised by “behavior 
therapy” and seem worthy of detailed examination. 

Next to be discussed is what Allport (1960) has 
called “the knottiest issue in contemporary social 
science”—how much to stress the separation of 
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man from the social context of his living and how 
to conceive of their interactions. 

At the end of this section we consider person- 
ality in relation to physical structure and physio- 
logical processes. 

Behavior. Some definitions of personality, such 
as those of McClelland (1951, p. 9) and Hilgard 
(1953), strive to keep behavior, rather than sys- 
tems, processes, or dispositions, in the focus of 
attention. These definitions reflect the concern of 
S-R psychology with keeping concepts and observ- 
able phenomena closely tied together, and they 
were offered at a time when S-R psychologists be- 
lieved they would find unequivocal links between 
something observable and the explanatory con- 
cepts of “intervening variables.” This quest seems 
now to have been abandoned. Today it appears to 
be almost universally agreed that a variety of ob- 
jective indicators will correspond to any intervening 
variable; thus the discovery of unique indicators 
of theoretical variables is virtually out of the 
question. 

Behavior therapy. The question of the impact 
upon personality of changes in behavior has been 
the subject of much discussion in recent years, 
owing largely to an extraordinary revival of inter- 
est in using conditioned response techniques as 
means of altering undesirable patterns of behavior. 
In 1958, Wolpe published a book entitled Psycho- 
therapy by Reciprocal Inhibition, in which he 
described the apparently successful application to 
adult neurotic patients of the techniques used in 
the 1920s by J. B. Watson and Mary Cover Jones 
in overcoming children’s specific fears. (A child’s 
fear of an animal could be overcome by repeatedly 
presenting the animal in association with pleasant 
experiences.) Wolpe’s cause was soon taken up by 
Eysenck (1959), a persistent and somewhat free- 
swinging critic of psychoanalysis, who labeled 
Wolpe’s procedure “behavior therapy” and, with 
some disregard of the usual cautions in evaluating 
research on psychotherapy, went on to make rather 
extravagant claims for the “new” therapy. In the 
years since then, there has been a great deal of 
activity on this front (Matarazzo 1965; California, 
Dept. of Mental Hygiene 1966). Various procedures 
for extinguishing maladaptive responses and for 
arousing, through rewards, desirable ones have been 
tried, so that there has now appeared what might 
almost be called a school of psychotherapy. It is 
claimed that these procedures remove symptoms 
more effectively than does any other method of 
treatment, that return of symptoms is rare, and 
that symptoms are not supplanted by others. [See 
LEARNING, article on INSTRUMENTAL LEARNING; 


MENTAL DISORDERS, TREATMENT OF, article on 
BEHAVIOR THERAPY. |] 

Eysenck considers behavior therapy a direct chal- 
lenge to psychoanalysis—both to its practice and 
to its theory. It is, indeed, a challenge to all forms 
of psychotherapy which assume that a person's 
behavioral problems have something to do with 
his subjective experience and that he ought some- 
how to participate as a conscious being in what 
happens to him in the therapeutic encounter, 
Psychotherapists of this turn of mind are extremely 
skeptical of the claims for behavior therapy. They 
admit that almost any kind of behavior can be 
changed by sufficiently powerful, or shrewdly pro- 
grammed, stimuli, but they doubt that such proceed: 
ings have no ill effects upon the person and that 
symptoms do not return in any guise. A psycho- 
analyst, for example, who after a great deal of 
hard work believed that he had discovered the 
meaning of a woman's phobia of going down steps 
(that it meant going out into the street and be 
coming a streetwalker) would probably admit that 
by heroic conditioning she could be induced to walk 
down stairs, but he would still be interested in 
how she managed her unrecognized wish to become 
a prostitute. He would be extremely skeptical of 
follow-up studies by psychologists who do not be- 
lieve in unconscious wishes, or in the possibility of 
a connection between a phobia and a desire to 
become a streetwalker. Behavior therapists could 
not be expected to hunt very diligently for other 
problematic manifestations in people whose behav- 
ior had been changed by conditioning; moreover 
since S—R psychologists use few concepts that rep- 
resent processes in the inner life of the personi 
they might be slow to recognize such changes a 
had occurred. Critics of behavior therapy also ; 
questions concerning the causes of the T ; 
changes in symptoms. One of the oldest and mos 
widely accepted propositions in the whole literature 
on psychotherapy is that almost any kind of thera: 
peutic procedure in the hands of a warmhe his 
and enthusiastic practitioner who has faith na 
method can bring about a reduction in Bee 
S-R therapists, typically, have focused on the e 
ditioning program itself, paying little attention d 
variables that enter into the relationship 
a therapist and his patient. [See CLINICAL PS 
OGY; MENTAL DISORDERS, TREATMENT OF, ae 
On PSYCHOLOGICAL TREATMENT, CLIENT-CENES $ 
COUNSELING, and GROUP PSYCHOTHERAPY; oop] 
CHOANALYSIS, article on THERAPEUTIC METH 

Fundamental to this whole argument IS ae 
sion between S-R psychologists and ona a 
organismic theorists. The former consider 


yCHOL- 


aa a T ee a 


stimulus makes its way into an organism and finds 
its target without disturbing anything in the areas 
about its course, and that an expected response 
then occurs. Dynamic-organismic theorists, on the 
other hand, think of a stimulus as being introduced 
into a system; initiating changes, however slight, 
throughout the whole; and being followed by 
numerous “responses,” most of which are only 
remotely related to the original stimulus. From the 
point of view of dynamic—organismic theory, be- 
havior therapy should be classed with a group of 
procedures, including hypnosis, Alcoholics Anony- 
mous membership, and some drug therapies, that 
induce changes in particular patterns of behavior 
without the affected person’s knowing what is hap- 
pening to him, without there being any expansion 
of his consciousness or of his capacity for further 
development. Adherents of this point of view may 
reject behavior therapy for the same reasons that 
Freud gave up hypnosis: patients did not incorpo- 
rate material uncovered by hypnosis (behavior 
changes) into their conscious selves, and Freud 
did not like to have the kind of power over people 
that hypnosis gave him. 

What the dynamic-organismic theorist can 
learn from the therapeutic activities of S-R psy- 
chologists is that people who are very ill psycho- 
logically are still responsive to the environment of 
the moment; they are responsive not only to pro- 
grams for extinguishing particular maladaptive 
Tesponses but also to programs that expand the 
response repertoire and make for awareness of 
areas of the environment that had previously been 
meaningless. This is an important antidote to those 
Oversimplified versions of Freudian theory which 
assume that everything of consequence in the per- 
Sonality is still tied to processes in the “sphere of 
conflict” and can be modified only by making the 
Unconscious conscious. A change in behavior, in- 
duced by conditioning, might indeed lead to change 
it Personality; it might, for example, lead to 
ae in the way a person was perceived by 
: s, and this might lead to change in the way 
oo conceives of himself, which might then 
as ae more Significant change in behavior, and 
ty ta n short, a single change in behavior induced 
“ee fresh stimuli to bear might set in 
TON a process of change that might in time 
R mplications for large areas of the personality. 
ae oglate; focused on overt behavior and 
Processe y convinced of the separateness of the 
rive E Tog by particular stimuli, do not 

ia E. ullest possible use of their own work. 
fites oF a reason to believe that different struc- 

person change under different condi- 
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tions and through the operation of different mech- 
anisms. The extensive activity of S—R psychologists 
could contribute in an important way to our under- 
standing of these matters, especially if these psy- 
chologists develop some curiosity about, and are 
willing to use some imagination in conceiving of, 
the structures to be changed. 

The environment. If personality tends to be 
fragmented by a psychology that focuses upon one 
or another of its part functions or substructures, 
it also seems in some danger of being swallowed up 
by social structures. There is no question that the 
individual and his social environment constitute a 
dynamic configuration or system, e.g., a family, 
whose parts are so closely related that drawing 
lines of demarcation is often very difficult. Some 
psychologists have joined sociologists in preferring 
to make configurations of this kind, rather than 
personality itself, basic units for study, e.g., for 
Sullivan (1953) there is the “interpersonal rela- 
tionship”; for Sears (1951), the unit is the “dyad.” 
This preference seems due largely to the rapid 
development of social science since the 1930s, but 
it may also owe something to the declining paro- 
chialism of American scientists: Murphy (1958) 
has stated that in Western thought there has been 
an overstress on the separation of man from his 
social context. The traditional view of personality 
psychologists, that personality is a system worthy 
of special study for its own sake, still has its de- 
fenders, e.g., Allport (1960), but there is fresh 
awareness of the importance of saying how bound- 
ary lines are to be drawn. [See INTERACTION, espe- 
cially the article on INTERACTION AND PERSON- 
ALITY.] 

Systems theory is one way of dealing with this 
boundary problem. If personality is conceived of as 
a system that is itself a component of other sys- 
tems, then the boundary around the personality 
and the boundaries of the subsystems within per- 
sonality may be conceived of in the same way. In 
general systems theory ( Miller 1955) the essential 
fact about a boundary is that any exchange of 
energy across it leads to some change in the energy 
form. Boundaries of systems are not always clear- 
cut, and they may change, in their location and 
in their permeability, under the impact of dynamic 
processes. Lewin’s distinction between the person 
and the environment was in accord with these 
principles. [See SYSTEMS ANALYSIS, especially the 
articles on PSYCHOLOGICAL SYSTEMS and SOCIAL 
SYSTEMS.] s 

Lewin proceeded in the same way in subdividing 
what he called the environment. He distinguished 
between the “psychological” and the nonpsycho- 
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logical, or “geographical,” environments, conceiving 
of the boundary separating them in the same way 
that he conceived of other boundaries, i.e., as 
regions that offered resistance to exchange of en- 
ergy between neighboring systems. 

Lewin’s conception of the psychological environ- 
ment has been a contribution of great importance. 
In the early 1930s, when Lewin’s views were just 
beginning to have an impact in the United States, 
American psychologists—except where psycho- 
analysis was influential—were largely involved in 
attempting to construct an objective S-R psychol- 
ogy and were seriously neglecting the organism. 
The psychological environment called proper atten- 
tion to the world in which the individual actually 
lives, the world of his experience, his hopes and 
aspirations, his fears—and thus it made inroads 
into the then current trend. [See FIELD THEORY 
and the biography of LEw1n.] 

In the years since then, Murray, with his concept 
of “beta press”—a stimulus situation as perceived 
by the subject (see Explorations in Personality . . . 
1938); Murphy (1947), with his biosocial theory; 
Rogers (1951), with his “phenomenology”; Kelly 
(1955), with his “personal constructs”; and psy- 
choanalysis, with its “inner world,” have assured 
the psychological environment of an adequate 
place in personality theory. Indeed, so much atten- 
tion has been paid to this environment as to raise 
the question of whether the nonpsychological en- 
vironment was not being neglected. [See MENTAL 
DISORDERS, TREATMENT OF, article on CLIENT- 
CENTERED COUNSELING; PHENOMENOLOGY; the 
biography of HussErt.] 

From the functional point of view, the psycho- 
logical environment is inside the personality. Psy- 
chologists have been interested in the relationships 
between the psychological and the nonpsycholog- 
ical environments and in the relations of each or 
both to the functioning of the person. They have 
recognized that in studying relationships involving 
the nonpsychological (geographical) environment, 
they must specify its stimuli without benefit of a 
particular subject’s perception of it. This is no easy 
matter, particularly in the case of the all-important 
social environment. One can learn a great deal 
about an individual’s psychological environment by 
the relatively comfortable procedure of listening 
to him talk about himself or by the highly respect- 
able procedure of placing him in a number of 
carefully controlled experimental situations. But 
characterizing the real environment in which he 
lives from day to day requires special methods and 
techniques—and a great deal of legwork. It is gen- 
erally recognized today, for example, that when 


personality is assessed, tests and interviews with 
the individual ought to be supplemented by visits 
to the home, interviews with the spouse, observa- 
tions of the subject on the job, and the study of 
other institutions within which he performs social 
roles. The difficulties and expense of such investi- 
gation are great. Nevertheless, some investigators 
have taken important steps in this direction. For 
example, Stern, Stein, and Bloom (1956), in their 
assessment of prospective ministers, made an effort 
to formulate the demands of the subject’s future 
situation and to obtain measures of them—eg., 
the requirements of his roles and the values and 
practices of the institution in which he was to 
work—and Freedman (1956) and Brown (1962) 
have sought to improve prediction of students’ per- 
formance and to enlarge understanding of person- 
ality development in college by specifying relevant 
features of the institutional setting. 

Studies of this kind, however, are concerned 
with improving the prediction of behavior by tak- 
ing into account a range of factors in the person- 
ality and in the social environment. They do not 
satisfy the need for understanding how the social 
system and the individual interact—how the social 
system operates within the individual at any given 
time and how the individual might have an impact 
upon the social system, or systems, within which 
he lives. 

Erikson’s concept of “ego identity” (1950) is if 
contribution here. When a person has ego identity, 
in Erikson’s meaning of the term, his sense of 
identity must be congruent with and based partly 
upon the role structure that actually exists in 50- 
ciety, and it must be confirmed in some degree by 
other people's perception and treatment of him. 
This means that a radical change in these aspects 
of the social system would change the ego identity. 
Bettelheim’s observations in a concentration camp 
(1943), showing some of the ways in which the 
ego could be broken down and some of the pie 
in which it could be preserved under the Nazi pr” 
gram for enslavement, are also relevant here; ant al 
so are Sanford’s observations on how the individ 
superego was altered as a result of the impositio 

SE These 
of a loyalty oath at a university (1966). ae 
studies strongly suggest that relatively stable— A 
might say deeply based—structures of the r 
sonality—whether or not these structures "e 
sustained by previous social stability — ma} sce 
radically altered by extreme social change. 
IDENTITY, PSYCHOSOCIAL; ROLE.] 

If changes in a social structure 
people who occupy its roles, then we $ 
it to be at least as difficult to change SU! 


change the 
hould expect 
cha struc- 


ture by deliberate plan as it is to change an indi- 
vidual person. This indeed seems to be the case, 
as any social scientist who has tried to bring about 
some mild reform in his university or department 
can testify. People vest interest in their organiza- 
tional roles, using them not only for rational ends 
but also for defensive purposes and for the grati- 
fication of unconscious needs. Thus it is that when 
changes in organizations are instituted from above, 
all sorts of seemingly irrational protests are heard. 
The point is that anyone who would change the 
role structure of an organization, say by manipu- 
lating the incentives for desired role performances, 
has to be guided by personality theory, by a con- 
ception of the interactions among the complex 
structures of the person, and not by just a few 
simple assumptions about universal human needs. 
The same consideration would seem to hold for 
social actions affecting masses of people, actions 
that so often fail in their purpose and are followed 
by various unanticipated consequences. Social anal- 
ysis can be aided by assessment of “social char- 
acter” (Fromm 1941), a socially shaped structure 
embodying deep emotional needs, that is common 
to most members of the group in question: for 
example, if the social character of group A is au- 
thoritarian, we should not expect that its prejudice 
against group B would be affected by an exchange 
program. [See Groups; ORGANIZATIONS.] 

In sum, there is a great need for more knowl- 
edge of the articulation of personality systems and 
social systems. To meet this need it is necessary 
to pay more rather than less attention to the rela- 
tively stable personality structures and to engage 
in more searching analysis of social structures in 
Psychologically relevant terms. 

Elna structure and physiological processes. 
A asons for studying the organism as a whole, 

r regarding all the biological and psychological 
Processes that take place under a person’s skin as 
eae of a single system, are at least as cogent as 
Se Susie for regarding the individual and his 
a TS as a system. The question is whether 
ible uld equate personality and organism as a 
anes oe some writers (eg. Eysenck 1953) have 
that ee regard personality as a separate system 

ae ia mn other systems of the organism. 
EA nality psychologists prefer the latter 
Pa amsatae in this field, sometimes the bodily 
treated’ eat ee the personality function, is 
Pelee 1s independent variable. The former 

rain dan is illustrated by studies of the effects of 
Bross RES on personality or by studies in which 
ons in bodily form or function are con- 
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sidered to be perceived by the individual and re- 
sponded to in accordance with his needs and 
values—as would be the case in physical handi- 
cap, for example. Research in which the person- 
ality process is treated as the independent variable 
is illustrated by studies which indicate that pro- 
longed psychological disturbances lead to changes 
in organ systems—ulcers of the stomach, for ex- 
ample, But research is most commonly directed to 
producing correlations between physical and phys- 
iological variables on the one hand and personality 
variables on the other—explanation then being 
sought in terms of complex interactions between 
the two sets of variables. This is true, for example, 
of studies stimulated by Sheldon’s classic and in- 
fluential work on physique and personality ( Shel- 
don et al. 1940; 1954). He emphasized the bio- 
logical basis of his “somatotypes” and has offered 
hypotheses to explain how the individual's bodily 
constitution helps to determine some of his char- 
acteristic behavior. Sheldon admits, however, that 
the somatotypes might change under the impact 
of nutritional or other environmental changes, and 
other psychologists have been ready with hypothe- 
ses to explain how personality processes might in- 
fluence physique. [See MENTAL DISORDERS, article 
on BIOLOGICAL ASPECTS; NERVOUS SYSTEM; Psy- 
CHOLOGY, article on CONSTITUTIONAL PSYCHOLOGY; 
PsYCHOSOMATIC ILLNESS.] 


Trends 

An anthropologist who read Sapir on person- 
ality (1934) in the Encyclopaedia of the Social 
Sciences and then went to the field, not renewing 
acquaintance with the culture of psychology until 
the mid-1960s, would not feel like a total stranger. 
He would note that Freud, Jung, and Adler, to 
whom Sapir devoted the most space, were still the 
subjects of the first chapters of Lindzey and Hall's 
standard text (1965) and that Gordon Allport and 
Gardner Murphy, upon whose work Sapir also 
leaned, were still modifying and enlarging their 
theories of personality. Noting, probably with some 
wry humor, that the behaviorists and the dynami- 
cists were still at it, he might say to himself, “This 
is where I came in.” 

Soon, however, he would note that some changes 
had been made. He would probably be pleased to 
discover that personality is now thought of less 
as a structure of reactivity, essentially fixed in the 
individual by the age of two or three—the “psychi- 
atric” definition favored by Sapir—and more as a 
structure that always interacts with, and is for a 
long time developed by, social and cultural stimuli. 
‘As he looked further into the matter, our anthro- 
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pologist would begin to be impressed by subtle 
modifications of theory, including the classic psy- 
choanalytic version, by strenuous efforts to develop 
concepts linked more closely to behavior and by 
the outpouring of empirical findings. 

‘An attempt is made below to identify some of 
the ways in which the field of personality has been 
developing and what seem to be major current 
trends. 

(1) In the 1960s the trend of the discipline, as 
of the city, is toward a disconcerting sprawl. The 
field of personality has expanded and proliferated 
in all directions, as it were, and of the many dif- 
ferent voices, none can be called dominant. The 
1960s are not a time for grand theory. The major 
theoretical systems of the period were all devel- 
oped before 1950, and most of them date from the 
1930s, or before. Most of the writers represented 
in Lindzey and Hall's text (1965) and in Sahakian’s 
Psychology of Personality (1965) are still expand- 
ing and developing their systems, but on founda- 
tions that were laid down much earlier. 

Personality theorists today seem much more 
aware of the complexity of personality than they 
were in the 1930s, when it was possible to develop 
a system around a few simple and sovereign ideas. 
There are few treatises by one man today; this is 
an age of the symposium and of the collection of 
essays on a single specialized topic. 

Specialization is a natural consequence of ex- 
pansion. The time has long since passed when a 
man could be acquainted with all of psychology; 
today he can hardly keep abreast of developments 
in personality psychology. A social scientist who 
turned to Psychological Abstracts in search of in- 
formation about personality would find little that 
he was tempted to read; an educator interested in 
the topic of learning in relation to personality de- 
velopment would find, on looking into the matter, 
that there are various theories of learning—and of 
learning particular kinds of content and of learning 
in particular situations. 

An educated layman examining the field of per- 
sonality today might be impressed by the multi- 
plicity of possibly fruitful approaches, but he 
would probably be shocked by the spectacle of so 
much research whose sole link to humanly mean- 
ingful issues is a naively misappropriated model 
of science. There are many psychologists who say 
that there is a need today for empirical work on 
hypotheses that have been accumulating over the 
years and for generating more elegant theories and 
testable hypotheses from the rather grandiose con- 
ceptualizations of the past. Without denying this, 
one can say that there is also need for integration 


of the field—for resolution of the major differences 
in philosophy, in strategies of research as well 
as in theory, that are still very much in evidence, 

(2) The controversy between behavior theorists 
and theorists of the dynamic—organismic per- 
suasion continues. This conflict seems to exhibit a 
certain periodicity in which now one side and now 
the other gains the ascendancy. The 1920s were 
a time in which the new behaviorism rose to as- 
cendancy in American psychology. The 1930s were 
a time of great ferment, in which ideas brought 
over from Europe made a great impact on Ameri- 
can psychology. Not only psychoanalysis and re- 
lated schools of thought, such as the theories of 
Adler, Fromm, and Horney, but also gestalt psy- 
chology, topological psychology, and general sys- 
tems theory, combined with native dynamic theo- 
ries (for example, that of Murray) to compose an 
effective opposition to behaviorism. By the end of 
World War 11, a dynamic—organismic approach was 
clearly dominant as far as the psychology of per- 
sonality was concerned. In the late 1940s and early 
1950s, systems theory and holistic views of per- 
sonality, and the concepts of self and ego, came 
very much to the fore in the conversation as well 
as in the research of psychologists. Shakow, taking 
stock of psychology at mid-century, could note as 
a major trend “increasing attention to molar 
studies, accompanied by a diligent search for 
methods to handle the organized complexity in- 
volved” (Shakow & Rapaport 1964, p. 197). 

In the early 1950s there was a marked change: 
Personality-oriented clinical psychology, which had 
expanded dramatically after World War 1, in re- 
sponse to practical needs and with the help of 
funds from the federal government, began to rest 
on its laurels, or to consolidate its gains, and 
havioristically oriented experimental psychology 
had its innings, moving into a position of domi- 
nance in academic departments. There followed & 
split in organized psychology, which became s0 
profound that there was talk of organizing olini 
psychology in its own department or school. Ca 
bach, in his presidential address before the Am 5 
can Psychological Association in 1957, more 4 
less formalized the division by speaking bap 
“two disciplines of scientific psychology se 
While the division was based on the P 
method of investigation— “experimental or ajor 
relational”—it expressed clearly enough the E K 
theoretical, or perhaps ideological, division W: mi 
concerned with here. Cronbach called for, oa je 
moted by positive suggestions, 2 confluence $ ae 
two disciplines, but this still does not appe: 
in sight. 
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No fresh attempts comparable to that of Dollard 
and Miller (1950) to translate Freudian ideas 
into the language of S-R reinforcement theory 
have been made. Research stimulated by the ideas 
of Lewin has fallen into a recession, his hypotheses 
peing called not “researchable” because the con- 
cepts are too remote from anything observable. 
This, however, did not prevent Lewin’s students 
from producing a stream of experimental studies 
on such topics as anger, regression, conflict, ego 
involvement, and the level of aspiration, Lewin, 
like other holistic theorists, is a victim of the times; 
his ideas, according to Hilgard (1963), remain un- 
assimilated to prevailing theories. Stagner (1965) 
has sterner words for his colleagues in clinical 
psychology, many of whom, he says, have aban- 
doned the efforts to understand the whole person 
in favor of the blindly empirical research so 
fashionable in graduate schools, Disagreeing with 
Loevinger (1965), who suggests, sympathetically, 
that research inspired by holistic theory may not 
be able to meet current standards in psychological 
experimentation, Stagner goes on to accuse clinical 
Psychologists of neglecting sound research strate- 


* gies, such as profile analysis and Stephenson's Q 


sort (1953), merely because they are cumbersome 
and do not promise a quick payoff. 

When the chips are down, psychologists still 
seem to have the will to stick together and to re- 
main, for practical purposes, members of one disci- 
pline. (Only one department actually gave up 
clinical training, a decision it soon reversed.) 
But American psychologists do not seem to feel 
any great pressure to integrate their theories. 
Curiosity about people, and the demands of prac- 
tice, will probably lead to another resurgence of 
dynamic-organismic psychology, but this will not 
necessarily bring a confluence with S—R psychology; 
and the new interest in “behavior therapy” sug- 
gests that a polarization has occurred and may be 
continued. 

(3) There is a greatly increased awareness of, 
a. Concern with, social determinants of person- 
od development, and increased concern with 

teractions of personality structures and the con- 
temporary social environment. This change has 
ae developments in anthropology and soci- 
peed it will be generally agreed that virtu- 
he l the distinguishable features of personality 
ed with features of the cultural or the 
sent environment of the individuals remote or 

ent past. Indeed, one might well wonder if bio- 
factors have not been too much downgraded 

EA partea The new accent on the social is re- 

ed in the appearance of new personality varia- 
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bles, e.g., “role dispositions,” “interpersonal reac- 
tion systems,” “social values,” in the personality 
assessor’s scheme of things. There are some indica- 
tions that the collaboration between personality 
psychology and the established social sciences is a 
two-way street. For several decades anthropologists 
have made much use of personality theory, par- 
ticularly Freudian theory, and there have been 
signs of increasing awareness on the part of his- 
torians and other specialists in the humanities of 
their need for an understanding of human person- 
ality (see, e.g., Cohn 1957). Sociology, concerned 
with being a pure science, with empirical laws of 
group functioning, still tends to be somewhat skit- 
tish about psychology. Nevertheless, the over-all 
picture suggests that the prospects for a genuine 
articulation of personality theory and social theory 
are good. 

(4) Recent years have seen rising interest in 
cognitive variables—cognitive structures, ideolo- 
gies, belief systems. In academic psychology, cog- 
nitive structures are seen not only as expressions 
of the individual's striving but also as motives in 
their own right, or as functions that might develop 
independently of motives. In psychoanalysis, as 
we have seen, the new ego psychology places vari- 
ous cognitive functions in the ego, which is con- 
sidered to have its own independent origins and 
course of development. The trend seems to be in 
keeping with a larger current emphasis on the dis- 
tinctively human (as opposed to the animal) in 
man. We may expect the current interest in cog- 
nitive variables to be continued, without any neg- 
lect of dynamic elements such as needs or impulses. 
Despite some effective criticism of tension-reduction 
formulas, the concept of striving remains the 
major organizing principle in personality theory, 
and dynamic elements have remained the most 
generally favored personality variables. [See ATTI- 
TUDES; COGNITIVE THEORY; IDEOLOGY; THINKING, 
article on COGNITIVE ORGANIZATION AND PROC- 
ESSES.] 

(5) Freudian psychoanalytic theories gained 
greatly in acceptability after 1935 and are still in 
relatively good standing. Although few of Freud’s 
physicalistic analogies have survived, save among 
slavish adherents within what is called official psy- 
choanalysis, the essential and distinctively pyscho- 
analytic ideas of the plasticity of motives and of 
the dynamic unconscious seem to have been gen- 
erally accepted, long since, by everyone in the 
intellectual community except some S-R psy- 
chologists. 

Psychoanalytic concepts do not excite the inter- 
est today that they did in the late 1940s. At that 
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time arrangements were being made for psycholo- 
gists to be psychoanalyzed, psychoanalysts were 
finding places in university departments, and psy- 
choanalytic research was being supported. In ex- 
pressing disappointment that the promise of those 
years has not been realized, some sympathetic 
observers have suggested that this is evidence of 
weakness in the theory. But these experiments 
were cut short by the reaction in the early 1950s 
against holistic theories: only a handful of psy- 
chologists were psychoanalyzed, grants for research 
went to psychoanalysts who were not well trained 
in methods, beachheads in university departments 
were not extended. In any event, the objective 
study of psychoanalytic concepts and theories con- 
tinues, psychoanalytically oriented texts in person- 
ality (e.g., Sarnoff 1962) are in use in university 
departments, and excellent theoretical work is be- 
ing done today by psychologists friendly to psycho- 
analysis. Holt (1962), Klein (1956), Loevinger 
(1966), Madison (1961), and Tomkins (1962—) 
take some vague Freudian formulation and make 
it the object of clarifying systematic treatment. 


NeEvitt SANFORD 


(Directly related to the field of personality are the 
entries IDENTITY, PSYCHOSOCIAL; NATIONAL CHAR- 
ACTER; PERSONALITY, POLITICAL; SELF CONCEPT. 
Other material relevant to personality may be 
found in ANALYTICAL PSYCHOLOGY; ANXIETY; ATTI- 
TUDES; CONFLICT, article on PSYCHOLOGICAL AS- 
PECTS; CULTURE AND PERSONALITY; Drives; EMo- 
TION; FIELD THEORY; GESTALT THEORY; GROUPS; 
INDIVIDUAL DIFFERENCES; INTERACTION, article on 
INTERACTION AND PERSONALITY; MOTIVATION; PER- 
SONALITY: CONTEMPORARY VIEWPOINTS; PSYCHO- 
ANALYSIS; ROLE; SOCIAL PSYCHOLOGY; SYSTEMS 
ANALYSIS, article on PSYCHOLOGICAL SYSTEMS; 
Traits; and in the biographies of ADLER; ANGYAL; 
GOLDSTEIN; Horney; LEWIN; RANK; SULLIVAN.] 
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PERSONALITY DEVELOPMENT 


The phrase “personality development” refers to 
a description and theoretical understanding of the 
establishment of those stable response dispositions 
that differentiate adult humans. A closed and un- 
equivocal definition of the behaviors that belong to 
the domain of personality is not possible, for the 
set of responses that bear the label personality has 
always been open to revision. 


The most popular definition regards personality 
as the hierarchical organization of all dispositions 
of which the individual is capable and the relation- 
ships between these response dispositions (Lazarus 
1961). In practice, however, a disposition that is 
regarded as a personality variable usually possesses 
three characteristics: it displays stability over time, 
generality across situations, and considerable inter- 
individual variability in either frequency of occur- 
rence or intensity. The tendency to use the article 
“the” before nouns shows both stability and gener- 
ality, but it is not regarded as a personality dis- 
position because there is minimal] interindividual 
variability in the occurrence of this behavior. Since 
many response tendencies meet these three critical 
requirements, new behaviors are continually being 
added to the already large corpus of personality 
traits. Fortunately, current theories and methods 
have honed this number to one of reasonable mag- 
nitude. The specific personality variables selected 
for the most intensive study between the 1930s 
and 1960s have been determined by the emphases 
of psychoanalytic theory (S. Freud 1932), social 
learning theory (Rotter 1954; Bandura 1962), 
factor analytic procedures (Cattell 1950), and pub- 
lic concern with behaviors that disrupt the individ- 
ual and society. 

Psychoanalytic and social learning theories have 
called attention to childhood experiences that lead 
to the acquisition of particular motives (e.g., ere 
ality, hostility, dependency, social recognition, 
power); sources of anxiety (e.g., social rejection, 
aggressive behavior, sexual behavior and thoughts); 
defensive reactions that reduce anxiety; expectan- 
cies; and standards. In order to make inferences 
about these constructs, empirical studies spawn 
by the psychoanalytic orientation make use of data 
derived from overt behavior, interview material, 
and verbal interpretations of ambiguous stimuli. 

The special emphases of social learning theory 
have been on the acquisition of motives, vapi 
expectancies, and behaviors through the ar 
social reward and punishment, as well as the chi A 
tendency to adopt and practice the a 
desirable models. Investigations derived from A 
orientation have used observations of benard 
controlled experimental situations aS the me 
of choice. i 

The factor analytic strategy has relied, a 

i s ani 
measure, upon answers to questionnaire: ae 
been less concerned with developmental aaa 
The answers that people give on o 
constitute a special kind of behavior, an nerged. 
quently, a different set of variables ie particu- 
Factor analytic investigations have bee: 
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larly concerned with modes of interpersonal be- 
havior (such as introversion—extraversion ); degree 
of introspective awareness of diverse affects, both 
dysphoric and positive; and, finally, the degree of 
ideational concern with the traditional motives of 
dependency, aggression, and sexuality. [See Traits.] 

Psychiatric and social concerns with forms of 
overt pathology have substantially influenced the 
selection of personality variables, as evidenced by 
the use of such labels as psychopathy, neuroticism, 
depression, hypochondriasis, delusional trends, shal- 
low affect, and schizophrenic withdrawal. 

The most recent work in personality research 
has selected responses that have traditionally been 
considered cognitive in nature, such as distribution 
of attention, speed of decision in difficult problem 
situations, performance in perceptual discrimina- 
tion tasks, and preferred type of conceptual cate- 
gory used in classifying diverse stimuli. Investiga- 
tors on this new frontier insist that these variables 
are legitimately in the personality domain because 
they display substantial interindividual variability, 
are stable over time and place, and are theoreti- 
cally and empirically linked to motives, anxieties, 
and defenses [Gardner et al. 1959; see ATTENTION; 
CONCEPT FORMATION; LEARNING, article on DIS- 
CRIMINATION LEARNING; PROBLEM SOLVING; REA- 
SONING AND LOGIC; THINKING.] 

Most of the variables mentioned above describe 
adolescent and adult functioning, and a theory of 
personality development attempts to understand 
the acquisition of the final pattern of interrelation- 
ships among these variables. Since there is no 
unanimity on the variables that have primary sali- 
ence for adult functioning, the developmental the- 
orist must first decide which constructs and be- 
haviors deserve primary emphasis and must then 
trace the conditions of acquisition of the adult per- 
Sonality structure. 


Practical implications 


r There are two major implications that derive 
rom increased knowledge of the psychological de- 
velopment of the child: early diagnosis of nascent 
Psychopathology and a scientific basis for advice 
to parents and educators. 
en diagnosis and prophylaxis. Most profes- 
dist a Epe involved in therapeutic activities with 
ae ed children acknowledge that many of the 
ie $ forms of psychopathology are refractory to 
ene treatment. It is difficult to modify 
id i avior of the adolescent who displays chron- 
eae sere! serious academic retardation, Or 
© fa renic symptoms. However, knowledge of 
Specific behaviors in the six-year-old reliably 
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predict these adolescent syndromes would allow 
rational intervention at a time when the child is 
more receptive to psychological change. For ex- 
ample, if social scientists understood the mosaic 
of sociological, intellectual, and temperamental 
characteristics in six-year-olds that predict delin- 
quency at age 15, social agencies could initiate 
special preventive or therapeutic tactics with the 
young children who possess these prognostic signs. 
Much of development is a cryptograph in which 
early childhood characteristics, such as level of 
activity, ease of anger arousal, capacity for sus- 
tained attention, intensity of affect, and nature of 
affiliative tie to adults, bear complex and disguised 
relationships to later behaviors. Temper tantrums 
in a two-year-old do not necessarily predict ease of 
anger arousal or frequency of aggression in ado- 
lescence, but tantrums may be predictive of other 
meaningful dispositions in early adulthood. 

Advice to social agents. Many parents look to 
the social sciences for guidance on the type of 
discipline to be used in socialization, when and 
how to initiate toilet training, the differential ef- 
fectiveness of breast feeding versus bottle feeding, 
degree of restrictiveness or permissiveness to be 
imposed on the child’s activity, and related prob- 
lems. Parents may change their child-rearing prac- 
tices as a result of formal and informal contact 
with public sources of information. At present, the 
scientific bases for recommending changes in pa- 
rental behavior are usually minimal, and increased 
knowledge about human development might pro- 
vide a sounder rationale for counseling. 


Major theoretical orientation 


Four classes of constructs make up the working 
vocabulary and basic assumptions to be presented. 
These constructs refer to motives, standards, sources 
of affect (including sources of anxiety), and in- 
strumental responses. Moreover, the processes to 
which these constructs refer must be studied in 
relation to the critical developmental periods dur- 
ing which they are being established. 

1. Motives and their characteristics. A motive 
is a wish for a specific class of events. The desired 
events can be external or intrapsychic (i.e., cogni- 
tions or affects). Motives wax and wane in inten- 
sity, for the acquisition of a goal appropriate to 
a motive temporarily attenuates the intensity of 
the wish. Motives are activated by recognition 
of a discrepancy between a goal state the indi- 
vidual desires and his evaluation of the degree to 
which he possesses the desired goal. The major 
motives developing during the first dozen years in- 


“clude wishes for: physical affection; instrumental 
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aid; the belief that one is valued by someone im- 
portant; perception of pain, unhappiness, or anxi- 
ety in others; the domination of others; genital 
stimulation; similarity to a desirable model; the 
determination of one’s own actions and freedom 
from coercive control by others; the reduction of 
anxiety; instrumental competence at selected tasks; 
and the acquisition and maintenance of behaviors 
that are in maximal congruence with previously 
established standards [see Drives; MOTIVATION]. 

2. Standards. A standard combines a cogni- 
tive representation of a response, belief, or affect 
with an evaluation of its appropriateness. Individ- 
uals attempt to develop and maintain a belief sys- 
tem about the self that is maximally congruent 
with those attributes that the person has learned 
to regard as “good” and to renounce and avoid dis- 
play of attributes regarded as “bad.” The bases for 
the evaluation of good or bad are culturally derived; 
for Western society, they emphasize the belief that 
one is valued by others, the possession of instru- 
mental skills to deal with selected problems, the 
possession of attributes that the primary reference 
group judges to be appropriate for one’s biological 
sex, the inhibition of socially prohibited behaviors 
and obsessional preoccupation with prohibited 
thoughts, and the possession of rational and co- 
herent thought processes. The details of how such 
standards are learned are not clear, but it is likely 
that continued repetition of a standard by a presti- 
gious and/or nurturant person may produce belief 
in its validity. Parents’ continual repetition of the 
warning “Don’t cry” to their son may eventually 
have a permanent impact on the child’s evaluation 
of crying [see ATTITUDES; NoRMs; VALUES]. 

Recognition of discrepancies from a standard 
constitutes a motivational condition. Thus, there 
are two major classes of motives in the child: 
(a) desire for a specific class of external events, 
such as genital stimulation or physical affection, 
and (b) desire for increased congruence with a 
standard. 

3. Affects. The concept of affect, or emotion, 
has been difficult to define and nearly impossible to 
measure. A differentiation should be made, more- 
over, between the excited states of the infant who 
does not use symbols and the emotional states of 
the older child who possesses language. What are 
popularly conceived of as affects (e.g., depression, 
anxiety, grief, anger, joy, excitement) involve the 
labeling of a complex stimulus pattern composed 
of perceived physiological changes, cognitive ele- 
ments, and the external context in which the in- 
dividual finds himself. A one-year-old infant can- 
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not experience disgust or guilt, for he has not had 
the necessary experiences or acquired the appro- 
priate language labels. We can have as many 
emotions as there are words to label combinations 
of internal physiological states, cognitions, and 
external contexts. 

Internal states. The primary component of an 
affect is a perceived change in the quality and the 
intensity of internal stimulation from muscle and 
viscera. Specific affects are linked, in part, to dif- 
fering intensities of internal stimulation, as differ- 
ent sounds are tuned to different loci on the basilar 
membrane. The intensity dimension is usually 
called level of arousal or level of activation, and it 
is generally assumed that the reticular formation 
and related structures in the centra] nervous sys- 
tem play a major role in determining arousal level 
[see EMOTION]. 

Grief, sadness, depression, apathy, and lassitude 
are characterized by a level of internal stimulation 
that is lower than the individual’s usual level— 
sometimes called his adaptation level. Excitement, 
rage, joy, fear, and anxiety are characterized by a 
perception of increased intensity compared with 
the individual's adaptation level. Absolute level of 
reticular discharge or autonomic reactivity is not 
to be regarded as a faithful index of the degree to 
which a person is experiencing an affect. An affect 
is more likely to be experienced when the person 
recognizes a discrepancy in level of intensity from 
what he regards as “normal.” s 

Labeling. A second component of an affect is 
the language label that the individual applies to 
his internal state. The label chosen is determined 
by (a) the direction of the discrepancy noticed 
(i.e., higher or lower intensity), (b) the content 
of his images and thoughts at the time the change 
in internal stimulation is perceived, and (c) the 
immediate contextual situation. Thus, perception 
of a lower level of internal stimulation combine 
with thoughts about one’s absent sweetheart ar 
in a lonely railway station is likely to be ka 
“depression” or “loneliness.” The same pattern 0 
internal stimuli when combined with though 
about the hard day at the office while driving hom 
late at night is likely to be labeled “fatigue. an 
the behaviors the individual may initiate are 6° 
erned by the affect label he applies, rather E 
the level of internal stimulation [see LANGUAGE 
article on LANGUAGE DEVELOPMENT]. that 

Schachter and Singer (1962) demonstrated ade 
following an injection of epinephrine, W ich ane 
to perceived changes in arousal level, nee 3 
affect-related behaviors the individual display! 
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were dependent on his immediate situation and the 
thoughts elicited by that situation. Thus, a person 
may regard himself as feeling angry, euphoric, or 
sick, depending on the situational context and his 
related thoughts. It is presumed that the child 
learns specific affect labels as a result of informal 
tutoring by the social environment. The parent 
sees the child laughing and remarks, “You certainly 
are happy today.” Or she tells her six-year-old who 
is stamping his feet in protest, “Do not show your 
anger to your mother.” 

Pleasure and displeasure. From a developmen- 
tal perspective, it appears that the infant less than 
six months old experiences two major arousal 
levels that are the result of events adults regard 
as characteristic of displeasure or pleasure. Dis- 
pleasure is characterized either by increased levels 
of stimulation in situations marked by pain, hun- 
ger, cold, and other states of physical discomfort 
or by sudden changes in external stimulation. The 
child’s prepotent reactions to these situations are 
crying and motor discharge. 

Pleasure is normally characterized by a percep- 
tion of lowered intensity of internal stimulation 
and a context in which pain or discomfort is being 
reduced, hunger or thirst is being gratified, or cer- 
tain forms of tactile stimulation (i.e., stroking) are 
being applied, The behavioral reactions to these 
situations are smiling and motor relaxation. 

Happiness, excitement, and fear. During the 
second half of the first year the simple relation be- 
tween pleasure and displeasure and level of arousal 
disappears, Experiences that adults regard as pleas- 
ant can be associated with increased arousal. The 
child laughs and thrashes when he is tickled, spoken 
to, or surprised. Adults say the child is happy or 
excited. Further, the behavioral signs resembling 
fear occur when the child anticipates a painful ex- 
perience, a perceptual expectancy is violated (e.g., 
expects to see his mother and a stranger enters), 
or he is separated from his primary caretaker. Each 
of these reactions requires some prior learning. 

h Rage. A third development during the latter 
ee He first year is the emergence of behavior 
ens ts call rage.” Rage is characterized by an 
eii e in level of internal stimulation and situa- 
5 ap Toa the infant loses a goal object, desires 
thas object he can perceive but cannot reach, or 
a aay restrained. The behavioral reactions 
tien T and motor discharge, but the crying is 
in ifferent in form and intensity from the cry- 

8 of pain or fear. 
age to cues. During the second year se- 

ousal states and motor patterns become 


609 


firmly conditioned to specific cues. If the child 
watches his mother leave the room, he may cry, 
especially if the setting is not familiar to him. 
Facial and vocal games with adults (e.g., peek-a- 
boo) often elicit laughing and thrashing. Standard 
forms of parental displeasure (e.g., shouting at the 
child) may have become capable of provoking cry- 
ing and signs of fear. Thus, behaviors displayed 
during the second year resemble the affect-related 
responses the child will later learn to call pleasure, 
fear, anxiety, anger, and excitement. The emer- 
gence of language at two years of age marks an 
important change in the development of the af- 
fects. Now the child becomes capable of interpret- 
ing contexts with greater accuracy and refinement, 
and he learns to apply affect words to his perceived 
arousal states. During the next three to four years 
he will learn to apply such words as mad, happy, 
sad, ashamed, bad, good, afraid, and sick to combi- 
nations of changes in arousal and situational con- 
texts. His overt behaviors will be a function, in 
large measure, of the specific affect words he em- 
ploys. 

Anxiety. The capacity to experience the affect 
of anxiety is acquired as a reaction to cues that 
combine internal stimulation, cognitions, and spe- 
cific environmental events. Anxiety can be defined 
as a state of unpleasant feeling characterized usu- 
ally, but not always, by a perception of change in 
afferent feedback from the stomach, heart, skin 
surface, or muscles as a result of increased motility 
of gastrointestinal tract, increased heart rate, pal- 
mar sweating, shivering, or flushing. The cognitive 
component of anxiety involves the anticipation of 
an unwanted or unpleasant event. In some cases, 
but not all, there is a clear cognitive representation 
of the undesired event. The sources of anxiety 
derive their names from the nature of the specific 
unpleasant events that are anticipated. During the 
first two years, prior to the dominance of language, 
the child shows behavioral signs of anxiety to (a) 
sudden changes in stimulation, (b) violations of 
a perceptual expectancy (Hebb 1946), and (c) sep- 
aration from a nurturant caretaker (Bowlby 1960). 
During the preschool and early school years the 
major sources of anxiety are (a) anticipation of 
the loss of nurturance or affection, (b) anticipa- 
tion of some physical harm, and (c) recognition 
of a lack of congruence between a previously ac- 
quired standard and evaluation of a current belief 
or behavior. It is possible that different combina- 
tions of these three basic anxieties make up the 
unpleasant affects that appear later in development, 
such as helplessness, guilt, shame, and conflict- 
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induced anxiety. Despite more than half a century 
of intensive empirical effort and thoughtful con- 
cern, there is no general agreement on a valid 
method of measuring the universal human experi- 
ence of anxiety [see ANXIETY]. 

4. Instrumental behaviors. The constructs of 
motive, standard, and affect are to be differentiated 
from behaviors that are instrumental in gratifying 
motives, maintaining standards, or reducing anxi- 
ety. Two major classes of behaviors are relevant. 

Goals and standards. Responses related to 
achievement of goals and maintenance of stand- 
ards are the means by which an individual attempts 
to gratify his wishes and increase or maintain 
congruence with his standards. Pleas for affection, 
stealing, and studying until midnight for high 
grades are instrumental responses aimed at ob- 
taining desired goals or maintaining standards. 
The same act can be the result of distinctly dif- 
ferent antecedent conditions. A request for help 
can gratify a desire for instrumental aid from a 
specific agent, the desire to display behavior that 
is congruent with a feminine sex-role standard, or 
the wish to elicit social attention. 

Defenses. Sources of anxiety generate their own 
reactions, and the term “defense” refers to those 
responses whose intent is to reduce and control 
anxiety. Anna Freud’s classic essay on defense 
mechanisms (1936) emphasized the defenses of 
repression, denial, projection, and reaction forma- 
tion, It is useful, however, to consider a broader 
domain of defensive responses. The overt reaction 
of withdrawal from an anxiety-arousing situation 
is a common defense in the three-year-old. The 
four-year-old who hides his face in his mother’s lap 
when a stranger enters provides a graphic illustra- 
tion of the defense of withdrawal. Refusal to go to 
school the morning of a test or running from the 
nursery room after an embarrassing act are other 
common instances of withdrawal during the pre- 
school and early school years [see DEFENSE MECHA- 
NISMS]. 

The cognitive analogue of behavioral withdrawal 
is the attempt to remove the anxiety-arousing 
thought by the mechanisms of repression and de- 
nial. When guilt or shame is the basis for anxiety, 
projection is often the defense of choice, The as- 
sertions “I didn’t do it; Johnny did” and “I failed 
because the teacher doesn’t like me” are popular 
representatives of this class of defense. 

Substitution of goal objects, a common defense 
among children as well as adults, is elicited when 
a desirable goal is unattainable because of practical 
exigencies or anxiety-based inhibitions. For ex- 
ample, the boy who is incompetent at athletics and 


feels alienated from his peers is likely to select a 
substitute activity to prove his competence and 
maintain congruence with his standards of mascu- 
linity. Finally, the defense of intellectual analysis, 
the attempt to mitigate anxiety through intellectual 
understanding of the anxiety, begins to occur in 
the child of school age but does not ordinarily be- 
come a strong habit until adolescence. 


Critical periods 

The term “critical period” is descriptive rather 
than explanatory and refers to the fact that certain 
environmental events have maximal influence on 
developing physiological processes or behaviors dur- 
ing a specific time period but an attenuated or 
negligible effect at some earlier or later time. The 
critical period is the era during which the effect of 
the environmental experience is maximal, For ex- 
ample, translocation of primordial ectoderm, meso- 
derm, or endoderm tissue from a fresh zygote will 
have one effect during the first few hours following 
fertilization and a different effect 48 hours later. 
Castration of a male rat during the first five post- 
natal days is associated with frequent occurrence 
of lordosis and related female postural mating pat- 
terns at adulthood, Castration after the fifth day 
has a minimal influence on this class of mating 
reactions (Young et al. 1964). When the term 
“critical period” is applied to psychological proces- 
ses, the outcome or dependent variable is a partic: 
ular response system, such as a class of overt acts, 
affects, standards, motives, or defenses [see SEX- 
UAL BEHAVIOR, articles on ANIMAL SEXUAL BEHAV- 
IOR and SEXUAL DEVIATION: PSYCHOLOGICAL AS- 
PECTS]. 

The concept of critical period implies that ea 
of the major response processes normally begins 
its growth during a different time period and that 
the response process is most vulnerable to mod- 
cation by environmental events during 4 partion 
period of development. A graphic illusi 
psychological development might consist of a serie 
of ogives, with each curve representing the grow 
of a separate response system. The ordinate ie 
reflect the strength of the response; the abaci i 
the age of the child; and the slope of the RE 
rate at which the response was being establis! E 
It is possible, although still unproved, that A 
critical period corresponds to the time wae 
slope of the response acquisition curve 1 grea! A 
Bloom has suggested that available data supp! 
this assertion (1964, p. 194). If onee K ie 
assumption, then to postulate that a critica ne? 
for the development of guilt is the period baie ihe 
ages three and eight is to state that this 1 


at each 
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period during which the rate of acquisition of the 
capacity for guilt is maximal, and modifications in 
the environment during this five-year period should 
have the greatest effect (facilitating or debilitat- 
ing) on the mature form of guilt responses. 

The remainder of this discussion attempts to 
delineate the major critical periods in the first ten 
years, the salient responses associated with each 
period, and the environmental events that appear 
to be most influential in shaping the final form 
of the response. 

1. Infancy—birth to eighteen months. During 
the first 18 months most infants learn the reward 
value of a human caretaker and learn to anticipate 
nurturance from a social agent in time of pain or 
distress. The primitive bases for the experience of 
anxiety, including separation from a caretaker, 
sudden change in stimulation, and confrontation 
with a surprise stimulus, are also acquired during 
this period. Some of the objective behavioral signs 
that signal the rate of acquisition of these response 
systems are the social smile during the first half 
year of life, the display of crying and signs of 
anxiety to strangers during the second half year, 
the manifestation of separation anxiety during the 
first half of the second year, and a general interest 
in people as evidenced by laughing, visual follow- 
ing, reciprocal play, and vocalizations. The envi- 
ronmental conditions that appear to be most rele- 
vant for the acquisition of these responses are the 
degree of regularity of the administration of nur- 
turance to the child, amount and frequency of 
physical contact between the child and caretaker, 
reciprocal vocalizations, and reciprocal play be- 
tween the child and caretaker. According to con- 
temporary theory, the mother who is nurturant ac- 
quires reward value for the child (Sears et al. 
1957). The infant begins to approach her when he 
requires gratification. Moreover, the learned ap- 
Proach responses to the mother will generalize to 
other social agents. When the mother is not nur- 
tee frustrates the child, or causes him pain, 
be As associates her with displeasure and may 
“i is respond to her with withdrawal and avoid- 
nae A bservations of the smiling response indicate 
Aunt infant who experiences a nurturant rela- 
ae m with his mother smiles more frequently 
child n ; infant who has a non-nurturant mother— 
ae din ationship (Provence & Lipton 1962). Re- 
ia g the infant’s smile by picking him up or 
ae him leads to more frequent smiling and 
1956. S responsiveness to other people (Rheingold 
oe aren 1958). Anxiety evoked by stran- 

ana to be less frequently observed among 

institutions than among infants raised 
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in families with nurturant adult caretakers (Pro- 
vence & Lipton 1962). Infants reared in impersonal 
or unstimulating environments are reported to be 
more quiet and passive and show less positive af- 
fect than children living in a normal family milieu 
(Provence & Lipton 1962). Many institutionally 
reared children improve in mental alertness and 
emotional display when they are moved to envi- 
ronments where they receive more nurturance and 
reciprocal stimulation. It should be noted that the 
striking pathological effects noted above are evi- 
denced in children who are markedly deprived of 
maternal warmth during the first year. The conse- 
quences of lesser degrees of deprivation are not 
yet known. Moreover, the specific aspects of ma- 
ternal deprivation that are most effective in pro- 
ducing these pathological behaviors have not been 
isolated. 

Biologically based differences. Neonates and 
young infants vary widely in many response tend- 
encies that appear to be determined either by 
heredity or by prenatal or perinatal conditions (Es- 
calona & Heider 1959). There are obvious and 
consistent neonatal differences in attentiveness to 
stimuli, in vigor and frequency of spontaneous 
activity, and in autonomic nervous system func- 
tioning both at rest and under emotionally arous- 
ing conditions. These characteristics may influence 
the parents’ perceptions of the infant and, conse- 
quently, how the parents respond to, and handle, 
the infant. At present, it is not possible to deter- 
mine how stable these response tendencies are, 
and our understanding of their role in personality 
development is far from complete. 

2. Eighteen months to three years. The onset 
of the second critical period is heralded by three 
major developments: marked improvement in loco- 
motor ability, improvement in comprehension and 
communication of language, and the first imposi- 
tion of socialization demands by the parents. So- 
cialization is the process by which the individual 
acquires those behavior patterns, standards, and 
wishes that are viewed as appropriate for his own 
sex, family, socioeconomic, religious, ethnic, and 
social groups. With improved motor coordination 
and the emergence of the new skills of walking and 
talking, the two-year-old is motivated to explore his 
environment and to test his new abilities. He ac- 
quires several critical responses during this period 
of unrestrained locomotion. He is learning that he 
is capable of attaining desirable goals and over- 
coming frustrations, and he is practicing the re- 
sponses that accomplish these ends. In effect, he is 
learning that he can be instrumentally effective 
with both words and actions. During this period 
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most mothers begin to impose constraints upon the 
child. The two areas that are first subjected to 
socialization are bowel and bladder control and 
unlimited exploration. In both instances, the child 
is required to inhibit a complex, strongly motivated 
response that has previously had considerable re- 
inforcement. 

Toilet training. Since intense anxiety may ac- 
company toilet training, severe training proce- 
dures during the 6-month to 12-month period that 
the training is practiced with the greatest vigor 
may have adverse psychological consequences, in- 
cluding (a) production of hostility toward and fear 
of the training agent, usually the mother; (b) 
anxiety about sexual thoughts and behavior, as 
well as about the genitals; (c) anxiety about dirt 
and disarray; (d) acquisition of the self-labels 
“dirty” and “bad”; and (e) inhibition of sponta- 
neous expression of novel behaviors. Much of the 
research on the effects of varied socialization prac- 
tices is methodologically deficient, but in general 
the data suggest that mothers who start bowel 
training later require less time to train the child 
than those who start earlier (Sears et al. 1957). 
Moreover, maternal attitudes toward toilet training 
are not independent of their attitudes toward other 
behaviors to be socialized. Mothers who press for 
early toilet training tend to be strict in their de- 
mands in the areas of table manners, orderliness 
and control of noise, school performance, obedi- 
ence, and inhibition of aggression (Sears et al. 
1957 ). Clinical studies indicate that children whose 
bowel training is initiated prematurely, completed 
too early, or accomplished with coercive methods 
tend to react with such symptoms as enuresis, de- 
fiance, and overconcern with personal cleanliness 
(Huschka 1942; 1943), 

3. Preschool years—ages three to six. The 
preschool years are marked by increased differen- 
tiation of behavior systems, attenuation of the 
mother’s influence, and increased significance of 
interactions with the father and siblings. The ma- 
jor responses established during this period are 
identification with the parents, recognition of 
standards for sex-typed behaviors and an emergent 
awareness of one’s sex role, development of the 
motivation to master motor and conceptual tasks, 
capacity for guilt, appearance of defenses against 
anxiety and guilt, establishment of early forms of 
dependent and aggressive behavior, and habitual 
manifestations of an active or passive orientation 
toward peers, 

Identification and sex role. Many of the child's 
behavior patterns can be understood as generaliza- 
tions of responses that were directly rewarded and 
learned in the home, But other characteristics are 
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apparently acquired, without direct teaching or 
reward, through identification with others. The 
child is said to be identified with the parent when 
he feels he is similar to that parent (sometimes 
called the model). Moreover, he usually attempts 
to increase his feelings of similarity by adopting 
new characteristics (i.e., attitudes and behaviors) 
of the model (Kagan 1958). Since the parents are 
generally faithful representatives of their cultural 
group, the child may acquire the attitudes and be- 
haviors appropriate to his cultural and social class 
groups by identifying with them. The child’s identi- 
fication with the parent of the same sex leads to 
appropriate sex typing. Research findings suggest 
that appropriate sex typing is facilitated by warm 
relationships with the parent of the same sex 
(Mussen & Distler 1959). For example, boys of 
kindergarten age who are more masculine portray 
their fathers in a doll-play situation as nurturant. 
[See IMITATION.] 

The development of the standards that are re- 
garded as constituting the child’s conscience also 
appears to be facilitated by nurturant parent-child 
relations. Maternal warmth is positively correlated 
with strength of the preschool child's conscience, 
and girls with accepting mothers develop stronger 
consciences than the girls with rejecting mothers 
(Sears et al. 1957). Moreover, the use of love- 
oriented disciplinary techniques (e.g, giving or 
withholding of love as reward or punishment) is 
effective in fostering a strong conscience, but only 
in children whose mothers had been warm and 
affectionate during the opening years of life (Sen 
et al. 1957). The development of standards is ap- 
parently the consequent, in part, of identification 
and the child’s anxiety over possible loss of love 
from a nurturant parent. 

Guilt, shame, fear, and conscience. The early 
emergence of guilt can be seen during hee 
school years. The appearance of signs of guilt mM d 
cates that standards have been established an 
that the child is able to recognize a discrepancy 
between some aspect of himself and those EA 
ards. Recent research suggests that the timing y 
punishment, parental explanation of the pany 
the violation and the reason for the punish 
and the reward value of the social agent ae ds 
punishing are all critical in establishing Gey 
in the child (Sears et al. 1957; Aronfreed Ih. 
The order in which various standards are a ae 
lished during the preschool years iS not saad 
in any detail, but it is likely that the first He 
ards are related to toilet habits. Which sa cif 
are established next will depend on the ee per- 
experiences the child encounters. Standar > 
taining to anger, interpersonal aggression, 
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tion, excessive dependence, crying, masturbation, 
and sexual exhibitionism are often established in 
close temporal contiguity during the preschool 
years. Standards dealing with instrumental compe- 
tence at tasks emerge a little later in most children, 
usually by five to six years of age. At that age the 
child may cry if he loses a game or scatter parts 
of a puzzle he is unable to solve. 

Symptomatology in the preschool child. The 
major symptoms that occur during these years in- 
clude phobias, temper tantrums, mutism, soiling 
or enuresis, extreme aggression to peers, head- 
aches, vomiting, and constipation. 

4, Early school years—ages six to ten. Chil- 
dren in Western cultures change dramatically in 
their overt behavior and in the form and quality 
of their cognitive products between six and ten 
years of age. A popular explanation attributes this 
change to the varied experiences associated with 
school attendance. However, many cultures insti- 
tute formal changes in responsibilities and expecta- 
tions between the sixth and eighth year of life. 
Perhaps this means that there is a general recogni- 
tion that fundamental changes transpire during 
these years which make this an appropriate period 
to institutionalize mastery of instrumental skills. 
For the child in Western culture, the critical re- 
sponses that are established during this time in- 
clude (a) learning attitudes surrounding intellec- 
tual mastery (i.e., anticipation of success or failure, 
Standards of performance, desire to master intel- 
lectual skills, relationships to adult teachers); (b) 
Practicing an active or passive social orientation 
with peers; (c) crystallization of attitudes toward 
the self, with the immediate peer group as the 
primary reference; (d) establishment of anxiety 
and guilt in association with aggression, sexuality, 
and dependency; (e) the establishment of prefer- 
ences for particular defenses against anxiety; (f) 
Practicing the behaviors and attitudes that define 
Sex-role standards; and (g) the development of 
Standards regarding rational thought and behavior. 
ae of the peer group. The peer group 
thou A many responses that were acquired 
tie is ; oe reinforcement or through identifica- 
E p home, and it implements the child’s 
alec a into the broader social world. The group 
child n a therapeutic function by reassuring the 
ana eich share his conflicts about anger 
e. ity. The child’s peers also reinforce ap- 
with a Sex typing by accepting boys and girls 
ing e sex-typed interests and Pe 

chaviors o display inappropriate sex-type 
E E of the school. The school situation 

the young child with a new adult to whom 
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he must conform and whose acceptance he is en- 
couraged to court. Middle-class boys whose mothers 
are protective and affectionate and urge school 
achievement tend to obtain the best grades among 
boys in school. The girls who perform well in 
school, however, have mothers who are not overly 
protective and encourage their daughters toward 
independent behavior (Kagan & Moss 1962). It 
appears that the child’s desire to master academic 
skills arises largely from his desire to maintain a 
nurturant tie to parents who encourage this form 
of behavior and identification with parents who are 
effective models for intellectual mastery. 

Dominance versus submission. The tendency 
to adopt initially a passive or active dominant 
approach to age mates is remarkably stable from 
age ten through young adulthood. especially for 
females (Kagan & Moss 1962). Some children 
characteristically initiate communications with 
peers, suggest activities for the group, and resist 
pressure to conform to the demands of others. The 
passive child is quiet with peers and typically fol- 
lows the suggestions of others. There is some evi- 
dence to support the notion that restrictive mothers 
promote the learning of passivity with peers, 
whereas permissive parents promote a more active, 
dominant interpersonal attitude. 

The effect of siblings. In general, the sex of 
the sibling is an important determinant of the 
degree of adoption of sex-typed behaviors. Girls 
with older brothers are more masculine in their 
behavior than girls with older sisters. Similarly, 
boys with older sisters are less masculine than boys 
with older brothers (Koch 1956). 

Symptoms of the school-age child. During this 
period the complete range of adultlike symptoms 
can appear—psychosomatic symptoms (such as 
ulcerative colitis and skin disorders), hysteria, 
nightmares, inhibition of effort to master instru- 
mental skills, undisguised aggression, excessive 
social anxiety, tics, obsessions, rituals, and halluci- 
nations [see MENTAL DISORDERS, article on CHILD- 
HOOD MENTAL DISORDERS]. 


Maternal dimensions 

Investigators have been continually concerned 
with discovering the particular features of parental 
behavior that have the most potent influence on 
the child, The choice of the maternal variables 
investigated is based not only on the age of the 
children studied but also on the investigator's the- 
oretical orientation. The basic data on parental 
behavior are derived from parental interviews and 
observations of parent-child interactions by trained 
observers. Recent reviews of the vast body of re- 
search on parental behavior suggest that psychol- 
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ogists have used two major bipolar reference di- 
mensions to describe parental behavior: love versus 
hostility and autonomy versus control (Schaefer 
1959). The love—hostility dimension appears to be 
relatively stable during the first ten years of life, 
whereas the autonomy-—control dimension is con- 
siderably less stable. 


This article has considered the time of emer- 
gence and mode of establishment of four primary 
response systems: motives, standards, affects and 
sources of anxiety, and instrumental behaviors and 
defenses. We have the beginning of a picture of 
the psychological growth of the typical child, but 
there is considerably less information on the mech- 
anisms of that growth and how environmental 
events influence response organization. This essay 
is more descriptive than theoretical and closely 
resembles the descriptive schemes for intellectual 
development proposed by Piaget and his colleagues. 
Theories of cognitive and behavioral development 
lack the tight net of interlocking theoretical propo- 
sitions that permit satisfying explanations. The 
constructs of identification, anxiety over loss of 
love, motives, and standards assume a heavy bur- 
den of explanation, but psychology does not possess 
sensitive methods of assessing these variables. 

Two conclusions seem reasonably correct. First, 
the behavioral profile of the ten-year-old is a mod- 
erately good predictor of the child’s behavior a 
decade hence. Second, the timing of environmental 
rewards and punishments and the actions and 
attributes of the child’s models during the first 
decade are critical influences on the behavior he 
will manifest during adolescence and adulthood. 


JEROME KAGAN AND PAUL H. MussEN 


[Directly related are the entries DEVELOPMENTAL PSY- 
CHOLOGY; IDENTITY, PSYCHOSOCIAL; SELF CONCEPT. 
Other relevant material may be found in ADOLES- 
CENCE; INFANCY; INTELLECTUAL DEVELOPMENT; 
MORAL DEVELOPMENT; PSYCHOANALYSIS; SOCIALI- 
ZATION. ] 
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PERSONALITY: CONTEMPORARY 
VIEWPOINTS 


1, A UNIQUE AND OPEN SYSTEM Gordon W. Allport 


1. COMPONENTS OF AN EVOLVING 
PERSONOLOGICAL SYSTEM 


Henry A. Murray 
I 
A UNIQUE AND OPEN SYSTEM 


My approach to personality theory has been 
called both humanistic and personalistic. It is hu- 
manistic in the sense that it demands full recogni- 
tion of all aspects of man’s being, including his 
Potential for becoming more than he is. It is per- 
Sonalistic in the sense that its goal is to understand 
and predict the development of the concrete, indi- 
vidual person. The approach has. likewise been 
labeled individual or trait psychology and some- 
ie a psychology of becoming. I prefer to regard 

as an eclecticism based upon the conception of 
pea as a unique and open system (Allport 

An open system is one in which constant intake 
E output of energy occur, as do homeostatic 

ension-reduction ) processes. While all theories of 
a allow for these two criteria of open 
di the present view proposes two additional 
fis ae Progressive internal organization ereD 
eet SA creative transaction with the environ- 
ae. x 60a). [See SysTEMS ANALYSIS, articles 

ERAL SYSTEMS THEORY and PSYCHOLOGICAL 
SYSTEMS, ] 
a most major theories, my view is derived 
Ei clinical experience with disordered per- 
Silvan (as are the views of Freud, Jung, Fromm, 
ey » etc.) but from the traditions of academic 
ology. There is an affinity between my posi- 


tion and the theories of Wilhelm Stern, Alfred 
Adler, Abraham Maslow, Carl R. Rogers, Henry A. 
Murray, and the contemporary existentialists. 
Sometimes these views as a group have been 
labeled the “third force” in psychology (the first 
two being psychoanalysis and stimulus—response 
behaviorism ). 

Internationalism. Influenced by historical de- 
velopments in psychology in continental Europe 
(especially the schools of act, gestalt, and person- 
alistic psychology), I have tried to relate these 
movements to Anglo-American empiricism, whose 
methods I prefer; but I insist that the European 
view of the person as a self-active (rather than 
merely reactive) being is sounder than the exagger- 
ated environmentalism and associationism that 
mark American theories of personality. 

Some American critics complain that I lean 
toward German romanticism. Some Europeans, on 
the other hand, resent my “Americanization” of 
their thought. Broadly speaking, however, this work 
is welcomed as a bridge between psychological cul- 
tures, and honorary memberships in psychological 
societies of Great Britain, Germany, France, Aus- 
tria, and Italy attest to this fact. 

Presuppositions. While my approach is aca- 
demic, historical, international, and eclectic, it 
rests also upon the conviction that science as yet 
lacks an adequate image of the nature of man. For 
this reason no doors of method or theory should be 
closed. It is premature and unseemly for psychol- 
ogists to present their partial and incomplete the- 
ories as if these theories were the final word. 
Because of this conviction my writing is often 
polemic in tone, for example in my attacks upon 
reinforcement theory of learning, projective tech- 
niques, animal models, and common actuarial 
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assumptions—insofar as such particularistic doc- 
trines seem to be overextended in current the- 
orizing. 

Psychoanalysis, with its heavy emphasis on the 
unconscious and on ego defenses, contributes much 
to our understanding of certain human qualities 
and occasionally provides the best theoretical fit 
for a given personality; but in general its biologism 
and reductionism are unsafe guides to follow. Most 
human conflicts are well configurated in conscious- 
ness and for this reason phenomenological report 
is important. “If you want to know what a person 
is like, ask him.” 

In place of particularism and reductionism, | 
advocate a systematic pluralism or systematic eclec- 
ticism, which regards human personality as an 
open system whose potentiality limits are not 
known. Because any given personality is always in 
the process of “becoming” (1955), the science of 
personality should leave doors open for expansion. 
Since, as a rule, it is purposivists who leave doors 
open and mechanists who close them, I am aligned 
with the former, although I do not exclude any 
valid evidence from any school of thought. I would 
allow for both the automatic and for the autono- 
mous in human nature. 

All theories of personality are nothing more than 
empirically based philosophies of man, Philosophies 
of materialism, egotism, hedonism, and positivism 
are too narrow to accommodate all the evidence. 
More adequate are philosophies of process, growth, 
and personhood. 

The unique and the general. This background 
accounts for the emphasis upon the study of the 
individual case in preference to the contrived aver- 
age case. While most writers pay lip service to 
uniqueness, they do not in fact allow for it in their 
dimensional methods or in their assertions con- 
cerning the laws (or tendencies) of mind in 
general. Because of the manifest uniqueness of 
genotype, environment, and experience, the prob- 
lem is not to justify the study of individual pattern 
but rather to justify our abstract and approximate 
nomothesis. We can do so by observing that people 
have similar (although never identical) physiologi- 
cal needs, comparable learning capacities, and cul- 
turally similar environments. Hence they may 
develop comparable interests and traits. On this 
basis we are justified in measuring common traits 
within a population. But while we engage in this 
familiar practice of differential psychology we 
should remember that the primary challenge is to 
invent morphogenic methods for the reliable and 
valid study of personal dispositions and individual 
life styles. Just as morphogenic biology lags behind 
molecular biology, so too does morphogenic psy- 


chology lag behind molecular ( dimensional) 
psychology (1961, chapter 1). 

The morphogenic (sometimes called idiographic) 
point of view holds that the single life itself should 
serve as the base line for studies of a person's 
course of development and also that an individual _ 
life is the proper matrix in which to search for the 
structure and functioning of personal motives and 
dispositions. I do not deny that the usual dimen- 
sional (nomothetic) approaches have their uses, 
but I argue for increased study of individual pat ` 
terns of growth. 

Research. Morphogenic methods are not easy to 
invent. Some possibilities lie in personal docu- 
ments, such as diaries, autobiographies, and let 
ters (Allport 1942; Masterson 1946). The widely — 
used Allport-Vernon-Lindzey test (see A Study of 
Values 1931), is partly morphogenic, partly di- 
mensional. One of my papers (Allport 1962) lists — 
a wide variety of techniques that I consider wholly 
or partly morphogenic, among them methods such J 
as matching, self-anchoring scales, and personal 
structure analysis, Eventually such methods should 
prove to be scientifically more acceptable than the 
prevalent dimensional (actuarial) methods, since 
they should enhance powers of prediction, under A 
standing, and control, the three goals of science 
(1940). 

As important as such methods are, they are not 
to be used exclusively. The bulk of my empirical 
investigations follow the more traditional method 
of seeking uniformities in such areas as children's 
imagery, expressive movement, social attitudes, 
common traits, social perception, rumor, prejudice, 
and the religious sentiment. These and other te 
search contributions are listed in the extensive 
bibliography accompanying Personality and Social 
Encounter (1960b). a 

An early and wholly characteristic field of te 
search is reported in Studies in Expressive xo 
ment (Allport & Vernon 1933), in which quanti- 
tative measures are made of handwriting, drawing, 
gait, gesture, and posture. Correlations of these 
measures disclose certain common traits of move 
ment, such as expansiveness (versus constricte 
ness), centrifugality (versus centripetality), i 
forcefulness (versus light pressure). In oe 
individuals are consistent in the degree to ae 
their movement manifests such characteristics 4m 
even when an individual does not follow the 
mon statistical trend, one can usually detect @ ie 
gruence in his personal movement pattem: they 
while most people are consistent in the force - 
exert in holding a pen in their fingers and in ye 
ing it to paper, an artist may grasp the pen pro- 
but apply it lightly (a reflection of his own 
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fessional style). In this example we note the dual 
emphasis upon both dimensional and morphogenic 
interpretations. 

Social psychology. My investigations bridge the 
topic of personality with the field of social psy- 
chology. I define the latter discipline as “the at- 
tempt to understand and explain how the thought, 
feeling, and behavior of individuals are influenced 
by the actual, imagined, or implied presence of other 
human beings (1954a, p. 5). In this definition the 
focus is upon the individual rather than upon soci- 
etal structures or social processes. The contention 
that the concept of “role” adequately covers the 
functioning of personality in the social system is 
rejected. Nor is there acceptance of a definition of 
personality stated merely in terms of its effect on 
other people. For me personality “is the dynamic 
organization within the individual of those psycho- 
physical systems that determine his characteristic 
behavior and thought” (1961, p. 28). Thus the field 
of personality study is related to, but not identical 
with, the field of social psychology. 

While The Nature of Prejudice (1954b) endeav- 
ors to give due weight to societal factors, it is the 
attitudes, traits, and character structure of persons 
that receive major attention. Cultural forces are 
ineffective unless they are embodied in the habits, 
attitudes, and motives of individual men. 


Structure of personality 
My main interest is in the organization of per- 
sonality. This fact is responsible for the somewhat 
misleading statement that I advocate a trait psy- 
chology. I define a trait as “a neuropsychic struc- 
ture having the capacity to render many stimuli 
functionally equivalent, and to initiate and guide 
equivalent (meaningfully consistent) forms of 
adaptive and expressive behavior” (1961, p. 347). 
An attitude is regarded as a special class of traits, 
pang an approach or avoidance tendency 
oward a specifiable object or class of objects. 
Common traits and attitudes, therefore, are those 
aspects of personality in respect to which most peo- 
ple within a given culture can be profitably com- 
rae Although I have done a considerable amount 
of work in this conventional area of differential 
Eeychiology, this should not obscure my special and 
re original emphasis on “personal dispositions,” 
or individual traits” (1937; 1961), conceived as 
oe and unique neuropsychic units that guide, 
rect, and motivate specific acts of adjustment. 
ae common traits are useful as coarse and ap- 
of ieee dimensions for convenient comparison 
eon sane the ultimate units we seek are per- 
ie ‘spositions—the actual, organized foci of 
individuaľ’s life. While objective methods are 


preferable in determining these dispositions, sub- 
jective experience and self-report are not to be 
denied their place. 

I am inclined to believe that a relatively few 
(perhaps six to ten) central dispositions can nor- 
mally account for the congruence and stability 
found in personal conduct (1960b, chapter 7). In 
a few lives one “cardinal” disposition seems to 
dominate (Hitler's mania for power, the compas- 
sion of St. Francis, Albert Schweitzer’s reverence 
for life). Most lives also have “secondary” disposi- 
tions, which reflect minor foci of interest or stylis- 
tic qualities of behavior (1961, chapter 15). 

Since even a cardinal disposition never accounts 
for all of behavior, the full unification of person- 
ality is never attained. Somewhat after the manner 
of Carl G. Jung, I assume that the nearest ap- 
proach to unity consists in the never realized striv- 
ing for unity. 

Proactivity. Most empirical psychology relies 
on fragmentary and short-run experiments; and 
clinical psychology and psychiatry rely for their 
data chiefly on anamnesis and backward tracing of 
lives. Hence, central motives, thrusting toward the 
future, are often lost from view. To trace motives 
into the past (to discover Oedipal conflicts or the 
history of rewards and punishments) is contrary 
to the structure of most lives, which strain toward 
the future. Hence much current theorizing allows 
only scantily for goals, plans, intentions, ideals, 
and hopes that mark the course of becoming. Val- 
ues (defined as “meanings perceived as related to 
the self”) should be considered a chief ingredient 
in personality structure. Here I am clearly aligned 
with the third force in psychology. 

Motivation. Traditional theories of motivation 
postulate instincts, impulses, and needs, all of 
which press toward gratification. Equilibrium, ten- 
sion reduction, or homeostasis, is said to be the 
goal of all activity. This is the position of most 
theories of personality, systems which I consider 
“quasi-closed.” Open-system theories insist that men 
are normally not content with restful equilibrium. 
Homeostasis, when achieved, merely guarantees, 
as Walter B. Cannon has said, that man is then 
free for the “priceless nonessentials” of life. 

Even the most basic biological drives are indi- 
vidually patterned; food, drink, sex, and sleep in- 
volve countless individual tastes. And beyond drives 
there are innumerable psychogenic interests that 
defy uniform classification. One should not, there- 
fore, postulate a basic list of motives, as many 
theories do. 

Instead I propose that all structural dispositions 
are in some degree motivational, in that they all 
“cause” behavior. Those dispositions involving 
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greater momentary tension (including drives with 
their respective canalized tastes) have a prepotent 
saliency. Dispositions that we call interests, ambi- 
tions, and values—all involving long-range inten- 
tion—are also persistently dynamic but ordinarily 
nonsalient. In addition there are stylistic disposi- 
tions (e.g., the tendency to act politely) that are 
directive in conduct rather than dynamic or driv- 
ing. Stylistic dispositions that reflect qualities of 
temperament are best studied through expressive 
movements (vide supra). 

Functional autonomy. To allow for the con- 
tinual growth of motives, I have introduced the 
concept of “functional autonomy.” It refers to “any 
acquired system of motivation in which the ten- 
sions involved are not of the same kind as the 
antecedent tensions from which the acquired sys- 
tem developed” (1961, p. 229). The existence of 
functional autonomy has been acknowledged by 
various authors in various ways, e.g., Franz Bren- 
tano, Edward C. Tolman, Wilhelm Stern, and 
Robert S. Woodworth. 

The principle of functional autonomy comprises 
the following points: (1) The character of motives 
alters so radically from infancy to maturity that 
we may regard adult motives as supplanting the 
motives of infancy; this view allows for the variety 
of goals and interests in adult personality better 
than any doctrine of fixed instinct or conditioned 
drive. (2) Even biological drives (breathing, hun- 
ger, sex) take on an individuality of patterning. 
(3) The “go” of a motive is always contemporary, 
and therefore tracing back to Oedipal conflicts, 
identifications, or childhood conditioning (except 
in rare neurotic instances) does not provide an 
explanation for current motivation. (4) Ontoge- 
netic emergence (here postulated) does not con- 
tradict continuity in personality development. 

One should distinguish perseverative from pro- 
priate functional autonomy. The former is due to 
reverberatory (feedback) mechanisms in the nerv- 
ous system. The latter is based on the dynamic 
properties of the self-image and on propriate 
striving. 

Except in certain neurotic conditions, a person 
is not tied to earlier motives, whether these are 
regarded as conditioned, channeled, or sublimated. 
Rather, the essence of his nature is to master and 
extract meaning from his environment and to ori- 
ent himself toward the future. Propriate functional 
autonomy is possible because the energy potential 
possessed by an individual is in excess of his sur- 
vival needs, drives, and the demands of immediate 
adjustment. 

In short, functional autonomy is “a way of stat- 


ing that men’s motives change and grow in the 
course of life because it is the nature of man that 
they should do so. Only theorists wedded to a re- 
active, homeostatic, quasi-closed model of man find 
difficulty in agreeing” (1961, pp. 252-253), It 
should be noted that post-Freudian ego psychology 
has modified psychoanalytic theory in the direction 
of functional autonomy (Heinz Hartmann, Erich 
Fromm, David Rapaport, Karen Horney). Ego psy- 
chology no longer looks for motivation exclusively 
in the unconscious, nor does it regard adult pur- 
poses as mere cathexes or sublimations of the in- 
stinctive impulses of sex and aggression. 

Functional autonomy is a declaration of inde- 
pendence for the normal personality. As a model 
it allows for authentic maturity and growth, Cri- 
teria for the mature personality are extension of 
the sense of self, warm relating of self to others, 
emotional security, realistic perception, self-objecti- 
fication (insight and humor), and a unifying 
philosophy of life (1961, chapter 12). 

The proprium. The term “proprium” is used to 
avoid the historic ambiguities of self or ego. Pro- 
prium is defined as the self-as-known—that which 
is experienced as warm and central, as of impor- 
tance. Ego involved (propriate) states are measur- 
ably different from states that are merely task 
involved, i.e., peripheral to the proprium (1943). 
Propriate functions include body sense, self-iden- 
tity, self-esteem, self-extension, rational coping, 
self-image, and long-range propriate striving. 

These propriate functions (which must not be 
attributed to a separate self or agent) develop 
gradually, beginning at the end of the first year of 
life. Early sensorimotor behavior is nonpropriate, 
as is much of the opportunistic functioning of later 
life. Most of the investigations conducted in gen 
eral and experimental psychology deal with non- 
propriate behavior. For this reason many gi 
logical principles of learning, motivation, ani 
cognitive operation are not fully adequate tga 
psychology of the propriate aspects of pereo 
For example, theories of learning place too aa 
emphasis upon the concept of reward and punis 3 
ment and neglect the effect on learning of 
congruence of an act with the self-image. al 

In the normal course of development, Ger 
sanctions give way to internal directives. A a 
who first seeks the teacher's approval may ae 
by searching for knowledge for its own sake ( E 
tional autonomy). To take a further a. ; 
childhood conscience is at first a disposition 
on prohibition, fear, and parental ean ae janes 
time, however, this primitive “must cone 
gives way to an adult “ought” conscience, whic! 
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based not on fear of sanctions but upon a generic 
image of one’s preferred life style. Thus the mature 
sense of moral obligation reaches far beyond the 
childhood superego. And so too with all complex 
sentiments. They are not mere channelings of early 
formations but are selective, critical, and often 
adventurous ways of handling the onrush of 
existence. 

The interweaving of the propriate functions per- 
forms the task of unifying personality (insofar as 
it is unified) in much the same way as William 
McDougall’s master sentiment of “self-regard.” 


As an academician and eclectic I respect and 
utilize contributions from the many particularistic 
approaches. It is, however, a persistent demand for 
adequacy of outlook that leads me to emphasize 
what I regard as neglected hypotheses and interpre- 
tations. My historical orientation is reflected in my 
sympathetic biographical interpretations of the 
work of William James, John Dewey, Wilhelm 
Stern, and Kurt Lewin—writers with whom I feel 
intellectual kinship. While I do not claim to have 
achieved a final eclecticism, I believe that the open- 
system view of personality holds hope for such a 
development (Allport 1964). An open system, as 
Opposed to a quasi-closed system, allows for an 
increase of internal organization over time and for 
transactional commerce with the environment be- 
yond the level of mere reactivity (1960a). 


Gorpon W. ALLPORT 


[Other relevant material may be found in FIELD 
THEORY; GESTALT THEORY; PERSONALITY; Traits.} 
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Il 
COMPONENTS OF AN EVOLVING 
PERSONOLOGICAL SYSTEM 


Without doubt, the ground for my inclusion 
among contemporary “personality theorists” in this 
encyclopedia as well as in four recent surveys of 
the field (Hall & Lindzey 1957; Wepman & Heine 
1963: Bischof 1964; Sahakian 1965) is the array 
of concepts and conceptual relations that were pre- 
sented in Explorations in Personality (1938), the 
product of a collaboration for which I have received 
in print more credit than I can reasonably claim. 
Since that time, the conceptual scheme that we had 
designed (for a restricted purpose) has undergone, 
needless to say, countless expansions, revisions, 
and reconstructions on its way to the still embry- 
onic and evolving personological system (PS) to 
which I currently subscribe. I welcome the oppor- 
tunity to report this much—a bare minimum—in 
order to accent my belief that today is no time for 
theoretical fixations. On the contrary, in my case 
an itch for comprehensiveness has resulted in such 
a multiplication of idenes, or basic gene-like ideas 
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—stolen goods for the most part—that at this junc- 
ture I find myself utterly incapable of packing them 
coherently into an essay of this length. The present 
assignment, then, came as a Procrustean bed which 
called for major surgery, regardless of the prospect 
that a successful series of amputations, or theori- 
ectomies, would be pretty sure to kill the patient 
(this PS), The most radical theoriectomy per- 
formed for this occasion involved the excision of 
that large, widely propagated portion of the Freud- 
ian system which is concerned with psychopatho- 
logical phenomena and their determinants. What 
were left to represent and understand after this 
huge excision were the more venturesome, progres- 
sive, ambitious, proudful, affectionate, joyful, self- 
actualizing, and creative potentialities of human 
nature, as well as of some of its higher-level, social 
and cultural, adult activities. In short, the task here 
is to set forth something which might serve as a 
health-oriented extension of, and complement to, 
the illness-oriented Freudian system, Further reduc- 
tions of my corpus of ideas have been accomplished 
by extracting a number of biological, physiological, 
sociological, evolutionary, and general systems con- 
cepts and theories, by deleting expositions and jus- 
tifications of ideas that are already in the literature, 
and by condensing nearly everything that needed to 
be said into the briefest sentences, without the nu- 
merous qualifications that accuracy demands. And 
SO, as a consequence of so much surgery, I am 
afraid that the standing of this present assemblage 
of mentations must depend on whatever imagina- 
tive supports the reader is prepared to proffer. 


A theoretical conception 


Definition. A personality at any designated 
moment of its history (in middle life, for example) 
is the then-existing brain-located, imperceptible and 
problematical hierarchical constitution of an indi- 
vidual’s entire complex stock of interrelated sub- 
stance-dependent and structure-dependent psycho- 
logical properties (elementary, associational, and 
organizational). Each elementary property is a dif- 
ferentiated (selectively focused), situationally ori- 
ented disposition (readiness) and capacity (power) 
to participate as a process in conjunction with other 
Processes (each in its own way) in a variety of 
functional exercises or endeavors which will pre- 
sumptively enhance that individual’s feeling of 
well-being in his world. The vast majority of these 
selectively focused dispositions and capacities— 
for example, a multiplicity of more or less energiz- 
ing interests, valuations, Passions, wants, and 
needs; of more or less adequate imaginal or con- 
ceptual representations of objects, persons, events, 
or abstract ideas; of beliefs, judgments, decisions, 


or plans of action; and of more or less competent 
organizations of subsidiary mental, verbal, and 
motor processes, that is, special skills—will be 
dormant (unactivated) at any given time. But a 
few of them will be operating momentarily in one 
of the internal or external situational proceedings 
(temporal units of activity), the ongoing proces. 
sions of which constitute the daily waking life of 
that particular individual. 

Psychological properties. The properties and 
processes of personality are those that are capable 
of becoming conscious (experientially discrimi- 
nated) in the natural course of events or when 
existing resistances are overcome. In short, this PS 
conforms to the principle that a scientist should 
adhere to a thoroughly self-consistent terminology, 
one that is suitable to the conceptual analysis and 
reconstruction of the class of phenomena (in this 
case psychological phenomena) he is investigating. 
This principle prohibits neither his identification of 
determinants or of analogous events at other levels 
of discourse, nor his adoption of the language of 
general systems theory in addition to his own terms 
[see SYSTEMS ANALYSIS, article on PSYCHOLOGICAL 
SYSTEMS]. 

Brain-located properties, Brain-located sub- 
stance-and-structure dependence simply means that 
there would be no psychological properties or proc- 
esses without the operations of the properties of 
material substances and structures on lower (elec- 
tronic, chemical, physiological) levels of reality, 
including the circulating blood and its various 
essential constituents, such as oxygen, water, en- 
ergy-bearing food particles, salts, vitamins, and 
hormones (if no animate brain, then no personal- 
ity); and consequently that alterations of ence- 
phalic functioning beyond a certain critical limit— 
for example, as a result of anoxemia, lobotomy, 
arteriosclerosis, drugs, etc.—will change the pet 
sonality temporarily or permanently in some i 
spects. Furthermore, this dependence on the e 
means that increasing knowledge of enceph 4 
structures and their properties—for example, wa 
cending reticular activating system—and variou 
centers of hedonic, emotional, and erotic eoi 
in subcortical areas—should provide ground x 
hypothetical conceptions of the hierarehica M 
tems of a personality that not only will oa 
more and more closely to the operations o E 
organic structures on which these systems 2° 3i 
pendent but will advance our understanding 
their situational activities. [See NERVOUS pce 
article on STRUCTURE AND FUNCTION OF THE nis PS, 

Comprehensiveness of properties. In thi ae 
« ity” ive term 
personality,” the most comprehensiv! eaning 
have in psychology, is given a functional m 
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embracing everything from basic temperamental 
variables—for example, energy level, hedonic level, 
affective state of being—to such higher mental 
processes as may be devoted to superpersonal 
(cultural) endeavors—for example, artistic, histor- 
ical, scientific, philosophical. Consequently, even 
by restricting one’s attention (as one inevitably 
must do and should do) to the most important 
properties, a personality cannot yet be adequately 
represented as a functional and temporal whole in 
less than 5,000 words, let us say; certainly not by 
a short list of traits. 

Hierarchical constitution. Hierarchical consti- 
tution refers to the conception of three “vertical” 
divisions (differentiated establishments) of a per- 
sonality, the states of which in childhood are quite 
comparable to the Freudian id, ego, and superego. 
The number, nature, object attachments, relative 
potency, and interrelationships of the components 
of each of these establishments change progres- 
sively with maturation and learning, stratum upon 
stratum, in opposition to an underlying tendency 
(augmented by frustration, etc.) to regress to lower 
strata, as manifested in sleep, reaction to extreme 
stress, neurosis, and psychosis. In these regressions 
the mind is overrun by mythological (archetypal) 
images from the stratum in which the narcissistic 
infant’s entire world consisted of the providing and 
depriving mother (matriarchal superego) and the 
anlage of the ego was nothing but a little actuator 
of its vocal and its oral muscles. 

In this PS the adult id embraces the entire strati- 
fied population of instinctual (genetically given, 
Viscerogenic, subcortical, unbidden and involun- 
tary) affective dispositions (hedonic, wishful, emo- 
tional, evaluative) with their engendered orienting 
fantasies, some forms of which have been repressed 
for years and may or may not be operating influen- 
tially “below” the boundary of the ego, but others 
ts Which have access to consciousness. The adult 
ee the more or less fact-perceptive, knowledge- 
s 3 rational, articulate, future-oriented regnant 
(ae em of the personality with energies of its own 

luring the waking hours) to fulfill its role as both 
rea and servant of the id. As the self-conscious 
Lee o leaves most things to habit (dynamic mech- 
a a the adult ego will, on signal occasions, try 
Gace as possible to function as the autonomous 
een and self-sufficient (unaided) deter- 
nah his destiny—especially as “I” the decision 
eee X the identifier, interpreter, evaluator, rejec- 
RA = worse” and chooser of the “better” press- 
for the tay and as “I” the plan-composer 
the ples etter” ones, the executor and adjuster of 
ate ae and, finally, the achiever of effects which 

ciently satisfying to self-respect and to the 


destructive and constructive id, greedy for pleasure, 
companionship, love, possessions, parenthood, 
power, and prestige. The adult superego (superreg- 
nant system of the personality) consists of strati- 
fied imaginal representors of the rulers (parent, 
king, god) and verbal representors of the ruling 
culture (moral precepts, laws, beliefs, sentiments, 
principles) of the world (family, group, nation, 
mankind) of which each ego is a member. Its func- 
tion is to commit the individual to the support and 
service of these values. [See PSYCHOANALYSIS, arti- 
cle on EGO PSYCHOLOGY.] 

Imperceptible, problematical properties. Imper- 
ceptible and problematical properties are encoun- 
tered especially in highly speculative, theoretical 
constructs, all too complicated to be properly ex- 
pounded in this article. 

Personality operations. The operations of a per- 
sonality during any designated period in its career 
are the history of the brain-located participations 
(with more or less frequency, intensity, persistence, 
and effectiveness) of many, but not al], of its man- 
ifold, currently existing properties, a few of these 
at a time—as manifested objectively and/or sub- 
jectively during the waking hours—in each of the 
succession of different (unique or recurrent) ex- 
ternal (e.g., overt interactional) and internal (e.g., 
covert contemplative) situational proceedings, 
which, with their varying hedonic accompaniments 
and effects (negative or positive reinforcements ), 
compose the more or less self-conscious life of an 
individual as it is carried on in relation to an 
influential environment of natural, artifactual, 
human, and cultural entities and events. To be 
included among the manifest operations and rela- 
tions of a person’s constitutional properties are 
prolonged emotional states (e.g., melancholy, anx- 
iety, internal conflict, passionate love, creative 
zest); long spans of proactive, progressive serial 
proceedings, that is, the continuing, though inter- 
rupted, operation of complex systems of interest 
and enterprise with subsidiary processes (e.g., 
working for a degree, a long courtship, building a 
house, a voyage of discovery, writing a novel, etc. ); 
and verbal disclosures of prevalent enduring dispo- 
sitions (e.g, sentiments, beliefs, ambitions), as 
well as manifestations of thematic imagination, of 
knowledge, and of skill in various testing situations. 

Not all possible operations are included, because 
nowhere near all classes of situations (e.g., oppor- 
tunities, tasks, perils) will be encountered by any 
one person in his lifetime: numerous possible 
shames and triumphs will be buried with him. 
Almost as revealing as the classes of situations that 
a person proactively chooses to encounter may be 
those he consistently avoids. 
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Succession of proceedings. A person’s waking 
life is characterized by a continuous procession of 
varying psychic states, processes, and representors. 
Some of these processions will occur when he is 
privately absorbed in a random, undirected (invol- 
untary) stream of covert feelings, images, and 
ideas (e.g., daydreaming) and others, when he is 
privately absorbed in a directed (voluntary) en- 
deavor to “make up his mind” about something. 
Temporal strips of this nature are termed undirect- 
ed or directed internal proceedings. In contrast to 
these are external proceedings, during each of 
which the person is proactively or reactively en- 
gaged in an environmental transaction, marked by 
actuations of overt, situationally oriented, motor 
and/or verbal processes, actuations which may be 
more or less spontaneous, random, impulsively ex- 
pressive, aimless, or deliberately directed, inten- 
tional, and purposeful. External proceedings have 
two aspects: an imperceptible experiential aspect 
and a perceptible and/or audible behavioral aspect. 
Internal progeedings, however, have only a scarcely 
appreciable behavioral aspect. There is no space 
here for an account of how these major (some- 
times overlapping) classes of situational proceed- 
ings have been classified into families, genera, 
species, varieties, etc. What needs to be stressed 
here is that this PS embraces both conventional 
behavioral psychology and (more particularly) 
experiential psychology—that is, it uses detailed 
reports, expertly obtained, of internal proceedings 
and of the subjective aspect of external proceed- 
ings. [See PHENOMENOLOGY. ] 

History. Chronological records of the more im- 
portant (revealing, consequential), successful or 
unsuccessful, satisfying or dissatisfying, single and 
serial proceedings (e.g., overt social interactions, 
sexual experiences of all sorts, covert dreams, fan- 
tasies, conflicts, feelings, self-evaluations, appercep- 
tions of the world, choices of long-range purposes, 
etc.) in a person’s life during the designated time- 
span, taken in conjunction with his responses to 
technical procedures, constitute the bulk of the 
data, the facts, to be analyzed into psychological 
variables, which, whenever possible, will be incor- 
porated into the systemic components of a total 
formulation of the existing constitution of a per- 
son’s more important properties. But, of course, the 
constitution will be changing, and strictly speaking, 
this great task of assessment and formulation 
should be repeated at each of the (Shakespearean 
seven, let us say) ages of the subject, since person- 
ality, as defined by this PS, has not only a compre- 
hensive scope but a comprehensive span: the his- 
tory of the personality is the personality. I shall now 


turn to a much abbreviated conception of the his- 
toric changes of the constitution of a personality, 
paying special attention to their genetical and ex- 
periential (learning) determinants. 


The developmental process 


A personality constitutionally and operationally 
considered as a temporal whole from birth to death 
is the brain-located history of the successive and 
cumulative constitutional products (say, a few mol- 
ecules and structural alterations at a time, in cells, 
axons and dendrites) of two interdependent on- 
going, psychometabolic processes: the genetical 
and experiential systems. 

The nature and order of the products of the more 
fundamental of these two interdependent systems 
are largely determined, in a basic and a general 
way, by the inbuilt, genetical program of consec- 
utive events (DNA code of instructions), a pro- 
gram which is roughly divisible into three succes- 
sive but overlapping temporal eras. 

The first era is marked by the emergence and 
multiplication of potentialities for a high propor- 
tion of new, developmental (associational and or- 
ganizational), structural compositions, each with 
its consequential psychological properties. The sec 
ond era (middle age) is marked by a relatively high 
proportion of conservative recompositions of the 
already developed structures and functions. And, 
finally, the third era (senescence) is marked by 4 
decrease of potentialities for new compositions and 
recompositions (e.g., less learning and retention) 
and an increase of decompositions (atrophy) of 
some previously existing forms and functions. [See 
ADOLESCENCE; AGING; DEVELOPMENTAL PSYCHOL- 
OGY; GENETICS, article on GENETICS AND BEHAVIOR; 
INFANCY; INTELLECTUAL DEVELOPMENT; LIFE 
CYCLE; PERSONALITY, article on PERSONALITY DE- 
VELOPMENT; SENSORY AND MOTOR DEVELOPMENT| 

Presumably, the progress of the genetical tee 
gram of events determines the earliest age at W ‘i i 
a number of nascent dispositions and a, 
will successively emerge and be capable of deve ae 
ment under favorable conditions (e-g., now 8 a 
time to learn to crawl, now to start babbling, net 
comes the onset of puberty, etc.); the limits 
excellence to which a number of special skills ea 
athletic, musical, mathematical, poetical, a 
be perfected under the most facilitating Ee 
stances; a number of temperamental pees: ane 
and susceptibilities; and finally, besides muc at 
perhaps, the onset of senescence and the ag 
death under the most fortunate conditions. and 

The nature and order of the construct de 
conservative products of the experiential ses 
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pendent system of psychometabolic processes (the 
system involved in learning) are largely and spe- 
cifically determined by the succession and recur- 
rence of diverse concrete environmental encounters, 
instinctive outbursts, functional endeavors, and 
their hedonic effects (negative and positive rein- 
forcements, including punishments and rewards 
for good behavior). Thus, within limits set by the 
existing potentialities, the operations and fortunes 
—or misfortunes—of a few, interdependent prop- 
erties of the personality in each of the critical pro- 
ceedings of its ongoing life largely determine what 
will be learned (internally composed, retained, and 
replicated); then, in a reciprocal manner, what a 
person learns on one occasion will determine or 
modify his performance on a subsequent occasion 
of the same class. 

Metabolic model. The term “ongoing” points to 
the fact that the brain-located nature of a human 
being is marked by a procession of vital processes, 
from the cessation of which (for example, as a 
result of even a short span of oxygen deprivation) 
there is no recovery. Ontogenetically considered, 
these given vital processes date from the fertiliza- 
tion of the ovum, but phylogenetically considered, 
they have a presumptive molecular (DNA or RNA) 
and cytoplasmic thread of historic continuity of 
some two or three billion years, which started, we 
may suppose, with the emergence of the genetical 
systems of a cluster of primordial units of living 
Matter, To represent the sine qua non of life within 
the boundary of an animal organism, including its 
brain, and to account for the free energy that sus- 
tains its multivarious operations, there are no bet- 
ter general terms than those of biochemical metab- 
olism, the incessant operation in each cell and 
tissue of catabolic (structure-decomposing, energy- 
teleasing) and anabolic (structure-composing, en- 
ergy-binding) processes. 

The succession of intracellular catabolic proc- 
esses (designated here by De) may be likened to a 
tiny fire in which energy-rich particles (carbohy- 
drates, fats, and proteins) are decomposed with the 
aid of enzymes in a watery medium of mineral 
ree maine, and other substances. To keep these 
ana aoe burning brightly in the cells of the brain 
ae pan organs, there must be an almost continu- 
a a ilar ingestion (symbolized by Ca) of the 
en entloned necessities—especially oxygen— 
leas an almost continuous cellular egestion 

sie ized by Ru) of waste products—carbon 
ia » Water, and incompletely oxidized nitrogen 
™mpounds, 
a a0 Opposite nature to the analytic processes 
‘atabolism (De) are the synthetic, molecule- 


binding, structure-building processes of anabolism 
(symbolized by Co); the energy for these, as for 
every other variety of activity, is derived from ca- 
tabolism. Basically attributable to synthetic proc- 
esses at the molecular level (e.g., millions of multi- 
plications of giant DNA and RNA molecules, etc. ), 
according to this theory, are all the above-men- 
tioned maturational formations, and, in conjunc- 
tion with concrete environmental transactions 
(perceptual and actuational), all the structural 
compositions and recompositions, the properties of 
which constitute the specific and general products 
of experience; what a person has actually learned 
to attend to selectively, to recognize, to understand, 
to represent conceptually, to evaluate positively or 
negatively, to love or to hate, to anticipate with 
pleasure or with dread, to accomplish with his 
muscles, to express with words, and to choose as 
realizable aims—all of which are important focused 
variables of personality. On the extent and fitness 
of such acquisitions and also very largely on favor- 
able circumstances and good fortune will depend 
the possibility of actualizing whatever genetically- 
given, latent resources there may be for love, joy, 
achievement, and service to mankind. 

As to the three, above-defined overlapping tem- 
poral eras of the life cycle, each of these can be 
most simply, though roughly, conceptualized in 
terms of a ratio of the two metabolic terms Co and 
De: during the era of growth and development 
of character and powers, Co > De; during the era 
of maintenance of character and the fruitful use 
of powers, Co = De; during the phase of induration 
and decay, Co < De. These are some of the chief 
reasons why the concurrent, complementary proc- 
esses of biological metabolism Co and De have been 
taken as the main components of this PS's basic 
paradigm, or model, a model that conforms with 
a conception of reality that is not expressible in 
terms of spatial structures of matter as such but 
in terms of the interdependent, operating properties 
of matter—that is, in terms of process, time, and 
energy. It is a fundamental animate model that 
accounts for the vital, vegetative processes (e.g., 
maturation, growth, differentiation, repair, etc.) 
that inanimate models do not fittingly represent. 
It is also applicable, with suitable modifications, 
to higher levels of mental activity, especially— 
when taken in conjunction with the widespread 
need for novelty (change, exploration, experiment) 
—to those analytic and synthetic imaginations 
which lead to cultural (scientific, artistic, ideolog- 
ical, ethical, etc.) innovations of all types [see 
AESTHETICS and ETHICS]. 

Finally, this model (with its emphasis on the 
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energy-building, antientropic processes of progres- 
sive development and creativity) provides the bio- 
logical ground for what is so conspicuously absent 
in the purely psychoanalytic system (with its em- 
phasis on energy-reducing, entropic processes, as 
well as on fixation, repetition compulsion, regres- 
sion, disintegration) [see CREATIVITY and STIMU- 
LATION DRIVES]. 

Learning and hedonic effects. It is provisionally 
proposed that among the genetical determinants 
of the development of personality, none is more 
influential than the morphology of the primitive 
brain (subcortex), with its increasing differentia- 
tions and integrations of centers (clusters of cellu- 
lar containers of specialized chemical compounds 
called micronic structures), each of which will 
sooner or later constitute an inbuilt readiness to be 
converted into a more or less distinctive energizing 
(feeling) state. The most basic centers, related to 
all others, are those that yield hedonic feelings 
(pleasure, delight, satisfaction, joy) and those that 
yield anhedonic feelings (displeasure, distress, dis- 
satisfaction, misery), with expressive movements 
in each case. From infancy on, hedonic learning, 
on which so much depends, will consist in the dis- 
covery (represented in the brain by more or less 
enduring spatial structures reconvertible into tem- 
poral associations and organizations) of what gen- 
erates hedonic feelings and what generates anhe- 
donic feelings (of this or that intensity or grade) 
within the self and then within numerous other 
selves whose feelings are important to the self. 

There are many kinds of hedonic and anhedonic 
determinants (generators), most of which, though 
analytically distinguishable, operate in relations 
with a few others, interdependently or consequen- 
tially. Some may be experienced together (as com- 
pounded sources of joy or misery) or some sequen- 
tially, the first being either a necessary forerunner 
of the second (e.g., tedious rehearsals of a skill 
before exhibiting it successfully in public) or the 
“cause” of the second (e.g., the pleasure of bully- 
ing a younger sister, followed by a humiliating 
punishment, a negative social repercussion ), Grow- 
ing up involves, among other things, gradual in- 
creases in the prospective time-span; the capacity 
to foresee the future; the power to postpone the 
gratification of certain wants, for one reason or 
another; and the ability to discriminate between 
pleasures and displeasures which have satisfying 
or beneficial consequences and those which have 
dissatisfying or harmful consequences and. even- 
tually to regulate one’s life so that neither the pres- 
ent nor the future is sacrificed for the other. 

Of course the operation of a generator, hedonic 


or anhedonic, will usually depend on a number of 
conditions or factors in addition to the past history 
and developmental age of the subject, factors such 
as the place, time, duration, and mode of its occur. 
rence, the absence or presence of one or more par- 
ticular persons, etc. But taking these and other 
qualifications into account, it might be said that 
each properly differentiated class of satisfiers (posi- 
tive reinforcements) and of dissatisfiers (negative 
reinforcements), taken separately or in combina- 
tion, deserves some place among the interdepend- 
ent ends or counterends of individual or collective 
human living and endeavor. 

Hedonic and anhedonic generators may be suit- 
ably classified in several ways, one being according 
to their temporal reference: they may be retrospec- 
tive, involving memories of past experiences which 
generate present pleasure or displeasure; spective, 
involving awareness of current delighters or dis- 
tressors; or prospective, involving anticipations of 
future experiences which generate present pleasure 
or displeasure. Current generators may be classified 
according to their predominant location in space: 
in the subject, in the environment, or, more often, 
in subject-environment (especially interpersonal) 
transactions. Each of these major orders of deter- 
minants is divisible into several families, genera, 
etc. For example, generators in the subject may be 
located in the body (somatic determinants), in 
some emotional (e.g., love, hate, fear) center of the 
subcortex (central determinants), in some type of 
sensory, imaginal, conceptual, verbal, or motor 
process (processional determinants), or in the judg- 
ments of conscience (superregnant determinants). 

Environmental determinants. Most of these de- 
terminants are dynamically interrelated with en- 
vironmental determinants. These are dominant in 
the early months of life when the child's capabili- 
ties are pretty much limited to piteous pe 
and to sucking, The hedonic generators in early life 
consist of whatever provisions (delighters) at 
gratuitously transmitted by the mother and ee 
tively enjoyed by the child: chiefly bodily ear 
sions (contact, firm support, stimulation, nipp? i 
milk) and affectional provisions (presence, ee 
attention, expressions of love, and later of ae 
tion and approval, etc.). The anhedonic gener: ee 
are constituted either by the absence 1n the vee 
imal environment of an urgently wanted br i 
(e.g., food, mother), whose arrival starts a r Pa 
process pleasure, or by the presence of EE 
wanted distressor (e.g., pain, loud noise), e 
removal restores the status quo. Somewhat ine 
however, the child may be confronted by pr ing 
movable distressor, in the form of an interlop 
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newborn sibling and/or (for a boy) of a father, 
who is perceived as a successful contender for the 
mother’s love. 

Central determinants. This brings us to a dis- 
cussion of central determinants. It is assumed that, 
by necessity, the child is wholly egocentric (self- 
centeredly oblivious of other selves and selfishly 
demanding a complete monopoly of his mother’s 
attention when he wants it); thus, he bellows furi- 
ously when he is kept waiting, and the sight of 
a displacing rival is not unlikely to arouse in him 
a fiendish jealousy coupled with a profound resent- 
ment which engenders murderous fantasies direct- 
ed toward the rival or revengefully toward the 
mother as betrayer of his trust. It seems that these 
fantasies and dreams are promoted, in part, by an 
inbuilt propensity to compose a bewildering proces- 
sion of extravagant mythological images, many of 
which are related to the parents or more generally 
to the “myth of the hero,” who, after performing 
extraordinary, superhuman exploits (e.g., flying, 
magic, etc.), kills a king and marries his queen. 
Complexes resulting from the vicissitudes and dis- 
astrous outcomes of these imagined heroic deeds 
and crimes may reside in the lowest strata of the 
id for a long time. Among other things, growing 
up calls for an increasing ability to distinguish be- 
tween imagined events and actual events “out 
there,” as well as a gradual progression from an 
utterly dependent state (“What will my parents do 
for me?”) toward a relatively self-sufficient state 
(“What can I do for myself?”). 

Achievement determinants. An achievement 
determinant includes references to the (immedi- 
ate or distant) past, present, and future and has 
its location in the subject’s impression of whether 
he is moving toward his goal—whatever this may 
be—and, if so, whether he is moving faster or 
slower than he expected; with more ease or diffi- 
culty, from moment to moment, day to day, or year 
to year; or, more generally, whether he is getting 
better, no better, or worse in some respect. One can 
observe in the child the beginnings of experiences 
of this nature: the progressive organization of proc- 
€sses—of elementary sensory processes to arrive at 
te Perception and recognition of an object; of 
aes. processes (with feedback loops) to 
elas the moment when an object canbe con- 
= ve ly reached, seized, and variously dealt with; 
on cal processes to form words and sentences, 
mea se Constitute, in each case, a stepwise 
tee a volitional power, which is experienced by 
ia ild as a pleasure-enhancing minor achieve- 
& ae as such, is often applauded by a parent 

or recognition), The central determinant 


here is ambition in the child, first to achieve greater 
and greater control of his own processes by concen- 
tration and persistence (i.e., to develop a variety of 
afferent and efferent skills in dealing with small 
portions of the encountered environment) and thus 
to attain a measure of autonomy and independence 
(which inaugurates the development of the ego, 
the executive of the regnant system of the person- 
ality) and later to use these skills (athletic, social, 
and intellectual) in competing with his peers for 
acknowledged superiority (victory or some form of 
higher order recognition, such as prizes, election to 
membership or office, or academic marks and hon- 
ors), as well as to use these skills in satisfying 
other (material, social, or cultural) wants. The 
ways-means-end learning cannot begin until a 
certain amount of functional learning (walking 
and moving objects) has occurred, and then, what 
has to be learned year after year to gain positive 
social feedback (repercussions) from parents, 
teachers, or peers becomes progressively more dif- 
ficult. Under competitive conditions, the prizes and, 
later, the best jobs and much else are reserved for 
those who have pushed some relevant types of func- 
tional learning to a sufficient degree of competence. 
In short, positive social reinforcements depend 
on individual differences in ambition (want to 
achieve), work enjoyment (process pleasure), per- 
sistence, innate aptitude, etc. [See ACHIEVEMENT 
MOTIVATION and SENSORY AND MOTOR DEVELOP- 
MENT.] 

Transactional determinants. Transactional de- 
terminants are delighters and distressors that come 
from dealing (successfully or unsuccessfully ) with 
the environment of things, people, or ideas; from 
enjoyed or not enjoyed dyadic reciprocations with 
another person—that is, transmissions and recep- 
tions of erotic stimulation (mutual orgasm), of 
affection or disaffection, of interesting or banal in- 
formation, self-revelations, or opinions, of comic 
or pointless stories, etc.; from the processes or out- 
comes of cooperative endeavors; from the reper- 
cussions of socialized (ethical, considerate, con- 
ventional) or unsocialized (criminal, offensive, 
obnoxious) forms or styles of behavior, especially 
the miseries of ostracism, disgrace, imprisonment, 
etc.; from the processes and effects of transmitting 
delighters to those who need them or deserve them 
(a very common, genuine form of enjoyment not 
covered by current psychological theories), and 
much else besides [see SYMPATHY AND EMPATHY]. 
Let these suffice as illustrations of a few elementary 
varieties of hedonic and anhedonic determinants. 
In this PS, those that emanate from others (alters ) 
and are directed toward the subject are classified 
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as press (plural, press); positive press include ac- 
ceptance, inclusion, promotion, respect, affection, 
lust—love, and negative press include rejection, ex- 
clusion, demotion, contempt, hate. 

As a personality develops, associations and organ- 
izations will constitute the ground for a large num- 
ber of general and particular (unique), enduring 
or unenduring positive and negative interests, eval- 
uations (sentiments and tastes), personal affec- 
tions, beliefs, wants, modes of behavior, and aims 
(a negative aim being an orientation toward the 
riddance in the present or the prevention in the 
future of the operation of a distressor ). 

Learning. The term “learning” commonly im- 
plies that its results are “valuable” (in some sense), 
regardless of the self-evident fact that a person may 
learn a good deal that “ain’t so” or “ain’t good”: 
superstitions, prejudices, repulsive tastes, the craft 
of forgery, etc.—all of which had better be un- 
learned. Also, in psychology, the term generally 
implies that its results determine a young person's 
subsequent behavior, that he or she very soon be- 
comes a creature of habits, with emotions and aims 
fixated by experienced distressors or delighters and 
tactics fixated by experienced successes, etc. This 
might be pretty nearly the whole truth if the genet- 
ical program, with its potentialities for self-actual- 
ization, ceased to operate at puberty; if the subject 
were not easily bored and not eager for new sights 
and new ventures; if the subject were commonly 
rewarded for frequent repetitions of the same in- 
formation (old news), the same jokes, etc.; if the 
human environment, parents, teachers, and peers 
were unanimous in their support of the same be- 
liefs, codes, manners, political sentiments, and 
tastes; if the person were not ambitious to emulate 
successively the more impressive performances and 
deeds of others; if for the subject the very meaning 
of achievement (something to be proud of) did not 
consist in the accomplishment of something new, 
extraordinary, more difficult or hazardous; if the 
person were not enticed by future-oriented imagents 
(fantasies ) of unexperienced delights or of untried 
ways and means; if no person were ever radically 
transformed by a “second birth,” “great emancipa- 
tion,” or religious conversion; and finally, if no 
person were ever to discover that the creation of an 
unprecedented, propitious form of living or of cul- 
ture (scientific, literary, etc.) could be more pro- 
foundly joyous than any experience he had had. 
If it were not for these and other self-realizing, 
novelty-seeking, ambitious, proudful, imaginative, 
and creative dispositions in human beings, all of us 
would stagnate with learned incapacities and a few 
enthralling memories of infantile attachments. This 


is not to deny the essential truth of one of Freud's 
greatest discoveries: the lasting underlying influ. 
ence (either beneficial or harmful) of early terrors, 
conditionings, loves and hates, erotized fantasies, 
and much else. What is being stressed at this point 
is the amount of unlearning (hedonic, cognitive, 
and tactical), experimentation, courage, endurance, 
and constructiveness that is required for a full life, 
[See CREATIVITY; FANTASY; STIMULATION DRIVES,] 


Variables, assessments, and representations 


Among the criteria in terms of which the worth 
of a personological system may be suitably eval- 
uated are; (1) the comprehensiveness (coverage, 
scope), structure (interrelatedness), adequacy 
(congruence with reality, significance), and opera- 
tional definiteness (clarity, precision) of its assem- 
blage of concepts and propositions; (2) the effi- 
ciency (in corresponding terms) of its assessment 
system (data-collecting, data-processing, data-eval- 
uating, and data-integrating processes); and 
(3) the over-all satisfactoriness (in corresponding 
terms)—the plausibility and verifiability—of the 
representations of different personalities (explan- 
atory conceptual biographies) that are composed 
out of the harvest of facts and interpretations 
yielded by the assessment process. i 

In this PS, the central set of concepts are classi- 
fied into abstract elementary orders (e.g, pro 
pelling energizers and evaluators; representors; 
movers; organizers; etc.) and these into families, 
genera, etc., of decreasing generality. The simplest, 
recurrent associations of these properties with ea 
other or with the properties of environmental en- 
tities (e.g., an interest in Z, a positive evaluation 
of X, a want to gain Y, images of W, the elect 
ness of V processes, etc. ) are the kinds of wei 
personality variables which are usually obtain 
with ratings from questionnaires and tests, and 
terms of which an individual is ordinarily deser! 
in a social conversation. The aim in this PS, how: 
ever, is to discover how these structural compo- 
nents, converted into their operating prore e 
interdependently participate in this or that m 
gized, oriented, organized, temporal unit (orga H 
zational variable). This synthetic mode of ner a 
eventually results in the conceptualization r 
hierarchy of purposive organizations, the m 
and longest of which are systems of cone 
planning, activity, enjoyment, and age 
It is chiefly of these that a personality-in-P™ 
is composed. 


z been 
Of necessity, large portions of this PS re indi- 


omitted; but enough may have been said 


cate that it is basically a psychometabolic system, 
with energy released for drives, emotions, and 
movements (electrical and muscular) and for the 
composition of new convertible structures of all 
sorts. From this it follows that its root concepts 
are dynamic, genetical, developmental, organismal 
(systemic, semiholistic), and hierarchical. It 
stresses the experiential (existential, subjective), 
hedonistic, and voluntaristic aspects of a person’s 
life (partly because of their contemporary neglect); 
but heretofore in practice it has done more justice 
to unconscious psychological processes, as well as 
to overt behavior with its situational and socio- 
logical (membership and role) determinants. 


Henry A. MURRAY 


[Directly related is the entry PERSONALITY. Other rele- 
vant material may be found in Motivation, article 
on HUMAN MOTIVATION; NERVOUS SYSTEM; PSYCHO- 
ANALYSIS; SYSTEMS ANALYSIS; and in the biogra- 
phies of FREUD; HUXLEY; JUNG; LEWIN; MORGAN, 
C. LLOYD; WHITEHEAD.] 
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1, THE STUDY oF POLITICAL 
PERSONALITY 


1. CONSERVATISM AND RaDICALISM 


Robert E. Lane 
Daniel J. Levinson 


I 
THE STUDY OF POLITICAL PERSONALITY 


Because one cannot write about, or even think 
about, human affairs without some implicit con- 
cept of human nature, the idea of “political per- 
sonality” is a very old one; indeed, the established 
classics of political philosophy from Plato through 
Mill almost always give it explicit attention ( Wallas 
1908; Lane 1953). Yet at the same time, the devel- 
opment of sophisticated psychology and psychiatry, 
especially psychoanalysis, is so recent that the 
modern concept of political personality is qualita- 
tively very different and must here serve as the 
substance of the discussion. This article will ex- 
plore the meaning of the concept, examine the ways 
in which the various psychological schools have left 
their mark on the idea of a political personality, and 
then deal more extensively with the ways in which 
theory and research in this area have helped in our 
interpretation of political outcomes. 


The concept of political personality 

“Political personality” may be defined as the 
enduring, organized, dynamic response sets habit- 
ually aroused by political stimuli. It embraces 
(a) motivation, often analyzed as a combination of 
needs and values (the push-pull theory); (b) cog- 
nitions, perceptions, and habitual modes of learn- 
ing; and (c) behavioral tendencies, that is, the act- 
ing out of needs and other aspects of manifest 
behavior. Each of these has obvious political impli- 
cations: (a) people who are motivated by needs 
for power may employ political leverage to satisfy 
these needs rather than (or in the course of) a 
pursuit of some explicit policy goal; (b) cognitively, 
people who handle information in the defense of 
their partisanship, rather than as an instrument of 
broader learning, become dogmatic and obstruct 
social adaptation to new situations; (c) behavior- 
ally, political life is vitally affected by the tenden- 
cies of leaders to act out (externalize) their psy- 
chic conflicts, projecting them onto other people 
and situations or, alternatively, to withdraw into 
inaction when threatened or, again, to make public 
demands to assuage their sense of worthlessness. 
Most simply stated, then, the habitual patterns of 
feeling, learning and knowing, and behaving in 
political situations constitute political personality. 

The definition above states that the elements of 
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political personality are “enduring”; this means that 
they are in some sense central to the personality, 
not merely the response to somewhat ephemeral 
situations or the product of a certain occupation or, 
more generally, a role that a person occupies for 
the time being. This means that in speaking of 
political personality, we are dealing with patterns 
of thought, emotion, and acting out that may be 
seen in operation in many different situations over 
a relatively long period of time, perhaps youth, 
young adulthood, and maturity. Again, this implies 
that these patterns are laid down relatively early, 
though their expression and style may reveal dif- 
ferences over time; indeed, there may be funda- 
mental changes in personality at a relatively 
mature age. Parenthetically, it may be noted that 
these changes are often the product of a slight 
change in some set of balancing internal forces 
(say, strong aggressive impulses held in check by 
fear of one’s own aggression) which may produce 
a relatively great change in outward personality 
manifestations (e.g., from ingratiating to domi- 
neering behavior). 

The definition of political personality includes 
the attribute “organized,” implying some interrela- 
tionship among the constituent elements such that 
a change in one, say, a growing need for social 
approval, would modify other elements, perhaps 
leading to a decreased willingness to defy author- 
ity. Inquiry into this organization implies some- 
thing like the following paradigm of questions: 
What patterns of needs, expressed through what 
need-coping mechanisms (repression, sublimation, 
ego-striving, etc. ), modified by what perceptions of 
reality and habits of learning, screened through 
what ideological constellations, produce what be- 
havioral tendencies? The organization of a polit- 
ical personality, then, implies a patterned relation- 
ship and interaction among these elements. And, 
among these elements, the manner of dealing with 
conflicting needs or motives is probably the most 
important, followed by the pattern of response to 
and internalization of authority. 

There was a third adjective in the definition: 
“dynamic.” Here, this overworked term refers to a 
capacity to produce change in something else. Op- 
erationally this means that if two ideas or emotions 
are brought into some kind of relationship, the 
more dynamic element changes the less dynamic 
element. For instance, the attitude toward authority 
is generally considered more potent than feelings 
toward particular leaders. Therefore, when a wor- 
shipful attitude toward authority is forced to con- 
front a dislike of a particular political leader, more 
change will be effected in the attitude toward the 


leader than in the posture toward authority in gen- 
eral (“he isn’t so bad, after all . . . I must have mis- 
judged him... at least he looks like a president, 
etc.”). We reserve for the term “political personal- 
ity” those elements of a person's total psychic pat- 
tern which tend to shape attitudes, beliefs, and 
actions on new issues as they arise. Just as some 
authorities talk of “reference groups” and “refer- 
ence persons,” so we might here refer to political 
personality as a constellation of “reference ideas” 
and “reference emotions”—ideas and emotions to 
which new problems are referred for guidance and 
instruction. But, of course, this reference is usually 
quite unconscious. 

The concept of political personality borders on 
other concepts from which it must be distinguished. 
There is, in the first place, the concept of “attitude,” 
which has been classically defined as a kind of 
“mental and neural response set.” We would distin- 
guish political personality from a single attitude on 
the obvious ground that the latter is too narrow, and 
from any conceivable complex of attitudes, how- 
ever broad, on the ground that such a complex 
lacks the organization and dynamic potential of a 
political personality. Personality, as has so often 
been remarked, is not a bundle of traits. Thus we 
conceive of a personality as shaping attitudes and 
not vice versa. [See ATTITUDES.] 

In the second place, there is the concept of “role, 
usually defined as a pattern of expected behavior 
associated with a given position in society. In prac- 
tice it is not easy to distinguish role-determined 
behavior from personality-determined behavior 
when a person is acting out his concept of appro- 
priate role behavior or, worse, when he has accept: 
ed the values and beliefs associated with a given 
role and performs accordingly. Sometimes the oe 
way to distinguish between personality and role 
to observe the person in a set of different roles, say, 
father and bureaucrat. Conceptually, political m 
sonality has an earlier genesis, has a different e 
ganizational principle, transcends the AES: E 
social position, is more internally motivate! ae 
autonomous of the environment, responds to ' 
ferent crises and conflicts, and is more pei 
cratic or individualized than is any (political) 7° 
behavior. [See ROLE.] hs 

Finally, there is the distinction between pei 
ality and culture, a difficult one because P ai 
ity must be learned somehow from availal “ality 
tural elements. It is for this reason that perso ¢ cul 
is sometimes said to be the subjective side aoe 
ture. Does individual anxiety reflect an mA ality 
laden culture? Does the authoritarian pers 
reflect an authoritarian culture? When Wi 


ing with individuals, the distinction is relatively 
easy, because no two individuals bring together an 
identical genetic pattern and an identical sequence 
of experiences; hence, each is in some way unique. 
When we deal with “modal personality” or “social 
character,” that is, the features of personality which 
are commonly shared in a group, we have greater 
trouble distinguishing between these shared per- 
sonality elements and the dominant themes of a 
culture, Obviously the carriers of the culture are 
people; they exemplify it, as well as conveying its 
themes to others through their norm-setting and 
norm-enforcing behavior. Perhaps the best way to 
distinguish between these concepts of modal po- 
litical personality and political culture is through 
the different questions each concept poses, the dif- 
ferent theoretical structures employed to answer 
these questions, and the differential focuses upon 
people and ideas in each instance. Questions deal- 
ing with modal political personality elicit answers 
employing psychological theories of individual de- 
velopment, learning, imitation, conflict resolution, 
and the like. They are designed to tell us about 
people, in this case individuals who happen to be 
in groups. Questions dealing with political culture 
employ theories of social change, cultural diffusion, 
group adaptation to ecological factors, functional 
requirements of a given social structure, the rein- 
forcement of social patterns, social (rather than 
individual) pattern maintenance, and so forth. 
Both concepts contribute to an understanding of 
political phenomena: modal political personality, 
through its contribution to an understanding of 
group psychology; political culture, through its con- 
tribution to an understanding of the prevailing 
myths, beliefs, and adaptive responses of the under- 
lying society. Yet it must be said that often these 
amount to the same thing. [See POLITICAL CUL- 
TURE, ] 


Political personality and environment 


_ It is now a truism to say that all social explana- 
tions employ some variation of what the learning 
theorists term a “stimulus($)-organism(O)-re- 
Sponse(R)” model, though sometimes it is referred 
P as “environment(E )—predisposition(P )—response 
m ): Social and political explanations place 
i emphasis now on one term (S or E), now on 
i ohen (O or P). Institutional theories, such as 
nee which claim that a separation of powers is a 
eee condition for the rule of law or those 
fee assert that the development of a middle class 
cane for the survival of representative gov- 
fea nt, emphasize the environmental part of this 

el. Yet while they are silent on the personalities 
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of the actors involved, they always quietly impute 
to them a set of personal qualities, e.g., the univer- 
sal love of power, which can be restrained only by 
others with power, or the reluctance of an economic 
elite to share power with the masses unless coerced 
by a balancing middle class and/or the incapacity 
of the masses to govern themselves. Each environ- 
mental theory, then, as mentioned in the first sen- 
tence of this article, implies a theory of personal 
motivation and some distribution of motives, 
values, and capacities in a relevant population. The 
study of political personality represents, in the one 
sense, an effort to fill in these never empty but 
often unexamined cells in the great macro theories 
which have guided the study of nations. 

On the other hand, there is a temptation to em- 
ploy the new concepts of political personality in an 
exaggerated way, so as to imply that an individual 
has a set of motives or needs which are evoked in 
the same way in all situations or, worse, that a 
given public, say the German people, chose a cer- 
tain leader, adopted a certain ideology, became 
bellicose and domineering or whatever because of 
some national character constellation, as though 
this constellation operated quite independently of 
the history and institutions of that particular pub- 
lic. One has only to glance at the early theories of 
political motivation to see the temptation at work: 
politicians choose their careers because they (all of 
them) love power; the Germans chose Hitler and 
Nazism because they were authoritarian; Ameri- 
cans offered economic aid to other countries be- 
cause they were other-directed and wanted the 
world to love them—and partially turned against 
aid giving when they found that they could not buy 
love. 

The lesson is clear, indeed obvious—but often 
neglected: it is in a combination of circumstances 
and attitudes, environment and political personal- 
ity, that the answers to the important political 
questions will be found. 


Theoretical interpretations 

Since, as was said at the beginning, the modern 
exploration of political personality is encouraged 
by and feeds on the developments of personality 
theory and research among psychologists and psy- 
chiatrists, it is not surprising that variations in 
these more molar fields are reflected in the inter- 
pretations of the role of personality in political life. 
For this reason it is useful to touch briefly upon the 
way in which various psychological theories affect 
these interpretations. 

The first approach is that of Pavlov (1927), 
Watson (1914), Hull (1943), and others in Russia 
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and the United States. Originally called learning 
theory, it is now more generally called behavior 
theory. The central doctrine of this school is that 
an understanding of all behavioral responses may 
be acquired through a grasp of the concepts of 
drive, cue, response, and reward and their deriva- 
tives; rewarded responses become habits through 
a process of conditioning, and unrewarded respon- 
ses tend to be extinguished. Personality, then, is the 
pattern of learned responses, not all of them adap- 
tive in long-range terms but all of them, at least 
once, reinforced. It has been argued that the be- 
havior of the Soviet elite has been heavily influ- 
enced by this concept of personality, leading them, 
so it is said, to believe that through a fairly gross 
manipulation of rewards and punishments they 
can shape the personalities and behaviors of the 
populations under their control (Tucker 1963, 
pp. 91-121). Although it is possible to translate 
behavior theory into the terms and constructs of 
other more clinically oriented theories, as Dollard 
and Miller (1950) have shown, the product is 
somewhat inelegant. Because of its mechanistic 
emphasis and insistence upon observable (opera- 
tionalized) ingredients, the theory leaves out the 
rich, speculative, and often fruitful concepts of 
internal dynamics, conflict resolution, fantasy and 
free-associational styles of thought, dream life, ana- 
logical thinking, and the secondary elaborations 
which these theories produce. 

A second approach to political personality fo- 
cuses upon the complex of vectors or forces which 
influence a person in his “life space.” Kurt Lewin 
(1939-1947) was the originator of this theory 
whose sources lie in gestalt psychology. His associ- 
ates and followers have gone on into the field of 
group dynamics exemplified in their small group 
experiments (Cartwright & Zander 1953). The 
central work of this school consists of accounting 
for the impact of the social world, the world of 
interaction and group life, upon an individual 
whose goals are constantly shaped and modified by 
these influential people—the other members of his 
group. This approach, reflected in the voting 
studies produced both by the Lazarsfeld—Berelson 
group (Lazarsfeld et al. 1944; Berelson et al. 1954) 
and the Michigan Survey Research Center group 
headed by Angus Campbell (Campbell et al. 1954; 
Michigan, University of 1960) has led to some 
important formulations of the way in which ref- 
erence groups, interpersonal influence, cross pres- 
sures or conflicting identifications, family, school, 
and work socialization, and group pressures all 
combine to modify electoral decisions. The find- 
ings suggest that in this area of American electoral 


decision making, individual differences are not so 
important as group differences, that is, national 
modal character is more important than individual 
personality. Another way to say this is that for any 
one country, more of the variance is accounted for 
by environmental factors than by personality 
factors. 

The more strictly Freudian approach, now only 
one of several competing schools of psychoanalysis, 
focuses, as is well known, upon the channeling and 
blocking of the libido, the conflict among id and 
ego and superego (impulse, conscious mind, and 
conscience), unconscious processes, early determi- 
nation of central personality characteristics, and 
the consequent need to return retrospectively to 
early experience in order to achieve fundamental 
personality change (Freud 1932), The view of 
political man which emerges from this perspective 
on personality and its formation is well symbolized, 
if not well summarized, by Lasswell’s early provoc- 
ative formulation : 


The most general formula which expresses the devel- 
opmental facts about the fully developed political man 
reads thus: 

p)d)r=P, 


where p equals private motives; d equals displacement 
onto a public object; r equals rationalization in terms 
of public interest; P equals the political man; and ) 
equals transformed into. (({1930] 1951, pp. 75-76) 


This makes of political man the rationalized ver- 
sion of inadmissible private motives and reflects 
the early emphasis of Freudians and neo-Freudians 
upon those aspects of personality which could be 
traced to a wild and assertive id. Anna Freud 
(1936), Heinz Hartmann (1927-1959), and many 
others have somewhat restored the balance by giv- 
ing more weight to ego psychology. But the polit- 
ical personality which emerges from this theoret- 
ical framework tends to underemphasize what ra 
been overemphasized by the rest of the world, the 
plain appeal and obvious infiuence of economic 
advantage. 

More recently, psychoanalytic theory has ae 
re-examined and modified by a group, aie | 
called the interpersonal school of payee a 
has been led by three important figures: Ee 
Stack Sullivan, Erich Fromm, and Karen eae 
The label “interpersonal” comes from ithe pa 
view of personality expressed by Sullivan in 
following terms: “Personality I now define in Ab 
particularist sense as the relatively enduring He 
tern of recurrent interpersonal situations 1053, 
characterize a human life” ({1940-1945] jitic 
p. xi). His work has had less bearing on P° 


problems than has that of his two associates, so we 
now turn to their formulations. 

Horney (1937), in emphasizing the reflection of 
the strains of society in the contemporary neurotic 
trends of her patients (as a sample of some larger 
group), reveals what social life is doing to people: 
making them competitive, decreasing their capacity 
to give of themselves and thereby reducing their 
capacity to love; at the same time it is increasing 
their “neurotic need for affection” and generally, 
and most importantly, making them more anxious. 
Unfortunately, because she has no base line, Hor- 
ney cannot substantiate her claim that modern 
man has more of these problems than did his pred- 
ecessors—but she can make a strong case that 
these facets of personality damage are real and 
important for our time [see HorNEy]. The impor- 
tance of these neurotic trends for political person- 
ality, that is, the personality faced with political 
decisions, is substantial, though their expression 
is uncertain: the anxious man may, depending 
upon the organization of his personality, withdraw, 
become assertive, cling desperately to some dog- 
matic belief, or yield utterly to some tendency 
toward “other-directedness.” What is certain is that 
his anxiety will impair his rational functioning and 
prevent him from using politics to his maximum 
long-term advantage. 

: Erich Fromm, the most politically oriented figure 
in this group, presents three main themes with 
special relevance to the study of political person- 
ality. The first of these has to do with the idea of 
“social character,” which on the one hand is the 
root” of ideology and culture and on the other “is 
molded by the mode of existence of a given society” 
(1941, p. 296), For Fromm, “the social character 
internalizes external necessities and thus harnesses 
human energy for the task of a given economic and 
Social system” (1941, p. 284). In the same sense, 
one might think of a modal political personality, in 
a world where personality and political system are 
in harmony, as providing the motives, values, and 
capacities for performing the required political acts. 
s Fromm’s second theme has to do with the rela- 
ee of man to society and to the government 
area It was he who in modern times first de- 
ee an accounting of the costs of “freedom, 
ties A AY, won at the expense of weakened 
re amily, neighborhood, community, tradition, 
ing tS dats and, in some cases, religion. In develop- 
the isn accounting and in elaborating on some of 
ees oe responses which it facilitated, Fromm 
ie ar » OF at least sketched, the atavistic polit- 
eee sonality unable to stand alone, searching for 
e synthetic ties to replace the ones lost in the 
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process of modernization. This theme assumed in- 
creased importance with the development of the 
theory of The Authoritarian Personality (1950) by 
Adorno, Frenkel-Brunswik, Levinson, and Sanford. 
Another sprout from the soil of this theory, nur- 
tured by that very original gardener David Riesman 
(1950), is a theory of modern man as other-direct- 
ed, that is, decreasingly reliant on his own con- 
science and increasingly reliant on cues from other 
people for his ideas and actions. 

A third theme in Fromm’s more recent work 
deals with the concept of alienation; it is important 
because it stands for, although it did not originate, 
an entire school of criticism of modern society. 
With roots in the work of Feuerbach and Marx 
(Fromm 1961) but now more leveled at “moder- 
nity” than at capitalism, the idea of the alienated 
man claims that industrialism has alienated man 
from his work; that commercialism and the long 
process of change from status to contract have cre- 
ated an alienated “marketing personality” where 
people and the self are regarded as “things”; that 
mass society divorces people from meaningful 
group life; and that mass politics creates automa- 
ton-like, meaningless political responses (Fromm 
1955). The alienated personality is said to be 
“available” for charismatic leadership and anomic 
destructive social movements. Unfortunately for 
the theory, most available evidence shows that the 
attitudes characteristic of alienation are more preva- 
lent in rural societies and are especially frequent in 
village life relatively untouched by industrialism, 
commercialism, mass media, and mass politics 
(Banfield & Banfield 1958 ). 

While the “interpersonal school” represents one 
variation of the original Freudian interpretation of 
political man, another variation is represented by 
Erik Erikson (1956), whose public hallmark is the 
concept of identity. Here we have a shift in focus, 
from instinctual libido and interpersonal relations 
to the concept of the self, or the self image, but it 
is a self image in a group context. The term is 
somewhat elastic: “At one time... it will appear” 
to refer to a conscious sense of individual identity; 
at another, to an unconscious striving for a conti- 
nuity of personal character; at a third, as a crite- 
rion for the silent doings of ego synthesis; and 
finally, as a maintenance of an inner solidarity with 
a group's ideals and identity” ({1956] 1960, p. 38). 
It is important now because of the growing evi- 
dence that psychic breakdowns have their origins 
decreasingly in the repression of impulses, as was 
the case in the nineteenth century, and more often 
in “identity-diffusion,” that is, the uncertainties and 
anxieties arising from ill-defined goals, ambiguous 
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group identifications, conflicting self images, and 
vague life patterns. In its political aspects, identity- 
diffusion leads to low political cathexis and ideolog- 
ical caution (Lane 1962, pp. 381-399). 


Political personality and political analysis 


The various concepts and interpretations of 
political personality are reflected in the applied 
work of political analysis—but often segmentally, 
or eclectically, by men seeking personality concepts 
by means of which they can grapple with the intri- 
cate workings of political institutions. The prob- 
lems of such analysis are many, and the amount of 
research in this area is small. 

Legislative behavior. Legislative behavior, the 
output of which is a set of laws, institutions, and 
appropriations, is inevitably modified by the per- 
sonality constellations of the legislators. An early 
study by McConaughy (1950) suggests that in one 
American state, South Carolina, a (somewhat un- 
systematic) sample of legislators were, compared 
with the average American adult, “less neurotic... 
far less introverted, more self-sufficient and slightly 
more dominant.” McConaughy does not relate 
these results to legislative behavior. A more com- 
prehensive study of Connecticut legislators by 
James Barber (1965) finds a large proportion of 
the legislators lacking in self-esteem and employing 
their positions in the legislature to compensate for 
their doubts about themselves; it is the few who 
are genuinely self-confident who get the legislative 
business of the legislature done. On a closely re- 
lated psychological syndrome, the sense of efficacy, 
evidence from a study of four American state legis- 
latures reveals how important this sense is in af- 
fecting the manner in which a legislator sees and 
handles his role: those with a higher sense of effi- 
cacy (ego strength?) look beyond their immediate 
districts and accept responsibility for the larger 
unit, in this case the state; more than others, they 
accept pressure groups and even welcome them 
as part of the play of forces necessary for develop- 
ing acceptable policy; and they tend to regard their 
role as brokers in a cockpit of conflict with 
equanimity, without becoming personally upset 
(Wahike et al. 1962, especially pp. 474-475). But 
in these four states, as in Connecticut, it appears 
that more than half of the legislators have grave 
doubts about their efficacy or even adequacy in the 
legislative situation. 

Another theme out of the nosology of personality 
syndromes is the matter of identity. It seems from 
Barber's Connecticut study, and from other work 
on the United States House of Representatives, that 
the more clearly a legislator perceives his role and 


assimilates this role concept to some vigorous and 
viable sense of identity, the more effective he 
is—i.e., persuasive, work-oriented, and hard to 
manipulate. [See LEGISLATION, article on LEGIS- 
LATIVE BEHAVIOR.] 

Although it does not explicitly deal with person- 
ality, Nathan Leites’ work on “operational codes,” 
first of the Politburo (1951) and then of the 
French parliament (1959), embodies in different 
language many of the themes which are normally 
conceived to be the stuff of personality. Thus, 
Leites asserts that two themes of the operational 
code of the Politburo are a fear of dependency, 
leading to strong measures to avoid any situation 
where people or nations are mutually interdepend- 
ent, and particularly any situation where they 
might be “used” by others (1951, pp. 40-43), and 
an unusually strong insistence on “the control of 
feelings,” leading to attacks on sentimentality and 
emotional responses and to an observable “hard- 
ness”—at least during the Stalinist period (1951, 
pp. 20-24). Speaking of the French legislator, 
Leites (1959) describes a series of behaviors, espe- 
cially avoidance of responsibility, efforts so to ar- 
range things that others are “to blame” for unpop- 
ular or unsuccessful decisions, inability to form 
permanent coalitions because of distrust of others 
(following from and causing this pattern of irre- 
sponsibility), and a search for a force majeure to 
get the French parlement or the nation out of its 
difficulties. Leites would argue that these are 
French, as well as legislative, characteristics; 
hence they also fall under the category of “national 
character,” though of course he would also argue 
that they are traits to some extent widely shared 
throughout the world. At the bottom of many ° 
these traits, there seems to be a concept of individ- 
ualism and a tendency to distrust others, a quality 
which Almond and Verba found central to the 
working of a successful civic culture (1963). é 

Judicial behavior. Many years ago Jerom 
Frank (1930) pointed to the very great Mere 
in the treatment given to defendants in si s 
situations by judges of different dispositions. at 
though at first this seemed to reflect ber 
“philosophies,” it later became evident that this Wes 
another name for the motivations, values, pum Jla- 
ness, and other elements of a personality contra 
tion, Harold Lasswell (1948) illuminated HAA 
problems in his discussion of the eee 
judicial decisions of narcissistic, paranoid, Be 
homosexual, and other tendencies in & ane a 
judges whose life histories were made ave dè 
him, Pritchett’s study of patterns of dec ne 
by Supreme Court justices (1948) reveals 


thing about the influence of personal characteristics 
upon judicial decision making, but the personality 
themes here are only partiaily developed. [See 
JUDICIARY, article on JUDICIAL BEHAVIOR.] 

Electoral behavior. As mentioned earlier, the 
main themes of the very substantial studies of 
electoral behavior in the United States, England, 
France, and Norway deal not with personality fea- 
tures but with group life, media influence, occupa- 
tional and class differences, electoral laws, and the 
like. This is for the very good reason that, at least 
in the United States, and generally elsewhere, 
modal behavior is situationally determined to such 
an extent that people with many different person- 
ality syndromes tend to behave similarly. The 
places to look for the influence of personality are 
in the interstices where social pressures are con- 
flicting or ambiguous. Lane (1959, p. 100) has 
listed some of these as follows: 


Selection of the grounds for rationalizing a political 
act. 

Selecting topics for political discussion. 

Selecting types of political behavior over and above 
voting. 

Expression of the probable consequences of participa- 
tion, 

Holding particular images of other participants. 

Styles of personal interaction in political groups. 


In general this means that, at least in the United 
States, there are no important personality differ- 
ences between Democrats and Republicans, though 
radicals of the right and left have been shown to 
have deviant personality syndromes (Almond 1954; 
Adorno et al, 1950), On the other hand, there are 
substantial differences between participants and 
Nonparticipants: the nonparticipants tend to be 
More neurotic, more anxious, less self-confident, 
more autistic, less trusting of others, lower in ego 
Strength, and more alienated from themselves and 
society (Lane 1959, pp. 97-181). In a cross-cul- 
tural study by Almond and Verba (1963), in which 
Personality and cultural differences are, as always, 
Somewhat fused, it appears that those societies in 
ve there is greater interpersonal trust; in which, 
tom the beginning, people have a sense that each 
re is himself important and influential in the 
‘amily, school, and place of work; and in which it 
1s possible to work easily and cooperatively with 
fee such societies develop healthier (more ef- 
dee and less destructive) patterns of participa- 
iis and more conciliatory and workable patterns 
y partisanship. [See POLITICAL PARTICIPATION; 
OTING.] 
Ideology. Electoral participation inevitably leads 


into a discussion of ideology, but, as Campbell 
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and his colleagues (Michigan, University of 
1960) make clear, the relationship between voting 
and ideology is often tenuous. Ideology here means 
not merely the attitudes and values on topical con- 
cerns (foreign aid, civil rights, welfare state) but 
also the fundamental views which form the idea- 
tional counterpart to a constitution: ideas on fair 
play and due process, rights of others, sharing of 
power, the proper distribution of the goods of so- 
ciety (equality), uses and abuses of authority, etc. 
It is in this sense that Adorno and his associates 
(1950) develop their concept of an authoritarian 
personality and belief system, and in this sense 
Lane (1962) explored the ideologies of a group of 
working-class and lower middle-class men. The 
problem here is to sort out the conventional beliefs 
which almost everyone in the society holds, not 
because these ideas have a special congeniality but 
because they are, so to speak, “given,” from those 
ideas which are selected from among alternatives 
because these ideas have a special “resonance.” The 
conventional ideas may be conceived as related to 
national character, though there can be a lack of 
congruence here, too; the “resonant” ideas, the 
more or less individual ones, may more properly be 
related to and explained by the concept of indi- 
vidual political personality. 

In the general discussion of ideology, it appears 
that one central aspect is that of alienation versus 
allegiance, a rejection of “the system” or some sub- 
stantial part thereof, compared with a fundamental 
loyalty to and acceptance of it, Cantril (1958) 
found this to be a main theme among the French 
and Italian communists he interviewed; Almond 
and Verba (1963) suggest that one of the main dif- 
ficulties of the Italian political system lies in the 
high incidence of alienation and contrast this with 
the sense of allegiance and political pride of the 
Americans and British. Lane found in his Eastport, 
U.S.A., sample that a failure to support democratic 
norms and a tendency to see decisions made by 
conspiratorial groups (“cabalistic thinking”) were 
related generally to just such political alienation 
but that this did not extend to a more general feel- 
ing of social alienation, a rejection of the society 
and its values (1962, pp. 161-186). On the whole, 
it seems wise to think of alienation in terms of 
specific targets of disaffection, a series of topically 
specific continua, rather than in terms of a dichot- 
omous and total classification, although the tend- 
ency of alienation in one area to infect another 
should be observed. [See IDEOLOGY; Loyatty.] 

National character. The study of widespread 
ideologies or belief systems leads to the study of the 
broad distribution of political personality types or 
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characteristics in a society. At this point we begin 
to study national or social character—a field which 
fell into disrepute because of early unsupported 
generalizations by Le Bon (1894) and others. 

Following World War 1, a series of studies on 
the German people, sometimes based on interview 
data and sometimes more speculative (Dicks 1950; 
Levy 1948; Schaffner 1948), sought to discover 
what led them down the path to the Nazi revolu- 
tion. A few comparative studies have indeed sug- 
gested that the Germans, more than others, tend 
to revere both paternal and state authority and 
have, by a very slight margin, a higher incidence 
of “authoritarianism” (McGranahan 1946). Sub- 
sequent political history has indicated that these 
elements of political personality are not incompat- 
ible with the functioning of certain kinds of repub- 
lican institutions, and, moreover, they are not fixed 
in the character of a people forever. 

Similar studies of the Soviet Union (Dicks 1952; 
Mead 1951), necessarily more limited because of 
the inaccessibility of most of the population to such 
study, have suggested the importance of certain 
other emotional themes: expressiveness and the 
felt need for external control (combined, it is true, 
with a strain between the controlling, overly bu- 
reaucratic elite and the mass of people still in tran- 
sition from traditional to modern behavior and 
norms); suspicion of “outsiders,” implying sharp 
differentiation between in-groups and out-groups, 
with some paranoid symptoms; and “identity crisis” 
posed by the long-term, but recently exacerbated, 
conflict between Russia and the West—and yet, 
withal, a lack of “tenderness taboo” (or sadomas- 
ochism) which characterized the Nazi mentality. 
[See NATIONAL CHARACTER.] 


We are only beginning to study the close inter- 
relationship of personality qualities and political 
life, particularly the way in which similar institu- 
tions function when manned by persons of different 
personality constellations. This article has not 
touched upon the important research done on the 
personality problems of the officials and publics 
in modernizing nations, as illuminated by Doob 
(1960), Pye (1962), and Leighton (Cornell... 
1963), among others, but this is surely an area 
where further work is needed. Problems of per- 
sonality and bureaucracy in a world which inevi- 
tably is becoming bureaucratized deserve further 
attention, following the seminal article by Merton 
(1940). Elite studies have, until now, tended to 
focus on the more easily accessible data, the ex- 
ternal circumstantial forces affecting career choice 
and selection of the world’s elites, but we need to 


know more about the internal dynamics and inter- 
personal characteristics of coopted, appointed, and 
elected leaders. By now, however, scholars are 
aware that there is no simple distribution of traits, 
syndromes, or personality types which is good or 
necessary (or at least sufficient) for the operation 
of an efficient and humane political system, and 
that hence we must direct our research toward dis- 
covering relatively subtle patterns of personality 
characteristics, with varying distributions, meshed 
into roles and institutions in complementary ways, 
each “way” modified by the ecology and history of a 
particular political system. 

ROBERT E. LANE 


[Directly related are the entries IDENTITY, PSYCHOSO- 
CIAL; NATIONAL CHARACTER; POLITICAL BEHAVIOR; 
and the biographies of Freup; Horney; HULL; 
Lewin; PavLov; Watson. See also the guide fol- 
lowing the next article.] 
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ie 
CONSERVATISM AND RADICA! 


The preceding article deals with the concept of 
political personality and with the relevance of per- 
sonality in various areas of political life. This 
article deals with the specific problem of personality 
and political ideology: the questions of what ways, 
if any, “conservatives” and “radicals” differ in per- 
sonality and what part personality plays in influ- 
encing the individual's preference for a conservative 
or a radical political outlook. 

Some of the major problems involved in answer- 
ing these questions are found in the questions’ 
terms. Conservatism and radicalism are not dis- 
crete entities, These terms are, so to say, genotypes 
rather than phenotypes, analytic categories rather 
than concrete doctrines. They refer to basic political 
tendencies or orientations that may be manifested 
in a variety of explicit ideological forms. The forms 
vary widely in different classes, countries, and his- 
torical periods. There is considerable disagreement 
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regarding the primary defining characteristics of 
each viewpoint and regarding the validity and use- 
fulness of employing these analytic categories in 
political analysis. 

It would be helpful, for purposes of psychological 
analysis, if the purely political meaning of the 
terms “conservative” and “radical” were clearly de- 
fined and widely agreed upon. This is, unfortu- 
nately, not the case. We must therefore begin by 
considering briefly the alternative usages of these 
terms and the bearing of this problem on studies 
of personality and political ideology. [See CoNsERV- 
ATISM; RADICALISM.] 

In the broadest sense—a sense implied by their 
pairing in the title of this article—the terms “con- 
servatism” and “radicalism” may be used to divide 
the vast array of political ideologies into two gross 
categories of “right” and “left.” Each term then 
becomes a relatively complex category under which 
a variety of more concrete viewpoints are sub- 
sumed, In general, ideologies of the right support 
the (present or imagined past) status quo and em- 
phasize the importance of tradition, stability, and 
hierarchical social order. Ideologies of the left are 
critical of the prevailing system and seek major 
institutional change toward increased social and 
economic equality. 

The simple right—left dichotomy, while useful 
for certain purposes, is in need of further differen- 
tiation. Within the right, for example, there are 
various ideological positions that differ markedly 
in their social and psychological bases and in their 
political aims. These include the moderate conserv- 
atives (who, especially in the more industrialized 
countries, accept many “liberal” reforms and the 
political rules of parliamentary democracy ), as well 
as ultraconservative, fascist, and “lunatic fringe” 
rightist viewpoints. For certain analytic purposes 
the similarities among these ideologies are of 
primary interest, while for other purposes the dif- 
ferences require sharper focus. [See CAUDILLISMO; 
FALANGISM; FASCISM; NATIONAL SOCIALISM. ] 

Within the left, too, are ordinarily included a 
wide range of ideologies. There are the modern 
liberals who espouse continual but gradual reform 
by democratic methods, seeking to increase the 
Scope of governmental welfare functions while 
maintaining individual freedom from state control. 
There are also various types of socialist and com- 
munist viewpoints, which differ considerably in 
their programmatic goals and in the methods they 
advocate for achieving and maintaining political 
power. [See COMMUNISM; EQUALITY; LIBERALISM; 
SocIALIsM.] 

As a first step toward taking these variations 


into account, the right—left dichotomy is often di- 
vided further into a four-part spectrum: extreme 
right, moderate conservative (right-of-center), lib- 
eral and moderate left, and extreme left. Within 
this scheme, radicalism has in the past been equat- 
ed with the extreme left. More recently, however, a 
new usage has developed: radicalism is sometimes 
equated with “extremism” and includes both the 
extreme left and the extreme right. Witness the 
growing use of the term “radical right” (Bell 1955) 
in the United States to refer to antidemocratic 
racist, militarist, and neofascist movements. This 
usage involves the assumption that the far left and 
the far right, despite their differences in stated 
values and ultimate goals, have important similar- 
ities in their methods and psychological character- 
istics. [See MILITARISM.] 

An example of this approach is Lipset's six-cate- 
gory schema (1960). He distinguishes the right, 
center, and left (advancing the interests of the 
upper, middle, and lower classes, respectively) as 
basic political orientations; and he suggests that 
within each there are contrasting moderate-dem- 
ocratic and extremist-antidemocratic positions. 
Thus, in the case of the center, the opposing views 
are liberalism and fascism. He asserts that one’s 
political orientation is largely a function of class 
membership, whereas preference for a democratic 
or an extremist orientation is strongly influenced 
by personality. Those who hold an antidemocratic 
political orientation, whether of right, center, or 
left, will show greater personal authoritarianism 
than will their democratic counterparts. 

These categories have been used in a multiplic- 
ity of ways as bases for ideological classification 
and measurement. Some studies deal with wel 
defined political parties or groupings; others eve 
guish only between broad categories such as left- 
right, liberal—conservative, or moderate-extreme. 
We are still far from an understanding of the a 
gree to which particular ideologies are associate 
with distinctive personality syndromes. _ adi- 

Thus far we have spoken of conservatism, eS 
calism, and other viewpoints as inclusive, oe s 
atic ideologies. A political ideology is an or 
ing conception of society, a stance that is re eit 
in numerous sectors of social life On AER 
1929-1931). It deals not only with politica sae 
in the narrow sense but also with economic Pe Wi 
social stratification, methods of social ohana a 
liberties and civil rights, international relai o 
religion and the relation of religious ina a 
political institutions, the societal functions na in- 
ernment, and so on. Political ideologies in aspects 
clusive sense have been studied primarily aS 
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of political parties, movements, or other collective 
units. We characterize “the” ideology of a given 
party or group through the analysis of various doc- 
uments written by its spokesmen, singly or jointly. 
This characterization is an analytic construction 
derived from multiple sources. [See Levinson 1964; 
see also CONTENT ANALYSIS; IDEOLOGY.] 

The problem is complicated, however, when we 
seek to study the ideology of the individual citizen. 
Not everyone has a political ideology in any devel- 
oped sense. Some persons have fragmentary opin- 
ions on a few personally relevant issues. Others 
have a vaguely defined “leaning” without differen- 
tiated views. Still others, more politically involved, 
have a political outlook that cuts across the estab- 
lished party lines. It is therefore often difficult to 
identify an individual's political opinions as “con- 
servative” or “radical” in a systematic ideological 
sense. 

One way of dealing with this problem in empir- 
ical research is to obtain only a general index of 
the individual's political tendency or leaning, with- 
out examining his views in detail. For example, 
large-scale surveys may simply ask their subjects 
to categorize themselves politically or to indicate 
their preference for a party, movement, or political 
leader. Party affiliation or preference is a relatively 
useful indicator of ideological orientation in coun- 
tries having diverse and well-differentiated parties 
covering a wide range of the political spectrum. 
Even in this case, however, there are often major 
ideological variations within a single party; for 
example, right-wing versus left-wing segments of 
many European parties, or the French and Italian 
workers who vote communist without adhering to 
the general party ideology. In countries such as the 
United States, where the two major parties are 
ideologically more amorphous and internally di- 
vided, party preference as such tells little about an 
individual's political orientation. 

Another way of dealing with the problem is to 
shift from very general ideological categories to 
the study of more delimited areas and dimensions 
of political opinion. Numerous areas and dimen- 
sions have been singled out for investigation. For 
example, in the domain of foreign policy, the di- 
mension of nationalism—internationalism; in reli- 
pie outlook, the liberalism—conservatism dimen- 
Feige similarly in other realms, such as civil 
ie ee the rights of ethnic and minority groups, 

onomic and welfare functions of government, 

and so on. 
Vee ses are sometimes conceived of 
fina e reflections of the more generalized and 
ental ideological vectors noted earlier. To 


the extent that this conception is a valid one, 
measures of internationalism, religious liberalism, 
civil libertarianism, and advocacy of governmental 
welfare functions will be positively intercorrelated 
to form a more inclusive syndrome of over-all socio- 
political liberalism. It is beyond the scope of this 
article to review the empirical evidence bearing 
upon hypotheses of this kind. In general, there is 
evidence of moderate consistency in the individual 
across ideological domains, so that we may speak 
of relatively generalized liberal, conservative, and 
other orientations. At the same time, the correla- 
tions among measures of the various dimensions 
vary considerably in magnitude. For example, there 
is a moderate positive correlation between civil 
libertarianism and economic liberalism; on the 
average, advocates of the former viewpoint tend 
also to support the latter, and opponents of one 
tend also to oppose the other. However, the con- 
sistency is far from perfect. We must have room 
in our thinking for ideological patterns that cut 
across the simple liberal-conservative polarity. 

The complexities in the analysis of individual 
political ideology have important implications for 
the study of relationships between ideology and 
personality. How can we define ideological dimen- 
sions or patterns in a way that lends itself to psy- 
chological analysis? If we group our subjects only 
in terms of general ideological leaning, such as 
over-all liberalism and conservatism, a given cate- 
gory will contain persons with diverse concrete 
ideologies and personalities. Overly gross ideolog- 
ical categories may subsume very different view- 
points and their associated varieties of personality. 
On the other hand, there are disadvantages in 
restricting the focus to a specific issue or narrow 
domain. Personality is most likely to exert influ- 
ence on the individual's orientation toward a rela- 
tively broad sector of political concern. His stand 
on any particular issue may be strongly affected by 
immediate situational factors. 

There is a “forest and trees” dilemma here. We 
must find intermediate levels of analysis between 
the forest of global, undifferentiated ideological 
categories and the trees of specific, segmented 
opinions and attitudes. In general, work in the tra- 
dition of the sociology of knowledge errs mainly in 
the former direction, while survey research on pub- 
lic opinion and social attitudes suffers from over- 
specificity and the neglect of ideological patterning. 

Work in this field is further complicated by dis- 
ciplinary conflicts. It would be convenient if there 
could be a simple division of labor between the 
relevant disciplines. One might hope, for example, 
that political scientists and sociologists would estab- 
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lish a standard, widely applicable set of ideological 
categories; psychologists could then determine 
whether the different categories have distinctive 
personality correlates, This disciplinary division of 
labor has not been successful and probably cannot 
succeed. The study of relationships between person- 
ality and political ideology in principle requires 
some degree of theoretical synthesis across disci- 
plinary lines. The political categories and dimen- 
sions must have psychological relevance; the study 
of personality in this context requires a grasp of 
political issues and realities. Progress will come 
through the joint efforts of political analysts who 
have some sophistication about personality theory 
and of psychologists who can think in political and 
sociological terms. This approach was advocated 
long ago (e.g., Mannheim 1929-1931; Fromm 
1936), Within the last few decades some progress 
has been made in this direction by a number of 
investigators taking a multidisciplinary point of 
view. We turn now to a consideration of their work. 


Authoritarian personality and political ideology 


The concept of personal authoritarianism has 
probably received more empirical study, and more 
critical attention, than any other personality con- 
cept that has been utilized in the analysis of socio- 
political ideology. Its history extends over several 
areas of modern social science and involves several 
disciplines. Reviewing the development of work on 
authoritarianism provides an opportunity to clarify 
some of the theoretical and empirical problems in 
the general field of personality and ideology. 

The first major systematic attempts to formu- 
late a conception of authoritarianism as a person- 
ality syndrome were made in the 1930s by Wilhelm 
Reich and Erich Fromm. The two worked inde- 
pendently but had much in common. Both were 
clinical psychoanalysts in the generation after 
Freud, and they took a leading part in the effort 
to develop a psychoanalytic social psychology. Both 
had a “democratic Marxist” conception of society, 
and each attempted his own synthesis of Marxist 
and Freudian theory. They were primarily clinical 
and historical in mode of analysis, Reich’s major 
work on this problem was The Mass Psychology of 
Fascism (1933). [See REIcH.] Fromm (1936) pub- 
lished a long theoretical essay in the volume Stu- 
dien über Autorität und Familie. This book epito- 
mizes the newly emerging multidisciplinary spirit 
of the 1930s—a spirit that originated in Europe 
and was subsequently carried forward in the 
United States both by Americans and by European 
émigrés. Soon after, Fromm (1941) published his 
influential work on authoritarianism and historical 
change, Escape From Freedom. 


Reich and Fromm took as a central problem the 
ways in which personality is implicated (as cause 
and as effect) in social ideologies, movements, and 
structures. They saw the family as a primary 
agency of social control, not only transmitting ex- 
plicit values and beliefs but, even more important, 
inducing types of character structure and modes 
of authority-subordinate relationships that pro- 
vide a psychological underpinning for adult socio- 
political functioning. The authoritarian character 
is hypothesized as the psychological structure most 
receptive to, and most required by, rigidly hier- 
archical antidemocratic social structures. The rise 
of fascism offered a prototypic example for the 
social-psychological analysis of authoritarianism. 
[See SOCIALIZATION.] 

As a parallel development in the 1930s, Ameri- 
can social psychologists were beginning the sys- 
tematic study of political opinion. Their concepts, 
methods, and approach reflected the then prevail- 
ing Zeitgeist of American psychology. Conceptually, 
their focus was not on broad ideological patterns; 
it was, rather, on more segmental, quantitatively 
measured social attitudes. The concept of attitude 
was systematically formulated by Allport (1935) in 
a widely influential article. This concept had great 
appeal, both because of its analytic usefulness and 
because it readily lent itself to measurement and 
to statistical analysis. Scale methods of attitude 
measurement were developed by Thurstone and 
Chave (1929), Murphy and Likert (1938), and 
others, and in the ensuing years a tremendous 
number of scales were developed. In the area of 
conservatism-—liberalism—radicalism, attitude scales 
were correlated with all manner of other variables, 
ranging from group membership to school achieve- 
ment to the readiness to tolerate pain. [An extensive 
review of the early literature is given in Murphy, 
Murphy, & Newcomb 1931; see also ATTITUDES: 
ScaLiNG; and the biography of THURSTONE.] Bs 

By the early 1940s attitude research had eee i 
a major field in academic social psychology: oe 
ever, the studies typically had a concrete, a 
retical, “shotgun” character. Broader mem i 
perspectives were needed. On the one a af 
early investigators failed to regard attitu ei a 
aspects of more inclusive ideological patte Hs 
consider their meaning for the individual, ca 
relate them to other, motivational-affective © ail 
tive components of personality. On the pecs pec- 
they had only a rudimentary sociological va al 
tive and were largely unable to place ey 
ideology within a larger social and cultural i 

A seemingly unbridgeable gulf separa 
European and the American lines © 


f investigati 
idge 
Perhaps the first large-scale effort to bride 
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gap was the collaborative research of Adorno, 
Frenkel-Brunswik, Levinson, and Sanford; the 
study began in 1943 and was published in 1950 
under the title The Authoritarian Personality. San- 
ford (1956) provides a fuller review of the intel- 
lectual origins of this work. [See also the biography 
of FRENKEL-BRUNSWIK.] 

The research on authoritarianism drew both 
from the European tradition—with its psychoana- 
lytic-sociological orientation and clinical form of 
analysis—and from the more measurement-ori- 
ented, attitude survey approach of American social 
psychology (in particular the previous work of 
Stagner 1936; Murphy & Likert 1938; Newcomb 
1943; and more clinical study by Maslow 1943). 
It used scale methods, but its analytic emphasis 
was on ideological patterns rather than on single 
attitudes. In conceiving of ideology as an aspect 
of personality, it had roots in psychoanalysis and 
related personality theories, notably that of Murray 
(see Explorations in Personality 1938). Personality 
variables were measured by a variety of techniques, 
including scales, projective tests, semistructured 
interview, and intensive case analysis (again fol- 
lowing Murray in his approach to personality as- 
sessment), 

The research dealt primarily with the relation 
of ideology to personality—with authoritarianism 
as an ideology—personality syndrome—and not 
with the sociology of ideology. Sociological con- 
siderations were, however, an intrinsic part of the 
theoretical approach, and the work has stimulated 
a good deal of discussion and research along socio- 
logical lines (e.g., Christie & Jahoda 1954; Lipset 
1960; Hagen 1962; McClosky 1958; McClosky & 
Schaar 1965; Levinson 1964; DiRenzo 1963). 

The program of research on the authoritarian 
Personality began as a study of anti-Semitism 
(Frenkel-Brunswik et al. 1947). Its ideological 
focus gradually broadened to encompass ethno- 
centric ideology, conceived of broadly as a system 
Of beliefs, attitudes, and values—a way of thinking 
and feeling—about “{n-groups” and “out-groups” 
at every level of social organization from the local 
to the international scene. The overriding concern 
of the research was the nature of, and the rela- 
coe between, ethnocentric ideology and au- 
Horitarian personality. To a lesser though still 
Significant degree, attention was given to other 
ideologica] domains, notably politics and religion. 
(See AntI-SemitisM.] 

Political ideology was studied primarily with re- 
ae the liberalism—conservatism dimension. 
ian Ease of the Politico-Economic Conserva- 
oni C) Scale emphasized that the scale yielded 

Y a rough right-of-center versus left-of-center 


distinction and did not distinguish qualitatively 
important variations within each of these. The PEC 
Scale showed statistically significant but moderate 
correlations, averaging .4 to .5, with the various 
measures of ethnocentrism and authoritarianism. 
Two main conclusions were drawn. First, political 
conservatives are, by and large, more ethnocentric 
and authoritarian than are political liberals. Sec- 
ond, there are wide variations in authoritarianism 
within both the right-of-center and the left-of- 
center groupings, and these variations may be re- 
lated to specific patterns and styles of political 
orientation. In The Authoritarian Personality a 
number of specific concepts were put forward in 
Levinson’s chapter on the PEC Scale and in 
Adorno’s chapter on the interview material. For 
example, Levinson proposed a distinction between 
the “genuine” conservative and the “pseudo” (au- 
thoritarian) conservative, whose ideologies, while 
alike in many respects, are fundamentally different 
in others. Adorno, in his discussion of left-of-center 
viewpoints, posits several syndromes, such as the 
“ticket liberal,” in which authoritarian features play 
an important part. 

The Authoritarian Personality thus gave evi- 
dence that authoritarianism has diverse modes of 
political expression. This point merits emphasis, 
since the book has been alleged to equate authori- 
tarianism with the political right, equalitarianism 
with the left (Shils 1954). The research must be 
seen in the context of the times—World War 11 
and its aftermath—during which the most destruc- 
tive forms of ethnocentrism (mass genocide) and 
fascism were the paramount issues. The survey 
measure of authoritarianism was called the F scale, 
F referring to “potential for acceptance of fascist 
ideology.” It might better have been called the A 
scale, Authoritarianism is, indeed, a primary psy- 
chological source of receptivity to fascist ideology, 
but it may take other ideological forms as well. 

Several studies conducted during and after 
World War 1 also deal with the relationship of 
authoritarian personality to fascist ideology. Dicks 
(1950) conducted intensive clinical interviews 
with German prisoners of war, investigating per- 
sonality differences between the strongly pro-Nazi 
soldiers and those who opposed or seriously ques- 
tioned the Nazi ideology. He derived an authori- 
tarian syndrome strikingly similar to that of the 
American study. Erikson’s classic analysis, “Hitler's 
Imagery and German Youth” (1942), is another 
example of this genre. 

In the decade following publication of The Au- 
thoritarian Personality there were literally hun- 
dreds of studies derived from its concepts and 
measures. Most of them were relatively narrow in 
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focus, theoretically limited, and concerned with 
measurement or technique rather than with sub- 
stantive issues. The 1950s in the United States 
were dominated by McCarthyism and a spirit of 
intellectual retrenchment, and few new directions 
were established in the social-psychological anal- 
ysis of authoritarianism and political ideology. 
There were, however, some significant contribu- 
tions, of which the following are major examples. 

Several large-scale surveys of U.S. national 
samples confirmed the earlier findings that au- 
thoritarianism is associated with a nationalist- 
militaristic stand in world affairs, with opposition 
to a civil-libertarian position, with racial and 
ethnic prejudice, and the like (see, for example, 
McClosky 1958). Inquiries into authoritarianism 
and social outlook within the working class were 
carried out by Kornhauser, Sheppard, and Mayer 
(1956) in the United States, by Spinley (1953) in 
England, and by Lipset (1960) in several coun- 
tries. These studies suggest that the incidence of 
authoritarianism is relatively high in the working 
class and is a function of the psychosocial condi- 
tions of working-class life: material, cultural, and 
emotional deprivation; familial disorganization; 
coercive, stressful, anxiety-inducing pressures. In 
their political views, the more authoritarian mem- 
bers of the working class tend, more often than 
the others, either to be highly conservative sup- 
porters of the status quo or to hold “extremist” po- 
sitions. In an extensive analysis of the bases for 
economic growth in countries at different stages 
of industrialization, Hagen (1962) attributes a 
major causal role to the influence of “innovative” 
(as against authoritarian) personalities. Christie 
and Jahoda (1954) brought together a series of 
critical discussions of the theory, methodology, and 
implications of The Authoritarian Personality. The 
research literature on authoritarianism until 1956 
was reviewed by Christie and Cook (1958). Efforts 
to relate authoritarianism and political ideology 
within a broader sociopsychological framework 
were made by Frenkel-Brunswik (1952) and 
Levinson (1957; 1958). 

Emerging from the authoritarian personality 
tradition, but differing from it in several important 
respects, is Rokeach’s The Open and Closed Mind 
(1960). His shift in theoretical orientation is im- 
Plied by the use of the term “mind” rather than 
“personality” in the title. Rokeach takes a strongly 
cognitive approach deriving from Kurt Lewin and 
gestalt psychology, although he also acknowledges 
a debt to Fromm and to Hoffer (1951). He offers 
the concept of dogmatism as an alternative to that 
of authoritarianism. His formulation deals primar- 
ily with the structure of belief systems and modes 


of cognitive functioning, rather than with motiva- 
tion, ego defense, and psychodynamics, Corre- 
spondingly, his research involves experimental, 
test, and survey methods rather than clinical and 
observational ones. 

Rokeach states the hypothesis that dogmatism 
(and authoritarianism) is uncorrelated with the 
right—left dimension of political ideology. He sug- 
gests that there are dogmatic and nondogmatic 
viewpoints within every grouping—left, center, 
and right—along the political spectrum. He singles 
out the communists as a major dogmatic subgroup 
within the grouping on the left. He obtains some 
support for this hypothesis in an English sample, 
where a small group of communists exhibited a 
high mean score on his Dogmatism Scale. How- 
ever, limitations of sampling and other difficulties 
raise doubts about the generalizability of this find- 
ing. Contrasting results were obtained by DiRenzo 
(1963) in an extensive study of members of par- 
liament in Italy. He obtained a mean score on 
Rokeach’s Dogmatism Scale for each political party 
in the Chamber of Deputies. The rank order of 
these mean scores was almost identical with the 
order of the parties from right to left on the po- 
litical spectrum. The neofascists exhibited the 
highest mean score (most dogmatic); communists, 
the lowest. Eysenck (1954) interprets the data of 
his research in England as indicating that com- 
munists and fascists have in common a high de- 
gree of tough-mindedness (a syndrome closely 
related to dogmatism and authoritarianism). His 
work has been strongly criticized by Rokeach and 
Hanley (1956) and by Christie (e.g., 1956), who 
point to serious deficiencies in sampling, data 
analysis, and data interpretation. Eysenck has, 
however, provided a defense (e.g., 1956). 

Although a great deal has been written about 
the psychological bases for Communist party we 
bership and ideology, remarkably little Gpr 
evidence has been gathered. As Almond (195 ) 
has observed, there are wide variations in the “ap- 
peals of communism” between countries, between 
social classes, between transient and on 
members, and the like. More adequate saa 
the multiple relationships between personaly na 
communist ideology would have to take into a 
sideration the significant variations in sa is 
text and in the character of the ideology. cole 
true of other political parties and ideologie: i 
well. In general, it would appear that epee: 
is more closely related to concrete political ee oe 
than to party affiliation or preference. Th afd 
appreciable variations in individual ideology Sig. 
personality within all parties. It is ther ae 
sary to supplement gross surveys of large $ 
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with the more intensive study of individuals, giv- 
ing equal attention to ideology, personality, and 
engagement in the societal matrix. 

What, then, can be concluded regarding the re- 
lation of personal authoritarianism to political 
ideology? On the average, there is significantly 
greater authoritarianism to the right of center than 
to the left. The relationship is, however, far from 
simple. More systematic analysis of qualitatively 
different patterns of ideology along the right- 
center-left continuum is urgently needed, both for 
purposes of political analysis and for the study of 
ideology as an aspect of personality. Authoritarian 
personality theory calls attention first of all to the 
democratic-antidemocratic dimension of ideology. 
One of its central postulates is that authoritarian 
individuals have a special affinity for antidemo- 
cratic viewpoints in all spheres of social life— 
political, religious, familial, organizational (Levin- 
son 1964). In the case of political ideology, it 
predicts that authoritarianism will be associated 
with antidemocratic ideologies, whether of the 
right, center, or left. The choice of general political 
direction is related to class membership, as Lipset 
has shown, and to a variety of other social and 
psychological variables yet to be elucidated. 

The ideological relevance of authoritarianism is 
clearest, and most fully documented, when we 
shift analytic focus from the broad categories of 
right, center, and left to more specific ideological 
dimensions. There is massive evidence that au- 
thoritarianism is significantly associated with the 
following: emphasis upon rigid hierarchy and 
Stratification in political and other structures; re- 
jection of democratic political processes; reliance 
upon the “great leader” in solving social problems 
and upon coercive social controls in maintaining 
Social order; chauvinistic nationalism as a stance 
toward one’s own nation and toward interna- 
tional affairs; an ethnocentric view of relationships 
among various groups within the nation; readiness 
to Place severe restrictions upon civil rights and 
civil liberties; religious fundamentalism and its 
extension into political and other spheres; puni- 
tiveness as a basic emotional-moral response to 
Mna ( expressed ideologically, for example, in 
foe upon military solutions to interna- 
ihe oy ibe or upon the use of punishment in 
faa errence of legal crime and nonconformity ); 
fee: ae of innovation, experimentation, and open- 

n political and other systems. 
een research to date clearly establishes authori- 
ha ae as a personality constellation having 
Dara ant implications for political ideology and 
a Scale Much more work remains to be done, 
, with regard to the components and vari- 


ant forms of this syndrome and with regard to 
its political manifestations under different social 
conditions. 


Other ideology—personality constellations 


Since the late 1950s interest in the social psy- 
chology of ideology has expanded beyond concern 
with authoritarianism, and new personality—ideol- 
ogy constellations are being investigated. Although 
the primary focus of these studies is not on the 
right-left dimension as such, they have a bearing 
on this dimension and, what is perhaps more im- 
portant, they reflect the emergence of new dimen- 
sions in the analysis of political ideology and 
personality. An extensive review of this work is 
beyond the scope of the present article; it must 
suffice to indicate briefly some of the relevant con- 
cepts and directions of inquiry. 

The “conspiratorial” view of politics has been 
subjected to psychological analysis by Hofstadter 
(1965). Although he uses the term paranoid style 
to convey the emotional quality of this view, his 
interest is not in psychopathology or psychiatric 
diagnosis but in the fantasies, ego defenses, and 
modes of thought that give coherence, meaning, 
and emotional appeal to the conspiratorial inter- 
pretation of political affairs. Exploring ideology— 
personality relationships through a series of in- 
tensive case studies, Lane (1962) uses the term 
cabalist to identify a similar political orientation. 
Bittner (1963) takes a historical and phenomeno- 
logical approach in seeking to delineate a geno- 
typic conception of radicalism. He attributes a 
number of distinctive psychological properties and 
themes to the radical orientation. One may then 
ask whether an orientation having these psycho- 
logical properties would have special appeal to 
individuals having corresponding personality char- 
acteristics. Although Bittner does not take this last 
step, his analysis suggests interesting leads for per- 
sonality study. Similarly, Talmon (1962) describes 
various psychological properties of millenaristic 
ideology. She is concerned primarily with the social 
antecedents and consequences of millenaristic 
movements, but her discussion is also richly sug- 
gestive of hypotheses regarding their psychological 
bases. [See MILLENARISM; PARANOID REACTIONS. ] 

The 1961 translation of Scheler’s classic essays 
on ressentiment (1912) brings to wider attention 
an ideology—personality constellation that merits 
further study. Scheler conceives of ressentiment 
as a complex syndrome involving conscious atti- 
tudes, feelings, and moral judgments as well as 
unconscious defenses and wishes. The ressenti- 
ment-laden person tends to devaluate authoritative 
persons and groups, not on the basis of genuine 
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commitment to his espoused values but, rather, 
on the basis of secret envy, vindictiveness, and im- 
potent rage. As Lewis Coser points out in his In- 
troduction to the volume, Scheler’s formulations 
are important in their own right and as forerunners 
of more recent analyses of alienation and anomie. 
[See the biography of ScHELER.] Since the latter 
concepts have received greater attention in con- 
temporary work, we shall consider them more 
fully here. 

The concepts of alienation and anomie have a 
long history in sociology but have only recently 
been singled out for extensive theoretical discus- 
sion and empirical investigation. The concept of 
alienation has its modern origins in the psycho- 
logical writings of Marx (see Fromm 1961), who 
postulated that the individual in capitalist society 
is estranged both from his labor and from himself. 
Without necessarily accepting Marx’s assumptions 
about the societal roots of alienation, many con- 
temporary social scientists have inquired into its 
nature, its social and psychological bases, and its 
manifestations in various sectors of social life 
(e.g.. Keniston 1965; Lane 1962). Alienation has 
been variously defined to include component dimen- 
sions such as sense of powerlessness, isolation, and 
inefficacy; lack of social rootedness; self-estrange- 
ment; noncommitment, cynicism, and failure of 
identity. It may be expressed in apathy, in empty 
conformity, or in “rebellion without a cause.” Po- 
litically, it may be found at every point along the 
right—left spectrum and may be manifested in lack 
of political participation or in zealous dedication to 
a party or movement. The analytic usefulness of 
this concept at present is limited by the multiplicity 
and ambiguity of its definitions. At the same time, 
it has been extremely useful in generating new 
lines of research and new perspectives on the social 
psychology of politics, [See ALIENATION. ] 

The concept of anomie was initially formulated 
by Durkheim (1897). He identified the anomic 
social system as one characterized by a relative 
failure of normative order, a lack of moral regula- 
tion over human strivings and passions. He gave 
evidence that anomic societies are characterized by 
relatively high rates of deviant and self-destructive 
behavior, including suicide, In seeking to explain 
these phenomena, he postulated that anomic soci- 
eties produce certain psychological states in many 
of their individual members: insatiable craving, 
sense of futility, lack of responsibility and moral 
purpose, emotional emptiness and despair. He did 
not, however, deal with the fact that there are 
wide individual differences in psychological anomie 
among the members of a given society and that 
the development of this state of mind may also be 


a function of individual personality. [See the biog- 
raphy of DURKHEIM; see also Inkeles 1959.] 

Recent investigations, given impetus by the work 
of Srole (e.g., 1956), have extended Durkheim's 
formulation to include the concept of the anomic 
personality. McClosky and Schaar (1965) present 
a systematic review of this theoretical develop- 
ment, as well as a new conceptualization in which 
anomie is regarded as a function of both person- 
ality and social conditions and some research 
data derived from large-scale surveys. They find 
significant correlations between anomie and au- 
thoritarianism, alienation, ethnocentrism, politi- 
cal extremism, sense of political futility, misan- 
thropy, mistrust, and punitiveness. While this 
Study requires further validation and refinement, 
it strongly suggests that anomie is a significant 
concept in the analysis of personality and political 
ideology. It also indicates the need for further 
clarification of the concepts of anomie, alienation, 
and authoritarianism. 


In conclusion, we note that the study of per- 
sonality and political ideology is entering a new 
and more multidisciplinary period. It is being in- 
creasingly recognized that work on this problem 
extends beyond the traditional province of “indi- 
vidual psychology”; the psychologist who would 
contribute to it must have some appreciation of 
the political, social, and historical context within 
which individuals form and act upon their ideolo- 
gies (Erikson 1958). At the same time, historians 
and social scientists working in this field must 
develop a more systematic and profound concep- 
tion of the individual as an active agent in m 
historical process—an agent capable of bine 
conformity, of irrational destructiveness, and 0 
rational, responsible choice in accord with his 
interests, values, and motives. Our further progress 
will thus require the conjoint perspectives of er 
sonality theory, political theory, and general soc 


ological theory. Danze J. LEVINGON 


[Directly related are the entries CONSERVATISM; Ie 
ALISM; RADICALISM. Other relevant material "a 
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1 
OVERVIEW 


The field of personality measurement, viewed in 
the perspective of its short but busy history, has 
progressed in a fumbling, digressive manner. It 
may now, because of hard-won recognitions, be 
ready for significant and cumulative advances. 


The problem and strategy 


The reason for the disappointing disparity be- 
tween the energies already expended and the ac- 
complishments which may be certified lies in the 
uncertainty which has surrounded the concept of 
personality and the naiveté in psychology about the 
logic and justification of measurement. 

In the physical sciences and in some subareas of 
psychology where intuitions are strong and widely 
held, one need not be acutely concerned with the 
distinction between a concept and the way in which 
that concept happens to be measured. By and large, 
measuring length by a ruler and weight by a scale 
does not generate controversy; the relation of these 
concepts to their respective methods of measure- 
ment seems obvious and beyond dispute. In the 
study of personality, however, there is no escaping 
an immediate, insistent, incessant preoccupation 
with the problem of linking concepts and empirical 
Operations. Personality measurements, if they dis- 
criminate at all, always express, in a pure or im- 
pure form, explicitly or implicitly, a personality 
concept in terms of which the differential behavior 
may be understood. Personality concepts, if they are 
not hopelessly vague or inconsistent, always imply 


specifiable behavioral differences in people. These 
differences may be impractical to study or they 
may be judged as trivial, but the concept requires 
that they exist. 

The distinction between, yet interdependence of, 
concept and measurement in the study of person- 
ality requires a spiraling interplay of these levels 
of analysis—concepts should suggest approaches 
to measurement, and measurement should refine 
conceptual formulations. This reciprocal improve- 
ment of theory and method has come to be called 
the process of “construct validation” (Cronbach & 
Meehl 1955). Unhappily, in the adolescent and 
polyglot field of personality, progress has been more 
often circular than helical. And yet, if a science of 
personality is to be formed, the responsibilities of 
coupling concept and measure must be met, 

The domain of personality psychology is sprawl 
ing and fuzzily delineated. With so many defini- 
tions of the field and with a plethora of vaguely 
redundant but not readily integrated concepts, it 
is difficult to know where to try to begin measuring 
the nebulous notion of personality and toward 
what ends, [See PERSONALITY. ] 

In order not to be immobilized by the endless 
conceptual possibilities, two tactics have been 
adopted. The first of these requires the courage to 
be arbitrary—to propose, essentially by fiat, that a 
small set of concepts comprehends the important, 
necessary ways of differentiating people. These 
concepts need not constitute—and thus far have 
never been—a formal model of personality. Rather, 
they represent simply a set of dimensions or ideas 
in terms of which their expounder finds it conven- 
ient and congenial to conceptualize his view 0 
“personality.” Selection of these concepts may be 
supported by observational, introspectional, clin- 
ical, test, or experimental data, of varying degrees 
of quality and persuasiveness, but often is not 
These concepts serve—or should serve—a heuristic 
purpose; they provide a way to begin and, subse 
quently, a way of indicating and integrating sy he 
phenomena that may fall under the rubric of Ti 
conceptual assertion. Presumably, the wae 
of the concepts will be tested, and they will be e 
shaped under the impact of the empirical ed 
quences they entail. The reader may wish to ge 
sider, as an instance of this general approach, fin: 
attempt of Lazarus (1966) to construct and deli 
eate a concept of stress [see STRESS]. Joyed. 

The second tactic, now increasingly emp ee T 
eschews the seeming arrogance of arbitrary a 
ceptualization. It begins with, and even Bea 
diversity but rests its fate in analytical me and 
believed capable of identifying the lawfulness 
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conceptual structure of the personality phenomena 
which have been collected for study. Patternings 
that are found to be stable when these means are 
applied to variegated data are presumptive evidence 
of functional entities. These functional entities may 
be named and employed thereafter as integrative 
and consequential concepts. By and large, this ap- 
proach has employed the method of factor analysis, 
with all of its strengths and weaknesses. A good 
illustration of the approach is the work of R. B. 
Cattell, recently presented in an integrating, sum- 
mary volume (1965). [See FACTOR ANALYSIS and 
Traits. | 

However an investigator chooses to proceed in 
articulating and delimiting the sense in which he 
will define “personality,” he will confront the dis- 
maying problem of “measurement.” What is meas- 
ured or recorded is behavior, and behavior has 
many determinants and many meanings. Identical 
behaviors may be the expression of fundamentally 
different mediating sequences or, conversely, dif- 
ferent behaviors may be essentially equivalent in 
the functions they serve or the processes from 
which they eventuate. Behavior is susceptible to 
many transient and otherwise conceptually irrele- 
vant conditions; it is susceptible to being organized 
in many ways and to many levels of interpretation. 
In all this flux, how are anchor points to be estab- 
lished? 

By way of illustration, consider the concept of 
anxiety and its measurement. Perhaps all concep- 
tualizations of personality leave room for or try to 
€ncompass the notion of anxiety; by focusing on 
the measurement of anxiety, most of the problems 
and logic of personality measurement may be 
illustrated. 

The term “anxiety” takes on scientific meaning 
only by its empirical concomitants and conse- 
quences. But, for the present, the sense with which 
the term should be understood here is perhaps suffi- 
ciently conveyed by resort to the tautology of a dic- 
tionary definition (Webster's New International 
Dictionary, second edition). Thus, anxiety means 
Painful uneasiness of mind respecting an impend- 
ing or anticipated ill; a state of restlessness and 
oo with a distressing sense of oppression 
ree heart; expectancy of evil or danger with- 
ton ground ...; concern, dread, fear, 
ness Oding, misgiving, worry, solicitude, uneasi- 

» apprehension.” [See ANXIETY.] 
mea the notion of anxiety refers to subjective 
en nce. And because anxiety is important re- 
less of whether it is theoretically conceptual- 
aes a prime mover, as derivative, or as epiphe- 
on, psychologists wish to measure it. 


Four basic procedures 


There are four classes of indicators that may be 
used: (1) a subject may be observed in his every- 
day life and actions (presumably without his ac- 
tions’ being affected by these observations) and 
from his behaviors a judgment is made as to 
whether he is anxious; (2) a subject may be asked, 
by means of a questionnaire, to state directly (or 
in ways he may not completely understand) 
whether he is anxious; (3) a subject may be placed 
in a controlled or test situation designed to elicit 
special behaviors or products relevant to anxiety; 
(4) a subject’s physiological reactions may be 
assessed by various instruments, to determine 
whether he shows certain responses or changes 
presumed to be indicants of anxiety. 

Observation. We all form impressions of other 
people in the course of observing them. If these 
impressions can be objectified and systematized, 
then there is no reason why observation cannot be 
employed as one kind of measurement of person- 
ality. Observation has an appeal because of its 
closeness to experience and because of its apparent 
simplicity. It has been criticized for the other side 
of these virtues—the data generated are subject 
to observer idiosyncrasies and often do not meet 
scientific standards of reproducibility. The method 
depends heavily on the quality of the observers, 
the range of contexts in which judges observe the 
subject, and the way in which the observations are 
codified so as to formulate a dimensional index. 
With close attention to these concerns this ap- 
proach becomes cumbersome, but it still appears 
to be the method of choice when complex con- 
cepts are being considered or when a problem area 
is being approached for the first time. There are 
not many instances of appropriate usage of this 
approach. For consideration of the ways and 
means of employing it effectively, see, for example, 
Guilford (1954, chapter 11), Peak (1953), Block 
(1961), [See OBSERVATION. ] 

When this method of measuring is used, anxiety 
might be indexed as some composite of the judg- 
ments of several psychologists or psychiatrists, 
each judge having observed the subject in a differ- 
ent but evocative situation and each judge referring 
the concept of anxiety to his professional (and 
personal) understanding of the term. Each judge 
having placed the subject at a (numerical) point 
along an anxiety continuum, some average value 
of these numbers can serve as a score or measure- 
ment of the subject with respect to anxiety level. 

Questionnaires, or inyentories. By far the most 
frequent technique invoked to measure personality 
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has been the questionnaire. Typically, the question- 
naire, or personality inventory, will contain a wide 
variety of questions on highly personal matters, 
which the subject is asked to answer. Such de- 
vices are extremely convenient for the investigator; 
explicit answers are provided by the subject re- 
garding a host of matters, and these responses lend 
themselves immediately to a variety of analytical 
techniques. However, there are a number of haz- 
ards in the use of inventories, and research has 
for a long time been concerned with identifying 
and coping with these problems. 

In the initial usages of personality inventories, 
it was presumed that the responses of subjects 
could be believed and that the interpretative or 
conceptual significance of a response was clearly 
apparent. This presumption is by no means wholly 
untrue, but its applicability is conditioned by the 
motivation of the respondent vis-a-vis the nature 
and context of the queries and by the complexity 
of the relations through which an inventory item 
relates to a personality dimension, 

People are astonishingly open and cooperative 
when approached in an interested, supportive, and 
nondemeaning way. Too often, however, psycholo- 
gists have impersonally handed out highly personal 
questionnaires for reasons not appreciated or be- 
lieved by the recipients. It is not surprising, under 
such circumstances, that deliberate innocuity of 
response has been and often still is the order of 
the day, Indeed, failure to cover up and appear 
blandly conventional in such settings appears to 
be a sign of psychopathology. But in the context 
of seeking psychiatric help, it is far less likely that 
someone will dissimulate in his responses to an 
inventory. In any event, many questionnaires in- 
corporate items which, separately or in conjunc- 
tion, provide an indication of the frankness of the 
respondent and hence of the validity of his re- 
sponses. There have been increasingly successful 
attempts to construct questions or combinations of 
questions the full implications of which almost no 
respondent could be expected to understand. 

A more interesting problem is the establishment 
of the psychological import of questionnaire re- 
sponses. The statements to which a subject is asked 
to respond may or may not be ambiguous. How- 
ever, the implications of a Single response, offered 
without a context, almost invariably are obscure. 
Just what is meant by an affirmative answer to 
the statement, “I know who is responsible for my 
troubles”? Is the respondent acknowledging that 
he is his own worst enemy or is he Projecting an 
accusation of persecution upon someone else? The 
item “I like tall women” is a Consistent empirical 


indicator of impulsivity in American males, but 
why? Seemingly equivocal items and items only 
deviously related to personality dimensions abound 
in contemporary inventories; and indeed, herein 
lies their strength, since even a sophisticated sub- 
ject finds it difficult to anticipate or control the 
dimensional implications of his response, 


It was the great conceptual contribution of the 
Minnesota Multiphasic Personality Inventory, still 
the dominant questionnaire used in the United 


States, to turn away explicitly from the apparent 
cogency of items and turn instead toward a re- 
quirement that items (and the scales subsequently 
developed from these items) discriminate between 
behaviorally or socially different criterion groups. 
This attitude toward test construction is not so 
blandly empirical as has been believed—there are 
still conjectures regarding fruitful questionnaire 
statements, and there is recognition that much can 
be learned from thoughtful reflection on the proc- 
esses that might underly the emergence of an un- 
anticipated discriminator. But first there is the 
insistence that beyond question the items separate 
groups. With this approach, well applied, the prob- 
lem of dissimulation largely vanishes, since re- 
spondents cannot foretell the implications of their 
answers, To date, because of the formidable labors 
involved in developing a widely ranging inventory, 
applying it to large and variegated samples, and 
then exploring for discriminating items, the full 
potential of this approach has not been realized. 

In particular it has often been the case that cri- 
terion groups are selected on the basis of poor of 
uninteresting or confused criteria, Thus, subse 
quent empirical analyses can provide only weak or 
trivial or ambiguous response to the particular a 
search inquiry. There has been little eae 
consideration given to the problem of which ¢ i 
terion groups might, should, or must be pe 
nated. However, with the advent of the one 
which can perform the vast clerical labors a, mie 
parison, it may be expected that the multiple at 
trasts required to build more than a narrowly abe 
fied validity will be forthcoming. For a on ied 
of some of these problems, see Block (196 ) 
Campbell (1957). sori ROH 

With regard to the concept of anxiety, i Ba 
be necessary first to contrast the inven i 
sponses of a group of individuals conni by 
experts to be anxious with responses eae rile 
a comparable group of nonanxious iN aifferen- 
Those statements affirmed or denied with s group 
tial frequency by the members of the pie consti- 
and corroborated on other samples w abli 
tute an anxiety scale. Once refined and es 
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as a valid discriminator, this scale could be em- 
ployed to identify individuals responding to items 
in a way characteristic of the anxious individuals 
initially studied. Although the significance of a 
subject’s response to any one item relating to 
anxiety can readily be questioned, it is reasoned 
that the closer the correspondence between an in- 
dividual’s response pattern and the response pat- 
tern typifying the anxious group, the more likely 
it is that he is anxious. 

Interviewing. It may be noted that the fre- 
quently used technique of the interview falls be- 
tween the two categories of personality measure- 
ment so far discussed. If the interview employs 
standardized questions, systematically asked of the 
subject, then the interview may be regarded as a 
verbally administered questionnaire. If the inter- 
view is unstructured and simply provides the inter- 
viewer with an opportunity to observe subjects in 
a social (and tense) situation so that the inter- 
viewer can formulate some impressions, then the 
interview falls into the classification of observa- 
tions as a basis for personality measurement. [See 
INTERVIEWING, article on PERSONALITY APPRAISAL.] 

Test situations. Observations are heavily and 
cumbersomely dependent on the quality of the 
judges used, their number, and the significance of 
the subject's behaviors sampled; questionnaires are 
convenient enough but often appear too distant 
from actual behaviors to earn a trust in their rele- 
vance. Because of the deficiencies or constraints 
in these approaches, psychologists often turn to 
Specially devised test situations as a basis for elicit- 
ing behaviors to serve as indicants of personality 
variables. This orientation has special promise, as 
yet only partially achieved because the special 
Premises on which it depends are not sufficiently 
understood. 

Test situations are constructed by the researcher 
to present to the participating subject a psychologi- 
cal context or influence or choice formulated in 
a way that makes it relevant to the personality 
Variable under study, The response of the subject 
to the environmental demand upon him can, pro- 
vided the subject appropriately interprets the test 
situation, serve as an indicator of the selected per- 
Sonality characteristic the test situation was de- 
Signed to elicit. The crucial requirement of this 
pace upon which all of its usefulness depends, 
ia gnified by the italicized phrase in the preced- 
A eene gil his response is to have the de- 
of ai eee significance and if the responses 

erent subjects are to be considered as com- 
ain the subject must apprehend the test situa- 
the way the experimenter prefers. 


More often than psychologists have wished to 
acknowledge, subjects react to experimental situa- 
tions in ways that foil interpretation or that are 
idiosyncratic. The usefulness of the test situation 
may be voided in two ways. First, unrealized by 
the experimenter, different subjects may enter the 
situation with different attitudes or sets or expecta- 
tions, which because of their heterogeneity cause 
behavior within the test situation to have equivocal 
implications. Thus, an experimental situation con- 
cerned with competition and cooperation, employ- 
ing actual money as a reward, might fail to recog- 
nize the different premises with which, say, a 
desperately poor subject, a wealthy subject, and a 
devout Quaker might conceive of their participa- 
tion in the experiment. Similar responses could 
well have different bases; different responses could 
be open to multiple interpretations. Of course, 
these different premises can themselves become 
the focus of study, but if they go unrecognized 
and if subjects are unwarrantedly presumed to be 
homogeneous, then the results derived from the 
experiment can have little significance. 

Second, also without realizing it, the experi- 
menter may have in fact defined the test situation 
in ways different from those he had intended, so 
that the behavior of subjects, although they may 
be homogeneous enough, responds to quite another 
(and unrecognized) issue or influence than the 
experimenter has conceptualized. Thus, an experi- 
menter may seek to convey in his test situation 
the false bit of information that the subject’s per- 
formance on some task is unusually poor. The ex- 
perimenter’s goal in communicating this “fact” 
may be to motivate the subject intensely. However, 
the subject's own view of his performance and 
general competence, amplified perhaps by ineffec- 
tive acting by the experimenter, may cause him 
(and all other subjects) to disbelieve the experi- 
menter and to behave on quite different grounds. 
For reasons of tact and tacitness the experimenter 
perhaps may not be confronted with the knowledge 
of this state of affairs, and he may therefore pre- 
sume that the subjects responded in the way his 
situation directed them. Failure to check appro- 
priately on the congruence between experimenters’ 
and subjects’ conceptions of the test situation is a 
frequent error. 

To exploit situational tests maximally for the 
purposes of measuring personality, a firmer recog- 
nition is required of the interpersonal nature of 
psychological experimentation and of the mani- 
fold structurings of a test situation which can be 
developed by the facile minds of human beings. 
An experimenters presumption about the proc- 
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esses operating within his experimental situation 
is insufficient alone—it must be supported by other 
views congruent with it. A consensus derived from 
the judgments of a group of psychologists is help- 
ful, but the most supportable basis for evaluating 
a test situation is the consensus perception devel- 
oped by actual subjects or individuals equivalent 
to the subjects being studied. Chein (1954), Gibson 
(1960), and Jessor (1956), each from a different 
viewpoint, have written thoughtfully on the matters 
involved in environmental specification. 

Further, the presence and the potency of the ex- 
perimenter’s effects upon his results must be as- 
sessed if generalizations beyond the immediately 
obtained results are to be made, In direct and in- 
direct ways experimenters or examiners influence 
the behaviors of the subjects they encounter—a 
friendly, warm experimenter will elicit higher 
levels of aspiration from subjects than will a stern, 
aloof experimenter; subjects are often exquisitely 
sensitive to (and cooperative with) the hypotheses 
of an experimenter. The review by Kintz and his 
associates (1965) summarizes to date much of 
what has been discovered regarding unacknowl- 
edged and unrecognized experimenter influences 
on subject performance. 

Yet, if experimenters prove to be essentially in- 
terchangeable and if the psychological definition 
of the test situation is personally involving and 
essentially the same for all subjects and if, there- 
fore, a subject is faced with the necessity of a 
behavioral choice, his decision must be formed on 
the basis of intrapsychic considerations. If the en- 
vironmental directives are not overwhelming, and 
thereby compelling of a uniformity of response, 
then differential behavior will result, and this dif- 
ferential behavior will reflect differences in per- 
sonality. The trick is to be incisive in the behaviors 
Studied, so that the manifestations of personality 
thus developed are nontrivial, 

To exemplify a test situation relevant to the 
measuring of anxiety, consider a rote-learning 
task involving occasional unforewarned, extremely 
painful electric shocks, Some subjects will learn 
the designated material quickly, and others will 
learn slowly, Provided there is convincing informa- 
tion, obtained separately, that the subjects are of 
equivalent intelligence and background vis-à-vis the 
learning task and that they have equal thresholds 


ing task without the threat of shock and its anxi- 
ety-evoking potential. In this second context the 
subjects should, according to theory, certainly 
manifest a much smaller range of efficiency in 
the learning situation and, again according to the 
concept of anxiety held, perhaps an effectiveness 
that on the average is superior to the average they 
achieved when threatened by shock. When an in- 
dividual’s performance under both conditions is 
compared, it becomes possible to infer his level of 
anxiety. 

Projective methods. Projective methods, such 
as the Rorschach ink blots or the Thematic Apper- 
ception Test, may be viewed either as test situa- 
tions or as providing a basis in observation for 
personality formulations by judges. If responses 
are scored and these scores are taken as indicants 
of personality, then the projective technique is 
being employed as a test situation, If projective 
stimuli are conceived of simply as the occasion 
for a complex, semistandardized interview of a 
Kind, eliciting many behaviors, from which an ex- 
aminer, by virtue of his background and experi- 
ence can, without scoring responses, evaluate the 
responding subject, then the projective method 
Serves to provide behavioral grist for the integrative 
mill of the observer. [See PRoJECTIVE METHODS. 

Physiological responses. A great deficiency in 
the three approaches to personality measurement 
just described is that the behavior of the subject 
is to a greater or lesser extent under his volitional 
control—the subject can, more than a little, decide 
how he will respond when under observation or 
when faced with an inventory or when function- 
ing in a test situation. He may be caught unawares, 
and he may not know how to direct his behavior, 
but he can, often appreciably, shape his manifest 
behavior. Since so many dimensions of personality 
relate to behaviors regarding which a subject may 
be motivated to be—deliberately or without pe 
ness—misleading, it is important to escape thi 
limitation. t 

The autonomic nervous system, however, sae 
except in a most unusual and sophisticated in i 
vidual, under the control of intentions. An ne 
in heart rate, a hypermotility of the stoma eal 
rise in circulating epinephrine, cannot in ge i 
be volitionally prevented. Accordingly, the us sy- 
measures of such functions is attractive to P 
chologists. 

The psychophysiological method has enjoyed 
cyclic popularity in personality pyc A 
spite the appreciable investments of E i 
thought put into the formidable problems py 
strumentation, the contributions of psychoP 
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ology to personality measurement have thus far 
been few and often without reliable usefulness. 
The reasons for this are several. 

One problem is that of recording in sufficient 
detail and without artifact or the intrusion of irrele- 
vancies the physiological phenomena going on 
underneath the subject’s skin. Although such tech- 
nical matters have inhibited growth of the field, 
recently great strides have been made and impres- 
sive sophistication has developed with regard to 
transducing and recording various physiological 
changes in the individual. While heretofore all 
psychophysiological measurements required care- 
fully prepared rooms and carefully wired subjects, 
now the time is near when many of these measures 
will be telemetrically recorded from unhampered 
subjects carrying miniature transmitters as they go 
about their everyday lives. It may be expected that 
More representative and hence more cogent sam- 
plings of psychophysiological changes will be 
gathered as these technical advances reach ma- 
turity and acceptance. 

However, even with this kind of technological 
progress, the psychophysiological field (and the 
possibilities of the field) cannot be advanced unless 
there is further insight regarding such matters as 
what physiological channels to record and what 
facets of the multifaceted physiological process to 
fix upon as informative and dependable. A second 
problem of the psychophysiological approach, then, 
is attributable to the slowness with which truly 
incisive measures are being found. Quite properly 
there has been a reluctance to standardize any 
Single set of physiological indexes, because of in- 
Sufficient orderliness of the measures thus far 
Studied and dissatisfaction regarding their ultimate 
behavioral relevance. Different investigators con- 
tinue to employ different measures, and there have 

een few multivariate studies oriented toward es- 
tablishing a reduced number of dependable meas- 
ures. Much cumbersome, trying, and undramatic 
Work must still be done before the necessary sys- 
‘mization and convergence on fundamental meas- 
ures is accomplished. 

$ e third problem affecting the psychophysiologi- 
al field stems, not from technical lags or defi- 
SE of discovery, but rather from an attitude 
a enag much of the research being done. The 
$ ie of many investigators who have chosen 
ace in the psychophysiological realm is more 
Piil the physiological than the psychological. 
oue gical reactions or changes, although oc- 
often G within the individuals life-span, have 
ee een looked upon simply as end products of 

Ysical stimuli or as cause and effect of other 


physiological changes. This kind of focus has its 
merits and its successes. A more psychological 
approach, however, would use psychophysiological 
reactions, given meaning by the context of their 
occurrence, as signs or tokens of the existence of 
a subject’s motivational state or characterological 
tendencies, Polygraph recordings of sexual arousal 
and of sympathy may well look alike; it is the en- 
vironmental context in which the recordings were 
made that defines the significance of the interior 
changes that have been identified. The psycho- 
physiological approach perhaps may realize its po- 
tential contribution to personality measurement 
when it is conjoined with a psychology of context. 
[See LEARNING, article on NEUROPHYSIOLOGICAL 
ASPECTS.] 

In applying the psychophysiological method to 
the measuring of anxiety, we might choose to 
place the subject in a stressful situation and then 
measure the level and changes in the electrical 
conductance of his skin. Despite various problems 
of artifact and technique, changes in skin conduc- 
tivity appear to mirror changes in the emotionality 
of the subject. As he relaxes, his skin conductance 
drops; as he becomes tense or is placed under ex- 
plicit stress, his level of skin conductance rises, 
usually by a series of rapid, superimposed changes. 
We may judge a subject as anxious if his con- 
ductance level has risen greatly from its normal 
or basal conductance level. Alternatively, we may 
measure anxiety as a function of the time required 
for the subjects conductance level to revert from 
its peak to its basal level. Or we may define anx- 
iety in terms of the number of abrupt conductance 
changes (galvanic skin responses) a subject emits 
in a unit of time during which there is no clear 
external or situational stimulus. In this last defini- 
tion of an anxiety measure, it is presumed that 
“nonspecific” or nonattributable galvanic skin re- 
sponses are tripped by internal, private, even un- 
conscious anxiety stimuli. Additional conductance 
measures of anxiety of course can be proposed, but 
it is not yet clear which, if any, may be used with 
assurance. 

States and traits. Concern with physiological 
indexes, such as conductance, brings forward with 
particular clarity a distinction we have thus far 
neglected but which in fact exists, however we 
propose to measure personality, and which con- 
founds many interpretations of personality rele- 
vant behavior. This is the “state—trait” separation, 
a conceptual attempt to distinguish between rela- 
tively transient, actually existing internal condi- 
tions that an individual experiences (states) and 
the general tendency of an individual to find him- 
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self enveloped by certain classes of circumstances 
(traits), Are these conductance rises or accommo- 
dation differences reflective of subject differences 
in anxiety level, or do they reflect instead subject 
differences in susceptibility to stress? Furthermore, 
when we conceptualize anxiety in general, are we 
talking of a particular, existential condition at the 
moment affecting a given subject (i.e., the sub- 
ject’s state) or are we out to measure the likelihood 
that the subject will experience often the phe- 
nomenal state of anxiety, i.e., that the subject 
has the trait, however achieved, of being anxiety 
prone? What we measure and how it will be meas- 
ured depend upon the conceptual position explicitly 
reached in regard to the state-trait distinction. 
Much unrewarding research and discussion in the 
psychological literature has been due to a failure 
to make this separation or to accept its conse- 
quences, In particular, state notions have been 
measured in ways appropriate only to traits, with- 
out recognition of the error—the correlates of 
anxiety proneness have been presumed to be the 
correlates of anxiety states (e.g., Taylor 1956). 
As has been shown by Mandler (1959) and others, 
there is no logical necessity for this presumption 
to be true and there are abundant empirical in- 
stances of its falsity. 

Short of special subject selection, prolonged 
measurement, and perhaps even longitudinal study, 
the empirical separation of the concepts of state 
and trait is a difficult task indeed. The recent re- 
view by Martin (1961) points up well both the 
difficulties and the attractiveness of the physio- 
logical approach to personality measurement. 


The necessity of convergence 


Having assessed a personality concept—in the 
present instance, anxiety—in multiple ways, it is 
necessary to evaluate the extent to which these 
operationally diverse but conceptually equivalent 
indexes are related to each other. It is embarrass- 
ing if the several measures of a concept do not 
constitute a congruent ensemble, It is also bother- 
some when a measure proves to relate too closely 
to an index or behavior conceptually irrelevant to 
the dimension of immediate interest. Thus, a small 
or moderate correjation between a measure of anx- 
iety and scores on an established intelligence test 
is conceptually supportable from a variety of theo- 
retical viewpoints. But an extremely high connec- 
tion between anxiety and intelligence might cause 
us to question whether our proposed anxiety index 


is not, instead, simply a camouflaged measure of 
intellect. 


By and large, personality measures to date 


have not performed well when evaluated against 
the criteria of convergence and discrimination, 
Campbell and Fiske (1959), after an elegant pres- 
entation of the logic which should underlie the 
development of a personality measure, evaluate a 
variety of commonly used indexes. They find that 
rather few of the measures psychologists employ 
meet the criteria that would justify the interpretive 
labels these measures have been awarded. Their 
analysis and survey is a significant one, which 
those interested in personality measurement will 
wish to read. 

Humphreys (1960) and Cattell (1961) have 
offered some useful psychometric tactics for deal- 
ing with the problem of excessively low corre- 
lations of a measure with conceptually similar 
measures and excessively high correlations of a 
measure with conceptually dissimilar indexes. As 
these and related suggestions gain currency and 
take hold, we may expect that personality measures 
will develop a substantiality they do not yet enjoy. 
The general principle for progress in personality 
measurement appears to involve aggregating a 
number of conceptually equivalent but operation- 
ally diverse measures and, through the use of 
standard scoring and factor analysis, constructing 
an average or expression of the dimension common 
to the phenotypically heterogeneous indexes being 
brought together. 


In sum, the goal of a scientific understanding of 
personality and its development requires recogni- 
tion of the inevitable and conducive role of person 
ality measurement. There has been extensive and 
increasingly sophisticated research oriented toward 
the development of conceptually univocal and psy- 
chometrically sound measures of personality. The 
basic approaches to personality measurement in- 
clude the use of observation, questionnaires, P 
cially constructed test or social situations, 5 
physiological indicators. A significant continu! He 
problem in the field is the poor agreement bela 
ostensibly equivalent measures. This pee: 5 
being solved by greater care and refinemen ity 
the construction and identification of personā 


measures, 
Jack BLOCK 


[See also PsycuomEtRics.] 
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PERSONALITY INVENTORIES 
E of the most popular ways of studying per- 
= ity is to construct a questionnaire, or inven- 
ry, Containing short statements about a person’s 
oy and feelings that he can check as true 
cae This deceptively simple idea of a self- 
ria inventory for personality assessment was 
feats on a large scale in World War 1, for 
ae ee army recruits. Containing items 
Wing to questions a psychiatrist would ask, the 
os po Personal Data Sheet served as the proto- 
ties or numerous personality inventories which 

flourished since. [See WoopworTH.] 


The modern self-report paper-and-pencil person- 
ality inventory bears only a slight resemblance to 
the straightforward, simple approach of Wood- 
worth. Numerous, well-developed scales may be 
embedded in a single instrument, including some 
especially designed to detect the individual who is 
faking or who is failing to read the items carefully. 
Objectively scored answer sheets, standardized 
scales with normative reference groups, and actu- 
arial predictions from statistical tables are now 
commonplace. And yet, most critics would agree 
that even the more widely used personality inven- 
tories today are still limited in validity and general 
application. 

The self-inventory approach to personality is ap- 
pealing on several grounds, Most individuals are 
more candid and objective in their self-appraisal 
when responding to statements in an impersonal, 
printed inventory than they would be in a direct 
interview or written autobiography. The method is 
objective, repeatable, economical, and efficient. The 
results are amenable to examination by multi- 
variate statistical methods and psychometric theory 
and to rapid analysis by computers. [See Factor 
ANALYSIS and PSYCHOMETRICS.] 


Major approaches 

Three major approaches have been employed in 
the development of personality inventories. The 
most obvious starts with a definition of the trait 
being measured, such as extroversion-introversion 
or neuroticism. A large number of statements are 
written and gradually refined until the manifest 
content “looks right,” on the face of it, with respect 
to the trait in question, Woodworth’s Personal Data 
Sheet and many of the personal adjustment inven- 
tories that have followed it, such as the Bell Adjust- 
ment Inventory, the Bernreuter Personality Inven- 
tory, and the Cornell Medical Index, have relied 
chiefly upon face validity for evidence of their 
value. Unfortunately, the use of these a priori, 
direct methods has been almost uniformly disap- 
pointing. Such information as they provide is super- 
ficial and misleading at best. 

The second approach, involving strictly empirical 
methods, arose partly in reaction to the failure of 
the earlier “common-sense” approach, Starting with 
a large pool of a few hundred statements, culled 
from a variety of sources, experimental forms of 
the inventory are given to samples of individuals 
drawn from known criterion groups. Those items 
which discriminate the criterion groups are placed 
in a provisional scale, while the remainder are dis- 
carded or employed for other purposes, The pro- 
visional scales are cross-validated by administering 
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a refined version of the inventory to fresh criterion 
samples, The most widely used instrument of this 
type is the Minnesota Multiphasic Personality In- 
ventory (MMPI). Ten clinical and four validity 
scales were originally developed for the MMPI, by 
using criterion groups drawn from clinical popu- 
lations. The MMPI pool of 550 items has proved 
convenient for the construction of many other em- 
Pirical scales employing similar methods. The pos- 
sible number of such empirical scales in a large 
pool of items is almost infinite, being bound only 
by the number of different specific criterion groups 
to be discriminated. 

A third major approach to the problem starts 
with a large pool of items, as in the MMPI, but sets 
as a goal the development of homogeneous scales 
that are logically independent and internally con- 
sistent. Ideally, one would first like to know the 
major dimensions underlying variations in response 
to every conceivable statement that can be imagined 
about an individual's personality. Then scales could 
be constructed to represent these dimensions by 
appropriate combinations of items. The dimensions 
of personality emerge only after a long process of 
internal item analysis, scale construction and re- 
vision, and the use of factor analysis to clarify the 
existing structure, The Guilford~Zimmerman Tem- 
perament Survey (Guilford & Zimmerman 1955) 
and Cattell’s Sixteen Personality Factor Question- 
naire (16 PF) (Cattell et al. 1957) are two well- 
known examples of personality inventories con- 
structed by this method. 

The Guilford-Zimmerman Temperament Survey 
is appropriate for adolescents and adults, The in- 
ventory grew out of work by Guilford and his col- 
leagues (Guilford & Zimmerman 1956), in which 
13 primary dimensions of personality were found 
by factor analysis of 69 item clusters, where three 
or more clusters per factor had been hypothesized 
in advance. With some exceptions, intercorrelations 
of the 13 factor scores are generally close to zero, 
Ten of these dimensions are included in the tem- 
perament survey. 

Cattell’s 16 PF was the result of an ambitious 
plan to sample the entire “personality sphere” (Cat- 
tell 1946) more completely than ever before. A total 
of 171 personality variables was compiled, and 
large numbers of items were written, Experimental 
test forms were given to college students, and a 
series of factor analyses was carried out, extract- 
ing more factors than most other investigators 
would judge to be meaningful and using oblique 
rather than orthogonal rotations, The 16 factors 
believed by Cattell to be most Significant form the 
basis for the 16 PF. While the major scales repre- 


sented in the questionnaire closely resemble di- 
mensions found by other investigators using similar 
approaches, the short scales have very low relia- 
bility and, if they are to be used at all, are recom. 
mended only for research purposes, The overlap 
of the scales permits the extraction of second-order 
factors (Cattell 1956). 

Cattell and his associates have also tried to ex- 
tend the use of paper-and-pencil personality inven- 
tories down the age scale to six-year-olds (Cattell 
& Coan 1958). For the most part, however, these 
attempts have not been very successful below 
young adolescence. 

Still a different point of view among factor ana- 
lysts working with personality inventories is that 
of Eysenck (1953), Where Cattell and Guilford 
postulate many primary dimensions, Eysenck in- 
sists upon a higher level of abstraction, with only 
three basic dimensions—neuroticism, introversion- 
extroversion, and psychotic behavior. The first 
two of these are represented in the Maudsley Per- 
sonality Inventory, developed by Eysenck and used 
widely in England, 

Disagreements as to the number of personality 
dimensions which can be measured reliably by 
questionnaire methods arise largely because of dis- 
putes over the best way to discover and define such 
dimensions. Employing different methods of factor 
analysis, Eysenck conservatively sticks to only very 
general factors. Moreover, he fails to sample sys- 
tematically throughout the universe of all person- 
ality statements that can possibly be put in ques- 
tionnaire form. In reviewing this dilemma Coan 
(1964) has pointed out that behavioral be 
tion or personality structure can be conn 
on four different levels—specific response, habitu 
Tesponse, trait, and type. The selection of subjek 
number of factors extracted, kind of rotation E 
ployed, and density of personality-variable san = 
affect the degree of referent generality of Sat 
within the hierarchy at which inferences al R 
dimensions are made, When viewed in a hee 
ner, Eysenck, preferring types, has a mut ae 
referent generality than have Cattell and G oe, 
who focus on traits and even habitual Be 
It is interesting to note that most second-order aly 
tor analyses of primary dimensions in perso three 
inventories yield general factors similar to the 
proposed by Eysenck, pointing to a basi! 
ing the dispute. F 

The California Psychological Invent) C 
developed by Gough (1957), is a hybrid of E ata 
empirical approach, exemplified by the M i Cattell. 
the dimensional approach of Guilford an for de 
Instead of using factor analysis as a basis 


s for resolv- 
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fining dimensions, Gough emphasizes concepts 
drawn from the folk culture or common language 
as criteria for scale construction. The CPI covers 
15 traits, plus three additional scales for detecting 
invalid or deviant records, The scales were devel- 
oped by item analysis in terms of behavioral criteria 
(usually ratings), without regard for their internal 
homogeneity. However, Gough did take into account 
dimensional studies of normal personality before 
selecting the behavioral ratings to be predicted from 
questionnaire scales. 

If major dimensions in personality inventories 
are really stable, they should appear consistently in 
different inventories, provided the same aspects of 
the personality sphere are sampled. A number of 
studies have been conducted comparing two or 
more personality questionnaires given to the same 
subjects. Typical of such studies are the one by 
Mitchell (1963) comparing the CPI with the 16 PF 
and the one by LaForge (1962) comparing the 
MMPI with the 16 PF, Although Mitchell found 
little relationship between the primary factors in 
the CPI and the 16 PF, the congruence of second- 
order factors in the two inventories was quite strik- 
ing, Six second-order factors were found among 
the 34 scales, the largest two of which were easily 
identified as neuroticism and extroversion-intro- 
version, verifying Eysenck’s contention that these 
two general factors run through most personality 
questionnaires and are of greater importance than 
the lower-order traits, LaForge also found neuroti- 
cism to be the first factor running through both the 
16 PF and the MMPI, although his other factors 
are complex. Because the MMPI comprises clini- 
cally derived scales of an overlapping, empirical 
nature, dimensional analysis using factorial meth- 
ods is not very satisfactory unless one works di- 
rectly with the individual item responses rather 
than the derived scales, 


Spurious factors in test taking 
A major problem in personality inventories is 
the Control of response set, stylistic variables, and 
Spurious factors that may be confounded with the 
chan Scores on personality scales. Self-report 
cae are subject to falsification by the individual 
A anes to make a good impression. Quite aside 
mee any intentional faking, people do not know 
hoe well enough to give factual answers to 
Personality or attitude questions. Since some 
eed inventories allow the individual a range 
He T ARER some subjects give many extreme 
EE While others are characteristically cau- 
check Some will tend to agree with a statement or 
true” rather than “false,” regardless of the 


item content. While these confounding variables 
have been recognized for many years, only recently 
has a great deal of interest been generated in their 
psychological meaning and in methods for their 
measurement and control. 

In his critical survey of personality assessment, 
Vernon (1964) reviews the history of this problem. 
An exhaustive review has also been made by Rorer 
(1963), who concludes that response style has been 
greatly overrated as a factor in personality measure- 
ment. Other recent reviews of interest have been 
published by Jackson and Messick (1958), McGee 
(1962a; 1962b), Christie and Lindauer (1963), 
and Holtzman (1965). [See RESPONSE SETS.] 

Social desirability and acquiescence. The two 
response styles upon which the greatest amount 
of research has been done are social desirability and 
acquiescence. Socially desirable responding is the 
tendency to give answers that are independently 
judged by social consensus to be desirable. Acqui- 
escence is the tendency to agree with a statement, 
regardless of content. Many writers have taken 
strong positions concerning the significance of these 
two response styles, positions from which they 
retreat only with great reluctance. 

Using the method of successive intervals for 
scaling items from the MMPI, Edwards (1957a) 
developed his Social Desirability Scale as a measure 
of the tendency to respond in the socially desirable 
fashion. The 39 items in the scale are keyed 
so that the higher the score, the more the individual 
endorses the socially desirable alternative. The cor- 
relations between this scale and the first general 
factor on the MMPI are so high that some investi- 
gators have argued that most of the variance in the 
MMPI can be accounted for in terms of this single 
dimension of social desirability-undesirability (Ed- 
wards 1962). It is clear to most observers, however, 
that the Social Desirability Scale alone is no match 
for the full MMPI profile (Marlowe & Gottesman 
1964; Wiggins 1962; Elvekrog & Vestre 1963; 
Jackson & Messick 1962), The standard scales in 
the MMPI cover much more than social desirability. 

As in the case of social desirability, considerable 
debate continues over the importance of acqui- 
escence in personality inventories. Jackson and 
Messick (1962) computed separate scores for true- 
or-false keyed items for a large number of MMPI 
scales, as well as for five social-desirability scales, 
and performed a factor analysis of the intercorrela- 
tions of the 40 variables separately for three large, 
diverse samples of individuals. The three samples 
yielded identical results. More than half of the total 
reliable variance could be attributed to the two 
stylistic dimensions—acquiescence and social de- 
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sirability. On the other hand, Dahlstrom (1962) 
takes Edwards and Jackson and Messick to task for 
interpreting social desirability or tendency to agree 
primarily as response style and for not considering 
the possibility that true-or-false answers may none- 
theless be veridical, may be attempts at various 
impression formations, or may be acquiescences to 
a particular pressure conveyed by the instructions, 
the examiner, or the test setting. Block (1962) 
presents data in support of the idea that social 
desirability and adjustment are basically different 
concepts even though they obviously are related. 

With these many conflicting points of view, it is 
difficult to reach a firm conclusion concerning the 
relative importance of response styles as confound- 
ing variables in personality inventories. Remaining 
largely unsettled and for the most part ignored is 
the more crucial question of external validity. Do 
these response styles measure anything important 
beyond their interesting correlates with other vari- 
ables from paper-and-pencil questionnaires? 

In general, most attempts to demonstrate the 
personality correlates of stylistic variables by com- 
paring them with independently derived personality 
measures have yielded negative results (McGee 
1962a; 1962b; Foster & Grigg 1963; Appley & Moel- 
ler 1963). The behavioral correlates of response 
styles are few and of questionable significance. 
While their existence cannot be denied in at least 
some personality inventories, their interpretation as 
important variables in their own right is debatable. 


Variations among personality inventories 


Most contemporary personality inventories em- 
ploy a dichotomous response format of true—false 
or agree—disagree and are designed as omnibus in- 
struments, with multidimensional scales covering 
different traits. A few have departed from this 
standard form in one way or another. 

Edwards ( 1957b) developed the Edwards Per- 
sonal Preference Schedule by Pairing statements 
having equal social desirability values, requiring the 
subject to choose the member of each pair which 
is most like him. This forced-choice method makes 
it more difficult for an individual to fake a socially 
desirable picture, although response styles are not 
really eliminated (Borislow 1958; Corah et al. 
1958). The 225 paired comparisons lead to scores 
on 15 scales, patterned after Henry Murray’s list 
of needs. Low intercorrelations between the scales 
and moderate reliabilities (.60-.87) suggest that 
the inventory has promise as a Tesearch instrument, 
although its external correlates have been disap- 
pointing. [See PERSONALITY: CONTEMPORARY VIEW- 


POINTS, article on COMPONENTS OF AN EVOLVING 
PERSONOLOGICAL SYSTEM, ] 

The forced-choice format for responding to per- 
sonality items usually involves pairs or triplets of 
statements, from which the subject chooses the 
alternative most like or least like himself, 4 more 
extensive variation of this approach is the Q-sort 
method, introduced by Stephenson (1953) and 
popularized by Rogers and his associates (Rogers 
& Dymond 1954), Typically, a number of state 
ments describing traits are printed on small cards, 
which can then be sorted by the subject into a 
forced normal distribution using a limited set of 
between 7 and 11 Categories arranged on a con- 
tinuum from “most like me” to “least like me.” The 
same statements can be reshuffled and sorted again, 
using different instructional sets, such as “how I 
want to become” or “how others view me,” instead 
of “how I really am.” Special statistical problems 
arise in the treatment of such data, because of the 
ipsative nature of the distributions, The method has 
been reviewed thoroughly by Wittenborn (1961). 

Another variation involves changing the response 
format from a simple dichotomy of yes-no or true- 
false to a continuum represented by five to nine 
gradations of response. Comrey (1964) found that 
he could improve item reliability by using a nine- 
point response format ranging from “always” to 
“never” for some items and from “definitely” to 
“absolutely not” for others. Still in experimental 
form, Comrey’s personality inventory consists of 
three validity scales plus 30 short scales of six items 
each, derived by repeated factor analysis of item 
intercorrelations and construction of new items: 
The nine-point response format yields more infor- 
mation per item than the usual three-point scale, 
making it possible to achieve split-half pee 
ranging from .62 to .93 (median .84) for the 
short scales. 

A five-point response format was developed 2 
Moore and Holtzman (1965) by combining i 
usual true—false dichotomy with an intensity ae 
for measuring the degree of concern or worry m 
ported by the subject. In addition to the a 
Point, “false, or does not apply to me,” four A af 
of personal concern were linked with a response 0% 

3 rn to me 

true, ranging from “true but of no concent e 

to “true and of great concern to me.” As in se 
iple-respon! 

study by Comrey, the use of a multipl t 
z A dichotomo 

continuum yielded better results than a inven- 

true-false, this time in a personal-problem ' mi 

tory containing eight well-defined scales, off 
study of thirteen thousand high school you ae a 
The adjective check list, or trait list, rep 
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an inventory technique at the other extreme. In- 
stead of a short sentence or statement, each item 
consists of a single trait adjective, such as “friendly” 
or “timid.” Occasionally, bipolar traits are employed 
rather than single words. A long list of such adjec- 
tives is given to the subject, who is asked to mark 
quickly the items most like him (and perhaps those 
least like him). An example of a well-developed 
adjective check list for use as a personality inven- 
tory is the one constructed by Gough and Heilbrun 
(1965). 

A rather novel approach to the format of an in- 
ventory has been successfully developed by Endler 
and his associates (1962) in their S—R (stimulus— 
response) Inventory of Anxiousness. Fourteen dif- 
ferent five-point scales are rated separately for each 
of 11 different situations, making a total of 154 
items. The scales contain such content as “My heart 
beats faster,” and “I get an uneasy feeling,” while 
the situations define common events: “You are just 
Starting off on a long automobile trip.” Designed as 
a research instrument, the S—R Inventory permits 
systematic study of responses, situations, and indi- 
vidual differences with respect to self-reported 
anxiety, as well as interactions between them. 

Of all the dimensions revealed in personality in- 
ventories, self-reported anxiety, or “neuroticism,” is 
one of the best defined and most widely used, Con- 
Sequently, a number of attempts have been made to 
devise questionnaires to measure anxiety. The Tay- 
lor Manifest Anxiety Scale was developed, using 
items from the MMPI (Taylor 1953), The IPAT 
(Institute for Personality and Ability Testing) Anx- 
lty Scale was constructed by Cattell from the 40 
items (20 subtle and 20 obvious) most-highly cor- 
related with the second-order anxiety factor in the 
16 PF. And Eysenck’s neuroticism scale in the 
Maudsley Personality Inventory is basically a meas- 
ure of self-reported anxiety, Correlations between 
these similar measures of anxiety are sufficiently 
high to indicate a large common core, although 
each has its own unique variance as well. [See 
ANXIETY and NEvRosIs.] 

Variations of anxiety questionnaires have also 
el ee for use with children. Patterned 
ne is Test Anxiety Questionnaire, Sarason’s Test 
riba Scale for Children (Sarason et al. 1960) 
the oy with anxiety aroused by events in 
ae particularly being tested or chal- 
chem he complete children’s inventory also in- 
defensiy, general anxiety scale, a lie scale, and a 
mea €ness scale, which can be used as control 
oped E A somewhat similar inventory, devel- 

ependently by a different approach, is the 


Children’s Manifest Anxiety Scale (Castaneda et al, 
1956). 

Other variations in format, content, and purpose 
of inventories are described in A Study of Values 
(Allport et al. 1960); the Mooney Problem Check 
List (Mooney & Gordon 1950); and the Minnesota 
Counseling Inventory (Berdie & Layton 1957). 
Mental Measurements Yearbook, edited by O. K. 
Buros, is a standard compendium of most published 
personality inventories, as well as of other kinds of 
psychological tests (e.g., Buros 1959). 


Validity 
A major problem with personality inventories 
has been the establishment of their validity. All too 
frequently an inventory has been published with- 
out sufficient evidence that it either has predictive 
validity with respect to some future behavioral cri- 
terion or has concurrent validity in terms of exter- 
nal correlates. In an exhaustive review of earlier 
work on this problem, Ellis (1946) concluded that 
personality questionnaires are of dubious value be- 
cause they fail to correlate with either practical 
criteria or independent measures of presumably the 
same personality traits. While some of the same 
criticisms can still be leveled at many current per- 
sonality inventories, the thrust of the argument has 
been blunted somewhat by the successful efforts of 
more recent investigators to demonstrate the valid- 
ity, as well as limitations, of contemporary methods. 
The most extensive research has been done on 
various empirically derived scales of the MMPI. 
This work demonstrates the usefulness of the 
MMPI for certain kinds of actuarial prediction, 
screening, and group discrimination, although its 
value for individual clinical diagnosis is still debat- 
able and depends partly on the skill of the clinician. 
Since most personality inventories combine both 
the rational and empirical approaches and involve 
explicitly derived dimensions with trait names to 
describe them, validation attempts are usually con- 
current rather than predictive. Properly conducted 
concurrent validation must involve two or more 
independent methods of measuring the same traits 
in the same individuals, Comparing the Guilford- 
Zimmerman Temperament Survey with Cattell’s 
16 PF, for example, does not validate either inven- 
tory, since both sets of traits are derived by the 
same method, the self-report personality question- 
naire. Personality scores on the inventory must be 
compared with ratings of behavior, with clinical 
judgments of the traits, or with trait scores from 
some other independent method of assessment. 
The most extensive cross-method correlational 
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studies have been undertaken by Cattell and his 
associates (Cattell 1950; 1957; Cattell & Scheier 
1961). The sheer magnitude of Cattell’s work rep- 
resents a tour de force unequaled by previous in- 
vestigators, even though he is vulnerable to serious 
criticism (Becker 1960). Generally speaking, the 
highest concurrent validities are obtained when in- 
ventory scores are compared with peer-group ratings 
for the same traits, regardless of the particular in- 
ventory in question. Ratings by trained observers 
also correlate to some extent with inventory scores, 
Validity coefficients of this type on the CPI, for 
example, range from .21 to .57 (Gough 1957). 
The lowest validity coefficients arise when cross- 
media comparisons involving inventory scores and 
laboratory-type measures or objective performance 
tests of personality are made. More often than not, 
such correlations are close to zero, 

A crucial design for examining validity by the 
cross-method technique has been proposed by 
Campbell and Fiske (1959). Usually an investiga- 
tor is content to show that the correlation for 
measures of a given trait across two methods is 
significantly better than chance. A much more rig- 
orous test of validity requires that the cross-method 
correlation for a given trait be higher than the cor- 
relation between any two different traits, whether 
within the same method or across the two methods. 
Insisting that both discriminant and convergent 
validity must be considered, Campbell and Fiske 
recommend that all the within-method and cross- 
method intercorrelations be arranged in a multi- 
trait, multimethod matrix. This matrix can be ex- 
amined in a number of ways, to determine both 
the convergent and discriminant validity. An exam- 
ple of this method applied to five traits purportedly 
measured by the CPI is reported by Dicken (1963). 

A more successful application of the Campbell- 
Fiske method in the validation of a personality in- 
ventory has been carried out by Jackson (1965), 
working with his newly developed Personality Re- 
search Form (PRF). Using four different methods 
of assessing the same 20 traits in a sample of 51 
college students, Jackson demonstrated high con- 
vergent and discriminant validity for most of the 
scales in the PRF, Individual validity coefficients 
for the PRF, using peer ratings of behavior, ranged 
from .17 to .71 (median .52). The PRF appears to 
be a most promising instrument, largely because 
it is one of the first in a new generation of person- 
ality inventories that capitalize heavily upon mod- 
ern test-construction theory and upon high-speed 
computers for item analysis. Two parallel forms of 
the PRF, with precise equivalence, are available. 
Derived originally from Murray's system of needs, 


the 20 scales contain 20 items each and are aug 
mented by two control scales, one for social desir. 
ability and one for acquiescence, 


The next few years should see considerable im. 
provement in the power of the inventory approach 
to personality assessment. While much of the re- 
search of the past ten years has been methodologi- 
cal in nature, sufficiently advanced technology is 
now at hand to overcome some of the more glaring 
weaknesses in existing personality inventories, It is 
unlikely, however, that paper-and-pencil self-report 
methods of assessing personality will ever be much 
more than just that, namely, a systematic way of 
measuring how a person judges himself. 


Wayne H. HOLTZMAN 


[Other relevant material may be found in INTERVIEW- 
ING, article on PERSONALITY APPRAISAL; OBSERVA- 
TION; PROJECTIVE METHODS; PSYCHOMETRICS.] 
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THE MINNESOTA MULTIPHASIC 
PERSONALITY INVENTORY 


The Minnesota Multiphasic Personality Inven- 
tory (MMPI) is a standardized, American test of 
emotional status and adjustment. It consists of 
550 verbal items to which the subject answers true 
or false about himself. Scores are obtained on 14 
scales: 10 that relate to clinical categories and 
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4 that help provide an estimate of the validity of 
the test for individual subjects. Scoring involves 
applying stencils to an answer sheet, either by 
hand or by various automatic machine methods. 
A psychograph is made for each subject that pro- 
vides a score profile, appropriate weights to cor- 
rect for defensiveness, and separate normative 
transformations for each sex. In addition, a num- 
ber of ratios and indexes may be computed for 
special applications, 

The initial derivational steps were undertaken 
in the late 1930s by Starke R. Hathaway, a psy- 
chologist, and J. Charnley McKinley, a neuropsy- 
chiatrist. All the basic work was carried out in the 
Department of Psychiatry and Neurology at the 
University of Minnesota Hospitals in Minneapolis. 
Inpatients and outpatients served as subjects in 
the carefully selected criterion groups, while hos- 
pital visitors, friends and relatives of patients on 
the various wards, made up the normal control 
group. Each component scale was derived empiri- 
cally, containing only items that separated known 
patients from normals; each scale usually passed 
through several preliminary forms. Most of these 
scales are described in a series of articles in Welsh 
and Dahlstrom (1956). The clinical categories 
that were the original referents of the scales are 
hypochondriasis, depression, conversion hysteria, 
psychopathic deviate, masculinity—femininity, par- 
anoia, psychasthenia, schizophrenia, hypomania, 
and social introversion. In Table 1, these scales 
are listed in the order in which they appear in 
the profile (Hathaway & McKinley 1943), 

Since its original Publication, a large number of 
special scales and indexes have been developed 
from the MMPI items, Most of these special pro- 
cedures, together with a great deal of the data 


Table 1 — Basic Minnesota Multiphasic Personality 
Inventory scales 


SCALE NAME 
Validity scales: 


CODE 
ABBREVIATION NUMBER 


“Cannot say” score X 
lie |á 
Confusion F 
Correction is 
Clinical scales: 
Hypochondriasis Hs 1 
Depression D 2 
Conversion hysteria Hy 3 
Psychopathic deviate Pd 4 
Masculinity-femininity ME 5 
Paranoia Pa é 
Psychasthenia Pt 7 
Schizophrenia Sc 8 
Hypomania Ma 9 
Social introversion Si o 


from which the original scales were derived, have 
been brought together in An MMPI Handbook 
(Dahlstrom & Welsh 1960). The MMPI items and 
methods have also served as points of departure 
for several other widely used personality instru- 
ments, most notably Gough’s California Psychologi- 
cal Inventory (1957), developed with greater em- 
phasis on social-psychological criteria, and Berdie 
and Layton’s Minnesota Counseling Inventory 
(1957), devised for high school guidance purposes, 
In addition, translations of the MMPI have been 
made into at least 15 different languages (see 
Dahlstrom & Welsh 1960, appendix N). Most of 
these versions can be considered to be only at the 
experimental stage at this time, since linguistic 
and cultural differences preclude simple transla- 
tion of the items and routine application of the 
standard scoring keys. However, in this regard, 
Horiuchi (1963) showed that the D scale worked 
surprisingly well in separating clinical cases of 
depressive disorder from normal controls in her 
Japanese-language version developed at Doshisha 
University in Kyoto. French, Italian, and Spanish 
(Spanish American) versions have been restand- 
ardized with sufficient care to warrant their use. 

Nature of the scales. As indicated in Table 1, 
the basic scales are usually grouped into categories 
of validity and clinical scales. 

Validity scales. The main function of the va 
lidity scales is to check on the possibility that the 
test subject may have been unable or unwilling to 
comply with the test instructions and procedures: 
Perceptive and conscientious use of these indexes 
serves to offset many of the widespread objections 
to personality inventories, which occur because © 
their reliance upon a subject's language oon 
hension, contact with reality, cooperativeness, pa 
personal insight. These validity scales also are 
some of the same variance that is measured by is 
clinical scales. Thus, the L scale reflects ee 
Stability as well as pervasive test defen ae 
the F scale mirrors severity of emotional ae 
ances and contact with reality as well con oA 
comprehension of English, and the Ks ent as 
varies with prognosis in psychiatric ee self 
well as the more subtle tendencies towar ne 
enhancement or self-excoriation that Pee of 
sponses to the inventory. A detailed score if 
the interpretation of validity scales is av 
Dahlstrom and Welsh (1960, chapter 5). plished, 

Clinical scales. When they were first ree 
the basic clinical scales were routinely ae Lae 
by the names of traditional psychiatric bprevition 
together with a one-letter or two-letter ab ew, in- 
(see Table 1). As the research literature 8" 
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creasing evidence of nonpsychiatric interpretive 
implications was accumulated. Usage has shifted 
from the restrictive semantic referents of these 
scale names to the letter symbols and, more re- 
cently, to the numerical designations. Thus, the 
hysteria scale which was derived by comparing 
the responses of conversion hysteria cases with 
those of normal adults came to be designated 
simply by the symbol Hy when it became clear that 
this scale was useful in evaluating anxiety hysteria 
and a variety of psychosomatic syndromes as well. 
More recently, as its fuller significance in evaluat- 
ing impulse control, emotional immaturity, level 
of insight, susceptibility to hypnosis, and repressive 
defenses is recognized, it is deemed less biasing 
to refer to the hysteria scale as simply scale 3. [See 
HYSTERIA.] 

In a similar manner the implications of the 
elevation of the component scores have changed 
over the years. A score falling two standard devia- 
tions above the average for normal subjects of the 
same sex was rather rarely achieved by normal sub- 
jects on any particular scale but typically fell close 
to the central tendency of patients showing the 
psychiatric characteristics under consideration; 
early clinical usage employed an elevation of two 
standard deviations as a critical level in score inter- 
Ptetations. These test inferences usually were part 
of a diagnostic decision process and the elevations 
Were judged to be either “clinically significant” or 
not. This argument was a probabilistic one, where- 
as the decision was a categorical one involving 
Placement within a normal or pathological group. 
Subsequent research has clarified the interpretive 
implications of scores and of score combinations 
at Various absolute elevations in the profile. A de- 
tailed description of the interpretation of the clin- 
teal scales is available in Dahlstrom and Welsh 
(1960, chapter 6). 

Profiles. More abstract designations of the 
Scales and greater reliance upon scale patterns at 
So Sa levels of score elevation have culminated 
‘ a System of profile coding and increasing reliance 
ane St Soon interpretive summaries (Hathaway 
oe 1951; Drake & Oetting 1959; Hathaway & 
Redan ts 1961; Good & Brantner 1961; Marks & 
ale eas 963; Gilberstadt & Duker 1965). In one 
Profile i eme, most of the information in the score 
elevation disregarded; only the sequence of relative 
abtolilie cen the scales and a rough index of 
Patterns e evation are preserved. In such a system, 
es ean designated by reference to the highest 
eae oe scales and for some purposes by 
ings fort, OW Points as well. Within the group- 

ormed by these criteria, additional distinc- 


tions about absolute elevation, validity scale pat- 
terns, or related scale sequences may be drawn. 

Extended validation. As a behavioral summary, 
the psychiatric syndrome seems to have been a 
felicitous choice for a test criterion. Although there 
are many problems inherent in using them (Meehl 
1959), these syndromes occupy a middle ground 
in degree of abstraction from the behavioral obser- 
vations upon which they are based. There is by now 
a large amount of loosely articulated research in 
psychiatry bearing upon etiology, therapeusis, and 
prognosis that is centered upon these personality 
constructs. By their efforts in deriving these scales, 
both in compiling item groups and in immediately 
and successfully cross-validating with new-patient 
groups, Hathaway and McKinley showed that these 
constructs were workable and at least minimally 
reliable. By now a long series of publications has 
substantiated the validational efforts, showing that 
these scales can similarly separate normals from 
patients for a range of ages, races, religions, regions 
of origin, and educational levels. This research has 
also made clear that there is no mutual exclusive- 
ness among these psychiatric syndromes. Scales 
separately constructed to distinguish clinical sub- 
groups from normals do not singly also distinguish 
among the subgroups themselves. Hither the pat- 
terns among the clinical scales must be used, or 
special scales must be constructed for these pur- 
poses (Rosen 1962). 

Useful as the syndrome may be as a summary of 
behavior, there is a legitimate interest in evalua- 
tions that are either more concrete and specific or 
more abstract and inclusive. MMPI scales derived 
from syndromes have proved useful in evaluating 
such specific behaviors as anxiety, hostility, halluci- 
nations, phobias, or suicidal impulses, Similarly, 
broader psychiatric conceptions such as differen- 
tiations of psychosis—neurosis, delinquency prone- 
ness (Hathaway & Monachesi 1953; 1963), or 
prognosis under various forms of psychiatric treat- 
ment have been studied. 

Broader applications. Perhaps the best index 
that the criteria initially chosen led to fruitful and 
productive scale variables has been the rapid ex- 
tension of this instrument beyond the special pre- 
occupations and concerns of psychiatry to general 
areas of personality, to psychology as a whole, and 
to sociology and the other social sciences, Examples 
of such areas are psychodynamic processes such 
as repression, projection, and perceptual defense 
and vigilance; emotional maturity and control; so- 
cial conformity, popularity, and leadership; political 
participation, religious affiliation, and occupational 


selection. 
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Research has also appraised the impact of a va- 
riety of personal characteristics and demographic 
variables on MMPI performance. Thus, while it has 
been known from the outset that the sex member- 
ship of the test subject influenced the component 
scales sufficiently to justify separate norms on six 
scales (1, 2, 3, 5, 7, and 8) in the profile, subse- 
quent work has also revealed that even with sep- 
arate normative tables systematic variations appear 
in the frequencies of code configurations (see Dahl- 
strom & Welsh 1960, appendix M ). The reported 
differences are quite consistent with the long-estab- 
lished sex differences in various psychiatric mor- 
bidity trends. 

Similarly, age trends have been studied, the most 
detailed analyses being made in older adult age 
ranges. Recently, a cross-sectional comparison has 
been made between adolescent and young adult 
levels and a start made upon some longitudinal fol- 
low-up studies ( Hathaway & Monachesi 1963). [See 
INDIVIDUAL DIFFERENCES, article on SEX DIFFER- 
ENCES. ] 

Two different approaches have been made to 
analyze the impact of social class status on MMPI 
responses. Most of the work has been based upon 
family origin, or status bestowed on the test sub- 
ject by his parents, such as father's education, 
father’s occupational level, or family income level. 
Thus, Gough (1948) and others have studied dif- 
ferences in test patterns among groups of subjects 
representing different family statuses. They have 
demonstrated that there is greater defensiveness in 
the test responses of subjects from higher status 
families. Gough developed a scale for this dimen- 
sion on the basis of test data from high school 
students; some of the early work indicated that the 


comes covarying with his socioeconomic classifica- 
tion. 

Critique. In some contrast to the generally suc- 
cessful and encouraging results from this wide 
range of application of the MMP] to practical prob- 
lems in personality appraisal, at least two major 


lines of criticism of the instrument and the method 
should be discussed. One of these patterns of crit. 
icism has grown out of the research of Edwards 
(1957), Jackson and Messick (1958), and Camp. 
bell and Fiske (1959) and their coworkers, Accord- 
ing to Campbell and Fiske, the total Variance of a 
set of test scores can logically be subdivided into 
methods variance (variability inherent in the test) 
and trait variance (variability in the personality 
attribute under consideration), Using a similar 


framework Messick and Jackson have advanced the 
formulation that methods variance can be concep- 
tualized in terms of various stylistic features of 
test performance, such as response biases, These 
include preferences for a particular response, e.g, 
true (or false) to an excessive number of items, 
acquiescence (or conformity) responses, person- 


ally or socially desirable responses (responding to 
items in terms of possible adverse implications 
rather than in terms of fact; Edwards 1957) and 
other test-oriented sets and predispositions, These 
authors contrast these response sets with valid, con- 
tent-related endorsements of inventory items, the 
trait variance in the Campbell—Fiske formulation. 
To the extent that test scores reflect style variance, 
they provide less valid information, presumably, 
about the actual content referents of the items in 
the various scales. Special scales constructed to 
evaluate these sources of “invalidity,” by both 
Edwards (1957) and Jackson and Messick (see 
Messick & Ross 1962), have shown high correla- 
tions with component MMPI scales. While the 
sues here are complex (see Dahlstrom 1962; Bloc! 
1965), one limitation of the approach that a 
workers employ can be readily discerned: En 
no reason to believe that MMPI-like persona r 
scales, empirically derived against external aA 
necessarily furnish their discriminations solely H 
the basis of content of the test items. A 
much of the information used in the work of a 
scales may indeed be stylistic (i.e., relying a 
identifiable features of a person’s approach to te 
situation) it may nonetheless be bine o, 
acterizing a particular personality attri = be 
when high correlations are obtained betw ges 
pirical scales and stylistic scales, the © E 
sources of variance reflected in the product-m fe 
correlations need not be interpreted as ue ai 
discriminating information bound to SD BERAY 
inventory or type of test. [See EXPRESSI 
IOR; RESPONSE SETS.] 

A sober re-examination of the 
MMPI usage is well represented by 
Little and Shneidman (1959). Erom 5 
enced in particular testing techniques, 


pragmatics of 
the work 0 
dges expeti 
lind clinical 
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interpretations were obtained of various test proto- 
cols from 12 different subjects. These interpreta- 
tions included clinical diagnoses, ratings of degree 
of maladjustment, Q sorts, and true—false state- 
ments about the behavioral and adjustment pat- 
terns of the subjects. Some of the results were 
probably biased conservatively by limiting features 
of the Q decks and the statements covering back- 
ground experiences; nevertheless, there was gener- 
ally a disappointing degree of correspondence be- 
tween these blind test interpretations and similar 
descriptions by judges who knew the case histories. 
While the results based on MMPI profiles were gen- 
erally not much better or worse than the interpre- 
tations based on other tests (Rorschach, Thematic 
Apperception’ Test, Make-a-Picture-Story ), the abso- 
lute level of success in these psychodiagnostic 
endeavors was by no means reassuring. Thus, it 
appears that even though various MMPI indexes 
are useful when applied to separate and discrete 
decisions about patients or clients, there remains 
unresolved the additional problem of integrating 
the multitude of possible implications of a partic- 
ular profile and producing a coherent and accurate 
Personological summary of that test subject. Some 
investigators have begun to work on this crucial 
problem; most of the relevant MMPI research has 
been devoted to combinatorial patterns of the 
MMPI scores and their interpretive implications 
(see Dahlstrom & Welsh 1960, appendix M). In 
addition to the problem of finding the best way to 
combine MMPI results to reach valid and discrim- 
mating psychodiagnostic summaries there also re- 
mains the broader problem of combining MMPI 
Score data with nontest information on a particular 
Person. Since the nonpsychometric information is 
generally less reliable and contains data of un- 
pown degrees of redundancy with the profile in- 
ation, it is even more difficult to collate it with 
MMPI variables for the common purpose of accu- 
rate depiction of personality. One promising ap- 
proach, exemplified by Kleinmuntz pioneering 
efforts (1963), is the utilization of high-speed com- 
i for the storage, retrieval, and most efficient 
ne of personological variables, once the 
a ais relationships have been discerned and 
fee T icit. In this way the limitations of the 
e clinician may be in part bypassed for this 
e activity (see Meehl 1963). 


W. GRANT DAHLSTROM 


[Other relevant material may be found in CLINICAL 
ITY ee INTERVIEWING, article on PERSONAL- 
Sure PRAISAL; PROJECTIVE METHODS; PSYCHO- 

ics; SCALING.] 
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Iv 
SITUATIONAL TESTS 


A situational test is “a measure of a person’s 
reaction to a situation that requires an actual 
adaptive response, rather than a mere ‘test’ Te- 
sponse. The situation may be contrived by the 
examiner but must be recognized as posing a real 
problem to be solved, independent of its status as 
a test” (English & English 1958, p. 504). In this 
definition the key phrases differentiating situa- 
tional tests from other personality tests are “posing 
a real problem” and “requiring an actual adaptive 
response.” Thus the ideal situational test would be 
one that simulates some significant aspect of an 
individual's environment (e.g., his superior repri- 
manding him for poor performance) and evokes 
typical coping behavior without the subject's being 
aware that he is undergoing testing. 

It is interesting that laymen seem to evaluate 
personality by observing and making inferences 
about everyday behavior, much as psychologists 
do with situational tests. Thus, if a friend seems 
always to organize the games and activities of a 
social event, we may conclude that he is authori- 
tative and perhaps exhibitionistic; or if we notice 


that a fellow office worker maintains a careful ar- 
rangement of objects on his desk, we may conclude 
that he is orderly and very controlled. 

Clinical psychologists, on the other hand, when 
confronted with the task of describing and predict- 
ing the behavior of a person, traditionally base 
their evaluations upon samples of verbal behavior 
elicited by questionnaires, pictures, or inkblots, 
However, psychologists are becoming more aware 
that a fairly large difference seems to exist be- 
tween the content and meaning of the verbal be- 
havior observed through use of traditional tests 
and the content and meaning of the behavior 
about which they must make judgments and pre- 
dictions. Because of this difference, inferences 
about real-life behavior, drawn only from samples 
of verbal-test behavior, are now being recognized 
as hazardous and difficult to make. Moreover, the 
clinician is reminded that fantasy behavior and 
self-description may not necessarily have a direct 
relation to overt behavior, and it is suggested that 
the clinician use tests that will duplicate the 
general factors underlying the day-to-day demands 
of the criterion situation (Cronbach 1956). In this 
connection it is both interesting and significant 
that expert clinicians are making more use than 
previously of so-called extratest behavior, for ex- 
ample, by noting that a patient forcefully throws 
together blocks he has been asked to assemble 
(suggesting aggressiveness and impulsiveness) 0r 
that a patient exclaims upon being shown an ink- 
blot, “Boy, you're really trying to put something 
over on me with this one!” (suggesting suspicious- 
ness). 

Since a major goal of personality assessment is 
to explain and predict overt behavior and art 
person's verbal responses to questionnaires Or i 
blots may spring from levels of personality ae 
different from those which are activated when 5 
is coping with a real-life situation, psycholg 
find themselves in a dilemma. One solution a 
problem may be for psychologists systema 
to observe actual coping behavior, as well a a 
tasy and self-report behavior. The method 5 6: 
ational testing is ideally suited for this ie 
Situational testing is intended to provide Er 
and tasks closely approximating those encoun jeld- 
in the normal course of daily living, therely ol 
ing samples of a person’s characteristic yer 
coping with real-life demands and of the P' ie 
ality traits and structures implicated in these 
ing efforts. ‘ vior 

Definition of coping behavior. Coping iem i 
is considered goal-directed, adaptive, an is the 
mental. Implicit in the concept of coping 
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assumption that when an individual is confronted 
with an object, person, or demand of significance 
to him, he experiences some disruption in his 
psychological equilibrium along with the need to 
readapt himself, that is, to re-establish his equilib- 
rium. In order to re-establish his steady state, the 
individual calls upon a variety of combined per- 
sonality and intellectual resources by means of 
which he physically acts upon, manipulates, or 
avoids the object confronting him in ways that 
bring about some adaptive resolution. Although 
coping behavior can be influenced by many factors, 
it is always assumed to be determined, in part at 
least, by an individual's needs, drives, goals, per- 
sonality structure, and history; the nature of these 
determinants influences the kinds of changes he 
effects in person—object relationships. Therefore, 
the goal of coping behavior is generally assumed to 
be need gratification, threat reduction, and mastery 
in achieving a readaptation. When coping, the in- 
dividual responds to actual physical stimuli of 
some potential significance to him, rather than to 
conceptually defined or artificial stimuli. 

As outlined here, then, so-called expressive as- 
pects of behavior (e.g., handwriting), psychomotor 
behavior, and physiological and reflex responses 
are not considered coping behavior. Verbal behav- 
ior elicited by inkblots or questionnaires is also 
excluded. Verbal responses, however, may qualify 
in certain instances as coping behavior. For exam- 
ple, if an individual actually fails a task and then 
blames his failure on the make-up of the test, the 
verbal blaming response is considered to be part 
of the need-determined coping behavior. 


History of situational testing 


Galton. One of the earliest significant state- 
ments concerning performance testing of person- 
ality was made by Francis Galton (1884). After 
ene that the character that shapes our con- 
ea is a durable entity and therefore measurable, 
a Bt Proposed ‘that it be assessed by observing 
‘a nite acts in response to particular situations. 
Saag he noted that such observation need not 
dia us chance happenings in real life that would 
ae Significant behavior; he pointedly stated, 
b mergencies need not be waited for, they can 
e extemporized; traps, as it were, can be laid” 
po as Substituting the term “test” for “trap,” 
pare, that Galton was advocating situational 
EA He discussed briefly one of his efforts at 
Aa up “traps.” After observing that friends who 
nes an ‘inclination’ to one another [seem to] . . - 
6 T or slope together when sitting side by side” 

. 184), Galton devised a pressure gauge to be 


placed on the legs of chairs in an effort to test 
this hypothesis. 

Galton’s proposal that personality be measured 
through carefully recorded acts representative of 
usual conduct received some attention in the years 
that followed. The work from 1900 to 1930 is re- 
viewed by Symonds (1931). Perhaps the most sig- 
nificant in this period is the work of P. E. Voelker 
(see pp. 302-304, 306, 308 in Symonds 1931), 
whose situational tests of “honesty” were later ap- 
plied with little modification by many workers, 
including Hugh Hartshorne and Mark A. May 
(1928) in their now classic Studies in Deceit. For 
example, one test measuring cheating consisted of 
a sheet of cardboard on which were printed ten 
circles of varying diameters. The subject was in- 
structed to close his eyes and try to write in each 
circle the appropriate number (one to ten). Suc- 
cessful subjects obviously had cheated by opening 
their eyes while performing, since the task is im- 
possible to accomplish otherwise. [See MORAL DE- 
VELOPMENT. ] 

Expressive behavior. The early efforts discussed 
above gave way in the 1930s to studies of expres- 
sive features of personality. Although research with 
expressive movements represented a change in 
focus, it nevertheless made a contribution to the 
history and development of situational testing by 
bringing systematic attention to the importance 
of nonverbal behavior in personality assessment. 
The suggestion by Allport and Vernon (1933) that 
a person’s forms of expression (e.g., body postures 
assumed, type of gait, manner of speaking) are 
determined by, and therefore reveal, personality 
was followed by many studies. Werner W. Wolff 
(1942), who made the largest contribution to this 
area, used a method in one study closely approach- 
ing that of situational testing. He proposed that 
by observing how a child punched a balloon, one 
could determine whether he was aggressive or 
insecure and by observing a child manipulating a 
jar of cold cream, one could determine whether he 
reacted cautiously, carelessly, or fearfully to his 
environment. [See EXPRESSIVE BEHAVIOR, | 

Objective tests. In the early 1940s there ap- 
peared a group of workers who contributed more 
directly to the technique and concept of situational 
testing and who distinguished themselves by spe- 
cifically challenging the adequacy of the inven- 
tories and projective devices rapidly growing in 
popularity as instruments of personality assess- 
ment. These investigators argued that the behavior 
measured in personality assessment should include 
not only introspective, self-assessing. verbal re- 
sponses but also responses to objective tests not 
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dependent upon the judgment of scorers, in which 
the individual would be unaware of the diagnostic 
meaning of his performance. R. B. Cattell (1957) 
and his colleagues have clearly made the largest 
contribution in this area. Their aim has been to 
establish a complete taxonomy of personality and 
to construct a battery of methods that will measure 
personality in terms of this taxonomy. Their strat- 
egy has been to collect observations from three 
types of sources, observations based on (1) ratings 
of life in situ and life records; (2) questionnaires 
(self-report); and (3) objective tests. Then by 
means of factor analysis, they have attempted to 
identify the major dimensions along which indi- 
viduals differ in each of these realms of behavior 
and to compare and relate dimensions found in 
one realm with those found in another. Only the 
objective tests developed by Cattell have relevance 
for the topic of situational testing. Some four 
hundred of these objective tests have been used 
with various clinical groups and age levels. Three 
methods are employed: pencil and paper tests; 
physiological and psychomotor tests; and minia- 
ture situational tests. As an example of the first, 
the subject notes whether he agrees or disagrees 
with a statement presented to him along with an 
indication of the percentage of persons who agree 
with it; the score reflects one’s tendency to agree 
with the majority. An example of a situational test 
is the “impairment of performance by fright test.” 
Here the individual performs the first of three 
equally difficult finger mazes under normal con- 
ditions, the second while he is shocked periodically, 
and the third while a cage containing a rat and 
several cockroaches stands adjacent to the maze. 
Reviews of factor-analytic studies using these 
objective measures are available (e.g., Cattell & 
Scheier 1958), In Great Britain, Eysenck and his 
colleagues ( 1947) have also made substantial 
contributions to the development of objective tests 
of personality, their research Strategy and tech- 
nique broadly resembling those employed by Cat- 
tell in the United States. In addition, estimation 
of the sizes of squares (Krathwohl & Cronbach 
1956) has been Suggested as a response that re- 
veals personality. This type of research can be 
credited with having brought to attention the limi- 
tations of the exclusive use of verbal responses, 
the need for objectivity in Scoring, and the useful- 
ness of factor analysis for Personality assessment. 


Studies are themselves limited, because the experi- 
ences or response processes activated in the sub- 
ject by many of the objective tests used may be 


unrelated to responses evoked by some real-life 
situation of potential interest to the student of 
personality. 

Simulation of life situations. Also during the 
1940s, a fourth phase developed in the history of 
situational testing. The subject was Presented with 
a situation constructed by the investigator to simu- 
late as much as possible some real-life criterion; 
judges rated the degree to which the subject mani- 
fested personality traits presumed to be critical for 
handling the situation. 

Max Simoneit is credited (see Kelly 1954) with 
the first attempt to use situational tests, which he 
developed to assess the suitability of individuals 
for various military assignments in the German 
army. Believing that military success depended on 
an individual's personality, Simoneit began by list- 
ing the traits that distinguished the personalities 
of German military heroes. He then devised situa- 
tions that were likely to evoke these traits and 
administered them to officer candidates. This pro- 
gram influenced the one set up by the British War 
Office Selection Board, which, in turn, influenced 
the program set up by the U.S. Office of Strategic 
Services (1948) in the 1940s. The mission of the 
OSS was to select and maintain a network of 
agents who would conduct destructive operations 
behind enemy lines during World War 1. The bat- 
tery of tests used to screen candidates included 
Situational tests for assessing such variables as 
emotional stability, leadership, and skill in observ- 
ing and reporting. For example, candidates were 
asked to direct two recalcitrant workmen (played 
by actors) in the construction of a giant cube and 
to supervise the movement of equipment across 
a brook. 

After the war, the OSS found that its test ie 
did not correspond with overseas staff ae 
of candidates but noted that some of the contri ie 
tions of the method could not be assessed ee 
cally because war conditions prevented the gal 
ing of adequate criterion data. s 

The Oss studies brought attention to, and a 
ulated interest in, the method of situational tes al 

Weislog 
during the first years after World War 11. bees. 
(1954) discussed the continued use of ngs!) 
testing by the military. Kelly and Fiske rai 
used situational tests to investigate the es cal 
of students for professional training n a 
psychology. For example, groups of stu a ; 
asked to place large cement blocks intent 
arrangements, or a student was asked to hose sex" 
a teacher (played by another student) W. E 
ual conduct had been called into question. 
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examining their data statistically, Kelly and Fiske 
reported situational tests contributed little, if any- 
thing, to the predictions made. 

The postwar years also saw the emergence of 
variants of situational tests. The leaderless-group 
discussion technique, also originated by a German 
psychologist, J. B. Riefert, in the 1920s, was ex- 
tensively investigated in the United States (e.g., 
Bass 1954) and in Britain (Higham 1952), pri- 
marily as a means of selecting personnel for indus- 
try. Leadership potential was measured by asking 
a group of subjects to carry on a discussion about 
a topic usually determined by the criterion situa- 
tion, and the amount of leadership each subject 
displayed was rated, Role playing, another variant, 
was also explored and developed (Mann 1956). 

After the postwar years, situational testing, in 
contrast to projective tests and questionnaires, 
failed to gain momentum and popularity as a 
method of personality assessment. This seemed to 
be due in part to the failure of the OSS and clinical- 
psychology programs to report statistical support 
for the predictive power of situational tests, in part 
to problems inherent in the method. These prob- 
lems are discussed below after a brief review of 
current applications of situational testing. 


Current applications 


Military applications. Perhaps the greatest at- 
tention currently being paid to situational testing 
is by the U.S. military. Tupes and his colleagues 
(1958), for example, employed situational tests 
designed to simulate the present-day job role of 
an air force officer. In one test the candidate is 
asked to play the role of a squadron commander 
confronted by three pilots (played by actors), all 
of whom wish a leave at a time when only two 
can be spared. In another, the candidate inter- 
views an actor playing the role of an irate mayor 
in whose town airmen on leave have recently been 
aggressive and abusive; and in another, small 
Stoups of candidates, placed in a prison compound, 
are asked to escape. Measures obtained from these 
Various situations related significantly to criterion 
measures obtained six months after candidates 
Ree graduated. Kipnis (1962) predicted job per- 
ormance of U.S. naval personnel with a “hand- 
Skills test” intended to measure an individual's 
Motivation to persist beyond minimum standards 
a a tiring task, The examinee, who views the test 
ae measure of hand and finger dexterity, pencils 

y marks, a task that rapidly promotes hand and 
a fatigue. The examiner tells the subject what 

Passing score is (established empirically) and 


notes whether the subject stops or slows down 
after reaching the announced passing score or 
continues to strive. 

General applications. In civilian psychology, 
situational testing is receiving isolated but concen- 
trated attention from several workers. Lois B. 
Murphy and her colleagues (1962) have con- 
ducted a series of studies designed to identify the 
“coping styles” children use to handle demands, 
new opportunities, and the problems they experi- 
ence throughout development. Murphy uses vari- 
ous naturalistic situations to elicit adaptive re- 
sponses; for example, children are observed during 
pediatric examinations, at the zoo, at parties, and 
while being transported to and from the clinic. 

Miniature situations test. Santostefano (1960) 
developed a Miniature Situations Test (MST) in 
an effort to make available a method that would 
overcome some of the disadvantages of the natu- 
ralistic assessment situation, while nevertheless 
providing samples of nearly real-life behavior for 
systematic study. Under the MST method, a child 
is presented with two objects and invited to act 
upon one of them as specified (e.g., “You may 
either water this plant or break this light bulb”; 
“You may either see how strong you are by squeez- 
ing this [a dynamometer] or raise this flag in honor 
of Abraham Lincoln”), The child is urged to make 
his choice quickly and is encouraged to make use 
of feelings he experiences when confronted with 
the objects. This method assumes that the child 
will selectively perceive and respond to the objects 
presented, endowing them with positive, negative, 
or neutral incentive values according to the domi- 
nant motives and personality attributes unique to 
him and that he will reveal some aspect of his 
personality through the objects he values or re- 
jects and through the changes he physically im- 
poses upon them. It is also assumed that the 
choices presented will activate genuine affective 
and motive states within the child and that al- 
though the nature of the instrumental acts re- 
quired by various situations may vary, the acts 
are nonetheless related to or derived from the 
same motive state. Thus, for example, if a child 
chooses, in several situations, to water a plant, to 
repair torn paper with Scotch tape, and to help 
the examiner put on a pair of gloves, it is assumed 
that his personality functioning is very much char- 
acterized by the disposition to bestow affection 
and to nurture. 

The MST, in comparison to the freer naturalis- 
tic method, seems to have several advantages for 
personality research. While allowing the free ex- 
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pression of personality dimensions in situations 
requiring the subject to cope with people and ob- 
jects, the MST (1) provides identical and restricted 
stimuli to all subjects, giving some assurance that 
each subject will become equally involved with the 
stimulus configuration designated as crucial; (2) 
produces unequivocally identifiable responses; (3) 
yields measures whose meanings are not com- 
pletely dependent on the judgments of raters; 
(4) elicits a wide sampling of personality disposi- 
tions represented in the choices performed; (5) 
makes available, in addition to specific acts, a 
wide range of behaviors, encapsuled, however, in 
brief time samples and available for scrutiny and 
systematic analysis; and (6) is feasible for use in 
ordinary research settings. 

Factor-analytic studies have contributed to the 
construct validity of the miniature situations, as 
have studies comparing the performance of vari- 
ous clinical groups (e.g., orphans, delinquents, 
brain-damaged children) with controls. The job 
effectiveness and conscientiousness of rehabilitated 
mentally retarded young adults have also been 
predicted with promising success. 

Leaderless-group discussion. The leaderless- 
group discussion technique is one variant of situa- 
tional testing that has continued, since the 1940s, 
to receive increasingly widespread use in the 
United States (in the military and industrial 
spheres), and.in England, Australia, and Germany. 
In one study (Kiessling & Kalish 1961 ), for exam- 
ple, groups of applicants to the Honolulu Police 
Academy were asked to assume that they were 
alone on foot, patrolling an area noted for “cop 
haters,” when they notice two delinquents stealing 
a tire from a car and suddenly find themselves 
surrounded by many other delinquents. The group 
is asked to discuss what one should do and why, 
while raters evaluate each candidate. 

Clinical applications. Recently other variations 
of the situational-testing technique have been pro- 
posed to study questions of interest to clinicians. 
For example, Lerner (1963) noted that recent 
social-psychological studies, whose results support 
the traditional notion that schizophrenics are un- 
responsive and autistic, have used artificial stimuli, 
such as asking patients to respond to stick-figure 
drawings to determine their capacity to perceive 
Cues in social roles. Lerner, instead, asked groups 
of schizophrenics to clean a ward and rated their 
behavior in terms of, for example, verbal interac- 
tion and dependency on the leader. He reported 
that schizophrenics exhibit appropriate social moti- 
vation and responsiveness when behaving in a 


meaningful situation. Similarly, Schulman and his 
colleagues (1962) have proposed Situational test. 
ing for investigating parent-child interaction, 

Non-English-speaking countries. The author 
finds little evidence that the technique of situa- 
tional testing is in use in non-English-speaking 
countries. 

Japan. A recent review (Kodama 1957) of 
personality tests used in Japan indicates a clear 
preference for projective and questionnaire devices 
borrowed from those most popular in the United 
States, One performance test of personality devel- 
oped by Japanese psychologists, and apparently 
widely used, makes inferences about “excitation, 
adaptation, and temperament” from an individual's 
continuous performance with addition problems; 
but it fails to qualify as a situational test because 
there seems to be no intent to approximate, in the 
test situation, behavior associated with some real- 
life criterion. 

The Soviet Union. Recent surveys of Soviet 
psychology indicate that techniques for assessing 
individual differences and personality are not of 
central interest to Russian psychologists, who are 
concerned typically with studying intellectual abil- 
ities or aptitudes. However, Brozek (1962) notes 
that one of the distinctive features of Soviet psy- 
chology is its emphasis on studying psychological 
processes under natural conditions. He points out 
that Russian investigators tend to see abilities 
(e.g. scientific, technical, literary) as integral 
components of personality and that one preferred 
method of studying these abilities is to observe the 
performance of individuals in “natural expeti- 
ments,” a method highly regarded by Soviet psy- 
chologists as an approach most true to life. m 
cording to this point of view, observation of the 
way in which a pupil solves tasks that are pre- 
sented to him enables the investigator to mike 
best the psychological characteristics implicated in 
various abilities. The resemblance between” al 
methodological philosophy and that of situation 
testing is noteworthy. 

Germany. In spite of the fact that sinador 
testing originated in Germany, the wilt 
found little evidence that the technique, ae 
the variant of group discussion (Rundgespr ie 
or round-table talk), is currently widely u: iret 
that country (Meili 1937). In a critique of se d 
logical tests Simoneit (1954) argues thane 
objectivity and quantification in personality anit 
ment has resulted in less psychological ne 
fulness and that the commonly used p true 
tests are inadequate in evaluating a person 
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feelings; he proposes situational testing and the 
study of expressive behavior as a remedy to this 
problem. 


Implications for theory and practice 


Considerations in devising and using tests. Three 
main procedural steps should be followed in de- 
vising and using situational tests. The first step is 
to select and describe the personality characteris- 
tics that are critically implicated in the criterion 
under investigation. The second is to construct tasks 
that will elicit these particular characteristics; and 
the third is to choose a method of observation and 
scoring. These steps are largely interrelated. The 
more detailed the analysis and understanding of 
the criterion situation and of the personality char- 
acteristics to be isolated and observed, the better 
one can construct situations to elicit these traits 
in ways which are identifiable and scorable. The 
past failure of situational tests to relate to criteria 
can be attributed, to a large extent, to the fact that 
workers have not specified in sufficient detail those 
aspects of the criterion selected as critical (e.g., 
Stern, Stein, & Bloom 1956). Thus, if situational 
tests are being used to assess effective parental 
performance, it should be made clear whether 
“effective” means setting limits, giving emotional 
comfort, or expressing an interest in the child's 
activities. The situations devised by Tupes and his 
colleagues (1958) are good examples of tests de- 
rived from a detailed understanding of the cri- 
terion. 

Several additional general considerations con- 
cerning the make-up and use of situational tests 
are worth noting. Situational tests are intended to 
be lifelike and therefore should evoke adaptive 
Tesponses as isomorphic as possible with the be- 
havior and feelings required by the criterion situa- 
tion. The tasks should be structured in such a way 
that each subject experiences the same need to 
respond and to become involved and each is un- 
meee of the specific psychological meaning of his 
ehavior and the intent of the situation. Ideally, 
Scoring should be confined to whether the person- 
re characteristic under question appears or not; 

grees of appearance of a trait are desired, the 
ee should be clearly defined in terms of 
Sai able forms of behavior. Situational testing 
ie ted a costly technique because of the elaborate 
hoe’ Material, and the use of actors and ob- 
z Di sometimes required. Lastly, it is important 
Hien pom that compromises are available be- 
are e degree of experimental control and of 

likeness built into the situational test. The more 


naturalistic the setting and observations, the closer 
the criterion is approximated; but at the same time, 
the more expensive the test, the greater the prob- 
lem of insuring that subjects will become involved 
with the same stimuli and the more difficult it is 
to identify critical responses. The less naturalistic 
the test, the greater the degree of experimental 
control over stimuli and responses. 

The concept of levels. In the investigation and 
clinical assessment of personality, one major goal 
is to learn about the psychological organization 
of an individual. For some time this organization 
has been viewed by psychologists as containing lev- 
els (e.g., conscious—unconscious, latent—manifest, 
depth—surface ). Recently much attention has been 
given to the relation between levels of personality 
functioning tapped by traditionally used projective 
and inventory devices and levels activated by situ- 
ations in everyday living. This interest seems to 
have grown out of the fact that psychologists con- 
tinue to experience failure and frustration in their 
efforts to assess and predict personality function- 
ing by using projective devices and inventories. 
Lindzey and Tejessy (1956), in discussing this 
issue, note that perhaps the most perplexing of the 
many problems facing the user of projective tech- 
niques is the difficulty in assigning an inference 
made about the patient to some specific level of 
his behavior; that is, will the attribute inferred 
from the test response be expressed freely in a 
public setting, or will it be revealed in rarely en- 
countered circumstances, or only in fantasy? Simi- 
larly, workers have wondered whether motive 
states are necessarily reflected in fantasy and 
whether fantasies are always mirrored in overt 
behavior; their research findings are equivocal. 
Questions and concerns such as these have led 
some workers to develop a theoretical system deal- 
ing with levels of personality (e.g., Leary 1957), 
defining and distinguishing between public, pre- 
conscious, and unconscious levels of functioning 
and their interrelationships, while others have been 
led to propose that the psychologist use as data 
only behavior that is as much like the criterion as 
possible (e.g., Cronbach 1956). The task facing 
personologists is to discover which test indices 
obtained at the fantasy or self-report levels are 
related to coping behavior and then to learn the 
systems and principles governing the interrelation- 
ships among these various links. Relating stand- 
ardized social-situational tests to projective and 
questionnaire devices could contribute to a solution 
of the problems raised in the consideration of 


levels of personality. 
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The criterion problem. Since the 1930s much 
effort has been devoted to improving the predic- 
tions that can be made from various test devices. 
Recently, however, workers have expressed dis- 
satisfaction over this one-sided enterprise and the 
primarily negative results it has produced. There 
is a growing awareness that solutions to the prob- 
lem of predicting behavior depend as much upon 
a definition of the criterion situation (the relevant 
types of behavior and personality traits it evokes) 
as upon the development of test instruments. The 
main goal of assessment is prediction relevant to 
situations of daily living. But naturalistic situations 
seldom contain easily defined, well-controlled stim- 
ulus configurations that are critically implicated in 
the evocation of behavior considered adaptive in 
terms of the demands of the situation. Discussion 
of the criterion problem (Stern, Stein, & Bloom 
1956), when combined with discussion of the 
philosophy and technique of situational testing, 
provides psychologists with an approach to con- 
structing adequate criterion stimuli against which 
personality instruments can be tested, an approach, 
interestingly enough, which closely parallels that 
used by Simoneit when first devising situational 
tests. If, for example, an investigator is interested 
in developing the power of a projective test to 
predict a child’s ability to handle the stresses of 
the first days of school, he could develop his cri- 
terion by observing children in the first days of 
school and by interviewing teachers, in an effort 
to isolate those aspects of a child’s personality that 
appear critical in handling the demands of this 
Situation, Situational tests could then be con- 
structed to evoke systematically and reliably those 
particular personality dimensions, providing ade- 
quate criteria with which the investigator could 
explore the predictive powers of his instrument. 

Current research concerned with the effects of 
television on the aggressive behavior of individuals 
nicely illustrates how failure to define and develop 
a criterion may lead to inconsistent findings. After 
showing subjects an aggressive film, one worker 
(Feshbach 1961) observed the associations they 
made to aggressive and neutral words, while others 
(Mussen & Rutherford 1961) observed answers to 
questions about the desire to aggress. Perhaps it is 
not surprising, then, that some studies report a 
decrease in “aggressive behavior” after participa- 
tion in fantasied aggression and other studies re- 
port an increase. It would seem that situational 
tests, which would provide the subject with the 
opportunity to aggress overtly and forcefully in 
terms of well-defined objects and circumstances, 
would provide a better test of the question whether 


fantasied aggression leads to an increase in be. 
havioral aggression. 


The technique and philosophy of situational 
testing have been traced historically and related 
to issues that concern theory, research, and clini- 
cal practice in personality assessment, What is 
the future for situational testing? Some workers 
have voiced the opinion that its future is dim, 
However, this writer feels that it is more than a bit 
paradoxical that psychologists are quick to reject 
the worthwhileness of test behavior closely resem- 
bling aspects of daily living, while at the same 
time they cling comfortably to their projective 
devices and inventories, which primarily evoke 
words as far removed from real-life conduct as the 
inkblots and pictures themselves. It would seem 
that the dominating influence of behaviorism and 
logical positivism has cultivated, at least among 
American psychologists, a preference for the sterile 
laboratory as a research setting and a fetish for 
test instruments, however far removed from the 
conditions of real life, an influence which seems 
to be spreading to personality-assessment practices 
in other countries. However, it is encouraging to 
note that dissenting voices are growing louder, as 
are pleas that behaviorism not frighten social sci- 
entists away from the study of naturalistic be 
havior. The survey given here suggests that much 
of the groundwork has been laid for the develop- 
ment of promising methods that could aid ee 
scientists in predicting the day-to-day behavior 0 
individuals. Use of the situational-test technique 
might even make the assessing of personality a8 
exciting as Galton’s first attempt. 
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PERSUASION 


The art of “winning men’s minds by words” has 
occupied the attention of philosophers since long 
before the time of Plato’s and Aristotle's commen- 
taries on rhetoric. But not until the twentieth cen- 
tury has there been any concerted effort of em- 
pirically oriented scholars to describe objectively 
the conditions under which persuasion succeeds or 
fails. 

Pioneering contributions. In a recent survey of 
current knowledge about persuasive communica- 
tion, four social scientists who made pioneering 
contributions during the first half of this century 
have been singled out as the founding fathers of 
the new field of research on persuasive communi- 
cation (Schramm 1963). One is the political sci- 
entist Harold D. Lasswell, who carried out the first 
detailed descriptive studies of major propaganda 
campaigns, focusing on the communications issued 
by national elites during World War 1 and by totali- 
tarian movements that tried to influence the masses 
during the period of the great depression. Lasswell 
formulated a set of theoretical categories for ana- 
lyzing the effects of persuasive communications 
and initiated the development of systematic tech- 
niques of content analysis ( Lasswell et al. 1949). 

A second major figure is the sociologist Paul 
Lazarsfeld, who worked out new methods for in- 
vestigating the impact of mass media on voting 
behavior and on the beliefs, judgments, and values 
of the mass audience. Using poll data from US. 
election campaigns and panel surveys of public 
reactions to a wide variety of radio programs, 
Lazarsfeld and his colleagues have described the 
complex communication networks and cross pres- 
sures that exist in modern communities. Their 
studies (Lazarsfeld et al. 1944; Katz & Lazarsfeld 
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1955) highlight the influential role of local opin- 
ion leaders, who function as “gatekeepers” by pro- 
moting or rejecting the evaluative judgments 
transmitted in the mass media by political parties, 
business organizations, public welfare authorities, 
and intellectuals. 

A third outstanding contributor to scientific re- 
search on social influence is the psychologist 
Kurt Lewin, whose studies emphasized the power- 
ful barriers to change that are created by the pri- 
mary and secondary groups with which the indi- 
vidual is affiliated. One of his major explanatory 
concepts to account for resistance to new sources 
of social pressure is the counterpressure arising 
from the existence of group norms, in which atti- 
tudes are anchored. When attempting to understand 
why a person accepts or rejects a persuasive mes- 
sage, according to Lewin (1947), the investigator 
should examine the person’s anticipations about 
whether or not he will be diverging from the norms 
of his reference groups, such as his family, his 
work group, and the social organizations with 
which he identifies. [See the biography of Lew1n.] 

Another psychologist, Carl I. Hovland, is the 
fourth contributor to have helped build up system- 
atic knowledge about communication effects and 
the processes of persuasion, Hovland initiated 
broad programs of experimental research designed 
to test general hypotheses concerning the factors 
that determine whether or not the recipients of a 
persuasive message will be influenced. Some of 
the studies by Hovland and his collaborators (see, 
e.g., Personality and Persuasibility 1959) bear di- 
rectly on the hypotheses put forth by Lasswell, 
Lazarsfeld, and Lewin, while others have led to 
unexpected discoveries and new theoretical analy- 
ses of the psychological processes underlying suc- 
cessful persuasion. [See the biography of HovLanp.] 

The sections that follow present some of the 
main generalizations drawn from these and other 
studies in order to indicate representative hypoth- 
eses and empirical findings. [For a fuller discussion 
of theoretical approaches, see ATTITUDES, article 
On ATTITUDE CHANGE.] 

Resistance to persuasion. During recent dec- 
ades many self-styled experts in propaganda, jour- 
nalism, advertising, and public relations have pro- 
moted an image of modern man as highly gullible. 
The new field of mass-communications research, 
which developed from the work of these four pio- 
neering social scientists, has shattered this image 
along with other popular preconceptions concerning 
the alleged power of the mass media to manipulate, 
exploit, or “brainwash” the public. A review of the 
evidence accumulated from relevant research stud- 


ies indicates that mass communications generally 
fail to produce any marked changes in social atti- 
tudes or actions (see Klapper 1960), The slight 
effects produced by the press, films, radio, and 
television are usually limited to a reinforcement 
of the pre-existing beliefs and values of the audi- 
ence. Campaigns designed to persuade people to 
change their values, to modify social stereotypes, 
or to foster a new political ideology generally mobi- 
lize powerful resistances in the public. So pervasive 
are these resistances, according to the documented 
accounts of numerous investigators, that one could 
characterize “successful persuasion” by the mass 
media as a relatively rare social phenomenon. 
Hovland, Janis, and Kelley (1953), in their anal- 
ysis of factors that influence attention, comprehen- 
sion, and acceptance of persuasive messages, call 
attention to essential differences between educa- 
tional instruction and persuasion. Most of the dif- 
ferences pertain to the audience's initial expecta- 
tions, which have a marked influence on motivation 
to accept or reject the communicator’s conclusions, 
In the case of instructional communications, where 
high acceptance is readily elicited, the educational 
setting is typically one in which the members of 
the audience anticipate that the communicator is 
trying to help them, that his conclusions are incon- 
trovertible, and that they will be socially rewarded 
rather than punished for adhering to his conclu- 
sions. In persuasive situations, on the other hand, 
interfering expectations are aroused, and these 
operate as resistances. The authors point out that 
the findings from experiments on communication 
effects seem to converge upon three types of inter 
fering expectations that operate to decrease the 
degree of acceptance: (1) expectations of being 
manipulated by the communicator (e.g. ei 
made a “sucker” by an untrustworthy source, wH0 
has ulterior economic or political motives for oe 
to persuade others to support his position); (2) 
ae king incorrect 
pectations of being “wrong” (e.g., mal i g ae 
judgments on a controversial political issue or (À ; 
looking antithetical evidence that would be groun 
for a more cautious or compromise position 
(3) expectations of social disapproval (¢8» im 
the local community or from a primary Le 
whose norms are not in accord with the comm 
cator’s position). j s 
This third type of resistance, which a 
Lewin’s “social anchorage” concept, has be n that 
extensively investigated. Many studies indic: ed to 
when the members of an audience are ES goes 
a communication advocating a position that b 
of their 
counter to the norms of one or more directly 
erence groups, their resistance will vary 


with the strength of the formal and informal sanc- 
tions put forth by the norm-setting group. Quite 
aside from any special sanctions applied to those 
who violate the group norms, the mere perception 
that the vast majority of other members accept a 
given norm operates as a powerful force on the 
individual to conform to that norm (Lewin 1947; 
Asch 1952; Kiesler & Corbin 1965). 


Determinants of successful persuasion 


Most of the substantiated propositions about 
successful persuasion designate factors that help 
to decrease psychological resistances when the re- 
cipients are exposed to a persuasive communica- 
tion (see Janis & Smith 1965), Exposure requires 
not only adequate physical transmission of the 
message but also audience attention, which will 
not be elicited if the communication is perceived 
as deviating markedly from pre-existing attitudes 
and values or as violating the norms of an impor- 
tant reference group. If a persuasive communica- 
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tion evokes sufficient attention to surmount the 
exposure hurdle, its success will then depend upon 
comprehension (i.e., the extent to which the audi- 
ence grasps the essential meanings the communi- 
cator intended to convey) and acceptance (i.e., 
the degree to which the audience is convinced by 
the arguments and/or is responsive to the motiva- 
tional appeals presented in the communication ). 

The main types of factors that have been investi- 
gated are those specified by Lasswell’s classic for- 
mula for communications analysis: Who says what 
to whom with what effect? Janis and Hovland 
(1959) present a paradigm of interacting factors 
that enter into successful persuasion (see Fig- 
ure 1). Communicator characteristics, the content 
of the message, the manner of presentation, and 
other crucial situational factors shown in column 1 
are considered to be stimulus variables capable of 
touching off the key mediating processes (repre- 
sented in column 3)—attention, comprehension, 
and acceptance—that give rise to the various 
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effects designated as attitude change (column 4). 
The magnitude of the influence exerted by the 
stimulus variables also depends on different types 
of predispositional variables (represented in col- 
umn 2). 

Communicator and content factors. The most 
thoroughly investigated propositions bearing on the 
processes of persuasion are those that specify how 
one or another of the stimulus attributes is related 
to successful persuasion. Sometimes communica- 
tions research has merely confirmed certain of the 
well-known prescriptions formulated by experts in 
the art of persuasion. But research occasionally 
leads to the discovery of unexpected limiting con- 
ditions or hitherto unknown relationships that call 
into question the commonly accepted assumptions 
about how people can be influenced to change 
their beliefs or attitudes. 

Prestige and “sleeper” effects. Studies of pres- 
tige effects have confirmed some “obvious” assump- 
tions and refuted other, equally “obvious” ones. 
Several communication experiments have shown, 
as might be expected, that there is an immediate 
gain in acceptance of persuasive communications 
when the message is given by someone who is ini- 
tially accorded relatively high prestige by the audi- 
ence or when the arguments are attributed to a 
relatively trustworthy source. But, contrary to ex- 
pectations, it has been found that in the long 
run, persuasive communications from low-prestige 
sources turn out to be just as effective as those 
from high-prestige sources. This phenomenon has 
been termed the “sleeper” effect (see Hovland et al. 
1953, pp. 254-259; Kelman & Hovland 1953), Both 
positive and negative prestige effects seem to wear 
off over a period of several weeks. When a com- 
munication comes from a nonprestigious or dis- 
trusted source, the audience tends at first to reject 
the message. But as time goes on, acceptance of 
the originally discounted statements has been found 
to increase, evidently because with the passage of 
time, the content of the message is no longer spon- 
taneously associated with the source. 

One-sided versus two-sided presentations. An- 
other issue that has been systematically investigated 
is whether persuasion is more effective when it 
concentrates exclusively on the arguments support- 
ing the propagandist’s position or when it includes 
discussion of the opposing arguments. Hitler and 
other Nazi propaganda strategists have claimed 
that in appealing for a specific line of action, no 
rival or Opposing ideas should ever be mentioned, 
because they invite comparisons, hesitation, and 
doubt. But the available evidence indicates that 
this principle holds true only under very limited 


conditions, such as when the audience is unaware 
of the arguments for the other side of the issue, 
When the audience is strongly opposed to the posi- 
tion being advocated, a persuasive message is gen- 
erally more effective if it includes the opposing 
arguments than if it presents only the arguments 
in favor of one side of the issue (see Hovland et al, 
1953, pp. 105-110; Hovland et al. 1949, pp, 201- 
227; Klapper 1960, pp. 113-116). Moreover, even 
when the audience is not initially opposed to the 
communicator’s position, a two-sided presentation 
will be more effective in creating sustained changes 
in attitudes if the communication is given under 
conditions where the audience will subsequently be 
exposed to countercommunications presenting the 
opposing arguments. 

Inoculation devices. When the members of an 
audience are pre-exposed to the opposing argu- 
ments along with some refutations, they are to 
some extent “inoculated” against subsequent coun- 
tercommunications, because the new arguments 
will be much less impressive and more readily 
discounted (see Lumsdaine & Janis 1953). This 
type of inoculation has been found to produce a 
“generalized immunization effect” under certain 
conditions, notably when a communication advo- 
cates recommendations that the audience already 
regards as being in line with commonly accepted 
norms, such as simple health rules. Thus, inclusion 
of a few arguments that momentarily shake the 
confidence of members of the audience in cultural 
truisms they had always taken for granted will 
reduce the chances of their being influenced by 
subsequent counterpropaganda, because they be- 
come more resistant both to the counterargumean 
specifically mentioned and refuted in the origin 
two-sided communication and to new comin 
ments which might otherwise shake their beliefs 
(McGuire 1961). 

Another simple inoculation device has e 
found to be effective in reducing the influence s 
unconventional communications that take T 
with cultural truisms of the type that people a 
seldom or never called upon to defend. This poen 
consists of stimulating the members of an ee 
to build up defenses by warning them in a an 
that their hitherto unchallenged beliefs wee 4; 
be exposed to strong attack (McGuire 1961; 
McGuire & Papageorgis 1962). à 

The Sal Eioaa Sroezing” effects of be Sa 
mitment to a newly adopted policy Or © T oE 
action form the basis for another type ° “py 
munication device that prevents backs A . 
perimental studies indicate that me: upi 
sequent countercommunications can be 


after presenting impressive arguments and ap- 
peals, the communicator uses his persuasive influ- 
ence to induce his listeners to endorse the position 
publicly—for example, by voting openly for it, 
signing a petition, or showing other overt signs of 
acceptance that will be seen by people in their 
community (see Lewin 1947; Attitude Organiza- 
tion and Change 1960). 

Other types of inoculation procedures have been 
studied to determine the conditions under which 
acceptance of a new attitude or policy recommen- 
dation will be sustained despite subsequent expo- 
sure to frustrations, threats, or setbacks that arouse 
strong negative effects. For example, after having 
been persuaded to adopt a communicator’s recom- 
mendations, the audience may subsequently be ex- 
posed to warnings or punishments that stimulate 
avoidance of the recommendations. The emotional 
impact of the subsequent setback will tend to be 
reduced if the audience has been given inoculating 
communications that predict the threatening event 
in advance, thus eliminating the element of sur- 
prise and, at the same time, stimulating appropri- 
ate defenses (see Janis 1962). Similarly, in the 
case of “bad news” events that generate pessimistic 
expectations about the future, preparatory com- 
munications that present grounds for maintain- 
ing optimistic expectations can help soften the 
blow and enable the audience to resist being un- 
duly influenced by the impact of the disturbance 
(Janis et al. 1951). In general, the eventual suc- 
cess of any attempt at persuading people to carry 
out a given course of action is likely to be attained 
if the communicator frankly discusses the possible 
subsequent difficulties and countercommunications, 
Presenting them in a way that helps to create a 
Cognitive frame of reference for discounting or 
minimizing them if they do, in fact, materialize. 

Effectiveness of “primacy.” Since most inocu- 
lation devices involve familiarizing the audience 
with counterarguments, two-sided communications 
might be more advantageous in the long run, even 
in circumstances where a one-sided communication 
could be expected to be more successful in pro- 
ducing immediate changes in a higher proportion 
of the audience. There are, of course, many differ- 
ent ways of arranging the opposing arguments in 
a two-sided communication, and some ways of 
Inserting them have been found to be more effec- 
tive than others. For example, when the audience 
iS not familiar with the opposing arguments, a 
Sadia. communication from an authoritative 

rce tends to be more effective if the opposing 
arguments are presented after, rather than before, 
the favorable arguments that support the communi- 
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cator’s conclusion. By giving strong favorable argu- 
ments first, the communication arouses the audi- 
ences motivation to accept the communicator’s 
conclusion, so that when the negative material 
subsequently occurs, it can be better tolerated. Fur- 
thermore, if a strong case is made for the com- 
municator’s position at the outset, there is a greater 
likelihood that the recipient will make an early de- 
cision to accept the communicator’s position and 
thereafter tend to minimize dissonance or conflict 
by ignoring the opposing ideas (see Brehm & 
Cohen 1962; Festinger 1957; 1964; Janis 1957; 
1959). This primacy effect, when tested with com- 
munications designed to induce opposing attitudes 
toward the same social objects or policies, proved 
to be extremely pronounced under conditions where 
the contradictory material was not spontaneously 
salient and there was no time interval between the 
first set of arguments and the second, contradic- 
tory set (Asch 1946; Luchins 1957a; 1957b; Janis 
& Feierabend 1957). Under other conditions, how- 
ever, such as where the audience is very familiar 
with the opposing arguments and has initial doubts 
about the communicator’s honesty, a recency effect 
might predominate, making it more advantageous 
to give the counterarguments first, with the main 
affirmative arguments saved for the end of the 
communication (see Hovland et al. 1957, pp. 
130-147). 

Emotional appeals. It is commonly recognized 
that when a person remains unmoved by repeated 
attempts to persuade him with rational arguments, 
he might nevertheless show a marked change as 
soon as emotional appeals are introduced. Probably 
the most widespread form of emotional appeal in 
modern Western culture involves the arousal of 
fear by emphasizing anticipated threats. Antiwar 
propaganda, public health campaigns, and other 
efforts at mass persuasion frequently rely upon 
emotional shock devices to motivate people to carry 
out preventive measures or to support policies 
designed to avert potential dangers (for example, 
promoting a ban against a nuclear weapons test 
by emphasizing the horrors of war). For maximal 
effectiveness, this device requires not only that the 
communications succeed in arousing fear but also 
that the recommendations function as effective 
reassurances. The latter term refers to verbal state- 
ments—plans, resolutions, judgments, evaluations 
—that are capable of alleviating or reducing emo- 
tional tension. Many communication experiments 
have been designed to test popular claims about 
the effectiveness of emotional appeals and to deter- 
mine objectively the conditions under which such 
appeals are successful. 
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Political leaders and public health authorities 
often assume that the protective actions or prac- 
tical solutions they advocate will be more readily 
accepted, the more they succeed in frightening the 
audience about the dangerous consequences of fail- 
ing to adhere to their recommendations. This as- 
sumption may occasionally be correct, as in the 
case of recommendations concerning immediate 
escape actions (e.g., evacuation of a danger area 
within a few minutes after an emergency warning 
is issued). But the assumption appears to be ques- 
tionable in many instances, especially when the 
goal is to induce delayed actions or sustained atti- 
tudes (e.g., evacuating at some future date, if the 
threat materializes; supporting a disarmament 
movement; favoring prodemocratic policies), The 
available evidence indicates that presenting fear- 
arousing material in a persuasive communication 
tends to stimulate the recipient's vigilance and his 
need for reassurance. But this does not necessarily 
increase his motivation to accept authoritative rec- 
ommendations about ways to avert or cope with 
the danger, since the person may find other ways 
to reduce his fear. Whenever fear is aroused to a 
very high level, resistances tend to be strongly 
mobilized. This will reduce the effectiveness of a 
persuasive communication, unless it is outweighed 
by certain other factors that could facilitate atti- 
tude change (Hovland et al. 1953, pp. 56-98; Lev- 
enthal 1965). Among the facilitating conditions 
that increase an audience's tolerance for a strong 
dosage of fear-arousing material in a persuasive 
message is the inclusion of one or more highly 
specific recommendations that offer an apparently 
good solution to the problems posed by the threat, 
with no obvious loopholes (Leventhal et al. 1965). 
When this condition is not met, as is often the case 
in “scare” propaganda, the use of a strong emo- 
tional appeal may produce much less acceptance 
of the communicator’s recommendations than a 
milder appeal, since the audience will then become 
motivated to attach little importance to the threat 
or develop some other form of defensive avoidance 
that enables them to alleviate their fear. Some- 
times strong emotional appeals attain spectacular 
persuasive effects, but it is difficult to predict accu- 
rately that a very high dosage of fear will not 
exceed the optimal level. Preliminary “program as- 
sessment” research with cross-sectional samples of 
the intended audience is usually needed to make 
sure that the version of the communication con- 
taining a strong appeal is more effective than a 
version containing a more moderate appeal. 

Implicit versus explicit conclusions. Many 
claims are made about effective Strategies for in- 


ducing people to change their attitudes and values, 
but some of these claims are difficult to assess 
empirically. One such notion is that a nondirective 
approach—similar to that used by many counselors 
and psychotherapists when dealing with people who 
seek help in making conflictful decisions of an up- 
setting nature—will generally be more effective in 
mass communications than a more directive ap- 
proach [see MENTAL DISORDERS, TREATMENT OF, 
article on CLIENT-CENTERED COUNSELING]. One 
testable implication of this notion is that a mass 
communication will sway more people if instead 
of stating an explicit conclusion, the communica- 
tor allows the audience to draw its own conclusions 
from the facts, arguments, and appeals that he 
presents. Undoubtedly, there are some circum- 
stances where direct suggestions are likely to meet 
with such insurmountable resistances that an in- 
direct approach is the only hope for exerting any 
influence whatsoever. But for informative com- 
munications dealing with relatively impersonal 
issues, the available research evidence indicates 
that it is generally more effective to state the con- 
clusions or recommendations explicitly, even when 
the propagandist is regarded as biased or untrust- 
worthy (see Hovland et al. 1953, pp. 100-105; 
Klapper 1960, pp. 84-91, 116-117). One of the 
main advantages of stating the conclusions ex- 
plicitly is that it helps to prevent the audience 
from missing or distorting the essential point of 
the arguments. 4 
Effects of role playing. One type of indirect 
persuasion that has been carefully investigated in- 
volves the use of a special role-playing technique. 
It has been repeatedly found that when a person is 
required to play a role entailing the presentation 
of a persuasive message to others in his own words, 
he will be more influenced than if he were passin 
exposed to the same message. This “saying 15 S 
lieving” tendency has been found to occur a 
when role playing is artificially induced by asking 
people to take part in a test of their public-speaking 
ability or to write essays (Janis & King 1954; that 
man 1953). Experimental evidence indicat 
mere repetition of a persuasive message has a 
effect as compared with an improvised = 
and elaboration of the arguments and cone 
(King & Janis 1956). The success of pe 
role playing might be attributed to several di n 
psychological processes. Festinger (1957) ae 
that the primary gain from role playing what 
from efforts to reduce dissonance ben te 
one is saying publicly and what one actu a a 
lieves, and a number of experiments offer 1962; 
supporting evidence (e.g., Brehm & Cohen 


Festinger & Carlsmith 1959; Festinger 1964). An 
alternative explanation is in terms of self-persua- 
sion: when attempting to convey the message to 
others, the role player is likely to think up new 
formulations of the arguments, new illustrations 
and appeals. These are likely to be convincing in- 
centives to himself, especially if he regards the 
improvised ideas as his “own” (see Hovland et al. 
1953, pp. 228-237; Janis & Gilmore 1965; Elms & 
Janis 1965 ). This theoretical issue has not yet been 
settled, and the differential predictions from the 
alternative explanations are currently under inves- 
tigation (see Carlsmith et al. 1966; Rosenberg 
1965). 

Personality factors. Numerous studies have 
shown that social attitudes are frequently resistant 
to persuasion because they satisfy deep-seated per- 
sonality needs. Such attitudes are likely to remain 
unchanged unless self-insight techniques or special 
types of persuasive appeals are used that take ac- 
count of the adjustive and ego-defensive functions 
of these attitudes (see Katz & Stotland 1959; Lass- 
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well 1930-1951; Smith et al. 1956). It is logical, 
therefore, that assessment of personality attributes 
should help in predicting whether a given person 
will be responsive to persuasive messages that deal 
with a particular topic or that employ one or an- 
other type of argument. Studies of authoritarian 
personalities, for example, indicate that any com- 
munication fostering rigid, antidemocratic controls 
over political dissenters and minority out-groups 
will tend to be more readily accepted by one par- 
ticular type of personality (see Adorno et al. 1950). 
An outstanding characteristic of this personality is 
a strong latent need to displace hostility away from 
in-groups toward out-groups—as manifested by 
symptoms of intense ambivalence toward parents, 
bosses, and other authority figures, combined with 
a high degree of inhibition of normal sexual and 
aggressive activities. 

Table 1 shows a set of hypothetical diagnostic 
categories, worked out by Katz (1960), that might 
prove to be useful for predicting individual differ- 
ences in responsiveness to persuasion on important 


Table 1 — Determinants of attitude formation, arousal, and change in relation to type of function 


FUNCTION 
Adjustment 


ORIGIN AND DYNAMICS 
The object of the attitude has 
proved to be useful for satisfying 
important needs; it increases one’s 
chances of gaining rewards or de- 
creases one’s chances of being 
punished. 


Ego defense The attitude helps to protect the 
person from internal conflicts or 
from becoming highly disturbed 


by external dangers. 


Valve expression The attitude helps the person to 
maintain his self-identity and his 
self-esteem; it allows him to ex- 
press himself or gives him a sense 


of independence. 


Knowledge The attitude satisfies the needs for 
understanding, for meaningful 
cognitive organization, and for 


consistency and clarity. 


1 


. Activation of needs. 
. Salience of cues associated 


« Posing of threats. 
2. Appeals to hatred and re- 


l. Rise in frustrations. 
|. Use of authoritarian sugges- 


CHANGE CONDITIONS 


. Failure to gain usual satisfac- 
tions from the attitude, 

. Creation of new needs and 
new levels of aspiration that 
are not satisfied by the attitude. 

. Shifting of rewards and pun- 
ishments so that the attitude is 
no longer reinforced, 

|. Impressive demonstration of 
new and better paths for need 
satisfaction, 


AROUSAL CONDITIONS 


with need satisfaction, 


~ Removal of threats. 

. Catharsis which reduces the 
need for defense against pent- 
up impulses. 

. Opportunity for development 
of self-insight. 


pressed impulses. 


tion. 
. Salience of cues associated 1. Some degree of dissatisfaction 
with values. with the self, 


|. Appeals to individual to reas- 


. Ambig 


. Impressive demonstration of 
the greater appropriateness of 
a new attitude for enhancing 
one’s self-image. 

. Loss of usual environmental 
supports to such an extent that 
old values are undermined, 


sert self-image. 
es which 
self concept. 


threaten 


. Ambiguity created by new in- 
formation or by a marked 
change in environment. 

. Meaningful information about 
a different way of analyzing 
the problem or about new 
ways of solving it. 


Re-encounters with the original 
problem that required a solu- 
tion or encounters with related 
problems. 


Source: From Katz 1960, p. 192. 
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social issues. The first step would be to determine 
which of the four basic types of functions (col- 
umn 1) is served by the person’s current attitude 
on the issue in question. The type of need fulfilled 
by each function is shown in the second column 
of the table. If one could assess the status of these 
needs accurately, one would presumably be able to 
predict the types of situations that would arouse 
the attitude (column 3) and the general conditions 
that would have to be met in order to change each 
individual's attitude (column 4). 

One of the main reasons why Katz's hypothetical 
schema is regarded as a promising functional ap- 
proach to the study of attitude change is that it 
helps to explain why the conditions required for 
changing certain attitudes, particularly those diag- 
nosed as serving an ego-defensive function, are not 
satisfied by the usual forms of persuasion to be 
found in the mass media. The material in this table 
carries the implication that no simple psychological 
formula can be expected to subsume all instances 
of attitude change. This implication confirms clin- 
ical observations, which indicate that unconscious 
dynamics as well as conscious processes are some- 
times involved when a person clings unyieldingly 
to his stand in the face of strong persuasive argu- 
ments or when he readily gives in, without any 
opposition, on an important social or political issue 
(see Lasswell 1930-1951). It also agrees with ex- 
perimental evidence indicating that even when 
dealing with conscious attitudes, we cannot expect 
to find only one type of cognitive process that will 
account for successful persuasion. For example, 
the striving for consistency among cognitions bear- 
ing on the same issue—which Heider (1958) has 
postulated as a fundamental human tendency in 
response to all meaningful communications— 
sometimes appears to be an important determinant 
of reactions to persuasion, but it does not account 
for certain instances where other motivational fac- 
tors, such as pleasurable anticipations of gain, may 
become the dominant determinants (see Rosen- 
berg & Abelson 1960). 

Individual differences are to be expected, not 
only in response to persuasive Pressures from argu- 
ments that create cognitive imbalances but also in 
response to emotional appeals (see Janis & Fesh- 
bach 1954) and group pressures induced by giving 
information about the consensus of judgments 
made by one’s peers (see Crutchfield 1955); these 
are called “content-bound” predispositions. Some 
research evidence also points to specific person- 
ality needs, preferences, and sensitivities that pre- 
dispose certain persons to be highly responsive to 
one or another limited type of communicator 


(Janis & Hovland 1959); these are termed “com- 
municator-bound” predispositions. In Figure 1, all 
such sources of individual differences are repre- 
sented in the second column by the box labeled 
“specific predispositions.” 

In addition to specific types of predispositional 
factors, there are also certain personality attributes 
that predispose people to be swayed by, or to be 
resistant to, any persuasive message, irrespective 
of what is said, how it is said, or who says it. This 
general persuasibility factor (represented by the 
box at the top of the second column in Figure 1) 
has been inferred from research on the consistency 
of individual differences. Such research indicates 
that when a large audience is exposed to many 
different types of persuasive communications on 
many different types of issues, some persons are 
consistently resistant, whereas others are moder- 
ately persuasible, and still others are highly per- 
suasible (see Abelson & Lesser 1959a; Janis & 
Field 1959a; 1959b). Among the personality fac- 
tors found to be predictive of low resistance to all 
forms of persuasive infiuence are (1) low self- 
esteem; (2) inhibition of overt aggressive behavior; 
(3) high fantasy imagery and strong empathic 
responses to symbolic representations; and (4) 
other-directed rather than inner-directed orienta- 
tion, that is, a value system stressing adaptation 
to the social environment rather than inner stand- 
ards for regulating one’s conduct. It is a puzzling 
fact, however, that these relationships have been 
found only in samples of men, since no such rela- 
tionships have been found as yet in samples of 
women. These findings have been attributed to dif- 
ferences in the social roles prescribed for women 
and men in our society, which may also account 
for the repeated finding that women are more pêr- 
suasible than men on social and political issues 
(see Hovland & Janis 1959). 


Current status of research 


The hypotheses summarized in the foregaing 
review of social-psychological studies of persuasi : 
do not constitute an exhaustive propositional 5 
ventory of all available findings but, rather, 
to highlight major relationships that have e 
from systematic research. Supporting evii e 
comes from carefully controlled experiment 
the studies usually have been carried Meee 
small subpopulation samples, most often ee 
to American high school or college students Er 
classroom situation. Consequently, the fae a 
of the hypotheses and the limiting conditions ise 
which they hold true have not yet been adeg me 
explored. There is some reason to expech 


ever, that the relationships initially observed in the 
limited experimental situations will have fairly 
wide applicability because: (a) they appear to be 
in line with observations from other, less well- 
controlled investigations of social influence (such 
as panel studies of opinion trends during political 
campaigns, market research surveys on widely ad- 
vertised products, and case studies of responsive- 
ness to psychological counseling or psychotherapy ); 
and (b) in a number of instances where replica- 
tions have been carried out with other subpopula- 
tions in other types of communication situations, 
confirmatory evidence has been obtained (see 
Janis & Smith 1965). 

In any case, the development of experimental 
techniques, attitude scales, and sophisticated meth- 
ods for analyzing the effects of many different 
causal factors and their interactions have now 
reached the point where we can obtain relevant 
and cumulative knowledge from systematic studies 
of the conditions under which persuasive com- 
munications are effective or ineffective (see Camp- 
bell 1963). As new techniques and methods are 
used in the rapidly expanding field of communi- 
cations research, we can expect a fuller account 
of the influence of the variables discussed, as well 
as new discoveries concerning the ways in which 
communication stimuli and predispositional fac- 
tors interact in the processes of persuasion. 


IRVING L. JANIS 
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relevant material may be found in BRAINWASHING; 
COMMUNICATION; COMMUNICATION, MASS; COMMU- 
NICATION, POLITICAL; Groups, articles on GROUP 
BEHAVIOR and GROUP FORMATION; HYPNOSIS; 
Norms; SAMPLE SURVEYS; SYSTEMS ANALYSIS, Arti- 
cle on PSYCHOLOGICAL SYSTEMS; THINKING, article 
ON COGNITIVE ORGANIZATION AND PROCESSES; and in 
the biographies of HovLanp and LEWIN.] 
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PETRIE, W. M. FLINDERS 


William Matthew Flinders Petrie (1853-1942) 
was the founder of Egyptology. An only child, he 
was born at Charlton, near London. His parents 
were cultured people, and he inherited an adven- 
turous streak from his grandfather, Matthew Flin- 
ders. Considered too delicate for normal schooling, 
young Petrie followed his mother’s interest in 
numismatics and collected coins for the British 
Museum. His own reading was extensive, and his 
education in mathematics, begun by his father, 
was reinforced by a university extension course, 
his only contact with orthodox instruction. 

Long walks in southern England sharpened his 
eye to the difference between natural contours and 
traces of man’s early constructions, and his first 
book, Inductive Metrology (1877), presented his 
idea that the units of measurement used in an- 
cient plans were indicative of date and culture. 
With his bent for mathematics, Petrie contem- 
plated astronomy as a career, but lack of a degree 
barred him from it, and he was not pressed to earn 
a living. 

Egypt. A casual interest in Charles Piazzi 
Smyth’s theory that Biblical prophecy was en- 
shrined in the measurements of the Great Pyramid 
Brew into a resolve to investigate its truth, and in 
1880 Petrie began his survey, which exploded the 
very theory he had hoped to prove. This work, 
The Pyramids and Temples of Gizeh (1883), re- 
mained the standard account until 1925, when it 
Was partially superseded by a new survey. 
ane Journey to Egypt had far-reaching effects, 
a etrie saw the destruction of antiquities going 

around him and realized that it was his life’s 
Work to salvage all he could. During the next fifty 
ee he published about a hundred books, mostly 
cavation reports. Funds were provided by friends 
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and by the Egypt Exploration Fund (now Society), 
and later he founded the Egyptian Research Ac- 
count and the British School of Archaeology in 
Egypt. After his marriage in 1897, he was ably 
seconded by his wife in the collection of funds. 
He became the first Edwards professor of Egyptol- 
ogy at University College, London, in 1892, a chair 
which he held until 1933. 

Stratigraphy in excavation. In 1890, under the 
auspices of the Palestine Exploration Fund, Petrie 
made a brief but revolutionary excursion into 
southern Palestine. Despite Heinrich Schliemann’s 
excavations of mounds at Troy, none of the people 
working in the Palestinian field had examined a 
mound, though Schliemann and Wilhelm Dörpfeld 
had recognized that these common features of the 
Asian landscape represented an accumulation of 
occupation. Using pottery as the main guide, Petrie’s 
new system of recording all finds, however small, 
was put to the test at Tell el Hesy in a six weeks’ 
campaign, and the results were published in a book 
of that name (1891). Through erosion, layers of 
the mound were visible in section, and Petrie noted 
exactly where each potsherd was found. He saw 
that every layer had its characteristic pottery, and 
he outlined absolute dates for the main types, 
which soon proved to be correct. 

Sequence dates. Petrie’s greatest work was ac- 
complished in the last decade of the nineteenth 
century, based on his excavation of about nine hun- 
dred tombs in Egypt. His book, Diospolis Parva 
(1901), established the sequence of prehistoric 
periods before the coming of the Dynastic Race to 
Egypt, estimated by him at that time as occurring 
in the fifth millennium B.c., although the date is 
reduced in current literature by a thousand years. 
His conclusions were reached by stages: first, he 
prepared a numbered corpus of pottery; then he 
listed the contents of each grave, arranging them 
in order to bring pots (represented by corpus num- 
bers) of like shape together in the series. New 
forms stood out in the arrangement, and each stage 
was necessarily linked to older and later phases. 
He assigned a relative date (“Sequence Date”; e.g., 
SD 30-80) to each phase, representing a fixed order 
of a given series of burials. Then he checked the 
sequence against that based on pottery alone by 
repeating the process on other grave goods such 
as stone vases and ivories. Petrie’s system is still 
the standard way of placing prehistoric tomb groups 
in Egyptian history, and it can be adapted for 
research in every field. 

With 25 years of experience behind him, Petrie 
wrote Methods and Aims in Archaeology (1904), 
the first book of its kind, and although in later 
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years he may not always have followed his own 
precepts, they remain sound in principle. 

Sinaitic inscriptions. In 1904—1905 Petrie went 
to Sinai; he discussed the physical and historic 
relations between Egypt and Palestine in Researches 
in Sinai (1906). Besides hieroglyphic records of 
Egyptian mining expeditions, Petrie found sculp- 
ture in foreign style, sometimes inscribed in a new 
script, perhaps an early attempt at alphabetic 
writing, 

Excavations in Palestine. After World War 1, 
Petrie decided to devote his remaining years to 
problems which could best be solved in Palestine. 
From 1926 onward, he or his staff dug at four sites 
south of Gaza, of which the most rewarding was 
Tell el Ajjal. The rich finds were described in a 
five-volume report, Ancient Gaza ( 1931-1952), 


Petrie died in Jerusalem in 1942 at the age of 89. 
Dedicated to his self-appointed task, he created 
Egyptology where before there was dilettantism or 
mere plunder; and his genius imposed discipline 
upon the previous disorder, His autobiography, 
Seventy Years in Archaeology (1931), records his 
triumphs and frustrations on the way, but nothing 
can detract from his achievements or from his 
wide view of what archeology should be—the study 
of how man has attained his present position and 
powers, 

OLGA TUFNELL 


[See also ARCHEOLOGY.] 
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William Petty ( 1623-1687), the English econ- 
omist and first systematic exponent of “the art of 
political arithmetic,” was a self-made man with a 


wide range of talents and immense mental energy, 
He was the son of a clothier, traditionally described 
by his biographers as “poor,” but if, as is likely, 
Petty’s father left him the good house and 8 acres 
of land that he owned in his birthplace, Romsey, 
in 1685, the family was comfortably above the 
poverty line. Certainly, when the young Petty went 
to sea as a cabin boy at the age of about 14, he 
was sufficiently literate in Latin and Greek to gain 
entry to the Jesuits’ college at Caen, where he was 
put ashore on breaking a leg: there he studied 
Latin, Greek, French, and mathematics. Then, after 
a short spell in the Royal Navy, he went on to study 
medicine at the universities of Utrecht, Amsterdam, 
Paris, and Oxford. By 1651 he had taken his Oxford 
degree of Doctor of Physic and entered the London 
College of Physicians. At the age of 28, he became 
vice-principal of Brasenose College and professor 
of anatomy at Oxford. 

Although one of the leading intellectuals of his 
day and a founding member of the Royal Society, 
Petty was a man of the world rather than a scholar. 
He wrote more than he read. He was a persistent 
inventor: he patented a double-writing device be- 
fore he got his medical degree, and he pursued his 
design for a twin-hulled ship through four proto- 
types. He worked out a scheme for the rebuilding 
of London after the Great Fire of 1666, In 1652 
he went to Ireland as physician-general to the 
army, which was no sinecure in a country ravaged 
by the plague and other lethal epidemic diseases. 
Within two years of his arrival in Ireland he had 
taken over the complex task of surveying the for- 
feited estates of the Irish rebels as a basis for thelr 
redistribution among the English conquerors. This 
task, too, he performed with his usual efficiency 
and drive and with the determination and pugnat- 
ity which earned him many enemies and er 
him into countless lawsuits. He spent much ee 
working life defending his Irish survey and han 
dling his own Irish estates, but he still mana 
to read a number of communications to the Roy 
Society on topics varying from dyeing e 
the clothing trade to the testing of mineral wa 
and to achieve a formidable and serious ee 
output on economic, demographic, naval, meaic® 
and scientific subjects. 

A good deal has been written about Petty a 
the published opinions of his contemporary he 
no doubt of the respect and esteem with ior as 
was regarded. It is unlikely that he was atta 
has been claimed, of the “Natural and P H 
Observations Upon the Bills of Mortality, oati 
was signed by his friend John Graunt a coul 
[1662] 1963, vol. 2, pp. 314-435), but 


well have been; and the title page of his own 
“Observations Upon the Dublin-Bills of Mortality” 
({1683] 1963, vol. 2, pp. 479-491) explicitly asso- 
ciates him with the earlier work, which was a path- 
breaker in demographic analysis, John Aubrey, his 
first biographer, the diarists John Evelyn and Sam- 
uel Pepys, and the economist and statesman, Dave- 
nant, all men of distinction in their own right, 
regarded Petty as one of the outstanding men of 
their time. 

Petty’s claim to fame as an economist lies not 
so much in his originality or his theoretical ability 
as in his analytical skill. His insistence on meas- 
urement and his clear schematic view of the econ- 
omy make him the first econometrician, and he 
was constantly evolving and using concepts and 
analytical methods that were in advance of his 
time. His evaluation of the gain from foreign trade 
in “Another Essay in Political Arithmetick Con- 
cerning the Growth of the City of London” ([1682] 
1963, vol. 2, pp. 451-478) is based on a statement 
of the benefits of the division of labor and special- 
ization and was written a century before Adam 
Smith’s famous account, Petty put so much stress on 
the role of labor in creating wealth that he has been 
regarded (for instance, by Marx) as an early ex- 
ponent of the labor theory of value. But, as is 
shown by a characteristic and frequently quoted 
passage from his “Treatise of Taxes and Contribu- 
tions” ([1662] 1963, vol. 1, pp. 1-97)—“Labour is 
the Father and active principle of Wealth as Lands 
are the Mother’—his theory of production and 
value is based on the two original factors of pro- 
duction of the early economists. He was the author 
of the first known national income estimates, in 
Verbum sapienti” ([1665] 1963, vol. 1, pp. 99- 
120), “Political Arithmetick” ({e. 1676] 1963, vol. 
1, pp. 233-313), and “Treatise of Ireland” ([1687] 
1963, vol. 2, pp. 545-621), although he did not 
trouble to define or develop his concepts and was 
Tough, even careless at times, in his use of figures. 
Some of the calculations in his “Treatise of Taxes 
and Contributions” and elsewhere are essentially 
exercises in what is now called “cost-benefit anal- 
ysis.” He was the first writer, so far as we know, 
to grasp the concept of the velocity of money, again 
in “Verbum sapienti,” althou gh in his “Quantulum- 
cunque Concerning Money” ([1695] 1963, vol. 2, 
F 437-448) there is no trace of it. He was not 
E ove manipulating his data in ways that would 
eae his polemical arguments, and it would be 
en T accept his statistics uncritically. But he was 
ihe ze to political prejudice, and his analysis of 
i, nomy of his time for England and Ireland 

or example, “The Political Anatomy of Ire- 
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land” ([1671—1676] 1963, vol. 1, pp. 121-231)— 
and his various essays on “political arithmetick” 
are both shrewd and penetrating. 

PHYLLIS DEANE 


[Other relevant material may be found in the biog- 
taphy of GRAUNT.] 
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PHENOMENOLOGY 


The word “phenomenology” is derived from the 
Greek phainein, “to show,” from which came phai- 
nemenon, meaning “that which appears.” Thus, 
phenomenology in a general sense could be liter- 
ally the orderly study of phenomena, or appear- 
ances, and could as such encompass much of tradi- 
tional philosophy and science. In its restricted 
sense, however, it refers to the study of phenomena 
as phenomena and more specifically to a twentieth- 
century movement in German philosophy centered 
loosely on Edmund Husserl. A good English in- 
troduction to Husserl’s Philosophy has been pro- 
vided by Marvin Farber (1943), and the whole 
phenomenological movement has been reviewed by 
Herbert Spiegelberg (1960). The various forms of 
existentialism, which have developed particularly 
in France since World War 1, may be regarded as 
offshoots of the phenomenological movement but 
should not be identified with it. Phenomenology 
and existentialism, while primarily philosophical, 
have had important repercussions in psychology, 
psychiatry, theology, literature, drama, and the fine 
arts. The present article will limit itself to a brief 
sketch of the philosophical background and a some- 
what fuller account of psychological phenomenol- 
ogy [see PsycHoLocy, article on EXISTENTIAL Psy- 
CHOLOGY, and the biography of Hussert]. 


Philosophical phenomenology 


Philosophical phenomenology is essentially a 
method of philosophical analysis rather than a 
school in the traditional sense of the term. The 
phenomenologist attempts to suspend or “place in 
brackets” (einklammern) all metaphysical and 
epistemological Presuppositions, to identify and de- 
scribe the essences of experience as they are in- 


tuitively apprehended (Anschauung, Wesensschau), 
and on this basis to provide a fresh approach to 
the classic problems of metaphysics, epistemology, 
psychology, logic, ethics, and aesthetics. Husserl 
believed that in this way philosophy could be res- 
cued from unproductive speculation and re-estab- 
lished as a rigorously empirical discipline (strenge 
Wissenschaft). Phenomenology is thus empirical 
in its insistence on a continuous and unbiased 
scrutiny of experience, but not empiricist as the 
word is commonly used to refer to explanation 
through past experience. It shares the empirical 
emphasis of Locke and his successors and of some 
of the positivists—e.g., Ernst Mach—differing from 
these primarily on grounds that their analyses of 
experience were neither rigorous nor complete. 
Phenomenology is not to be confused with phe- 
nomenalism, the doctrine that knowledge is limited 
to the data of experience and that the knower is 
consequently incapable of transcending the world 
of phenomena, nor with psychologism, the conten- 
tion that all philosophical problems can be reduced 
to terms of psychology. 

History of modern phenomenology. Anteced- 
ents of modern phenomenology are to be found in 
the many classic attempts—e.g., that of St. Augus- 
tine—to found a philosophy on the data of intui- 
tion. The modern movement begins, however, with 
Descartes, whose “method” involved the suspension 
of all beliefs and the acceptance as true of only 
those ideas which are presented “so clearly and 
distinctly as to exclude all ground of doubt.” Al- 
though Kant's distinction between the phenomenal 
and the noumenal world rests in part on phenome- 
nological analysis, the post-Kantian (e.g., Hegelian) 
use of the term “phenomenology” and its loose use 
in the sciences to denote anything observable are 
not regarded as relevant. Husserl’s phenomenology 
goes back to Descartes for its inspiration and a 
liberally from the psychological analyses of f 
liam James (1890) and Husserl’s teacher pete 
Brentano (1874), particularly from the lat i 
doctrine of intentionality. Husserl’s early Eer 
was in the phenomenological basis of mathem: E 
and logic, from which he moved to epist 
and eventually to a transcendental phenomen yee 
which is usually regarded as a form of pe, jm 
cal idealism, Among his publications the eee 
portant for psychology is his Logische ae 
ungen (1900-1901). The Jahrbuch für Phi ie edit: 
und phdnomenologische Forschung, which Ee 
ed from 1913 to 1930, also contains oo 
to psychology, and the Husserl archives at vol 
are yielding further material of pa how- 
terest. His more strictly philosophical wor 


ever, have had a profound influence on the ex- 
istentialist psychologies [see the biographies of 
James; KANT]. 

Other influential German philosophers who 
shared in the phenomenological movement were 
Alexander Pfänder, Max Scheler, Nicolai Hart- 
mann, and Moritz Geiger. Scheler in particular, al- 
though not a disciple of Husserl, became widely 
known for his extension of the phenomenological 
method into the fields of ethics and value theory. 
It was Martin Heidegger, however, Husserl’s pupil 
and successor at the University of Freiburg, who 
perhaps unintentionally popularized the term “ex- 
istentialism” and who, through his often baffling 
metaphors and neologisms, has been in some meas- 
ure responsible for phenomenology’s reputation as 
an obscure and almost esoteric metaphysical sys- 
tem. Since the publication of Sein und Zeit 
(1927) Heidegger has moved steadily away from 
a straightforward phenomenology and has con- 
cerned himseif more and more with the ontological 
problem—that of the fundamental meaning of be- 
ing (Sein). In spite of his early enthusiasm for 
the National Socialist ideology he has been an 
influential figure in recent continental European 
philosophy, particularly in French existentialism 
and in the existentialist movements in theology 
and psychiatry. Since World War 11 the phenome- 
nological movement in philosophy has been most 
active in France under the leadership of Jean-Paul 
Sartre and Maurice Merleau-Ponty. Both men have 
accepted in principle Husserl’s phenomenological 
method but have pushed further in the direction of 
a detailed examination of the phenomena of hu- 
man existence (the human situation) and thus to- 
ward a complete existentialist philosophy. Sartre’s 
more strictly phenomenological contributions are to 
be found in his studies of imagination and emo- 
tion; in his novels and plays and in his more sys- 
tematic philosophical works, e.g., Being and Noth- 
mgness (1943), he presents existentialism as a 
Philosophy of life. Merleau-Ponty is also known 
both as a phenomenologist and as an existentialist. 
pe most important phenomenological contribu- 
ions have been his analyses of perception (1945) 
and of language (1952). 


Psychological phenomenology 
Psychological phenomenology, too, is essentially 
Bt approach rather than a particular kind of the- 
Pella and it owes as much to Goethe, 
iS inje, and the physiologist Ewald Hering as it 
2 re to Husserl. Whereas Husserl’s phenomenology 
oad prior to all empirical science, concerned 
essences rather than with “matters of fact,” 
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psychological phenomenology is frankly and ex- 
plicitly descriptive. It represents what David Katz 
called an attitude of “disciplined naiveté,” the at- 
tempt to suspend all presuppositions (biases, im- 
plicit assumptions) and observe and describe the 
world of phenomena (consciousness, immediate 
experience, phenomenal world, psychological field) 
as it is naively apprehended. Husserl regarded this 
kind of psychology as an important empirical dis- 
cipline, coordinate with the other empirical sci- 
ences but not to be confused with “pure” phenome- 
nology. Psychological phenomenology is in the 
tradition of all the psychologies since Descartes 
which have accepted as their task the scientific 
study of consciousness, and it is thus to be distin- 
guished from the faculty psychologies, the depth 
psychologies, and the psychologies which limit their 
subject matter to externally observed or logically 
inferred behavior. 

Phenomenology and introspection. Phenomeno- 
logical description must not be equated, however, 
with the introspective analysis of the Wundtian 
school, best represented in the English literature 
by Edward B. Titchener. For Titchener (1929), 
“existential” experience consists of the pure, irre- 
ducible, conscious content (sensation, feeling, im- 
age) which is left after all object reference or 
meaning context has been deliberately brushed 
aside. To confuse the sensation itself with the 
stimulus which arouses it—e.g., the color with the 
light wave—is to commit the stimulus error. The 
phenomenologist would accept introspective anal- 
ysis as a legitimate technique for the establishment 
of correlations between the variables of physical 
stimulation and the dimensions of consciousness, 
as in psychophysics, but he would reject the asser- 
tion that the introspectively identified sensations 
are necessarily the elements of consciousness. He 
would also agree with the introspectionist that the 
physical, physiological, or other conditions which 
give rise to a phenomenon, or which can be cor- 
related with it, should not be confused with the 
phenomenon itself. Just as color is not an array 
of light waves, so the person we perceive is not 
the person who may “really” exist; one is a phe- 
nomenal datum, the other an independently defin- 
able process, entity, or condition which may or may 
not be causally related to the phenomenon. The 
phenomenologist differs radically from the intro- 
spectionist in his insistence on the acceptance as 
legitimate psychological data of the very phenom- 
ena which the introspectionist emphatically rejects, 
namely, such phenomena as organization, directed- 
ness, attractiveness, or requiredness—phenomena 
which for the introspectionist are secondary out- 
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comes of association or interpretation. If we sus- 
pend our presuppositions, the phenomenologist 
argues, and refuse to impose on the phenomena 
any theory as to their origin or their veridicality, 
we still find ourselves confronted with a meaning- 
ful world of things, events, and people, organized 
and interrelated in vastly complex ways. 

Elimination of biases and presuppositions. Our 
first phenomenological task is to observe, describe, 
and analyze the structures, properties, dimensions, 
and interrelations of phenomena as they are naively 
apprehended. This, the phenomenologist contends, 
is a highly disciplined activity, fully as rigorous as 
introspection, analogous to but not identical with 
the phenomenological reduction of Husserl [see the 
biographies of TITCHENER; WounptT]. 

The various biases or presuppositions which 
must be placed in brackets have been variously 
classified and need not be detailed here. Some of 
these stem from the reductive atomism of classical 
Newtonian physics and can be recognized as im- 
plicit in the associationist psychologies developed 
since the work of John Locke; others stem from 
the Darwinian emphasis on explanation by refer- 
ence to origins, evident both in behaviorist learn- 
ing theory and in Freudian psychoanalysis; and still 
others reflect the confusion of logical implication 
with psychological content so frequently encoun- 
tered in theories of motivation. Without denying 
the value of reductive analysis or of developmental 
studies, the phenomenological psychologist would 
argue that these and other biases involve a pre- 
judgment of the phenomena and the consequent 
risk that significant phenomena, such as the me- 
lodic property of a series of tones or the physi- 
ognomic properties of a face, will be dismissed as 
of no consequence because they disappear in re- 
ductive analysis or seem to be the products of past 
experience. He would argue further that in the 
recent history of psychology this deliberate bracket- 
ing of implicit assumptions and the acceptance of 
all data of experience, however subtle or evanes- 
cent, as intrinsically valid, has led to Significant 
extensions of psychological knowledge and ad- 
vances in psychological theory. 


Phenomenology and contemporary psychology 


The phenomenological method has been applied 
to many areas of psychology, most systematically 
perhaps in the experimental study of perception, 
particularly in connection with the traditional prob- 
lems of space, time, motion, color, sound, and 
touch. These are obviously the phenomena most 
readily accessible to observation under conditions 
of experimental control. The phenomenal world 
contains much more, however, than things and 


events with their properties and interrelationships, 
It also contains the phenomenal self and phenom- 
enal “other selves,” with their feelings, emotions, 
and desires; and it contains a whole welter of phe- 
nomenal structures, states, and processes which 
have been traditionally classified as memories, fan- 
tasies, choices, beliefs, and the like, These: phe- 
nomena also invite the interest of the phenome- 
nologist, and even though they are difficult ta bring 
into the laboratory, they are steadily yielding to 
other techniques of investigation. Some of the most 
promising of these techniques are being developed 


by the clinician, whose patient may be incapable 
of giving a full and free account of his experience 
and must consequently be induced through indirect 
devices to reveal what is there for him, The range 
of phenomenological investigation is thus steadily 
broadening to include what have been traditionally 


thought of as noncognitive processes. As facts are 
being accumulated and tec hniques refined, more 
and more attention is being devoted to the recon- 
struction of the phenomenal world of the “other 
person,” including that of the child, the deviant, 
the person who has grown up in a culture radically 
different from one’s own, even members of other 
species, Basic to the investigation is always the 
attempt to establish the “what” of experience, To 
explain the “why” one must transcend phenome- 
nology and become a systematic psychologist ot 
even a philosopher. Most of the phenomenologists 
have gone beyond their phenomenology, but me 
nomenology has produced no single psychologic: 
theory. In the available space only a few of the 
most significant contributors to psychological phe- 
nomenology can be mentioned. 3 Thie 

Experimental psychology of perception. a 
pioneer in the experimental phenomenology En 
ception is undoubtedly David Katz, whois ka 
World of Colour appeared in 1911. Before a 
Goethe (1810) and Purkinje (1819-1825) = 
published detailed descriptive analyses of the w the 
of color, and Hering in Outlines of a THRA 
Light Sense (1872-1875) had countered an 
holtz’ empiricist theory of color constancy Yer 
nativist theory based on the acceptance of Ma 
apprehended object color as psychologically 1890) 
Karl Stumpf in his Tonpsychologie (188 TA 
had also made use of the phenomenologit was 
od, but Stumpf's interest in phenom cal 
as much philosophical as it was Psy¢ e ean 
Helmholtz had explained the a ne 
stancy of object color as the interpre a py un- 
mary, physically bound sensation eleme 
conscious inference. Katz bracket to explore 
physicalistic assumptions and proceedet ain 
the world of color in all its modes of app? 


(Erscheinungsweisen), demonstrating that while 
film colors (Fliichenfarben) vary simply and di- 
rectly with changes in retinal stimulation, the same 
does not hold true for surface colors (Oberfléchen- 
farben), which are’ phenomenally inherent in the 
perceived object. Impressions of surface color 
and of illumination are phenomenally covariant, 
changes in total retinal stimulation being regis- 
tered, within very wide limits, as changes in im- 
pression of illumination, with the consequence that 
object color tends to remain phenomenally con- 
stant. As a result of Katz’s phenomenological stud- 
ies of color and of his similar studies of the world 
of touch in Der Aufbau der Tastwelt (1925), the 
facts of phenomenal constancy have become basic 
to a reconstructed psychophysics and psychophysi- 
ology of perception. [See PERCEPTION, article on 
PERCEPTUAL CONSTANCY; VISION, article on COLOR 
VISION AND COLOR BLINDNESS; and the biographies 
of HERING; Karz; STUMPF.] 

Gestalt theory. Perhaps even more rewarding 
has been the phenomenological approach to per- 
ceptual organization represented by the Berlin 
group of gestalt psychologists. The reality of ge- 
stalt qualities had been recognized by Christian 
von Erenfels in his article “Uber ‘Gestaltqualité- 
ten’” (1890) and by Stumpf, but it was Max 
Wertheimer’s experimental studies of apparent 
movement (1912) which set the stage for the ge- 
stalt movement. The older theories could not admit 
as psychologically valid an experience of move- 
ment when there is no physical movement in the 
stimuli; phenomenal movement had to be explained 
away as an illusion. Wertheimer, like Katz, simply 
accepted the phenomenal fact as valid, insist- 
ing that movement as such must have its direct 
neural correlate; hence the controversial principle 
of isomorphism. Wertheimer’s pioneer experiments 
led to a long series of studies of gestalt phenom- 
ena in perception, memory, thinking, and motiva- 
tion, many of which have been reviewed by Koffka 
(1935). While the gestalt theories which emerged, 
notably the physiological and the psychological 
field theories, go beyond phenomenology, the basic 
pian is in each case phenomenological. It 
i ould be noted that this approach, although prom- 
nently associated with the gestalt group, has been 
cd characteristic of experimental psychology 
F pem Europe. Examples of this approach are 
me ound in the work of Albert Michotte on phe- 
a cree Causality, Jean Piaget on developmental 
hee ology, F. J. J. Buytendijk on expressive move- 
ps t and Geza Révész and Albert Wellek on the 

t ology of music. [See GESTALT THEORY.] 
eee ende psychologie. Related to the experi- 

movement but not to be identified with it 


i 
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is the “understanding” psychology (verstehende 
Psychologie) of Wilhelm Dilthey, Max Scheler, 
Eduard Spranger, Karl Jaspers, and Ludwig Bins- 
wanger. Distinguishing broadly between the Natur- 
wissenschaften (natural sciences) and the Geistes- 
wissenschaften (mental or humanistic sciences), 
Dilthey (1883) placed in the former category the 
reductive or “explaining” psychology of the labora- 
tory and in the latter the psychology which seeks: 
an intuitive understanding of man’s value orienta- 
tions as revealed both in individual life and in 
history. This kind of psychology, systematized by 
Spranger as a sixfold value typology in his Types 
of Men (1914), has had considerable influence on 
the study of personality, both in Germany and else- 
where. Although the verstehende Psychologie is not 
strictly phenomenological, it represents the attempt 
at an intuitive, nonanalytic penetration of the in- 
ner life of the other person; a somewhat shaky 
phenomenological basis for it is to be found in 
Scheler’s The Nature of Sympathy (1913). Schel- 
ers descriptive analysis of the various forms of 
sympathy is part of his attempt to lay the phe- 
nomenological groundwork for a broadly inclusive 
philosophical anthropology. The relevance of the 

verstehende Psychologie to psychopathology is per- 

haps best demonstrated in Jaspers’ General Psy- 

chopathology (1913) and Binswanger’s Grundfor- 
men und Erkenntnis menschlichen Daseins (1942); 

both authors are to be ranked among the leading 
existentialists. [See SYMPATHY AND EMPATHY.] 


The phenomenological movement is still dis- 
tinctively European. In the United States, leading 
interpreters of philosophical phenomenology in the 
Husserl tradition are Marvin Farber and Herbert 
Spiegelberg. The phenomenological emphasis in 
experimental, developmental, and social psychology 
is recognizable in what has come to be known as 
“cognitive theory,” a useful summary and evalua- 
tion of which was made by Martin Scheerer (1954). 
In the psychology of personality and in psycho- 
therapy there is a vigorous and growing existen- 
tialist movement, influenced greatly by Binswan- 


ger’s Daseinsanalyse. 
Rosert B. MacLEop 


[Directly related are the entries PSYCHOLOGY, article 
on EXISTENTIAL PSYCHOLOGY; THINKING. Other rele- 
vant material may be found in MENTAL DISORDERS, 
TREATMENT OF, article on CLIENT-CENTERED COUN- 
SELING; PERCEPTION; and in the biographies of 
HUSSERL; Katz.] 
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PHILANTHROPY 


The terms “philanthropy” and “charity” have 
often been used interchangeably, but changes in 
attitude to the phenomenon of giving have meant 
that charity now has a somewhat derogatory con- 
notation, and so it is gradually giving way to the 
more acceptable concept of philanthropy. The 
Oxford English Dictionary’s definition of philan- 
thropy as being “the disposition or active effort to 
promote the happiness and well-being of one's fel- 
low men” introduces the modern conception of 
charitable effort which stresses the total well-being 
of the individual rather than merely relieving dis- 
tress. Philanthropy, too, seems a more appropriate 
term for the highly organized types of giving typical 
of modern industrialized societies. i 

Participation in philanthropic activity is now 
more characteristic of the individualistic laisse 
faire societies in which the ideology of Bast 
liberty and noninterference by the state is widely 
held than of those with communist political sys 
tems. In some of the individualistic societies it 
plays such an important role that its org 
agencies have become closely integrated with i 
whole social structure. Every year vast sums A 
money are collected for an infinite number g Rof 
anthropic purposes, and an increasing num E 
people participate in the work of collecting mo A 
through highly organized fund-raising campaign 
So well has this form of activity come to a 
certain needs that in many countries it has € 

F i icy in the form 
been incorporated in government policy 1 aritable 
of tax exemptions for contributions to go d 
agencies, It has become such an accepted fo eo 
behavior that few now escape the deman acl 
giving, and many important institutions at p 
or wholly dependent on it. e 

On sA oke hand, in countries in which ub 


rights of the community take precedence over the 
rights of the individual, there is less need for vol- 
untary philanthropy because the state takes re- 
sponsibility for most if not all of the needs of its 
people. 

No adequate data have been gathered to tell the 
history of philanthropy all over the world, Anthro- 
pologists have shed light on the practice and atti- 
tudes to giving in a number of preliterate societies. 
Historians have given us accounts, often with much 
detail, of a few Eastern societies, ancient Greece 
and Rome, and some of the medieval European 
countries. These studies show that the scope of 
charity has undergone many changes with regard 
to beneficiaries, the type of benefits, and those held 
responsible and that these have reflected the chang- 
ing social structures of the societies concerned. 
But there is very little accurate information on the 
philanthropic activity of many societies, even in 
modern times, and there are few scientific studies 
of the effect of philanthropic activity on the giver 
and on the recipient. 


History 


In early times, charity was usually motivated by 
religious faith, and so its history in different soci- 
eties can, in part, be understood by studying their 
religious ideologies. However, religious zeal is 
largely the product of a complex of forces; thus, 
even though the ideal of philanthropy was articu- 
lated through the religious functionaries, there 
were always other underlying reasons that prompted 
philanthropic giving. It seldom, if ever, occurred 
either when the giver did not receive some practical 
or psychological reward, or when there was no 
punishment for not giving. Active participation in 
Philanthropy has through the ages been much more 
characteristic of Christian, or Western, than of 
pagan, or Eastern, societies, and of those belong- 
ing to Protestant rather than to Roman Catholic or 
Orthodox religions (Grubb 1917, p. 837). 

In preliterate societies, the family, kin, caste, 
tribe, or clan looked after its own people as a nat- 
ural duty. The wealthier and those in high positions 
Were expected to bear the larger share of looking 
seed the destitute, and sometimes the village would 
ps after all its members. But almsgiving, in the 
tae the duty of everyone to give to those out- 
at : eir own close circle, was not necessary. For 
i F ong to a large family or clan was to be part 
ee System that supplied social and economic 
Male The continuing strength of the ties of 
Er RA ate tribe, or caste in Africa and Asia is 
oes e main reasons for the difference in the 

ation of philanthropy in the East and the 
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West. Another is the fact that almost all the new 
nations have begun their independence with a cer- 
tain amount of guaranteed social security in their 
constitutions. 

Certain current trends in philanthropic thinking 
and organization go back very far in history. The 
idea that giving would ensure a reward in heaven 
was found in Egypt many centuries before the 
Christian era, and giving was not limited to family 
or clan. In ancient Rome, the idea was first intro- 
duced that citizenship was the basis of the right 
of relief for every person, no matter whether he 
was destitute or not; this foreshadowed the prin- 
ciple of universality. Philanthropy in this sense had 
very little connection with poverty and was not nec- 
essarily motivated by pity, nor was it considered an 
important virtue. The state looked after its citi- 
zens, and the wealthy did not give large sums for 
charity but, rather, aided the state with such gifts 
as fitting out navies, Some organized charities did 
eventually develop around the first century A.D. 

Another philanthropic ideal that is still held by 
some Christians is the belief that one-tenth of a 
person’s income should go to charity. This idea, 
called the tithe, goes far back in the history of 
Hebrew giving and was a religious duty. The sys- 
tem of tithing was fairly common among many 
ancient peoples and was often collected in the form 
of a general tax rather than as a gift to God or the 
poor. Thus our present acceptance of state taxation 
for charity could be said to be an idea that was also 
suitable for the charitable needs of earlier societies. 
The teachings of Jesus with regard to giving, which 
have had great influence in determining philan- 
thropic attitudes down to the present day in the 
Western world, can also be traced to earlier reli- 
gious attitudes toward charity. In particular, Jesus’ 
teaching that the spirit of the giver is more im- 
portant than the size of the gift, and that it is more 
blessed to give than to receive, emphasized the 
virtues of unselfishness and giving as a personal 
sacrifice. 

In the East, religion has also usually been the 
main force to motivate giving. Many verses in the 
Koran exhort the believer to give alms. This is con- 
sidered a basic duty, and the destitute and poor 
can demand alms as a right. The Muslims thus 
look on almsgiving as a compulsory act, but one 
that enhances the prestige of the giver. 

In China, the teachings of Confucius and Men- 
cius were especially important in implanting the 
ideal of benevolence. The Chinese have always 
been generous givers, especially in the last two 
centuries. Up until the communist regime, large 
amounts of money were donated voluntarily, and 
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many local charities were coordinated through or- 
ganizations like the Hall of United Benevolence, 
which was established as early as 1805; this fore- 
shadowed the modern trends toward the Com- 
munity Chest. The idea of state philanthropy, 
however, did not exist before the communist regime. 

The Hindu religion also stresses that there will 
be a reward for generosity, after death as well as 
in the present world. But in other respects, the 
concept of Hindu charity is different from that 
found in the West. Traditionally almsgiving was 
not looked on as a matter of bestowing gifts to the 
poor and needy because of compassion and sym- 
pathy, at the whim of the individual giver. Rather, 
sharing possessions and relieving the wants of 
others was a habitual activity. This attitude was 
implanted at such an early age, and was so deeply 
accepted at all levels of Hindu society, that it ex- 
plains why ascetics have been able to wander far 
and wide in India, knowing that they would always 
be cared for. Manu’s teaching, however, did not 
include the conception of collective, disinterested 
responsibility, and so it was not until the nineteenth 
century that the idea of state responsibility and 
organized philanthropy developed in India. 

The trends which foreshadowed modern philan- 
thropy can be more easily traced in the Western 
Christian countries. For since they led the indus- 
trial revolution, they were forced to alter their pat- 
terns of giving sooner than the less developed 
countries of Africa and the East. In the early Chris- 
tian era the religious institutions taught that giving 
alms was an intrinsic and essential part of a Chris- 
tian’s duty. But there was no over-all control of 
giving in the sense that all the poor and destitute 
were equally served, Rather, a giver gave according 
to his conscience and the amount of social pressure 
to which he was exposed, directly to the recipient. 
This meant that the giver usually saw the problems 
of the recipient at first hand and thus made charity 
a personal affair, 

The great changes that took place in the whole 
conception and organization of charity were due, 
first of all, to the decline of medieval society in 
Europe and to the ensuing disorganization which 
loosed forces that gradually undermined the for- 
mer, tightly structured feudal way of life. The dis- 
solution of the monasteries, the steady increase in 
population from 1400 on, the beginning of a seri- 
ous rural-urban movement, Plagues, and wars all 
aggravated the problem of Poverty and destitution, 
The growing towns, too, gradually provided the pos- 
sibility of group concern over poverty, for destitu- 
tion had grown so extensive and evident that it 
could no longer be ignored. It was soon seen asa 


serious, continuing social problem, and this made 
its relief a social as well as a religious duty. It was 
soon evident, too, that the church, which had for- 
merly had a near monopoly over charity, could no 
longer cope with the situation; thus, private and 
secular charitable institutions began to arise to fill 
the gap. This movement to secular organizations 
was nourished by the mounting wealth of the in- 
dustrial middle classes. 

Thus historical circumstances, which shifted the 
power of the former feudal lords and the medi- 
eval church, redistributed wealth and power, and 
brought about the beginning of a new industrial 
society, also caused a redistribution of responsibil- 
ity for charity. For the dislocations and maladjust- 
ments caused by these momentous changes pro- 
duced a situation which was too heavy for the 
church and local piety to handle (Jordan 1959, pp, 
55-63). 

This new approach to philanthropy gradually be- 
came defined and supported by law. The earliest 
and most famous law was that passed by Queen 
Elizabeth 1 in 1601 to “create, control and protect 
the funds that had been allotted or donated to 
charity.” It made the local community responsible 
for providing for the destitute whose families were 
unable to look after them. But it went no further 
than to mark out those responsible for the needy. 
Laws created to deal with this new social phenom- 
enon, however, necessarily lagged behind the rap- 
idly growing needs. For no one could foresee the 
extent of the fundamental changes that were tak 
ing place or envisage the coming industrial revo- 
lution, which would produce new types of destitu- 
tion and poverty far beyond the capacity of m 
individual family or voluntary agencies to a 
after. Part of the gap was filled in by wealthy a 
viduals who gave large gifts of money o 
the misery of those caught in the new od Ei 
economic dislocations. These gifts resembl ae 
grants awarded by the numerous modern Amel pe 
foundations, but differed from them in that a 
were solely for relief, for in that era preoa A 
not recognized, nor was the future state is ) 
destitute of much interest (Andrews 1956, Ee 
They did, however, serve the function of ena 2 
people to test out new ways of coping ked 
$ x - F 1d be 100 
situation. In this sense the givers cou 
on as initiators of social change. 


Social legislation industrial 


As time went on, the agrarian and the twen 
revolutions gained momentum, until DN faced 2 
tieth century the industrialized countri aE, 


completely new way of life which ha 


about so much economic and social uncertainty 
that citizens could no longer be protected from the 
misery of destitution by haphazard relief. More- 
over, by that time it was recognized that there were 
problems to be alleviated in other areas of life. 
Thinkers, writers, statesmen, and politicians gradu- 
ally began to accept and then to define a new con- 
ception of social welfare. And the public slowly 
and grudgingly accepted the fact that the state 
must bear the major share of public relief. This 
new attitude was gradually incorporated into the 
state legislation of a number of Western nations. 

Western countries. The Scandinavian countries, 
some of the Commonwealth countries, and Eng- 
land, with their relatively stable political systems, 
were among the first countries to lead the way to 
the welfare or “social service” state, although part 
of the Scandinavian experiment was influenced by 
the first embracing movement to ameliorate the 
conditions of the workers, which had been intro- 
duced in Germany by Bismarck in the 1870s. This 
statesman had persuaded his government to adopt 
the most extensive program of social legislation 
that the world had ever seen, in order to cut the 
ground from under the feet of his socialistic op- 
ponents. The northern European countries and 
Australia served as the first social laboratories 
whose experiments could be watched and ap- 
praised by the rest of the world. 

In addition to voluntary or compulsory insur- 
ance, introduced in many of the western Euro- 
pean countries in the nineteenth century, other 
Social legislation was enacted after World War 1 
and the 1930 depression. Countries such as France 
and England led the way to further legislation after 
World War 1m, inspired in part by the Beveridge 
Report (Great Britain . . . 1942). This report great- 
ly influenced both public opinion and social legis- 
lation, for it was introduced at a time when the 
destruction and privations of the war had so stirred 
People that they were ready for a constructive ap- 
Proach to social problems. 

_ In England the real change in attitude and prac- 
tice came about during and after World War 11. 
Rising costs and the heavy taxation of the post- 
War years caused donations to private charities to 
decline sharply. This forced many of the former 
Voluntary agencies to depend on the government 
for at least part of their income in ‘such forms as 
grants in aid. By the 1960s the social services pro- 
vided by the welfare state, either directly or 
enough local authorities, impinged on the life of 

citizen from birth to death. These services are 
now regarded as one of the basic rights of all citi- 
7ens. Thus the principle of universality, the con- 
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ception of the desirability of universal coverage for 
all persons and all needs, gradually emerged. 

Some of the countries in central Europe, Czecho- 
slovakia in particular, were also early in establish- 
ing some governmental measures of social security. 
The countries of eastern Europe were a little later, 
and those of Latin America, on the whole, did not 
bring in state legislation until the 1920s. After that 
there was a rapid movement toward social assist- 
ance in many of these countries, although they 
were greatly hampered by distances, difficulties of 
communication, the slow rate of industrial develop- 
ment, and the great number of people living in 
primitive conditions. Chile, however, has had very 
advanced social legislation since 1924. 

In no country did the transition from private 
giving to state control go through without a strug- 
gle, and all countries enacted their social legisla- 
tion in a piecemeal way and in different sequences 
of coverages, with no coordinated effort for over- 
all security, until the communists came to power. 

The advance toward the universal provision of 
basic support has cut heavily into the former ac- 
tivity of voluntary organizations and private phi- 
lanthropy, although even in the United Kingdom, 
with its well-established welfare state, so many 
voluntary agencies raise money for charitable pur- 
poses that it is impossible to estimate their number 
or the amount of money they raise or spend. 

Developing countries. As the less advanced 
countries of Asia and Africa became politically in- 
dependent in the 1950s and 1960s, they attempted 
to establish a sound basis of social security for 
their people. Such attempts were hampered by the 
conditions of their economies, low standards of 
living, and overpopulation. Many of them still have 
to depend on voluntary effort and money to cover 
all their needs. The history of the extension of 
philanthropy in India is much like that of the West- 
ern countries. In the nineteenth century, changing 
conditions gradually made it apparent that indi- 
vidual effort alone could not provide for all the 
contingencies of Hindu life, so voluntary social 
agencies began to take over some of the burden. In 
the nineteenth century the Christian missionaries 
in particular led in setting up social services, and 
many Hindu reformers, such as Ghandi, were in- 
strumental in furthering this voluntary work. After 
India had gained its independence in 1947, the 
government embodied state responsibility for many 
measures of social security in the constitution of 
1950. 

However, the immensity of the problems faced 
by the new government, the lack of finances to 
train personnel to carry out the proposed reforms, 
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and the belief that only private volunteers could 
provide the “human touch” essential to counter- 
balance the impersonal administrative machinery 
of a welfare state made the planning commission 
of the first five-year plan determined to set up the 
principle that the major responsibility for organ- 
izing activities in such schemes as the welfare 
of women and children, education, and community 
development should fall to private voluntary agen- 
cies, The central Social Welfare Board was estab- 
lished in 1950 to coordinate and administer these 
agencies. It functions as an autonomous body and 
enables members to move more rapidly as new 
problems or social crises arise. A number of foun- 
dations have also arisen, and money-raising cam- 
paigns on the North American pattern are held in 
crises such as floods (Kulkarni 1961, p. 68). 

Most of the less advanced countries are still at 
the stage when their people are looked after to a 
large extent by the extended family system of tribal 
groups. One of their main problems is that sudden 
and rapid industrialization is making them alter 
their traditional patterns of duty and responsibility 
almost overnight. 

Japan, however, has been an exception. It had a 
modified pension scheme as early as the 1870s and 
was the first Far Eastern country to establish a 
modern social security system (Durand 1953, pp. 
85-157). Japan’s first Community Chest drive in 
1947 showed that the Japanese had accepted the 
conception of the responsibility of citizenship, 
which entails impersonal and regular giving. 

Communist countries. The social legislation 
and philanthropic outlook of communist countries 
are difficult to compare with those of capitalist 
countries, where private initiative and individual 
ownership of the means of production are domi- 
nant. The communist countries have gone further 
in legislating for the needs of their peoples than 
have other countries. From the time of the 1917 
Revolution in Russia the government has been de- 
termined to institute a complete system of social 
assistance. This ideal was embodied in the first 
constitution of the Union of Soviet Socialist Re- 
publics, which claimed that every citizen had the 
right to education and to many types of social as- 
sistance. Soviet citizens have been trained in this 
ideology for so long that they now take it for 
granted that the government will provide them 
with extensive social welfare benefits and universal 
education, as well as supply a society in which they 
will have cultural and even spiritual well-being. 
In the Soviet Union only religion is not covered by 
state protection, and must be supported by volun- 
tary effort. Any congregation that is willing to pay 


the salary of its clerics and the costs of maintain- 
ing the buildings may hold services in a church, 
mosque, or synagogue. The Soviet citizen is also 
able to contribute his time to different voluntary 
agencies, for the constitution permits Russians to 
form voluntary organizations to pursue scientific 
research, experiment in education, encourage 
sports, etc. There are also occasions when some 
disaster brings about the need for immediate action. 
In such cases voluntary contributions are solicited. 
But there is no formal philanthropic structure for 
the continued solicitation of funds for long-term 
projects (Maxwell 1962, pp. 652-655). 

In the countries with more laissez-faire phi- 
losophies and democratic governments the struggle 
to define the public and private areas of responsi- 
bility still continues. In its essence it is an underly- 
ing struggle between individualism and collectivism. 
The individualistic view holds that voluntary 
charitable agencies must be administered and fi- 
nanced by volunteers because they are one of the 
important bulwarks against complete state control. 
Moreover, it is felt that in a period of rapid social 
change, governments cannot act rapidly enough 
when new problems arise. So there is still the need 
for voluntary philanthropists who can experiment 
on a small scale even in controversial fields, with- 
out causing too much general disruption, Many 
voluntary agencies have gradually been forced to 
relinquish their control to the state when the 
financial burden has become too much for them 
to bear, or when a problem has become so wide- 
spread that it cannot be handled by private dona- 
tions of time or money. 


North American philanthropy 

The expansion of philanthropy and its new forms 
of organization have gone further on the i 
American continent, particularly in the ae 
States, than in any other part of the world. Ind ia 
voluntary giving has grown to such an oe F 
the past seventy years that some people spe 
a “philanthropic revolution.” 

Nolh ‘Aiterioan attitudes to private phianta 
have been very deeply influenced by the tra hy 
of the frontier, with its individualistic ae 
and suspicion of government control. Private aan 
tive is still thought of as the key to wi the 
progress and success, for it has led to most xperi- 
inventions, medical discoveries, educational ® the 
ments, and basic research which have enabl zed 
United States to become the leading indus aniza- 
nation. It has also led to the patterns of an 
tion necessary to run a very complex soci? ra e 
attitude, as well as the businessman’s desire 


tain control over philanthropic activity, has deeply 
influenced American business and philanthropic 
efforts. Indeed, many businessmen feel that corpo- 
rations should give freely to philanthropy, for in 
this way they will retain control over areas of life 
which are still free of government control. 

This philosophy, combined with the rich re- 
sources of the American continent, produced ac- 
companying attitudes that poverty and destitution 
are mainly the result of a lack of initiative and 
apathy and that therefore receiving public relief 
is a shameful thing. As in other countries, however, 
the twentieth century showed that the social dis- 
locations of rapid industrialization and urbaniza- 
tion were too vast and complex to be relieved by 
private contributions, even in a rich country. Up 
to the 1930s, health, welfare, research, and, to 
some extent, higher education were believed to be 
mainly the responsibility of private philanthropy. 
But the great depression of the 1930s forced Amer- 
icans to accept an enlargement of governmental 
functions. And so the state began to take over the 
financing and direction of one voluntary agency 
after another, assisted by such movements as the 
New Deal. This movement, however, did not di- 
minish the actual donations to private philanthropy. 
For they have continued to increase in size, largely 
because of the affluence of the postwar era and the 
Opening up of new fields for experimentation en- 
gendered by the rapidly changing social and eco- 
nomic conditions, 

Corporate donation. Part of the growth of 
donations in America is due to the increase in 
number and size of business enterprises. For as 
Corporations have grown in size, there has been an 
increasing demand on the part of the public that 
they take a leading role in financing the social 
services of the community. Corporations soon rec- 
°gnized that philanthropic activity could become a 
very important part of their public relations, for 
it is an excellent medium through which they can 
demonstrate their interest and participation in 
community welfare. Today it is one of the most 
fruitful advertising media for all corporations, big 
and small. The recognition of the part corporations 
could play in financing voluntary agencies was 
asst introduced at the end of the nineteenth cen- 
ury by the Y.M.C.A., an organization that had 
cen imported into America in 1851 from England. 
HERG A, had set a new pattern for raising 
ioe ponve drives for money over a short 
o an time, the use of sophisticated techniques 
dn $ money, and an emphasis on corporation 
this pale. Other voluntary agencies soon copied 

ern, and it is still the typical large North 
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American fund-raising practice. Soon definite 
quotas were set for the contributions of most large 
corporations. World War 1 was instrumental in 
spreading this new philanthropic pattern because 
it multiplied local problems and brought about 
many appeals for the relief of sufferers in Europe. 
At the same time, the patriotic wartime spirit, the 
large wartime earnings, and the legal right to de- 
duct charitable contributions from the taxable in- 
come of individuals and corporations all combined 
to establish habits of giving on the part of citizens 
from all income groups. 

Another trend in philanthropic activity that was 
established during the war was that of coordina- 
tion of effort. There was a growing realization that 
while much of the voluntary work was overlapping, 
many of the needs were being overlooked. This 
coordination took the form of Community Chests, 
which combined a number of charities under one 
appeal, and United Appeals. By 1929 it was esti- 
mated that there were 331 Community Chests in 
America. These chests had raised $73 million in 
that one year alone. Again, corporations played a 
large part in financing this expansion, 

In 1936 a revenue act was passed which al- 
lowed corporations an exemption of up to 5 per 
cent of their net income for charitable contribu- 
tions. As the result of this tax and the unprece- 
dented prosperity of the postwar years, corporation 
giving rose sharply after World War u. This is 
seen in figures which show that whereas corpo- 
ration support for 13 Community Chests (for which 
figures are available) in 1920 was 23.8 per cent 
of the total amount collected, in 1950 it was 39.5 
per cent for 69 Community Chests reporting re- 
turns (Andrews 1952, p. 158). The total philan- 
thropic donations of corporations has also risen 
steeply. The Bureau of Internal Revenue estimates 
that whereas in 1936 corporations donated $30 
million to charitable purposes, in 1951 the figure 
was over $300 million. Although this figure is 
large, a breakdown in terms of total giving to phi- 
lanthropy and type of giver shows that corporation 
contributions are only 5 per cent of the total an- 
nual receipts. The rest is made up of donations 
from foundations (3 per cent), individual gifts 
(74 per cent), and other sources (18 per cent) 
(Andrews 1952, p. 19). 

The donations do not come only from the corpo- 
ration budget but also directly from employees. 
This source was tapped when the growing number 
of campaigns caused strong competition between 
fund raisers during World War 1 and forced them 
to move further down the economic ladder to 
achieve their objectives. Employees were a well- 
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organized group from which to solicit. This move- 
ment was encouraged in so many corporations that 
by 1950 a large number permitted payroll deduc- 
tions for the major national and city-wide cam- 
paigns in North America (Committee . . . 1953, 
p. 19); 

The rise in the contributions of corporations has 
enabled them to take over the control of raising 
and allotting money to many philanthropic agen- 
cies, Management can determine the amount that 
will be given, and since the same men are often 
found on the boards of a number of the larger 
corporations, they have come to form an “inner 
circle” which can control both the gifts that are 
given and the selection of the top executives in the 
philanthropic agencies and in the more important 
and prestigious fund-raising campaigns, such as 
those for hospitals and universities. Once the mod- 
ern fund-raising pattern had been established, it 
did not take the business world long to realize that 
donating manpower to campaign activity would be 
as good an advertising device as donating money. 
Moreover, this was a means by which the organiz- 
ing and executive ability of promising young men 
could be tested. In fact, taking part in philan- 
thropic activity, and particularly fund-raising cam- 
paigns, has become an essential part of the success- 
ful businessman's career. His participation brings 
him into contact with men who are influential in 
business, gives him training, and enables him to 
demonstrate his ability publicly in the highly com- 
petitive field of raising money (Ross 1954). 


Social functions of philanthropy 


The type of charity has varied from country to 
country and from one historical period to another, 
but philanthropy has always been the reflection of 
a class society because it has depended on a di- 
vision between rich givers and poor recipients. Even 
when the poor have themselves been the givers, 
they have always made up the largest proportion of 
recipients. The wealthy have not only given because 
they have more but because, by alleviating distress, 
they have secured their own positions against those 
who might displace them and thus have avoided 
revolt, At the same time, giving has been a means 
of enhancing their status in the eyes of their fellow 
men. W. K. Jordan believes that many of our 
present attitudes toward philanthropy have come 
down from the early Elizabethan days, in which 
the maladjustments were so intense that they 
simply could not be ignored by anyone who was 
in any way sensitive to suffering (Jordan 1959, 
P. 146). The Protestant Reformation in England 


was partly a product of these conditions and partly 
in its turn the means of reinforcing the conception 
of the responsibility of the top classes for the “have. 
nots,” even when the structure of the classes 
changed and the industrialists began to replace the 
aristocrats. 

The association of financial responsibility with 
upper-class position continued to be as strong in 
the eighteenth and nineteenth centuries as it had 
been in the seventeenth. The chief charitable of- 
fices in` Western Protestant countries have tradi- 
tionally been the prerogative of the top social 
classes. The well-known term “Lady Bountiful” sug- 
gests the part that upper-class women played in 
philanthropic activity in the nineteenth century. 
At that time the aristocratic classes in England 
had leisure and were just becoming conscious of 
the necessity of justifying their position because 
of the growing political strength of the lower 
classes. They also retained from the past the sense 
of noblesse oblige, a feeling of responsibility for 
persons of inferior economic and social status. 
These factors led them to this voluntary work, al- 
though the agencies they established were at first 
so scattered that they either did not cover all needs 
or else suffered from overlapping. 

The idea that the top social classes should bear 
responsibility for the lower continued in the 
twentieth century, which ushered in new types of 
problems and necessitated an expansion of phil- 
anthropic activity. Today upper- and middle-class 
men and women hold many executive positions on 
boards of charitable agencies and fund-raising 
campaigns in all countries that engage in private 
philanthropy. In fact, philanthropy is now such an 
important part of the expected activity of vo 
as well as men that it is very difficult for them 

in § as f the important 
escape working in at least some oi 
fund-raising campaigns. 

For oe NSE tund raising has become such 
a time-consuming activity that they often ee 
it as a career. Training for it begins at home, W i 
children are subtly initiated into their paris in 
anthropic roles. This training may be me oo 
such associations as the Junior League, ana 
social club for young North American ading 
where actual practical training is given for 
roles in voluntary agencies. -o E 

One activity that supports the relationship 


is that 0 

tween philanthropy and the upper classes e 
the charity ball, which has become an 1 
in most 14 


adjunct to the life of the social elite i a 
North American cities. These balls are sre often 
by committees of society women an 


spectacular affairs. They combine the display of 
wealth and position with the worthy purpose of 
helping a good cause. 

Many studies have shown this dual relationship 
between philanthropic activity and social position. 
Warner's study of Yankee City (Warner & Lunt 
1941) was perhaps the first comprehensive study 
to demonstrate in detail the relationship between 
social position, wealth, and philanthropy. Although 
the philanthropic pattern has moved further down 
the social scale with the tremendous increase in 
formal organizations, upper-class men and women 
still hold far more of the most prestigious com- 
munity positions in the important voluntary agen- 
cies than do those of the lower classes. 


Modern trends 


Professionalization. Modern philanthropy serves 
many and varied functions for the givers, the col- 
lectors, and the business world, as well as for the 
recipients, The more obvious functions that phi- 
lanthropy has played through the centuries, such as 
relieving the lot of the poor and needy, is easy to 
see, but the more subtle aspects of its effects have 
seldom been analyzed. Nor has the way in which 
it has become integrated into many aspects of the 
social structure been fully realized. 

During this process a core of professionally 
trained administrators and fund raisers has arisen 
to handle the complex problems that the extension 
of philanthropy has brought about. And a number 
of professional organizations have grown up with 
the specific purpose of directing the larger fund- 
taising campaigns and training volunteer canvass- 
ers. These people were at first looked on as “pro- 
fessional beggars,” but by now the importance of 
their role is recognized and they have an accepted 
Professional status. 

The organization and coordination of philan- 
thropy have eliminated much of the early spon- 
taneity of giving. They have also brought about a 
more rational assessment of people’s ability to give 
and the introduction of scientific methods of sur- 
veying community and national needs and of rais- 
ing money. Thus philanthropy has entered the field 
of planning, This trend has meant that the sense 
of personal involvement with a problem which in 
the past led to many worthwhile reforms has given 
ee to impersonal donating to a charitable budget. 

Owever, emotional appeals are still effective, and 
certain charities like the Salvation Army and the 
March of Dimes draw money easily. 
cehe ideology of philanthropy. Parallel to the 

ntralization of many former charities under the 
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jurisdiction of the state there has arisen the cen- 
tralization of private giving under the control of 
businessmen. Business institutions have thus suc- 
ceeded the religious institutions on which early 
charity depended for inspiration and control. This 
change has brought about a change in the ideology 
of philanthropy. Implicit in the religious ideology 
of giving was the idea that the giver, as well as the 
recipient, would receive some benefit from the gift, 
either in this world or the next. This is illustrated 
by such well-accepted quotations from the Bible 
as “Cast thy bread upon the waters, for thou shalt 
find it after many days.” This implication of future 
reward is still present in the modern ideology of 
giving, but the individual reward is no longer 
thought of as coming in the next world. Rather, it 
is expected in this one, either in the emotional form 
of personal satisfaction received from sacrifice and 
doing one’s duty or as a more tangible reward in 
the form of direct or indirect benefit from better 
medical services, better homes, playgrounds, and 
the like. 

Attitudes of donors. The attitude of the donors, 
too, has tended to change. Much of the spontaneity 
and emotional satisfaction that were derived from 
personal giving to a need that the donor could ob- 
serve at first hand has disappeared, and people 
tend to give more from a rational appraisal of the 
situation or from strong, inescapable pressures ex- 
erted by friends and community leaders. These 
pressures, combined with the multiplicity of de- 
mands that most citizens face each year, have 
made many people apathetic to even the most dis- 
tressing situations, resentful of canvassers, and re- 
sistant to their appeals. Collectors thus face more 
and more problems in raising money each year. 

Up to the middle of the nineteenth century there 
was an almost complete disregard on the part of 
the public of the donation’s effect on the recipient. 
The recognition that one of the most important 
rights of the citizen is that of public assistance has 
changed this attitude and has helped to eliminate 
much of the humiliation and shame that taking 
charity had engendered in earlier times, Further- 
more, state control has meant a broadening of the 
base of givers. Not only does a larger proportion of 
the population give regularly and continuously 
through taxation but also few people, even in the 
lowest income brackets, escape the pressures that 
come through the payroll deductions for charitable 
gifts that are incorporated in the rules of many 
large corporations, or the systematic canvassing of 
people of moderate and low incomes. Even children 
do not escape, for they can be induced as captive 
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audiences in schools to give to as many charitable 
agencies as the school permits. This is rationalized 
in terms of training future adult subscribers. 


The basic problem of philanthropy has changed 
from that of caring for the physical needs of a 
relatively few destitute people living in simple so- 
cieties to attempting to meet the physical, social, 
and psychological needs of total populations living 
in highly complex societies. The emphasis is now 
being placed on securing a “better,” “happier,” or 
“healthier” world for all; and the focus has shifted 
from the relief of immediate want to long-term 
planning that will prevent future want. This trend 
toward prevention, whose development has been 
due to the growth of scientific knowledge, has elimi- 
nated the need for charity in some areas. It has 
also led to a change in the motivation to give; the 
emotional identification with someone else's prob- 
lems has been largely replaced by a rational ap- 
praisal of present and future needs. 

AILEEN D. Ross 


[See also COOPERATION; FOUNDATIONS; POVERTY; TECH- 
NICAL ASSISTANCE; VOLUNTARY ASSOCIATIONS, article 
On SOCIOLOGICAL ASPECTS; WELFARE STATE; other 
relevant material may be found under PLANNING, 
SOCIAL.] 
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PHOBIAS 


A phobia is an irrational, persistent fear of a 
particular object or place, class of objects or places, 
etc. Used as a noun or combining form, phobia 
denotes fear or dread, and in medicine and psychi- 
atry it is joined either with names of objects, sit- 
uations, etc., or with Greek-derived forms of such 
names; thus, street phobia, or agoraphobia (from 
the Greek agora, or “market place”). As a symptom 
complex the phobia also includes any avoidances 
and inhibitions employed to forestall impending 
attacks of anxiety. Thus, an agoraphobic person 
may be unable to leave his home without incurring 
anxiety and may therefore remain in it constantly. 
A person with rat phobia may stop his ears when 
rats are mentioned in conversation. The phobic 
person may elaborate and form compromises to 
keep the dreaded object, or the idea of it or allu- 
sion to it, at a distance (e.g., by restricting his 
activities, putting general or special taboos on cer- 
tain localities, etc.). 

Whether a state of mind is to be considered ra- 
tional or irrational eventually depends on common- 
sense judgment. This judgment is usually that of 
the phobic person himself, who knows that the 
feared object is harmless or in any case should not 
provoke such intense, unmasterable anxiety. Often 
ashamed or aware of the absurdity, the phobic 
patient may nevertheless try to rationalize that 
there is a possibility of “danger.” Accessory circum- 
stances often determine whether a given fear is 
rational or not, but judging the degree of ration- 
ality is still ultimately a matter of common sense. 
í In early childhood the matter of rationality is 
irrelevant, The fears of small children are rarely 
rational according to adult standards. Children’s 
Msecurity and fear of strangers are neither rational 
nor irrational. Animal phobias of small children 
resemble in many ways the fears of certain sav- 
ages, with whom they share a “totemistic” attitude, 
the differentiation between man and other animals 
not being well established. Gradual development of 
the Sense of reality through experience and educa- 
tion leads to such differentiation. 

: History and semantics. Modern interest and 
ideas about the phobias began with an article by 
Westphal, “Die Agoraphobie” (1871). The term 
“goraphobia was invented in this case to denote a 
ESA fear of open places and is usually extended 

ean a fear of streets or the outdoors generally. 
Sra words used were Platzangst and 
P t, which retain the idea of an open 
evn © one of Westphal's three patients, a 

n square in Berlin looked as if it were miles 
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wide. The patient complained that any attempt to 
cross the square or to walk on certain streets 
brought on a severe attack of anxiety accompanied 
by trembling, head sensations, palpitation, etc. The 
presence of a companion prevented or assuaged 
the anxiety and enabled the patient to skirt the 
square with somewhat lessened fear. He did not 
fear traffic: indeed, he felt no anxiety if he could 
stay near a carriage while crossing. To see houses 
with windows was also reassuring, but to walk 
along windowless walls, like those of certain bar- 
racks, would bring on an anxiety attack, and he 
could not go into the Tiergarten, where there were 
no houses. Absence of human beings or signs of 
life, an empty street or park, was more dangerous 
than traffic. 

Phobeio, the Greek word from which phobia is 
derived, means “I fear,” but in earliest times it 
meant both “I am afraid” and “I am put to flight”; 
for Homeric Greeks, emotion and tendency to 
action were “close together” (Onians 1951, p. 19). 
Greek words can be readily used in compounds, 
and phobo- and -phobia were used as combining 
forms. Hence, -phobia easily became one of a large 
class of suffixes (including -mania, -philia, -pathy, 
etc.) that were adopted by medical and psycho- 
logical writers. Thus we have hydrophobia, fear of 
water, at one time also called phobodipsia, fear of 
drinking; photophobia, fear and avoidance of light, 
a medical but not necessarily a psychiatric term; 
xenophobia, fear and avoidance of strangers or 
foreigners. An obsolete term of this sort is pano- 
phobia, the night terror of infants. (See Sauvages 
1763.) 

Name inventing has periodically flourished and 
waned in science and medicine. In the late eight- 
eenth and early nineteenth centuries, partly in 
emulation of Linnaeus, imaginative nomenclature 
was in vogue. Psychiatrists participated heartily in 
wordbuilding, particularly after Esquirol’s adven- 
tures with the suffix -mania (1838). They made 
such wide use of the suffix -phobia that Benjamin 
Rush (1825) jocularly suggested the name rum 
phobia for what he opined must be “a very rare 
distemper.” He also facetiously suggested doctor 
phobia and church phobia (cited in Menninger 
et al. 1963). Such near-slang usage emphasizes 
the avoidance aspect and ignores any anxiety. It 
has been resuscitated in current psychiatric writ- 
ings in such terms as school phobia, job phobia, 
and the like, where the anxiety may not be of the 
conscious phobic type at all. 

The modern popularity of the word and suffix 
began with Westphal’s coining of agoraphobia. The 
new term was widely accepted, and many analo- 
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gous terms were invented (see Janet & Raymond 
1903). For several decades after Westphal many 
types of phobias were described and named. Some 
names are still in use, some are obsolete, and 
others are met with only occasionally. Claustro- 
phobia still refers to the morbid fear of entering 
or staying in closed spaces, and ereuthophobia is 
a word generally understood to mean a fear of 
blushing. Aelurophobia and gephyrophobia are 
rather rare and are ordinarily superseded by cat 
phobia and bridge phobia, respectively. Zoophobia, 
or animal phobia, is inclusive, applying to dogs, 
cats, birds, etc. Besides the newly described pho- 
bias, others were retrieved from earlier literature 
and given new names, By 1914, G. Stanley Hall 
could list 136 words ending in -phobia; many of 
these were never subjects of clinical description 
but were apparently purely philological virtuoso 
inventions. The history of this period and pertinent 
references are to be found in Krafft-Ebing (1895), 
Janet (see in Janet & Raymond 1903), and Loew- 
enfeld (1904). 

The study of the phobias and other neuroses 
went well beyond philology. Obsessions and com- 
pulsions were described by Westphal (1877), and 
many studies dealt with nervousness, neurasthenia, 
and anxiety states other than phobias, such as 
hysteria and hypochondria. This work culminated 
in a general conception of the neuroses, or psycho- 
neuroses, a group of conditions with all sorts of 
nonpsychotic psychological symptoms not attrib- 
utable to recognizable pathological changes in the 
body. [See OBsEssIVE~COMPULSIVE DISORDERS. | 

Classification. The general tendency was to 
group phobias along with obsessions and compul- 
sions under one heading, on the grounds that they 
were “compulsive emotions” (Zwangsaffekte ). Even 
today German textbooks usually put quotation 
marks around the word phobia or refer to die 
sogenannten Phobien—the so-called phobias (Bum- 
ke 1924; Bleuler 1916). French writers, follow- 
ing Janet (see in Janet & Raymond 1903), group 
phobias with other anxiety states, compulsions, 
and obsessions under the general rubric psychas- 
thenia. Such classification has some Point, at least 
superficially, for many adults troubled by obses- 
sions and compulsions also may have one or more 
phobias. However, there are certain cases, adult 
and infantile, in which there are phobias but no 
obsessional symptoms. Moreover, Stekel ( 1908), 
Psychoanalyzing phobic patients of the “pure” type, 
found that they resembled the conversion hysterics 
he had treated, differing only in that where the 
phobic patient suffered an anxiety attack, the clas- 
sical hysterical patient would manifest a conversion 


symptom. For this nearly pure type of case Freud 
suggested to Stekel the designation anxiety hysteria 
and justified this term in a paper on a childhood 
phobia (1909). As psychoanalytic conceptions of 
anxiety developed, and particularly as analysis of 
phobias in small children progressed, it became evi- 
dent that phobias might or might not be found in 
obsessional persons and that they were to be found 
in patients of all types. Freud's final theory of 
anxiety (1926), which interpreted phobias in a 
novel and original way, lessened the interest in 
classification. 

Although in a general way earlier authors rec- 
ognized psychological influences as precipitating 
and predispositional factors, they did not distin- 
guish the phobia from other neurotic disorders in 
regard to etiology. From the mid-nineteenth cen- 
tury through the beginning of the twentieth, all 
neuroses were supposed to arise from constitu- 
tional weakness or deficiency (“degeneration” in 
some authors’ terminology) and essentially from a 
subtle hereditary or inherent defect of the nervous 
system. Except for Freud and those who accepted 
his views phobias were, in short, stigmata of the 
degénérés supérieurs or of psychasthenia. 


Freud and psychoanalysis 

Freud's early papers on anxiety and the neu- 
roses contained incidental discussion of certain 
phobias (1894; 1895a; 1895b), but he first clearly 
expressed his thoughts on this matter in the case 
history of “little Hans” (1909; see also pe 
1962). There he recognized a class of phobias 0 
psychic origin, resembling in many ways the Bese 
phobia previously described; he gave them ps 
name anxiety hysteria. Phobias were conceived Be 
as illnesses, nor yet precisely as signs of an illnes d 
but as a set of findings: anxiety plus certain pi 
cial avoidances and inhibitions. In such canen % 
time goes on, overt anxiety decreases part os 
as restrictive measures increase. An anxiety $ 
becomes more and more of a phobia. ada 

Infantile zoophobias. Freud characteris aa 
looked to the phobias of children for a oe 
that would furnish a clue, uncomplicated by ro 
elaborations, to their essential meaning, yea in 
he had found simple, direct wish sede A 
children’s dreams that threw light on a ee 
cated and distorted wish-fulfilling dreams oa ad 
The childhood zoophobias did indeed Pa toht 
atively simple picture of the elements n genic 
tion of phobias and ultimately of Lge waite 
symptoms in general. The two cases aie Hans” 
and most cited are the horse phobia of ia yoong 
(Freud 1909) and the wolf phobia of 


Russian referred to as “the wolf-man” (1918). In 
later discussions of anxiety, anxiety hysteria, and 
symptom formation Freud constantly used the ma- 
terial of these cases to illustrate and develop his 
theories (see 1915a; 1915b; 1926). 

In The Problem of Anxiety (1926) Freud ex- 
plained Hans’s phobia as originating in oedipal 
wishes. Hans was in love with his mother and was 
jealous of his father, toward whom he harbored 
hostile wishes; these wishes came into conflict with 
his love for his father. The aggression led to a fear 
of castration, the possibility of which he believed 
to be a reality. Repressing his love for his mother 
and his hostility toward his father from conscious- 
ness, he displaced his aggression and consequent 
fear from his father to horses, which he feared 
would bite off his penis. Thus, to form his phobic 
symptom, after his repression of the original im- 
pulses, Hans used three defense mechanisms: 
displacement, reversal, for actually his feelings 
were dangerous to his father rather than the other 
way round, and regression from the genital idea of 
castration to the oral idea of being bitten. 

The wolf-man’s phobia arose from a different 
component of the Oedipus complex, from an in- 
verted oedipal wish. Having observed parental 
coitus, he identified himself with his apparently 
castrated mother and feared castration if he per- 
mitted himself feminine longings for his father. 
His fear of castration was displaced to the picture 
of a wolf in a storybook. The fear that came to 
consciousness when he ran the risk of seeing the 
picture was of being devoured—an oral equivalent 
of the desired passive genital satisfaction. Mem- 
ories recovered during his analysis established the 
connection between the wolf and the idea of cas- 
tration, Avoidance here took the convenient form 
of not looking at the wolf's picture. 

From these and other cases of childhood zoo- 
Phobia Freud asserted that the motive for repres- 
sion was castration anxiety and that the repressed 
wishes were the erotic or aggressive components 
of the Oedipus complex, whether normal or in- 
verted. Repression was then followed by an appro- 
Priate displacement, determined by individual ex- 
Perience. Other infantile phobias are based on 
Other types of anxiety, particularly on a fear of 
losing love, which serve as motives for repression. 

Because of a child’s limited experience, the 
choice of animals is not large. The Angsttier 
LERA animal”) is apt to be a horse, cow, dog, 

» bird, chicken, beetle, or wasp. 
gectemiem and zoophobia. The study of the 
oe showed that a child could displace his 

ngs about his father to an animal and that this 
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displacement reflected ambivalence. Hans not only 
feared horses but admired them, and at one point 
in his therapy he began to jump around like a 
horse and even bit his father (Freud 1909). His 
ambivalent attitude and his identification with the 
animal gave Freud one argument for interpreting 
the totem animals of primitive peoples as substi- 
tutes for the father, and he spoke of the “return of 
totemism in childhood” (1913, chapter 4). 

Mechanisms in the phobia. The defense mech- 
anism of displacement, by which the repressed 
wishes return to consciousness in disguise, is basic 
for all phobias, including those of adults. Avoid- 
ance behavior or restriction of function follows 
upon this. In adult phobias, too, the wishes are 
aggressive or erotic—largely the latter; they are the 
revived wishes of childhood, usually in new garb. 
Freud compared the content of the phobia to the 
manifest facade of the dream (1916-1917), im- 
plying that many defense mechanisms may alter 
the original content and give a conscious picture 
different from the unconscious latent fantasy. Thus 
regression modified the wolf-man’s fantasy of being 
castrated into a conscious fear of being devoured. 
This further altered the picture, in which the father 
had already been changed to a wolf by displace- 
ment. The displacement mechanism is fundamen- 
tal, since it is the first and invariable means used 
by the ego in distorting material that comes back 
from repression. Other defense mechanisms, such 
as regression, reversal, and identification, are ac- 
cessory or secondary in altering the content of the 
manifest phobic picture. Additional defense mech- 
anisms are described by Sigmund Freud (1926) 
and some relevant to children by Anna Freud 
(1936). In the more complicated adult phobias 
several instinctual wishes may gain expression at 
the same time through the same content, as a 
dream may fulfill several wishes at the same time 
through what is called overdetermination (see 
Deutsch 1929; Weiss 1935; Fenichel 1944; Lewin 
1952). 

From the study of the simple phobias Freud con- 
structed a paradigm for the formation of neurotic 
symptoms in general: threatened eruption of an 
instinct from repression (or from the id); anxiety 
or threatened anxiety as the signal of such impend- 
ing danger; subsequent alterations in the ego by 
means of defense mechanisms that produce symp- 
toms, each type of which (conversion, phobia, 
obsession, etc.) appears because of the predomi- 
nance of one (or a few) defense mechanisms. 
Thus, the meaning of the word symptom is com- 
pletely altered. No longer an indication of some 
underlying illness, it is considered a “structure” 
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composed of alterations in ego functioning due to 
the mutual interaction of id, ego, and superego. 


Some individual phobias 


Like the dream, the phobia may show much 
variation in regard to its manifest façade. However, 
as certain frequent types of dreams have a nearly 
standard interpretation, so certain phobias, studied 
through psychoanalysis, show a relative constancy 
in unconscious meaning. For example, many well- 
studied cases of agoraphobia show that the feared 
streets are unconsciously conceived as places suit- 
able for sexual adventures (see Deutsch 1929; 
Katan 1937; Miller 1953; Weiss 1935). The com- 
panion chosen as protector by some agoraphobes 
represents a person to whom they are uncon- 
sciously hostile and whom they therefore put in 
the same jeopardy as themselves, Fear of crossing 
bridges is a feminine fear of childbirth. A fear of 
falling, on the street or elsewhere, is also feminine, 
one of its meanings referring to the figurative 
sexual fall. Claustrophobic anxiety is connected 
with unconscious ideas of being buried alive and 
with prenatal life (Lewin 1935). Generally speak- 
ing, these unconscious fantasies are either of 
temptation or punishment. Fear of being alone 
often indicates a temptation to masturbate under 
such circumstances. Fears of insanity may include 
this temptation, as well as a fear of symbolic cas- 
tration (head equals penis), Fears of dirt refer to 
sexuality thus conceived. Animal fears have been 
discussed above. An interesting phobia is the fear 
of spiders, representing the devouring and danger- 
ous female genitalia. Butterflies may represent 
beautiful and tempting women; beetles and crawl- 
ing insects may represent genitalia of small chil- 
dren. This list could be extended greatly; but many 
other interpretations depend on the individual pre- 
phobic history, which determines the relative im- 
portance of the fantasy and the suitability of the 
object chosen for its representation. 


BERTRAM D. LEWIN 


[For a conception of phobias that contrasts with the one 
in this article, see MENTAL DISORDERS, TREATMENT 
OF, article on BEHAVIOR THERAPY; see also ANXIETY 
and Emotion, Other relevant material may be found 
in DEFENSE MECHANISMS; HYSTERIA; MENTAL DIS- 
ORDERS, article on CHILDHOOD MENTAL DISORDERS; 
and in the biography of FREvD.] 
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PHYSICAL ANTHROPOLOGY 


The article under this heading provides a broad 
overview of the field of physical anthropology. 
Specific areas within the field are reviewed in 
EVOLUTION, articles on PRIMATE EVOLUTION and 
HUMAN EVOLUTION; GENETICS, article on RACE AND 
GENETICS; RACE; SOCIAL BEHAVIOR, ANIMAL, article 
on PRIMATE BEHAVIOR. Parallel areas within the 
field of psychology are reviewed in EUGENICS; INDI- 
VIDUAL DIFFERENCES; PSYCHOLOGY, article on CON- 
STITUTIONAL PSYCHOLOGY. Also relevant are the 
biographies of Boas; Broca; Darwin; Hooton; 
HRDLIČKA; KrETSCHMER; KROEBER; PEARSON; 
ROHEIM; WEIDENREICH; WISSLER. 


Physical anthropology, linked by tradition and by 
academic usage with the behavioral science of 
anthropology, is nevertheless a discipline that is 
basically biological. The long-established connec- 
tion between two such apparently disparate sub- 
jects might suggest a common origin obscured by 
increasing specialization and ultimate subdivision. 
This, however, is not the case. Physical anthropol- 
ogy had an origin quite independent of anthropol- 
ogy, or to put it another way, it had roots going 
back to developments that had little or nothing to 
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do with the beginnings of anthropology and its 
cultural preoccupation. 

Origins of physical anthropology. One of the 
classic themes of physical anthropology—the races 
of mankind—emerged from the zoological and tax- 
onomic studies of the seventeenth and eighteenth 
centuries. As early as 1684, Bernier published an 
article in the Journal des scavans on this subject; 
this article was one of the first attempts, if not the 
first, to do for the varieties of mankind what the 
zoologists and botanists had for some time been 
seeking to achieve for the bewildering and exciting 
array of newly discovered flora and fauna that the 
age of discovery had made known to European 
students. 

Following Bernier’s pioneer attempt at a classifi- 
cation of man, Linné in 1735 proposed another 
based on taxonomic principles developed from the 
experience of several generations of botanists and 
zoologists attempting to find some kind of order in 
the enormous variety of nature, By including man 
in the world of living organisms susceptible to a 
common systematic treatment, he did much to ad- 
vance the notion that it was proper to consider 
man as a biological organism and an object of 
scientific study. 

By the end of the eighteenth century, the ana- 
tomical and morphological variation of man as a 
field of serious investigation was well established 
by the contributions of anatomists, physicians, and 
representatives of other cognate fields. By the turn 
of the century the contributions of two anatomists, 
Blumenbach and Camper, stood out as particularly 
formative in this area, which had not yet developed 
into what we would now call a discipline. Both 
were notable for seeking new methods of analyzing 
racial differentiation and Camper in particular for 
introducing quantitative techniques in the search— 
particularly the measurement of the facial angle. 

The predominance of anatomy in the formative 
Stages of physical anthropology is also reflected in 
the contributions made by the studies on man and 
the primates that were initiated by Tyson’s Orang- 
outang, sive homo sylvestris (1699). The very na- 
ture of the inquiry into the racial differentiation of 
mankind required a comparative methodology, the 
basis of which was supplied by Tyson. The com- 
parative method has continued to be one of the 
distinctive features of physical anthropology. 

The intimate relationship between anatomy and 
physical anthropology was maintained during the 
nineteenth century, Some of the outstanding physi- 
cal anthropologists continued to be men trained in 
anatomy or medicine. The first names that come 
to mind are Retzius, Broca, and Virchow, but there 
are many others. This tradition has continued down 


to the present day in Europe, where medical train- 
ing remains a standard preparation for research in 
physical anthropology. And, indeed, this tradition 
is so strong that the association of physical anthro- 
pology with cultural and social anthropology is far 
less intimate there than it is in the United States 
and in England. 

Nineteenth-century developments. Despite its 
early connections with anatomy and other biolog- 
ical subjects, physical anthropology eventually 
came under the aegis of anthropology. One of the 
reasons for this association lies in the fact that 
among the dominant problems to which the emer- 
gent anthropology of the late eighteenth and early 
nineteenth centuries addressed itself were the cul- 
tural evolution of mankind and the relationships 
of various groups within such a frame. Every body 
of data that appeared to contribute to an eventual 
solution was appropriated. Thus, anthropologists 
not only used data on cultural phenomena but also 
drew evidence from racial and anatomical studies, 
as well as from linguistics. Later, when archeology 
became productive of appropriate information it 
was drawn into the association. Since at this stage 
of its development physical anthropology had 
neither a disciplinary identity nor much of an 
organizational life of its own, it was readily ab- 
sorbed, at least in the tradition followed in the 
United States. Indeed, its very name suggests this. 

The subject matter of physical anthropology re- 
mained largely confined to the races of may and 
whatever comparative anatomical studies He 
pertinent until the era inaugurated in 1859 i 
Darwin’s publication on the origin of species. ri 
terest in the fossil record of man and its signif 
cance for the unraveling of human evolution 
became evident almost at once. Human fossils a ; 
as the Gibraltar and Neanderthal skulls (both 3 
covered just before Darwin’s publication and v 
tually ignored at the time for the lack of an e 
lectual readiness to see their importance) edi 
on a new meaning and eventually beea gical 
nized as appropriate for physical on 
techniques which were concerned with ee ie 
human anatomy, Thus, although many of fe 
cialists in the area of human evolution were sdl 
from older and well-established acade a 
plines such as anatomy, medicine, and Pi da 
ogy, their special interest in the fossil ath 
man was generally regarded as falling be ed t0 
category of physical anthropology and s 
widen and enrich the subject. spe- 

Race, comparative human anatomy CoA 
cial emphasis on osteology), and human PET 
continued to constitute the major FE present 
physical anthropology until well into 


century. This does not mean that expansive tend- 
encies into related areas and problems were absent. 
But for the most part these tentative explorations 
were abortive either because the intellectual en- 
vironment was not favorable or for lack of a fruit- 
ful methodology. Broca, for example, examined in 
an interesting, but ultimately sterile, way the prob- 
lem of hybridism in man. Without the benefit of 
modern genetics, this inquiry lacked the founda- 
tions for a solid development and now has only 
historical interest. 

Methodology. During the nineteenth century, 
physical anthropology established its classic tech- 
niques for measuring the human form; they re- 
mained the primary instruments for its research 
until well into the twentieth century. Data were 
obtained by a series of standardized observations 
and measurements. The observations, mainly on 
qualitative characteristics or those not readily open 
to measurement, were evaluated by reference to 
graded scales and charts meant to cover the range 
of variation. The direct measurements were taken 
from fixed landmarks wherever these could be es- 
tablished or from areas that could be described 
with some degree of precision. As variations of 
technique and measurement proliferated and inter- 
fered with comparability of results, it became nec- 
essary to establish some uniformity of practice. 
Several international congresses dealt with this 
oie the last one being held at Monaco, in 

The techniques along these lines that were de- 
Signed for osteological material are collectively 
known as osteometry. Those adapted especially for 
living subjects are grouped under anthropometry. 

Physical anthropology was one of the first of the 
biological disciplines to make extensive use of 
mathematics, particularly statistics. The problem 
of dealing with populations, the necessity of using 
4 sampling process, and ultimately the need to 
devise methods of handling large bodies of data 
derived from an array of individuals required 
mathematical evaluation. The suitability of the 
data of physical anthropology for statistical treat- 
ment was especially recognized by Karl Pearson. 
Almost from the beginning the pages of Biometrika, 
a journal he established in 1900, were opened to 
Papers that exploited this potentiality, and he him- 
Self contributed notably to this development (see 
Pearson 1920; Wilks 1941), 


Twentieth-century trends 


ga the present century the field of physical an- 
m Topology has grown rapidly to become something 

oser to a biology of man than it was before. This 
Srowth has come about through the development 
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of traditional inquiries and the influence of dis- 
coveries in other disciplines. 

Postnatal growth, for example, has become a 
subject closely linked with physical anthropology, 
in part because it is a phenomenon that contributes 
to the understanding of human variation, in part 
because the technique for studying growth was de- 
rived originally from the anthropometric measures 
devised by physical anthropologists, and perhaps 
to some extent because it had not been already 
firmly attached to some established discipline. 
More recently, physical anthropologists have ex- 
plored the patterns of over-all growth and have in- 
vestigated a wide variety of growth problems re- 
lating to specific areas of the body, such as the 
dentition and face. They have established norms. 
They have examined the factors that affect growth 
and have sought to discover the dynamics in the 
growth process that throw light on various anom- 
alies and abnormalities. 

Discoveries in other areas of science, such as 
genetics, have also had profound effects upon the 
recent developments in physical anthropology. In 
the study of isohemagglutination, the observation 
of the Hirszfelds in 1916 that various populations 
exhibited distinct and different frequencies of the 
ABO blood groups led shortly afterwards to a very 
active interest of physical anthropologists in ex- 
ploring this approach as a means of clarifying ra- 
cial differentiation (see Hirszfeld & Hirszfeld 1919). 
Field studies in great number were undertaken to 
determine world-wide variations in these reactions 
and to seek some basis for their use in identify- 
ing racial differences. As other systems of blood 
groups, such as M-N and Rh, and various hema- 
tological characters were discovered, they were 
added to what has now become an impressive array 
of hematological variations arising from genetic 
mutations. And although blood typing and hema- 
tology have developed into highly specialized areas 
with their own problems and goals, they continue 
to be employed as tools by physical anthropologists, 
not only in studying racial problems but also in 
analyzing the dynamics of population genetics. A 
considerable portion of the physical anthropologi- 
cal literature is still devoted to these applications. 

It is perhaps worth noting that the earlier ex- 
pectation that blood groups would provide a genetic 
and quantitative basis for racial classification re- 
placing older methods has not yet been borne out. 
The use of blood groups alone has thus far proven 
to be less than adequate for this purpose, although 
they retain much value as adjunct criteria. 

Perhaps even more significant in the recent his- 
tory of this discipline has been the general impact 
of genetics upon it. Beginning as early as 1905, 
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when Farabee published a study on the inheritance 
of brachydactyly in man—the first on human he- 
redity based on Mendelian genetics—physical an- 
thropologists have applied genetic principles to 
their problems and have gained, thereby, consid- 
erable insight into processes that would otherwise 
have been inaccessible. Similarly, Eugen Fischer, 
a decade after the effective establishment of ge- 
netics as a branch of biology, applied this new 
method to the study of racial hybridism in man 
with his study of the Rehobother bastards (1913). 
This was followed by Shapiro's field study (1929) 
on the English-Tahitian descendants of the muti- 
neers of the Bounty, Rodenwaldt's investigation of 
the hybrids on Kisar (1927), and Davenport and 
Steggerda’s work on the Negro-white mixtures on 
Jamaica (1929). At present, population genetics 
is the aspect of genetics that is being applied most 
fruitfully to many areas traditionally cultivated by 
physical anthropology. As in other interdisciplinary 
contacts, the resulting interactions have created a 
flow of geneticists into physical anthropology and 
of physical anthropologists into genetics, 

Population genetics has been especially valuable 
in providing insights and leads in the study of the 
racial differentiation of mankind. The dynamic 
interaction between mutation and environmental 
selection and the processes of adaptation that re- 
sult from this are seen as basic to human racial 
differentiation, just as they are to animal differen- 
tiation. The effects of isolation and genetic drift 
have become as valid for human groups as they 
are for Drosophila. Even some of the older con- 
cepts of race have been swept away. One can no 
longer entertain the view, once widely held, that 
the races of man were originally uniform and that 
the present heterogeneity of human populations 
is entirely the product of miscegenation. Accord- 
ing to the conclusions drawn from studies based 
on population genetics, it is more likely that some 
degree of polymorphism has always been charac- 
teristic of human races. 

Illustrative of the tendency of physical anthro- 
pologists to make use of other specialties that 
promise to throw light on traditional problems is 
their use of physiological approaches to race. Al- 
though not much has as yet been done along this 
line, some work shows promise. In particular, the 
adaptative responses of man to various environ- 
ments is an area of much significance but one 
which is as yet little known, Enough, however, has 
been achieved to indicate that this represents an 
area fruitful of insights into the biological nature 
of man. 


Another interesting example of the interaction 


between physical anthropology and other disci- 
plines—in this case medicine and psychiatry—is 
the field of studies of human constitution. The 
notion that personality types are recognizable from 
physical stigmata has existed since Greek antiquity, 
From this ancient concept came the medieval ter- 
minology of “humors” of the body, from which the 
words “phlegmatic” and “sanguine” are derived, 
About a century ago, this rather naive and certainly 
primitive idea began to be re-examined in the effort 
to find, if possible, a more convincing demonstra- 
tion of a linkage between body build and variations 
in behavior, disease, and temperament. The pi- 
oneering work of Mac Auliffe in France and of 
Pende in Italy culminated in the system proposed 
by the psychiatrist Kretschmer (1921) in Germany 
and was later extended into clinical and diagnostic 
medicine by Draper in the United States (see 
Draper et al. 1944). This system envisaged a three- 
fold division of body types—asthenic, athletic, and 
pyknic—with associated and distinctive types of 
temperament and susceptibility to specific diseases. 
It was based upon the assumption that body varia- 
tions in build were correlated with psychological 
and physiological patterns of behavior. Although 
much skepticism existed then, as now, on the va- 
lidity of this thesis, it has continued to engage the 
attention of a number of researchers. Sheldon, in 
particular, has evolved a far more sophisticated 
and useful classification of body build based on 
three major variables, which he has termed ecto- 
morphy, mesomorphy, and endomorphy (1940). 
According to this system each individual can be 
rated for each component, whereas in earlier 
schemes most individuals could scarcely be aa 
into the extreme types. This is the method preferre 
by most investigators currently working in this 
field. sunset 
One of the more recent developments in p iE 
cal anthropology reflects a growing appret a 
the role that culture and social organizer 
played in human evolution. One form of ae ae 
is the study of primate behavior. It is no one 
after a period of development during whic HE 
cal anthropology had turned increasingly to a ae 
biological concerns, in turning to the sut fis 
behavior of primates it is, in a sense, renew! 
links with anthropology itself. 

The pioneer work in this are: and 
the investigations of Carpenter (1934; 19 car- 
Zuckerman (1932; 1933)—has receno ddae 
ried forward vigorously by a number o! penaior 
who have not only experimented with the re 
of primates in captivity but have made He Much 
and detailed observations of wild anima’s- 


a—beginning with 


of this work has been inspired both by the insights 
into human behavior that such studies may be 
able to furnish and by the belief that social and 
behavioral characteristics have played a greater role 
in human evolution than has been previously ap- 
preciated or, at least, identified. 

This development, which certainly in part arises 
from physical anthropology’s continuing interest 
in human evolution, has, I believe, been stimu- 
lated specifically by the great advances made in 
the last generation in recovering the fossil record 
of man’s emergence. The important discoveries 
made in China in the late 1920s and 1930s by 
Black (1930), Weidenreich (1935), and Pei (1934); 
the enrichment of the record through the work of 
von Koenigswald in Java in the 1930s (1950); and 
the opening of totally new geographic areas and 
systemic groups in south Africa by Dart and Broom 
in the 1920s and 1930s mark the beginning of a 
new era in the study of human fossils (see Dart 
1925; Broom & Schepers 1946). The south Afri- 
can fossils (the Australopithecines), in particular, 
forced a revolutionary re-examination of the older 
concepts, leading to their rigorous modification to 
fit the new facts, and in general provided a stim- 
ulus to the whole field, which has remained in 
active ferment ever since. More recently, Leakey’s 
discoveries relating to early hominids in the vi- 
cinity of Olduvai in east Africa have served to sus- 
tain this interest (1951; 1959). 

From these and other fossils and from the as- 
sistance now available from related fields, it is 
possible to place the appearance of manlike pri- 
mates at the beginning of the Pleistocene, perhaps 
as long as 24% million years ago. Contrary to the 
older conception of a hominid evolving both in 
brain and in his adaptation to erect posture pari 
passu, it is now clearly evident that it was the latter 
that came first and, in fact, triggered the former. 
The anatomical evidence that early hominids had 
Bone a long way toward an accommodation to erect 
Posture before much expansion had occurred in the 
brain is overwhelming. To many students it seems 
likely that erect posture, by freeing the hands, with 
their innate capacity for grasping and manipulat- 
ing objects, opened the way for the development 
of technology. And this, in turn, as it became part 
of man’s environment, exerted a selective pressure 
that was reflected in an expansion of the brain. 
ao an interplay, with its continuation of a se- 
; lon for intelligence, was reinforced by social 
evelopments and particularly by the perfection 
of language with its abstract symbolism. 


Harry L. SHAPIRO 
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Arthur Cecil Pigou (1877-1959), professor of 
political economy at Cambridge University from 
1908 to 1943, is today best known for his éontribu. 
tions to the theory of economic welfare. He suc- 
ceeded Alfred Marshall as professor at the age of 
31 (a remarkable fact in the mature society of Ed- 
wardian Cambridge), to the delight of Marshall but 
to the chagrin of the older generation, and particu- 
larly Foxwell, who had believed their claims supe- 
rior. He remained a professor at Cambridge until his 
retirement and indeed continued to live in Kings 
College, of which he had been a fellow from 1902, 
until his death in 1959. 

Pigou was born in 1877, the eldest son of Clar- 
ence Pigou and his wife, Nora, at Beachlands, 
Ryde, in the Isle of Wight, the family home of his 
mother, His father was a retired army officer, de- 
scended from a Huguenot line which had long had 
connections with India and China, first as traders, 
in later generations as civil servants. His mother’s 
family came from a line which had earlier won 
distinction and wealth in Irish administration. With 
such a family background it was rather a matter 
of course that Pigou should be sent, like his father, 
to Harrow, but it was his own very considerable 
abilities that won him an entrance scholarship. At 
Harrow he was a good enough athlete to win ap- 
proval in an age in which athletics carried more 
honor than did scholarship and serious enough 
scholar to win many of the prizes open to Harro- 
vians. He ended his schooldays as the first boy on 
the modern side to be head of the school. i 

He went to Cambridge as a history ee 
King’s College and spent his first two years jes 
history as a pupil of that remarkable oddity, “ae 
Browning. But to his Cambridge wee a 
Pigou was best known as a leading figure aah 
Union Debating Society, where from his first is 
he made his mark and was outstanding ae 
orator, in a generation of brilliant speakers. Pigou 
not until his third year at Cambridge that e the 
began to study economics, and only becaus it 38 
Cambridge syllabus of that time require Pigou's 
part of the moral sciences tripos. Thus, ai 
introduction to economics came first po a 
history and second by way of philosophy an College 
Indeed, until he became a fellow of Kin 5 carcelY 
and began to teach economics, Pigou coul 5 e ha 
be regarded as a specialist in that subject: 


submitted, in his first attempt to obtain a fellow- 
ship at King’s, a thesis entitled Robert Browning 
as a Religious Teacher, which later became his first 
book, published in 1901. He had won the chan- 
cellor’s medal for English verse, as well as the 
Adam Smith prize for an essay on the principles 
and methods of industrial peace. 

He began to lecture on economics in 1901, be- 
fore his election to the King’s fellowship (at a sec- 
ond submission) and was made Girdler’s lecturer 
in the summer of 1904. He was lecturing in 1903, 
surprisingly, on the history of labor in the nine- 
teenth century. But in 1901/1902 he had already 
begun to give to the second-year students the course 
on advanced economics that was the basis of the 
education of hundreds of Cambridge economists 
over the next thirty years. 

In that course Pigou acted as expositor of the 
theoretical ideas developed by Marshall. Inevitably, 
as presented by Pigou, those ideas took a form of 
their own. But it was primarily through Pigou that 
the Marshallian tradition was handed down and 
became the Cambridge school of economics. While 
Marshall’s lectures had become less and less sys- 
tematic as the years went on, Pigou (educated in 
the more systematic habits that the school of moral 
sciences had inherited from Sidgwick, whose lec- 
tures on ethics and political science he had attend- 
ed) gave to the exposition of Marshall’s theories a 
clarity and architecture that they had lacked in 
Marshall’s personal teaching. To the end, Pigou 
remained a devoted and almost uncritical pupil of 
Marshall's, indeed an almost idolatrous worshiper. 
It was Pigou, more than any other, who brought 
up a generation of Cambridge economists in the 
conviction that (in his often-repeated words) “it's 
all in Marshall” and the belief that if they were in 
error, it was because they had misunderstood 
Marshall or had overlooked some essential passage 
in the holy writ. It was not until the coming of 
Piero Sraffa in the middle 1920s and of such 
younger lecturers as Joan Robinson and John Hicks 
in the 1930s that Cambridge escaped from an un- 
critical acceptance of the Marshallian orthodoxy; 
but then, and even much later, through Robertson, 
Guillebaud, and others of us, Cambridge retained 
lts essentially Marshallian tradition. 

In these years it was as a lecturer that Pigou 
was at his best, The Cambridge economists of the 
1920s owed to Keynes their sense of the impor- 
tance of the economic issues and their sense of 
urgency to achieve essential reforms; but it was 
Primarily to Pigou (and also, in later years, to 

pms Robertson and others) that they owed their 
training in the disciplines of economic reasoning. 
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The same qualities that one finds in Pigou’s books 
were paramount in his lectures. Clarity of analysis, 
division and subdivision where it was appropriate, 
a willingness to follow an argument through to the 
end and to refine it as it needed to be refined, all 
these were the characteristics of his own presenta- 
tion and of what he demanded in others, Curiously, 
this eloquence and clarity in teaching did not ordi- 
narily carry over, as it did so abundantly in the 
case of Keynes, to an enjoyment of personal argu- 
ment and discussion with his pupils. We admired 
Pigou; after a lecture we would sometimes shyly 
ask him a question, and he would answer, either 
jocularly or even more shyly. But most of us as 
undergraduates hardly knew him outside a lecture 
room, while we knew Keynes quite intimately, 
through his Political Economy Club. 

Pigou’s first book on economics, Principles and 
Methods of Industrial Peace, an expansion of his 
Adam Smith prize essay, appeared in 1905. It is 
interesting to go back to that book, not only for its 
matter but also for its method, which remained 
characteristic of Pigou throughout his life and 
which deeply impressed his pupils. He applied to 
economic material the method of the philosopher, 
clarifying the issues, dissecting them and analyzing 
them, trying to see how different assumptions re- 
garding the material might modify conclusions— 
the analytical method applied with great precision 
to an essentially qualitative argument. He used 
statistics but rarely, and only to establish or to in- 
dicate the order of magnitude of some relevant 
fact. This is very far from the form of argument 
of the modern, statistically minded economist. 
While Pigou, as his working life continued, gradu- 
ally acquired the habits of mathematical economics, 
he was not by instinct and training a mathematical 
economist. His thinking remained primarily that 
of a philosopher trained in analysis. His first book 
eschews mathematics completely. 

Welfare economics. It was in 1912 that Pigou 
published the book which, in retrospect, is the most 
important of his works—Wealth and Welfare, as 
it was called in its first edition. It was expanded 
vastly over the next thirty years and, as The Eco- 
nomics of Welfare (1920a), played a major part 
in the education of the Cambridge economists of 
the 1920s and 1930s, and indeed of economists 
generally. Those who came to it when it had ac- 
quired middle-aged embonpoint would do well to 
restudy it in its slimmer, more youthful, and more 
rapidly moving form. Its characteristics are those 
of the 1905 book—beautiful clarity of reasoning 
and thought; the method of qualitative analysis 
rather than of statistical argument; the sparing 
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introduction of mathematical argument, and only 
where necessary to a precise formulation. Above 
all the early version possessed a beauty of archi- 
tectural design and construction that was lost as 
new extensions were added over the years. 

Pigou’s ultimate reputation will, one suspects, 
depend on the judgment by future generations of 
the work in the field of economic welfare that was 
principally enshrined in that book. He started from 
two existing ideas, to be found, in the Cambridge 
tradition, in the work of Marshall and of Sidgwick. 
Marshall had discussed (as had Bastiat and others 
before him) the concept of maximum satisfaction 
and the conditions under which an economy would 
achieve it. Sidgwick, in dealing at a less rigorous 
level with the same problem, had made use of the 
idea of divergence between utility to the individual 
and utility to society, Pigou’s book was more ambi- 
tious. He set out to examine, first, the full condi- 
tions of maximum satisfaction; second, the condi- 
tions under which private and social product (as 
he preferred to call them) might be different, so 
that the maximum would not be achieved under a 
system of private enterprise; and third, the meas- 
ures which might be taken to bring the two into 
equality. In one important respect Pigou’s work 
was fundamentally different from that of his suc- 
cessors in the field of welfare theory. He was pri- 
marily concerned (as his first chapter shows ) with 
“fruit” rather than “light”; with writing a theory 
of welfare that was applicable in practice, 

Pigou’s definition of maximum economic welfare 
made it dependent on the average volume, the dis- 
tribution, and the variability of the national divi- 
dend. Any course, he argued, which without com- 
pulsion increased efficiency and the volume of the 
dividend or which increased the Proportion of the 
dividend which was received by poor persons or 
which (without diminishing its volume or injuring 
its distribution) diminished the variability of the 
dividend would increase economic welfare. 

The greater part of Wealth and Welfare as orig- 
inally written, and an ever-increasing part of it as 
it grew through the various editions of Economics 
of Welfare, was devoted, in the first place, to 
the differences between social and private margi- 
nal net products arising from imperfect divisibil- 
ity, to elements of monopoly and similar factors, 
and to the methods of controlling monopoly and 
otherwise correcting these divergencies; second, it 
considered methods of remunerating labor and the 
practicability and desirability of interference to 
raise the wages of poorer workers. Wealth and 
Welfare treated rather scantily the whole question 
of taxation as it involves transfers from the rela- 


tively rich to the relatively poor, which later edi- 
tions of Economics of Welfare treated more fully 
and systematically. 

Much of Pigou’s work, as far as it relates to mo- 
nopoly, to pricing policies of public activities, and 
to wage bargaining, for example, has been so 
thoroughly absorbed into the present corpus of 
economic doctrine that it has long ceased to be 
identified consciously with his name, Antipathy to 
his conclusions has been much keener and much 
more continued with regard to his argument that 
measures to equalize incomes would increase eco- 
nomic welfare. Pigou himself in 1912 was a liberal 
—neither an extreme radical nor a socialist, He 
was almost certainly thinking in terms of minor, 
marginal changes within the framework of a lib- 
eral society. By definition he excluded redistribu- 
tions which would so affect incentives as to reduce 
the national dividend. But his conclusions were 
inherently so radical that they evoked criticism and 
resistance among economists who would not have 
felt equally critical of his more familiar and ac- 
ceptable arguments where they related to monopoly 
or public utility pricing. Once these criticisms had 
been voiced, other economists felt bound, from 
a purely academic standpoint, to examine their 
validity. 

In later editions Pigou slightly modified his 
actual presentation of the argument for increased 
welfare with less inequality of income, but the 
essentials remained the same. He began by assum- 
ing that individuals are of similar temperament and 
anticipating the potential criticism that at nai | 
rich and poor are temperamentally different, argu 
that there is evidence that with greater income 
equality and with time any differences of tempera- 
ment and taste would disappear. He satisfied him- 
self, by appeal to evidence based on social i 
that the assumption he proposed was a plausi 
one. But in a strict sense, he did not prove it. peony 

The subsequent discussions of welfare t a 
have started with the problem of the nee of 
measurability of utility and that of the pee 
interpersonal comparisons and have inevita a 
to the difficulties of measuring the national eee 
in welfare terms and defining the circums He 
in which it can be said to have increased. ae 
ular, ingenious attempts have been made A 
tify the cases in which, without any inte ni 
comparisons and thus without ambiguity, A 
can be said to have been increased. But eve 
the addition of the compensation pinak 
discussions have tended to leave untouche case 0 
number of real cases in which (as in the be sup- 
redistributional taxation) the poorer may 


posed to benefit at the expense of the richer who 
will suffer some loss; even the compensation prin- 
ciple does not provide a complete and satisfactory 
solution. Pigou himself took no active part in these 
discussions, but he was persuaded in 1951 (when 
he was in his 74th year) to contribute to the 
American Economic Review a short article which 
shows his final thinking. 

In this final summing up of his thinking, Pigou 
argued, first, that (in Bertrand Russell’s words) 
satisfactions are not in principle incomparable, even 
if they are not directly measurable. One can thus 
attach meaning to marginal changes. But he con- 
ceded that one cannot hope to establish absolute 
magnitudes of total utility or to be able to answer 
the question (which in his Study in Public Finance 
[1928] he had attempted to answer) whether a tax 
proportioned to total income will inflict equal sac- 
rifice upon everyone, whatever the size of his in- 
come. Second, he turned to interpersonal com- 
parisons and argued that “the utilities enjoyed by 
different people are not in their nature incompara- 
ble. . . . The question whether they are comparable 
in fact is a more difficult one” (1951, pp. 291-292). 
His final stand so well states not only his own 
attitude and that of Robertson (who in this, as in 
many other respects, remained his pupil and ad- 
herent) but also the attitude of many working 
€conomists, other than the specialists in this field, 
that it deserves quotation in full: 


Now, if we take random groups of people of the same 
race and brought up in the same country, we find 
that in many features that are comparable by objective 
tests they are on the average pretty much alike; and, 
indeed, for fundamental characters we need not limit 
ourselves to people of the same race and country. On 
this basis we are entitled, I submit, to infer by analogy 
that they are probably pretty much alike in other 
Tespects also. In all practical affairs we act on that 
supposition, We cannot prove that it is true. But we 
do not need to do so. Nobody can prove that anybody 
besides himself exists, but, nevertheless, everybody is 
en Sure of it. We do not, in short, and there is no 
in, Son why we should, start from a tabula rasa, bind- 
B ourselves to hold every opinion which the natural 
Tee to be guilty until it is proved innocent. 
not urden is the other way. To deny this is to wreck, 
tatus see Welfare Economics, but the whole appa- 
Shad _ Practical thought. On the basis of analogy. 
sons eaten and intercourse, inter-personal compari- 
unles: sty as I think, properly be made; and, moreover, 
trary, we have a special reason to believe the con- 
yield ‘a. given amount of stuff may be presumed to 
etwee} similar amount of satisfaction, not indeed as 
iephesen any one man and any other, but as between 
as th ntative members of groups of individuals, such 
© citizens of Birmingham and the citizens of 
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Leeds. This is all that we need to allow this branch of 
Welfare Economics to function. Of course, in working 
it out, positive conclusions can only be reached sub- 
ject to very important qualifications. (ibid., pp. 292- 
293) 


Pigou, that is to say, remained to the end a 
believer in the validity of the welfare economics 
that he had done so much to create. He admitted 
that certain conclusions rested on assumptions 
that could not be proved, but he believed that these 
assumptions accorded best with the available evi- 
dence. He believed that the economics of policy 
making was a serious academic study and not 
merely a politically loaded exercise. In some re- 
pects Pigou, even if imprecise, at least was on the 
side of the angels: If he believed, rather than 
proved, that a transfer from richer to poorer was 
likely to increase welfare, it was, one suspects, 
partly because he was the sort of person he was. 

Public involvement and retreat. Throughout 
his life he was a passionate believer in justice; this 
was apparent in the way he handled all the prob- 
lems of running the faculty of economics at Cam- 
bridge. If you were working with him, you had to 
satisfy him that what you proposed was a com- 
pletely just solution of the problem in hand. To 
him it was just and proper, whatever could or could 
not be proved about capacity to absorb satisfac- 
tions, that the poor should be treated as if they 
were equal in value and capacity to the rich. If 
one could invent exceptions, the exceptions would 
seem, to anyone of his Victorian uprightness, a 
case of special pleading. He was always a protector 
and defender of the underdog. At moments we 
could visualize him as the severe but just head of 
the school at Harrow. 

Pigou’s early years, before he was made profes- 
sor, were the years of the great tariff disputes. 
Fresh from his triumphs in the Union debates, 
Pigou threw himself into the conflict and toured the 
country making free-trade speeches. The probable 
line of his argument can be deduced from political 
pamphlets, including The Riddle of the Tariff 
(1903), and from the much more sophisticated and 
academic Protective and Preferential Import Duties 
(1906). But to those who knew him only: later in 
life, what is interesting is that at this stage he was 
a speaker widely in demand and responsive to 
those demands, anxious to influence public opinion 
and policy. With Alfred Marshall he was a signa- 
tory in 1903 of the professors’ letter to The Times, 
which gained some notoriety in the tariff disputes 
of the day. In this phase of his life he was very far 
from being the recluse he became in later years. 

His powerful conscience created problems for 
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him when war came in 1914, as was the case with 
many of his Cambridge contemporaries. Although 
still young (he was then 36), he was not prepared 
to undertake military service that entailed an obli- 
gation to destroy human life. But he devoted all 
his university vacations to driving an ambulance 
for a Friends’ ambulance unit, and he insisted, no 
doubt at the instigation of the same conscience, 
on undertaking jobs of particular danger. Toward 
the end of the war he was persuaded by Clapham, 
in peacetime one of his colleagues at King’s, to 
accept a post in the Board of Trade. He achieved, 
however, no distinction as a civil servant. It could 
not be claimed for him that he ever possessed the 
practical administrative abilities that carried 
Keynes, Walter Layton, Hubert Henderson, and 
others of the Cambridge economists of that genera- 
tion rapidly into positions of authority and respon- 
sibility. He was never, as an economist, quick to 
see intuitively the order of magnitude and the po- 
tential dangers of economic forces, and he was 
never a person to whom colleagues turned instinc- 
tively for advice in the sphere of economic policy 
making. This limitation betrayed him when, first 
as a member of the Cunliffe Committee of 1918- 
1919 and later as a member of the Chamberlain 
Committee of 1924-1925 on currency and Bank of 
England note issues, he was one of those who 
recommended an early return to the gold standard 
at the pre-1914 exchange rate—a policy that was 
to be flagellated by Keynes in his Economic Conse- 
quences of Mr. Churchill (1925). From the early 
1920s onward, Pigou withdrew, save for brief ex- 
ceptions and an occasional letter to The Times, 
from taking part in national affairs and devoted 
himself more and more completely to Cambridge. 

Increasingly, over these years, Pigou retreated 
into the ordered life of a recluse. He lectured a 
couple of hours a week in a course which, excellent 
as it was, remained essentially unchanged for 
twenty years and cost him little effort. He worked 
incessantly and regularly at his books. He con- 
sumed all that was written in economics, seeking 
always realistic illustrations for quotation in his 
own work, and in addition he devoured every de- 
tective story available. He would march out on an 
afternoon walk to Coton or thereabouts. Within this 
private world were a few privileged friends. They 
were usually chosen because they shared his love 
of mountains, and they climbed with him at But- 
termere, where he had built a house, or in the 
Alps. No account of Pigou would be complete with- 
out mention of the mountains. These he loved, with 
these he illustrated problems in his lectures, and 
to these he hurried away when term ended. He was 


a keen and enthusiastic, but not a Supremely great, 
climber; he introduced to climbing many who, like 
Wilfrid Noyce, became greater climbers, Around 
Buttermere he invented and pioneered new rock 
climbs. In the Alps he achieved most of what is 
possible to the best. Into this private world few 
women were admitted: the exceptions in later life 
were the wives of his climbing friends and other 
women who knew him in that life. His clothes pro- 
gressively became more and more those of a her- 
mit, and one never knew in what sartorial disarray 
or in what superannuated castoffs he would next 
appear. 

It was through exposure while climbing that he 
acquired an illness affecting the heart, which from 
the beginning of the 1930s increasingly curtailed 
both his climbing and his other activities. This 
affliction greatly impaired his vigor and energy and 
left him, through the rest of his life, with inter- 
mittent phases of debility. It affected his lecturing: 
in the 1920s he had been vigorous and immensely 
stimulating; in the 1930s the liveliness and stimu- 
lus somehow slipped out of the lectures; and in his 
later years, although essentially unchanged, they 
had lost something of their power to grip an audi- 
ence. And in the same way, something of the vigor 
and domination departed from his writing, distin- 
guished as it still remained. 

Other writings. Over these years he had writ 
ten much. Unlike the present generation, he tended 
to write books rather than articles, although on 
occasion he contributed the latter also. In 1914 
appeared Unemployment, a small, popular book in 
which Pigou, following the thinking of that time, 
attributed unemployment principally to lack of I 
ibility of wage rates. In the 1920s there appeare 
two books reflecting his interest in war pees 
A Capital Levy and a Levy on War Wealth (192 A 
and The Political Economy of War (1921), boo 
which had great influence on current thought. e 
1923 a collection of his journal articles was P 
lished, entitled Essays in Applied Economics; ay 
volume was included an important ariani 
had first appeared in 1917 in the Quarterly H 
of Economics, “The Value of Money.” In a penoy 
which economists at Cambridge were E oe 
antly waiting for Marshall to publish his S cae 
tion of monetary theory (Money, es article 
merce appeared only in 1923), this 19 ce other 
had for long been the only printed sot ee 
than Marshall’s evidence before the Gold an 
Commission. 

Industrial Fluctuations (1927), 
lic Finance (1928), and The Theory oi 
ment (1933) continued Pigou’s stream 


A Study in Pub 
f Unemploy- 
of output. 


They represent the most distinguished work of the 
pre-Keynesian epoch and played a significant part 
in the rapid development of economics during that 
period. Inevitably these books, in retrospect, 
have become dated more than his work in other 
fields. But one wonders whether some of this work, 
with its emphasis on variations of harvests, on 
variations of the rates of invention and of con- 
sumer stock-building of newly invented durables, 
and on other nonmacroeconomic factors, may not 
be overdue for rediscovery and adaption to newer 
macroeconomic theories. 

In 1935 there followed The Economics of Sta- 
tionary States. This book, which appeared shortly 
after the publication of the ideas developed by 
Chamberlin and Joan Robinson and makes occa- 
sional use of concepts worked out by them, is in 
another sense very remote from their thinking, 
Pigou set himself the question of the way in which 
equilibrium is reached in a stationary state, defined 
rigorously as one in which the population, its qual- 
ity and age and sex distribution, its stock of equip- 
ment, technology, and tastes, are all fixed. He 
approached his results through a long series of 
simplifying assumptions, beginning with Robinson 
Crusoe economics and progressing only by stages 
into the problems of specialization and exchange 
anda monetary economy with markets, many com- 
modities, elements of monopoly, of transport cost, 
and the rest. He rejected the idea of a progression 
from perfect competition to monopoly through a 
chain of closer and less close substitutes. He as- 
sumed that “there can only be one single type of 
(homogeneous ) commodity or service. Commodi- 
ties that are good substitutes for one another are 
not, as is sometimes suggested, in ‘an imperfectly 
competitive market.’ My use does not allow this. 
In respect of any number of different commodities 
or services, however good substitutes they may be, 
there are at least that number of different markets” 
(1935, pp. 77-78). 

Pigou was, in fact, not directly concerned in this 
book with the same problems as Chamberlin and 
Joan Robinson, As “The Passage to Real Life,” its 
last chapter, makes clear, this work on stationary 
States presented itself to his mind as a path-clear- 
ing operation in a longer progress toward a more 
dynamic economics. But he conceived of the devel- 
opment of an economy, not as more or less steady, 
dictated by continuous technical progress and cap- 
ital accumulation, but rather as a set of steps— 
Stationary states that are disturbed by spasmodic 
changes in technology, in accumulation, or in other 
actors. A new set of environmental conditions 
Would imply a transition (which, he emphasized, 
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lay outside the scope of the book) toward a new 
stationary state. This work represents Pigou’s ana- 
lytical skill at its highest. But it made curiously 
little impression on contemporary economics, either 
in Cambridge, elsewhere in England, or in America, 
and it now seldom receives detailed study. A re- 
reading discloses some of the difficulties of relating 
Pigou’s thinking to that of others. He seldom ac- 
knowledged an idea; his acknowledgments are 
more numerous than a very unworthy index would 
suggest, but they are, as always, almost entirely 
acknowledgments of facts or terminology. That he 
had read and absorbed a great deal is evident. But, 
like Keynes, he was never a writer who began with 
a systematic reading of the literature of his prede- 
cessors and contemporaries. 

Throughout this period there were numerous re- 
visions of The Economics of Welfare; there were, 
in fact, four main editions and six additional re- 
printings between 1920, when Wealth and Welfare 
first acquired its new name, and 1952. These oc- 
cupied most of Pigou’s time. There were also a 
variety of smaller books of essays and lectures; 
as well as some elementary and pedagogic writings 
of lesser importance. These latter include Income 
(1946), The Veil of Money (1949),"and Essays in 
Economics (1952). For a period they had their 
place in the teaching of economics. 

Relationship with Keynes. Keynes's General 
Theory appeared in 1936. It affected Pigou doubly. 
First, Keynes had dared to attack Marshall, for 
whom Pigou had more than a pupil's reverence. 
The contemporary Cambridge belief that “it’s all in 
Marshall” was certainly Pigou’s belief. But it was 
enshrined in the more fundamental belief that if 
you could not find it in Marshall, it was because 
Marshall had had to write for the general reader 
and not because the idea would have been any- 
thing new to Marshall. That apart, Keynes had 
made his chief attack on the classical economics 
as embodied in Pigou’s Theory of Unemployment 
(1933). Thus, Pigou was on the defensive—prin- 
cipally, I think, because of Marshall; while he had 
a great sense of authority as professor, he was not 
a vain man, and as subsequent events showed, he 
was not unprepared to admit that he had been in 
the wrong. His defense took the form of a severe 
review of the General Theory in Economica. He 
condemned Keynes’s “patronage extended to his 
old master Marshall,” and his iconoclastic treat- 
ment of classical economists as a group. Thus, 
indignantly, he imagined Keynes making the fol- 
lowing attack: “Professor Pigou, in a book on Un- 
employment, ... has committed a variety of sins. 
Professor Pigou is a classical economist; therefore 
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the classical economists have committed these 
sins!” (Pigou 1936, pp. 115-116). In defense of 
Marshall, Pigou was prepared, if necessary, to 
sacrifice himself. 

Whether Keynes could have achieved a radical 
rethinking of economics without wounding, as he 
did, Pigou and Robertson, among others, is a ques- 
tion that will be eternally debated. I have always 
believed that this was the sad but necessary cost 
of the revolution of thought. With Robertson, the 
wounds went so deep that they never could be 
healed. With Pigou, time brought a healing. And 
indeed the wounds were never as severe: Keynes 
and Pigou were both fellows of King’s; they dined 
at the same high table; they immensely respected 
each other. And Keynes, anxious as he was to see 
his theories accepted, was almost equally anxious 
to see Pigou’s reputation preserved; he was very 
conscious that in attacking Pigou, he was attacking 
ideas which he himself had held only a few years 
earlier. When Pigou asked for something to be 
published in the Economic Journal, which at that 
time Keynes and I edited, Keynes would accept it 
without demur but would seek to persuade Pigou 
to agree to minor changes on points where he be- 
lieved that subsequent discussion might show 
Pigou to be wrong, When Keynes was ill for a 
period and I accepted from Pigou, and printed as 
Pigou wrote it, the article “Real and Money Wage 
Rates in Relation to Unemployment” (1937), 
Keynes was greatly distressed lest I should have 
done something that might discredit Pigou. It was 
an oddly tolerant and mutually admiring relation- 
ship, which never came near to reaching the 
misery of the cleft between Robertson and Keynes. 

Pigou continued his work on the problems of the 
General Theory, publishing in 1941 Employment 
and Equilibrium, which remains a vigorous defense 
of the classical position, with new refinements but 
with essentially the traditional approach. He em- 
phasized the multiplicity of the causes of unem- 
ployment and argued that it results from a compli- 
cated system of interrelated factors, He brought 
to the surface the real-balance effect that has since 
come to be associated with his name. This so- 
called Pigou effect identifies one of the possible 
ways in which rising prices may tend to reduce 
excessive demand and thus restore equilibrium: 
they decrease the net indebtedness of the public 
sector to the private sector, and this reduction in the 
real assets of the private sector may serve to lower 
aggregate demand. But Pigou’s thinking in this 
field continued to change and develop; he had, 
indeed, a remarkable fertility of mind until well 
into his seventies. And in 1949 Pigou, eight years 


after retirement, asked some of us who were 
then running the economics faculty at Cambridge 
whether he might have a couple of lectures in 
which to say certain things about the General 
Theory. In those two lectures, even more moving 
to hear than to read, he said, in effect, that he 
had come to feel, with the passage of time, that 
he had failed to appreciate some of the important 
things which Keynes had been trying to argue in 
the General Theory and that he had been over- 
severe in his Economica review. It was the noble 
act of a scrupulously honest man, who put truth 
beyond vanity and another's reputation beyond his 
own. 

Assessment. It is not easy to place Pigou in the 
theogony of economists. In the reactions of the 
1960s he has probably been more underrated than 
any other economist of first distinction. Outside 
the field of welfare economics, one does not in- 
stantaneously associate him with any of the major 
instruments of economic analysis that one uses 
daily. He is not, in that sense, to be ranked with 
Smith, Ricardo, Marshall, or Keynes, if one thinks 
only of the British school. But modern economics 
would have been very different without him. Mar- 
shall was almost certainly right in thinking that 
Pigou was the proper successor to create a new 
disciplined and professional school of analytical 
economics in Cambridge. As teacher and creator 
of a tradition within Marshall’s economics tripos, 
he set a pattern for a generation of Cambridge 
economists, and Pigou’s pupils, teaching a Pigou- 
vian version of Marshall’s economics, are widely 
distributed around the Anglo-Saxon world. But 
Pigou’s innate and notorious shyness, ever increas- 
ing as the years went by, cut him off from all in- 
formal discussion of economics, both with Cam: 
bridge colleagues and foreign visitors. Pigouvian 
economics came from the brain of Pigou and re 
his reading of published material; it was not i 
fined by the cut and thrust of personal ae 
Occasionally he would ask privileged pupils re 
friends (David Champernowne for ne 
help him with mathematical or other analysis a 
accept from them criticism and suggestions s 
to the ordinary run of his Cambridge par ma 
he could not have been more different from a 
and Robertson in these respects. Even w: ihe 
us who were privileged to stay with him 


the problems of ascents, the merits 
of the authors of detective stories, pans f 
ally, the flaws in a book on economics, to 
others the labor of reading it. ant 
Pe me: 
Pigou’s isolation from the world inevitably 


jsolation from foreign economists. He read con- 
scientiously, but he was not sufficiently intimate 
with any non-Marshallians to project himself into 
their minds and thinking; and he was too loyal an 
admirer of Marshall to want very much to do so. 
Pigou’s great qualities were those of the classical 
economists. He was, it might truly be said, the last 
of the classical school and the last of the great 
Victorians, who had the misfortune to survive to 
an age which had lost its reverence for them. He 
himself showed to the end unflagging interest in 
the work of the younger generation and no trace 
of embitterment at its development of economics 
along lines that he had not foreseen. 


AUSTIN ROBINSON 


[See also CONSUMPTION FUNCTION; Money, article on 
QUANTITY THEORY; WELFARE ECONOMICS; and the 
biographies of KEYNES, JOHN MAYNARD; MARSHALL; 
ROBERTSON; SIDGWICK.] 
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Modern psychiatry was born of the efforts of a 
few great pioneers. Among them, Philippe Pinel 
(1745-1826) stands out: his was the first major 
achievement, at once scientific and humanitarian, 
in the treatment of insanity. 

The oldest of seven children, Pinel was born to a 
pious Roman Catholic family in a small village in 
the Tarn in southern France. His grandfather and 
father were physicians; his mother, a very strong 
character, dominated his youth. Her death, when 
Pinel was only 15, moved him to take orders. He 
joined the Oratorians but soon left monastic life, 
becoming instead deeply imbued with the new 
philosophical outlook of Locke, Condillac, Voltaire, 
and Rousseau. 

Pinel was a brilliant student and a great admirer 
of the classics, of Hippocrates, and of Plutarch; he 
read Greek, Latin, and also English, from which he 
was to translate some of the works of his con- 
temporary, William Cullen. 

Initially Pinel studied in the Faculty of Letters 
at Montpellier, but he was above all interested in 
natural science and mathematics. Personal ac- 
quaintance with Paul Joseph Barthez, a renowned 
physician of the faculty at Montpellier, led him to 
study medicine. Studious, reserved, and simple in 
tastes, Pinel had few friends, but those he had 
were close ones, especially Jean Antoine Chaptal, 
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the future scientist. Pinel’s erudition made a variety 
of careers possible, but once he arrived at Paris he 
decided to devote himself to psychiatry. 

Pinel’s career is inseparable from its political 
context: The French Revolution made it a duty to 
protect all individuals, even the insane, against 
enslavement by others. Pinel first became profes- 
sor of medical physics, but he was next appointed 
physician of the hospital of Bicétre, the ancient 
grange aux gueux (“beggars’ farm”) at the gates 
of Paris, where “the insane . . . are treated as crimi- 
nals, by mercenaries, contrary to either reason or 
humanity.” Pinel denounced this state of affairs, 
asserting, “I am convinced that the only reason 
these insane people are so intractable is that they 
are deprived of air and liberty” (Semelaigne 1888, 
p. 35). At that time they were, in fact, kept 
chained. 

Thanks to his persistent efforts, reason tri- 
umphed over prejudice, and soon the insane were 
no longer regarded as possessed by demons, but 
as mentally ill. This was certainly Pinel’s greatest 
achievement. However, he refused to accept the ex- 
cesses of the revolutionary movement: churches 
were being robbed, and Pinel from within his hos- 
pital coolly condemned the raging iconoclasts and 
the powerful authorities who supported them. Dur- 
ing one uprising the mob was about to hang him; 
he was saved by the vigilance of a recently “un- 
chained” patient whom he had taken into his 
personal service. 

Beginning in 1795, Pinel held a position at the 
Salpétriére hospital in Paris. There too he had to 
start from the ground up, replacing the orderlies, 
refurbishing the sordid rooms, and eliminating the 
use of chains. His psychiatric researches led to two 
fundamental works, La nosographie philosophique 
(1798), awarded a prize by the Directory, and Le 
traité médico-philosophique sur Valiénation mentale 
(1801), his major work. He had a wide audience 
among medical students; although not eloquent 
and sometimes absent-minded, Pinel was greatly 
appreciated in scientific quarters, 

The fundamental innovations to be found in 
Pinel’s works are of several kinds. First, he insisted 
on the humanitarian treatment of patients. He had 
a humane, moral attitude toward the mentally ill, 
having long talks with them in which he “skillfully 
gave them a drive that they thought was due to 
their own initiative” (Semelaigne 1888, p. 118). 
He substituted understanding and gentleness for 
violence, strait jackets for chains, and insisted on 
the temporary nature of even this restraint. 

Second, he made administrative changes. He 


distributed patients in several pavilions according 
to their condition, thus foreshadowing the dy- 
namic organization of the psychiatric hospital of 
today. He also urged that asylums be governed by 
physicians. 

Finally, he made innovations in therapy. He 
tried to diversify the treatment of the “raving” pa- 
tients according to the etiology of their disease, He 
saw to it that they had games and music to amuse 
them. “Skill,” he said, “consists less in the use of 
remedies than in the deeply judicious art of using 
or not using them at the right times” (ibid., p, 102), 
He completely rejected unnecessary bleedings and 
sudden cold baths but made good use of warm, 
sleep-inducing baths. In addition to such physical 
measures, he had workshops set up for therapeutic 
purposes, paying the patients wages. He advocated 
having farms—to be worked and managed by the 
convalescent patients—attached to asylums. Far in 
advance of his time, he proposed the establishment 
of institutions for convalescents in transition be- 
tween the hospital and the outside world; this inno- 
vation was developed further by his students. 

The doctrine that Pinel had worked out was not 
universally accepted after his death, Although the 
basic elements of his approach were applied, they 
were applied incompletely and without the effort 
and commitment he had had. We do not know if 
this failure was due to postrevolutionary political 
reaction, to a lack of medical men capable of carry- 
ing on his work, or to the absence of a sufficiently 
elaborated theoretical structure. In any case, only 
today are the foundations laid by Pinel being prop- 


erly developed. 
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Henri Pirenne (1863-1935) was a Belgian his- 
torian. After studying under Godefroid Kurth and 
Paul Fredericq, he received his doctorate of letters 
at the University of Liége in 1883. In 1885 he was 
appointed to a teaching position at that university 
but the following year moved to the University of 
Ghent. There he taught until 1930 and between 
1919 and 1921 acted as rector of the university. 
He taught medieval history and from 1893 on held 
the first chair of economic history in Belgium. 

Several circumstances contributed to Pirenne’s 
interest in the social aspects of history. He had 
personal experience with the Walloon woolen in- 
dustry, since his father managed a factory in the 
town of Verviers, and he became aware of the 
acuteness of social distress in Belgian industry at 
the time that he completed his university studies. 
His interest in social and economic history was en- 
hanced by postgraduate studies in Germany with 
Gustav Schmoller and through his acquaintance 
with Karl Lamprecht; his contact with Arthur Giry 
in Paris in 1884-1885 strengthened his interest in 
urban history. 

Pirenne worked out a synthesis of social, eco- 
nomic, and political factors in the study of the 
history of his own country (1900-1932), especially 
of certain mass movements. Thus, he ascribed the 
Process of political unification of the areas on 
Opposite sides of the language frontier to their 
Similar economic and social structure. This does 
not mean that he was a historical materialist: 
important individuals play a significant part in his 
explanation of events, as does chance. 

Throughout his career, Pirenne studied the his- 
tory of towns, In his first important contribution 
to this field ( 1893-1895), he rejected previous 
explanations of the origin of towns. His own theory 
was that towns developed as the result of the re- 
vival of commerce in the tenth and eleventh cen- 
turies and that their original nuclei were settle- 
ments of merchants. When he later, in 1927, wrote 
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Les villes du moyen âge, characteristically sub- 
titled Essai @histoire économique et sociale, he 
remained loyal to his original theories but devel- 
oped certain aspects of the subject more fully. He 
expanded especially the discussion of Italian and 
Russian towns. 

In order to place in proper perspective the effect 
of the development of the cloth industry on the 
growth of Flemish towns, Pirenne decided to write 
a history of that industry. A preliminary step was 
the collection and publication of the available 
source materials, and this was accomplished by 
Pirenne in collaboration with Georges Espinas 
(1906-1924), Pirenne himself, however, was never 
to write the history of the industry; others did so, 
piecemeal, according to his guidelines. 

Pirenne pioneered in historical demography. His 
study of the population of Ypres in the fifteenth 
century set the pattern for such studies in Belgium 
(1903). Another of his important intellectual con- 
tributions was contained in his article “The Stages 
in the Social History of Capitalism” (1914), which 
aroused great interest in international historical 
circles at the beginning of this century; it pre- 
sented the theory that the evolution of capitalism 
did not follow a straight line but rather a cyclical 
pattern, in which each phase of development to- 
ward freedom and progress was followed by one 
characterized by regulation and conservatism, which 
was in turn succeeded by a phase in which a gen- 
eration of homines novi reintroduced progressive 
tendencies. 

Pirenne’s comprehensive interpretation of medi- 
eval social and economic history in his Economic 
and Social History of Medieval Europe (1933) was 
to influence economic historians for at least a 
quarter of a century after his death, Two aspects 
of this work deserve particular mention: the im- 
portance of the role ascribed to long-distance trade 
as the primary stimulus to economic evolution and 
the characterization of the end of the Middle Ages 
as a period of contraction. 

The scope of Pirenne’s influence may be inferred 
from his close connection with the two most im- 
portant Continental periodicals devoted to social 
and economic history, the Vierteljahrschrift für 
Sozial- und Wirtschaftsgeschichte and the Annales 
d'histoire économique et sociale: both these jour- 
nals published important articles by Pirenne in 
their earliest issues. 

Even after his death, Pirenne attracted consid- 
erable attention through his posthumously published 
book, Mohammed and Charlemagne (1935), in 
which he suggested that the crucial break between 
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antiquity and the Middle Ages did not occur in the 
fifth century but rather about the year 700, when 
the Muslims had conquered the southern shores 
of the Mediterranean. The theory was based chiefly 
on economic considerations and gave rise to heated 


controversy. = W. 
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Augustus Henry Lane Fox (1827-1900), the 
man who now holds a very high place in the his- 
tory of archeology and anthropology as General 
Pitt-Rivers, was born into an ordinary English and 
Scottish aristocratic family. His father was W. A. 
Lane Fox of Hope Hall in Yorkshire, and his 
mother, Lady Caroline, was a daughter of the 
eighteenth earl of Morton. He changed his name 
to Pitt-Rivers in 1880, when he unexpectedly in- 
herited the estates of his uncle George Pitt, the 
second baron Rivers. Owing to his own exceptional 
endowments, he and his descendants were to enter 
into a more intellectual aristocracy, for he him- 
self married a daughter of Lord Stanley of Alderley, 
and one of his daughters became the wife of the 
well-known archeologist John Lubbock, afterward 
Lord Avebury. 

He began his career in the conventional aristo- 
cratic tradition, going to Sandhurst Military Col- 
lege and afterward being commissioned in the 
Grenadier Guards. 

He was, in fact, a successful soldier, distin- 
guishing himself in the field in the Crimean War 
and retiring with the rank of lieutenant general. 
Quite early, however, his originality and gift for 
research made themselves felt and he was charged 
with tasks which involved investigating the history 
of small arms in the British Army and, subse- 
quently, their improvement. k 

His work in this field had important military 
results, but far more important was the influence 
it had on his own thinking. This was a time an 
biological evolution was on everyone's mind, an 
Pitt-Rivers’ study of muskets and rifles made hm 
realize that principles similar to those of biolo 
evolution could be applied to human —_ 
They, too, evolved in a sequence of minute a 
provements—or, occasionally, followed re 
lines of development which led to their disap- 
pearance. 

While he thought, he also bought—and al 
in. He had soon amassed large collections of sp E 
mens, some ancient, some the handiwork E 
viving primitive peoples (or savages, as he ee si 
called them}. His house in London. bec eat 
crowded with weapons, skulls, stone in ete 
pottery, and other works of primitive art pen te 
that he decided to found a public ao `The 
ranged according to his evolutionary syst of it in 
authorities established the greater part ers him- 
Bethnal Green in east London, and Pitt-Riv ae 
self wrote the illustrated catalogue in ma 1874). 
opening in 1874 (South Kensington Muse 


In a paper presented to the Anthropological In- 
stitute in that year, he pointed out that, while 
archeology had imposed classifications on prehis- 
toric artifacts, nothing of the kind had been at- 
tempted in ethnology and museums had hitherto 
been arranged by regions (see Evolution of Culture), 
His own collections were now exhibited to show 
the development of forms and their transmission 
from one people to another. 

Although he classified the objects of material 
culture, he insisted on the mental processes behind 
them, seeing each category as representing an idea 
—the idea of the spear thrower, of the bow—each 
original idea subsequently being modified by lesser 
ideas for improvements. He carried the biological 
analogy so far as to say that human ideas can be 
divided into genera, species, and varieties. 

The collections soon grew too large for the ac- 
commodation at Bethnal Green and were trans- 
ferred to Oxford, where the Pitt-Rivers Museum, 
still arranged on evolutionary lines, now plays an 
important part in the teaching of anthropology at 
the university. 

Pitt-Rivers had long been interested in ancient 
cultures, and when he inherited the vast (29,000 
acres) Rivers estates around Cranborne Chase in 
Dorset, the focus of his attention inevitably shifted 
from ethnology to prehistory. For the Chase lay in 
the heart of chalk country exceptionally rich in 
prehistoric remains, and he found himself the pos- 
Sessor of large numbers of monuments dating 
from the Stone Age down to Roman times. It is a 
Proof of his gifts and of his stamina that, begin- 
ning in a new field on retirement from military 
Service, when he was over fifty years old, he was 
able to make himself one of the world’s pioneers 
in scientific archeology. 

Almost at once he set to work on the excavation 
of burial mounds, earthworks, and prehistoric vil- 
lages. And the precision and thoroughness which 
he had developed in the army he applied to such 
effect that his excavating techniques were, from 
the first, as accurate as those demanded by modern 
Standards. In particular he saw that things still 
Incomprehensible to him would be understood by 
later workers and that it was therefore a duty to 
record everything he observed in meticulous de- 
tail. This he did, regardless of either time or cost, 
and as a result it has indeed proved possible for 
modern archeologists to utilize his findings. 

He was an innovator, too, in paying attention 
especially to the exploration of settlement sites and 
in excavating them completely. These laborious 
and expensive undertakings were in striking con- 
trast to the hasty opening of burial mounds which 
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was then still prevalent. They provided almost the 
first reliable evidence concerning the living con- 
ditions of early man in Britain. 

He not only recorded every type of find on plans 
and sectional drawings that were far ahead of 
their time but also had exact models carved in 
mahogany of all the principal excavated monu- 
ments. Soon he had again accumulated so much 
material that only a museum could house it. He 
had one built not far from his house at Rushmore, 
and it is still maintained, with its fine display of 
local antiquities, in the midst of the Dorset 
countryside. 

He had set altogether new standards in excava- 
tion, and he did no less in scientific publication. 
Between 1887 and 1898 he published his dis- 
coveries in four privately and sumptuously pro- 
duced volumes entitled Excavations in Cranborne 
Chase. The importance of Pitt-Rivers’ work re- 
ceived adequate public recognition even in his own 
day, although his reputation has certainly in- 
creased as archeology has caught up with this 
extraordinary pioneer. In fact, he became the first 
inspector of ancient monuments in Britain after 
the passing of the Ancient Monuments Protection 
Act in 1882. He was made a fellow of the Royal 
Society, vice-president of the Society of Antiquaries 
of London, and was a most energetic president of 
the Anthropological Institute. 

Although he had made himself a scientist, Gen- 
eral Pitt-Rivers remained very much a soldierly 
aristocrat. He caused his numerous assistants to 
wear his colors and on occasion to ride to excava- 
tions on penny-farthing bicycles, following humbly 
in the wake of his high dogcart. 

As a landowner he was, however, somewhat ec- 
centric. He planted his estate at Rushmore with 
exotic trees and shrubs and stocked it with wild 
animals, which are said sometimes to have terrified 
visitors and countryfolk, He also provided a brass 
band to play for his tenants on Sundays—and 
expected them to attend. 

He remained true to his rational and scientific 
principles to the end. He ordered his body to be 
cremated after his death and left his head to the 
museum of the Royal College of Surgeons. This 
museum was destroyed by a bomb during World 
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[For discussion of the subsequent development of Pitt- 
Rivers’ work, see ARCHEOLOGY.] 
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Planning is engaged in by all sorts of institu- 
tions; among them, business enterprises and gov- 
ernments. This article mainly deals with govern- 
ment planning. Present-day planning is rooted 
partly in socialism, partly in modern economic 
ideas as expressed in national accounts, Keynesian 
theories, and anticyclical policy generally. The 
latter expresses the desire to understand and influ- 
ence the economy—the national economy, to begin 
with—as a whole. This desire was strengthened 
by the great depression, when it was clear that 
something had gone wrong. At the same time, 
planning, based on market analysis and similar 
techniques, has been increasingly applied in big 
business enterprises. 

: Over-all government planning was not applied 
in western Europe before World War 1. Restricting 


ourselves to democratically ruled countries, we 
may say that the first examples of over-all plan- 
ning were found in the United Kingdom, where 
the Economic Section of the Cabinet Offices pre- 
pared estimates of the national product and some 
of its components as a basis for the organization 
of the war effort. 

Immediately after the war, when a number of 
democracies started to function again, the Nether- 
lands and Norway followed the British example— 
each in its own way and for the purpose of postwar 
reconstruction. France soon followed in 1946, In 
a sense, some national planning was imposed on 
all member countries of the Organization of Euro- 
pean Economic Cooperation (OEEC), created to 
administer the European Recovery, or Marshall, 
Plan, which started operations in 1948. Not long 
afterward Italy began an exercise in planning, 
known as the Vanoni Plan, for a balanced develop- 
ment of north and south. This plan was not given 
official status but served as an important back- 
ground to Italian policy. In Sweden and Denmark 
planning developed gradually out of business cycle 
policy: the annual reports, for instance, of the 
Swedish Konjunkturinstitut hesitantly progressed 
in presenting estimates of the next year’s inflation- 
ary or deflationary gaps, adding suggestions for 
counteracting them. Several years later, in 1959, 
Belgium established its Programming Bureau. 

The remaining countries did not have official 
agencies in charge of planning, but all OEEC 
members fully cooperated in the planning exercise 
of that organization for the period 1955-1960 
(OEEC 1957). The planning issue became rele- 
vant for Germany when, in its Action Program, the 
Commission of the European Economic Com- 
munity stated the necessity for some planning at 
the community level (European Economic Com- 
munity 1962). The topic was widely discussed 
(Albrecht 1964; Plitzko 1964), and these discus- 
sions showed the differences in terminology a 
ideology still prevailing in western Europe, mainly 
in Germany, though even there they were more 
seeming than real. 

This article will discuss the definition of H 
ning which seems to be the most practical one; He 
theory of planning; the concepts of economic up i 
and economic policy serving as the backen a 
planning; the methods of short-term and none os 
planning; the organization and procedure of ae 
ning activities; the supervision of plans and 
testing; and the impact of planning on se 

Definition and theory of planning. Present 
western European countries have “mixed ves ii 
than purely “capitalist” or “free-enterprise 
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omies. They have public sectors of considerable 
size, and they exert various types of controls over 
the private sectors. This state of affairs is a product 
of a pragmatic policy but is consistent with modern 
economic theory. If the question is asked what 
economic order—that is, what set of institutions— 
will maximize welfare under the constraints im- 
posed upon us by nature—that is, the laws of tech- 
nology and psychology—the answer is a mixed 
order (Tinbergen 1959; Myrdal 1960). 

The old idea of Paretian welfare economics, ac- 
cording to which free enterprise would produce the 
optimum, is tenable only if the laws of technology 
exclude external effects and indivisibilities [see 
WELFARE ECONOMICS]. Where external effects and 
indivisibilities do exist, activities characterized by 
either of them should not be left to private enter- 
prise, A set of tasks for the community, represented 
by government, can be derived on the basis of this 
distinction. They consist of activities showing ex- 
ternal effects or indivisible means of production 
and of a number of otherwise determined policy 
tasks. Among the former we have public produc- 
tion of energy, transportation services, education, 
and information. Among the latter we encounter 
the regulation of unstable markets, the redistribu- 
tion of income (as required even by the Pareto 
optimum conditions), the organization of the 
Monetary system, and the determination of the 
optimum level of savings and of an optimum level 
of total demand (high and stable employment). 
Taken together, these tasks constitute such a com- 
Plicated set of activities that their efficient execu- 
tion requires preparation. Planning is just that. 
(Over long periods the most desirable develop- 
ment of an economy should coincide more or less 
with the theoretical concept of optimal growth 
[see Economic GROWTH, article on MATHEMATICAL 
THEORY], ) 
pee will not use the word “planning” in the sense 
ype order or policy, such as a command 
aS with detailed intervention. Depending on the 

Pee the tasks assumed by the government, 
ning of these tasks will also be more or less 
xtensive, Independently of the scope of the tasks, 
‘ead their preparation requires, in principle, 
stead} E of general goals for the most de- 
cas economic development of the country. The 
that pen a activite then must be conducive to 
E = desirable development. To the extent that 
PEDA optimum development requires govern- 
is arrives, it represents a task-setting device 
Private Prepublioyinsiitu gna; The delineation of 
rather yonun is not, as a rule, coercive but, 

> icative.” It constitutes a background for 
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the appraisal—in the future—of the actual accom- 
plishment of the private sector. This does not 
necessarily mean that all deviations from the plan 
must be considered “errors” of the private sector. 
There may have been good reasons to deviate from 
the plan. 

Moreover, the difference between the role of the 
plan for the public sector and for the private sec- 
tor should not be exaggerated. Only in the abstract 
can government completely control all its activities. 
In reality it consists of so many levels and units 
that its activities cannot all be determined at the 
top. The lower units, such as the municipalities, 
can be influenced mainly by indirect means, and 
there is not so much difference, from this point of 
view, between municipalities and private enter- 
prises. Moreover, a certain degree of codetermina- 
tion may be given to the private sector in the forma- 
tion and the execution of the plan. 

Planning should be seen as a manifestation of 
the ever-growing tendency consciously to organize 
human activity. In this process there is a continu- 
ous search for efficiency in its broadest sense. In 
the particular case of economic planning this 
means, on the one hand, that social or group needs 
are increasingly recognized as aims and, on the 
other hand, that no means, including planning, 
should be used where not positively contributing 
to these aims. We observe a gradual expansion of 
the fields to which planning is being applied; thus, 
education is now rapidly becoming more planned 
than some decades ago, and there is a tendency to 
extend planning to ever larger geographical areas. 
Finally we observe the integration of hitherto iso- 
lated and incoherent pieces of planning into a 
broader, more coherent frame. Thus, town and 
country planning are now becoming integrated 
into national economic planning. 

Economic order and economic policy. We have 
already stated that the optimum economic order 
must be described as a certain set of institutions. 
Examples of such institutions are private enter- 
prises, public enterprises, markets, social insur- 
ance schemes, tax systems, monetary systems, and 
hierarchies of public authorities. Institutions in 
turn are characterized by certain rules of conduct 
or as instruments of the society's economic policy. 
Examples are tax rates and social insurance bene- 
fits. Apart from a static concept of an optimum 
order, there is a continual process of striving for 
and managing the optimal order, which we usually 
call economic policy. 

In a general way, any economic policy may be 
described by its aims and its means. Both aims 
and means may be qualitative or quantitative. 


104 PLANNING, ECONOMIC: Western Europe 


Most institutions as such must be described in 
qualitative terms; their rules of conduct or policy 
instruments can sometimes be described in a quan- 
titative way. Thus, the level of a tax rate, of an 
import duty, or a public investment has a quanti- 
tative aspect. We will use the word “target” for a 
quantitative aim and the word “instrument” for a 
quantitative means and reserve the words “aims” 
and “means” for situations where both qualitative 
and quantitative aspects are present. 

There is a causal nexus from means to aims, 
with intermediary consequences of the use of 
means which may be called intermediate or derived 
aims, as distinct from final aims. Thus, the ulti- 
mate aim may be higher utility or satisfaction for 
the citizenry, but higher consumption may be the 
first derived aim, higher incomes the second, 
higher employment the third, and higher invest- 
ments the fourth. Clearly, short-term policies will 
be characterized more by derived or “nearer-by” 
aims than long-term policies will be. We will put 
the dividing line between short-term and long-term 
policies at somewhat more than a year. The stand- 
ard example of short-term plans will be an annual 
plan and that of long-term plans a five-year plan. 
Often a five-year plan will be classed as a medium- 
term plan and the phrase “long-term plan” or 
“perspective plan” will be used to indicate 15- to 
25-year plans. In western Europe the emphasis has 
been on the shorter range, mainly because of the 
already developed character of the economies. For 
less developed economies, perspective plans are 
more important than for developed economies. 

For short-term policy in western Europe a well- 
known set of targets has been (1) full employ- 
ment; (2) balance of payments equilibrium; 
(3) a level of net investment somewhere near 15 or 
20 per cent of national income; (4) an increase in 
the share of lower income groups in national in- 
come; and (5) stable prices. It should be noted 
that full employment has been interpreted vari- 
ously, but often as a level of unemployment less 
than 2 or 3 per cent of the labor force. It should 
also be noted that the fourth aim has seldom been 
expressed in figures and, finally, that price sta- 
bility has not been attained. 

A compilation of the targets or “objectives” of 
economic policy in Western countries, including 
the United States, was made by Kirschen and his 
collaborators (Kirschen et al. 1964, p. 148). They 
added a listing of instruments, making a two-entry 
table indicating which instruments are supposed 
to help attain which objectives. The table contains 
16 objectives: (1) full employment; (2) price sta- 


bility; (3) improvement in the balance of pay- 
ments; (4) expansion of production; (5) promotion 
of internal competition; (6) promotion of coordi- 
nation; (7) increase in the mobility of labor; 
(8) increase in the mobility of capital; (9) interna- 
tional division of labor; (10) satisfaction of col- 
lective needs; (11) improvement in the distribution 
of income and wealth; (12) protection and priori- 
ties to regions or industries; (13) improvement in 
the pattern of private consumption; (14) security 
of supply; (15) improvement in the size or struc- 
ture of population; and (16) reduction in working 
hours. 

The main instruments of economic policy in 
western European countries have been public 
finance and social insurance, the regulation of un- 
stable markets, and some quantitative restrictions 
in foreign trade. A fuller list is given in the table 
just discussed. It is subdivided into 17 instruments 
of public finance; 17 instruments of money, credit, 
and the exchange rate; 16 instruments of direct 
control; and 11 types of changes in the institu- 
tional framework. 

These numbers probably do not exhaust the 
number of instruments available. One may enter 
into much more detail by distinguishing between 
a number of sectors or industries, and both in war- 
time and postwar Western economies and in com- 
munist-ruled economies, a very large number of 
instruments consist of investment encouragement 
or quantitative restrictions for single industries. In 
many instances there is a choice among several 
instruments to attain the same target. 

In principle, both the aims and the means are 
the subject of autonomous choices of the com- 
munity: questions of taste, in a way. The restric- 
tions on this autonomy are that the means and the 
aims should not be mutually inconsistent; this 
implies that the means should, in the degree used, 
lead to the aims. In practice the autonomy 150. 
little consequence because of the similarity 1M 
tastes among different nations. As a rule the aims 
are not very different among nations; the autonomy 
on the side of means usually consists of ae 
some extreme means, such as forced labor an 
detailed intervention. The choice among the au 
sixty instruments mentioned by Kirschen depen 
largely on the effects expected from each. m 

The knowledge of these effects is equivalent 
knowledge about the operation of the ear 
Kirschen’s table suggests some qualitative ie 
edge of this kind in that it indicates, for aaa 
jective, the instruments supposed to effect it. ant 
is needed for planning purposes, however, is 4" 
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titative knowledge. This has been accumulated for 
the last half century by an ever-increasing volume 
of statistical and econometric research. 

The remainder of this article will discuss the 
various elements of the planning process, that is, 
of the preparation of economic policy. The main 
elements may be defined as the tasks, the methods, 
the organization, and the procedure of planning. 
What is meant by the tasks will be clear from 
what has been said in the present section. They 
consist primarily of making up plans, either year 
by year or for longer periods. In addition there are 
preparatory studies, especially those concerning 
alternative methods of production in some indus- 
tries and those concerning the structure of con- 
sumer demand. There may also be special studies 
of the effects of the introduction of new institu- 
tions or of particular measures—for instance, to 
increase productivity or to regulate a given market. 
Studies about the consequences of European inte- 
gration represent another example of an element 
in the planning process. 

The methods of short-term planning. By the 
“methods of planning” will be understood the tech- 
nical methods used to estimate the framework of 
figures constituting the hard core of any plan. This 
quantitative job must always be preceded by a 
qualitative analysis of the changes in the social 
order deemed desirable and of the significance that 
should be attached to the economic phenomena 
that are to be estimated quantitatively. For short- 
term plans the changes in the social order cannot 
be large, since such changes usually take time. 
The main emphasis will be on the quantitative part 
of the process, which may be seen as the necessary 
adjustment of the economy to short-term changes 
in world markets and to autonomous changes in 
Private behavior, with a view to maintaining full 
employment and achieving some of the other tar- 
Sets mentioned in the preceding section, including 
the Portion of a long-term plan of development per- 
oe to the year considered. As already stated, 
aan knowledge of many reactions of the 
cate au to autonomous changes and to induced 
app Se of such reactions are demand and 
Pi ae ea (both with regard to prices and 
Aes de ami in a number of markets, 
Koda = market for consumer goods, for export 
Salae E so on. The main problem is to find the 
beiko the instrument variables—taxes, public 
i ern credit and wage policies, ete.—given 
Soa targets and the changes in external or 
inthe eae data. We will use the latter term here 

Stricted sense of not including changes in 
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instrument variables. Sometimes such exogenous 
variables are indicated as “other data.” 

Essentially there are two methods of solving the 
main problem: trial and error and the mathemat- 
ical method. By the latter is meant a systematic 
use of symbols and equations (or more generally 
constraints, including inequalities) expressing the 
reactions mentioned before and a number of tech- 
nical relationships, which must be satisfied simul- 
taneously by all the figures in the plan. As a rule, 
combinations of both methods will be used; one 
may even argue that certain portions of mathe- 
matics do not surpass the level of trial and error. 
Yet the weight given to one or the other of the two 
extremes differs among countries, and the Nether- 
lands Central Planning Bureau is known for a 
rather extensive use of simple mathematical 
models for the framework of planning. The models 
used (Netherlands . . . 1956; Tinbergen 1956) can 
be characterized as generalized Keynesian models, 
since the Keynesian consumption equation is one 
of the most important. 

Since the Netherlands is a small country, export 
and import demand play a considerable role. Sup- 
ply equations have been given the special shape of 
price fixation equations. Capital as a factor limit- 
ing production does not enter the system: it is 
assumed that in the short run capital goods are 
redundant, but labor does enter as a limiting factor. 
Public finance is described in some detail, distin- 
guishing various types of revenues. Investment is 
split into private and public, and the private into 
investment in fixed equipment and in stocks. The 
models used often consist of some thirty or forty 
equations. They are adapted to the particular pur- 
pose they have to serve. The models used for the 
general plan differ from those used for special 
studies (see below). They do not contain financial 
flows, that is, changes in the main types of finan- 
cial assets, since so little is known about the pro- 
pensities to hold various kinds of assets. Financial 
flows are dealt with separately, using balance equa- 
tions and a number of less formalized assumptions 
about the changes in single asset holdings (see 
below). The formal model refers to the main com- 
modity flows, prices, and current payments of 
incomes, expenditures, taxes, etc. 

The application of the model starts with an 
estimate of the initial position, that is, the situa- 
tion in the year preceding the plan year. This 
position, of course, has to be in part estimated and 
is presented in the form of national accounts and 
the balance of resources and expenditures. 

The next step consists of an estimate of the 
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probable changes in exogenous data (in the re- 
stricted sense defined before, that is, excluding 
policy instruments). An important part is played 
by estimates of the changes in exports, taking into 
account income changes of the country-groups of 
destination, changes in the price levels of competi- 
tors and of Dutch exports, incidental changes in 
trade policies, etc. Another important datum is the 
level of private investment to be expected in the 
coming (plan) year. Investment estimates are 
based partly on econometric relations but, because 
of the latter’s unreliability, also on a direct survey 
of a large number of firms. A number of items of 
public expenditure are also considered data, and 
figures for these are obtained from the treasury. 
Agricultural supplies are usually known from crop 
statistics for the current year and are estimated at 
the “normal” level for the plan year. Seasonal 
fluctuations in building and demand for fuel are 
also assumed to be normal. 

The coefficients of the equations reflecting re- 
actions, technological or institutional relationships, 
etc., are taken from econometric and statistical 
studies and are revised periodically. 

With these figures given and with given targets, 
the system of equations can be used to determine 
the most desirable values of the instrument varia- 
bles and of the economic variables not representing 
targets. The results are discussed with various 
ministries, and as a rule the published plan will 
be the result of these discussions, avoiding a devia- 
tion of policy and plan. The influence the plan 
exerts is mainly through these discussions. Some- 
times alternative figures are added in order to show 
the possible effect of changes in instrument varia- 
bles. Thus, the effects of possible changes in wages 
other than the ones envisaged or of public expendi- 
ture for different purposes have been shown in 
some plans. 

Once the current money flows have been esti- 
mated, financial flows (changes in assets) are 
estimated so as to produce the most desirable de- 
velopment. The main problems involved in annual 
planning may be summarized, using the schema 
of B. Hansen (1955) as follows. From the table 
of resources and uses we derive the permissible 
volume of private consumption as the residual be- 
tween production on the one hand and investment 
and public expenditure on the other (assuming 
balance of payments equilibrium on current ac- 
count). From permissible consumption we derive 
the necessary tax level. From the financing equa- 
tion for private investment we derive the necessary 
increase in banking credit to business, From the 
balance sheet of the commercial banks we then 


derive the necessary change in reserve require- 
ments, which constitutes the main instrument of 
monetary policy. 

The methods of long-term planning. France 
has given attention to long-term planning to a 
greater extent than has the rest of western Europe, 
The French example is now being followed by other 
countries, including the United Kingdom, the Neth- 
erlands, and Belgium. Long-term planning has not 
been entirely absent in the other countries but has 
been applied somewhat less systematically, 

The method of the French Fourth Plan, as set out 
by Cazes (1962, pp. 33 ff.), was that of successive 
approximations. In a first approximation, a projec- 
tion was made for a set of 17 sectors—presumably 
using input-output coefficients in addition to those 
already mentioned in “The methods of short-term 
planning”—employing three alternative assump- 
tions with regard to the general rate of growth: 
3, 4.5, and 6 per cent annual growth. In a second 
approximation, after the government expressed a 
preference for a rate of 5 to 5.5 per cent, more 
detailed figures were prepared by “vertical com- 
mittees” covering 27 main branches and 300 sub- 
branches. The figures were checked by “horizontal 
committees” dealing with general aspects such as 
financing of investment, current demand versus 
supply, manpower and education supply, and 
so on, 

The method is not too different from methods 
several authors have recommended to developing 
countries; one difference is in the relative scarcities 
of the factors of production. Capital is less scarce 
and labor more scarce in developed than in devel- 
oping countries. This will come up in the concrete 
appraisal of single projects and in the choice of 
technologies. For developing countries this ap- 
praisal of projects is often added as a third phase. 
Lately, a geographical subdivision of the nation 3 
be planned for has been added and a correspon a 
ing phase in long-term planning methods distin 
guished. 

Long-term planning methods have also es 
developed for special purposes, such as the prena 
ration of institutional reforms or educational wee 
ning. In the Netherlands, an extensive study ve 
made of the introduction of unemployment insu 
ance in a new form and of the ince 
profit sharing by workers. In both cases mal T 
matical models were used in which es 
peared characteristic for the particular pro! pa 
studied; for instance, in the first example te ees, 
tributions to be made by employers and anne a 
the benefits to the unemployed, etc. A farh 
ample of special models is the model used to 
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the consequences of Benelux integration (Verdoorn 
1960). 

Organization and procedure of planning. The 
“organization of planning” considers the hierarchic 
relations within the group of persons and adminis- 
trative units charged with the tasks of planning. 
The “procedure of planning” considers the nature 
and time order of outside contacts made during 
the execution of planning activities. 

In the description of the organization, the dis- 
tinction will be made between the external and the 
internal organization of the main administrative 
unit charged with planning. This unit will some- 
times be an office, like the Netherlands Central 
Planning Bureau, within one of the ministries; 
sometimes it will have a more independent status, 
as in France, where it is called a commissariat. In 
a number of developing countries it is a ministry 
by itself. The level may thus be different, but as a 
tule it has some interdepartmental character. It 
may either be organized under the prime minister 
or, if elsewhere (treasury, ministry of economic 
affairs), have the right to deal with all ministries 
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without special permission of the minister con- 
cerned. The ideal situation, in the author’s opinion, 
is for the planning unit to be directly responsible 
to the prime minister. The preceding remarks refer 
to the external organization. 

The internal organization is determined by the 
number and nature of the divisions and subdivi- 
sions and the division of labor among them. It also 
depends upon the size and the background of its 
staff. As a rule, different divisions will deal with 
long-term and short-term planning work, since ex- 
perience teaches that charging the same persons 
with these two types of work leads to neglect of the 
long-term tasks—they are usually considered less 
pressing than short-term ones. Research in the nar- 
rower sense of basic scientific analysis, as distinct 
from the applied type of planning proper, will be 
entrusted to a separate division or be the task of 
the long-term planning division. Current advisory 
work can best be entrusted to the short-term plan- 
ning division. Sometimes subdivisions will be estab- 
lished to deal with individual sectors of economic 
life. 


Table 1 — Schedule of the French Fourth Plan 


PLANNING COMMISSARIAT 
AND RESEARCH SERVICE 


GOVERNMENT 


SOCIAL AND PROFESSIONAL 
CONSULTATION 


Preliminary studies for 1965 and 
1975 (1959 and beginning of 1960) 


Investment and Planning Section of 
the Economic and Social Council 
(spring 1960) 


Directives of government to comms e l 


sariat; appointment of members of 
Committees of Modernization 


Elaboration of accounts for 1965 de- 
rived from government directives 


Uuly 1960) 


L~~ > Start of work of 27 Modernization 


Committees (summer 1960); reply to 
questionnaire (February 1961) 


First check of physical and financial 
consistency of work by committees 
(March 1961); elaboration of a pro- 
visional synthesis for 1965 on the 
basis of a growth rate of 5 per cent 


Decision by government on rate of << 


Growth and distribution of public in- 
Vestments (April 1961) 
L 


Approval of draft by government 
(September 1961) 


—> Continuation of work on the basis of 
new growth rate of 5.5 per cent 


Final synthesis (May 1961); draft plan 


> Consultation of High Planning Coun- 


cil (October 1961); consultation of 
Economic and Social Council (No- 
vember 1961) 


Parliamentary vote on draft law ap- had deg Se 


Proving Fourth Plan 


Source: Adapted from Cazes 1962, p. 34. 
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The planning unit may have branches for indi- 
vidual sectors, for instance, in some specialized 
ministries (agriculture, manufacturing industry, 
trade, transportation, education) or for individual 
regions (state planning units in a federation), But 
the organization may also be different, if the tasks 
are entrusted to the corresponding ministries or 
lower public authorities. In our terminology the 
contacts with these branches are then no longer 
internal but external, and form part of the “pro- 
cedure” of planning. 

Clearly, the number of persons engaged by the 
central unit depends on which of these two forms 
of organization has been chosen. Usually in west- 
ern European countries this number is not very 
high. In France the Commissariat employs only 
about forty persons; the Netherlands Central Plan- 
ning Bureau is somewhat more centralized and 
employs some one hundred persons, whereas the 
Norwegian unit is smaller. In the French system, 
however, about three thousand persons, distributed 
over some three hundred working groups, collabo- 
rate in the committees dealing with the single 
sectors (Cazes 1962, p. 35), This number far sur- 
passes the number of committee members in the 
Netherlands, which amounts to less than one 
hundred. 

We now come to the subject of the procedure 
of planning. In France, as the preceding figures 
show, outside contacts are highly developed. The 
time order of the various contacts is well illustrated 
in Table 1. 

In the Netherlands, sector plans have occasion- 
ally been discussed with representatives of some 
sectors but not systematically prepared by working 
groups, as in France. Some steps in this direction 
have recently been taken. Until now, since the em- 
phasis is on macroeconomic means of economic 
policy, the usual procedure, once agreement has 
been reached provisionally with the various gov- 
ernment departments, has consisted mainly of con- 
sultation with central organizations of employers 
and employees, and with independent experts, both 
in the Socio-economic Council and in the Central 
Planning Commission. Apart from the committee 
work, which is far more voluminous in France than 
elsewhere, so far the procedure is not very different 
from that of the French. Major changes in general 
government policies in the Netherlands (where 
wages are, to some extent, under the supervision 
of the Sovernment) have as a rule been preceded 
by advice given by the Socio-economic Council, 
based on research and plans made in the Central 
Planning Bureau. 


Supervision and testing of plans. In western Eu- 
rope, planning agencies are not charged with the 
supervision of the implementation of plans. There 
is far less scope for such supervision by a separate 
agency than there may be in developing or in 
communist-ruled countries, since in the latter such 
an important part is played, within economic pol- 
icy, by the individual public investments envisaged, 
Such projects may be in need of progress super- 
vision by a special agency, although this is a debata- 
ble issue. The general view held in western Europe 
is that planning units are advisory bodies and that 
supervision proper is precisely the government's 
responsibility, that is, the responsibility of the ex- 
ecutive agencies, Indirectly, any deviation of actual 
policies from those recommended by the planners 
will show up in the next plan, since this always 
starts out with a survey of the year preceding the 
plan year. Routine statistics will show any such 
deviations; but there may be a need for some spe- 
cial statistics on the progress made in big invest- 
ment projects. Such statistics have been developed 
in the Netherlands during the last decade. It has 
become customary also, in the Netherlands and 
elsewhere, to report not only investment planned 
for the coming year by private business but also 
actual investments completed in the year before 
and estimated investments in the current year, 
therefore covering three years in all. 

If supervision proper is not considered a task of 
the planning unit, testing of previous plans is very 
much so. By testing is meant an analysis of ac- 
tual compared with recommended development. In 
principle, actual development of any variable may 
be considered the result of three types of factors, 
each of them also used to get the estimated “most 
desirable” development of the same variable: 
(a) the values of the exogenous variables (exclud- 
ing policy instruments); (b) the values of the in- 
strument variables; and (c) the values of the co- 
efficients of the model used. As a consequence, 
deviations between planned and actual values may 
be ascribed to (a) deviations in the values of oe 
exogenous variables, that is, forecasting errors ie 
these variables; (b) deviations in the values of 
instrument variables, that is, between actual i 
recommended policies; and (c) deviations in a 
values of the coefficients, or errors in the mode” 
If we like, we can combine (a) and (c) ra 
them forecasting errors in the endogenous 
ables. 4 

It is possible to give a special form to m Be 
parison between predicted and observed V o ane 
either exogenous or endogenous variables by 
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ing the number of cases where, in three successive 
years, turning points were either (i) correctly pre- 
dicted, or (ii) wrongly predicted, i.e., predicted 
where none occurred, or (iii) not predicted, i.e., 
where they did occur. The reason for choosing this 
particular test is that with so-called naive forecast- 
ing methods, turning points will never be found: 
it is essential for these methods that they extrap- 
olate the direction of change observed between 
the first two years. Thus, Theil (1958) found that 
the Netherlands Central Planning Bureau correctly 
predicted 13 out of 14 turning points in a number 
of variables over a number of periods, wrongly pre- 
dicted another 5 and did not predict the one left 
of the 14 observed turning points. The record for 
a number of forecasts made by Scandinavian insti- 
tutions was that out of 21 turning points, (i) 15 
were correctly forecast, (ii) 3 wrongly forecast, 
and (iii) 6 were not forecast. 

A similar test consists of scrutinizing whether 
acceleration or deceleration occurred. Out of 91 
Dutch cases, 71 were predicted in the correct way 
(either acceleration or deceleration); out of 74 
Scandinavian cases, 62 were characterized in the 
correct way. 

From other studies (by C. van der Panne, in 
Dutch) it was found that relatively good forecasts 
between 1949 and 1955 were made of price in- 
dexes, indirect taxes, public employment, and the 
value of exports, whereas relatively poor forecasts 
were made of invisibles, imports and exports, in- 
vestments, productivity, and the salaries of civil 
servants. 

The impact of planning on society. To con- 
clude, we may ask the general question “What has 
been the impact of planning of the western Euro- 
pean type on society?” Clearly the effects, if any, 
Must be the justification of the efforts and other 
Sacrifices made. For the time being, the answers 
to this question can only be approximative; and 
even this phrase may be a euphemism. 

RR a markets with well-defined regu- 
aa a the impact of such regulations may some- 
rae e estimated with a fair degree of accuracy. 

4 E economy as a whole only little can be said. 
Aes pn point to be made is that planning has 
unpla led in avoiding the main inconsistency in 
the en economies of the pre-1914 type, namely, 
cas Aoaeiaa of productive capacity as a 
Tee neg of business cycles and of structural 
ae ria. It is highly probable that the disap- 

TEG r the business cycle after World War 1 
Daini Obtained with the aid of macroeconomic 

ng of the type described in this article. A 
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crude estimate of the contribution to national prod- 
uct to be attributed to anticyclical policy can be 
obtained from the estimates of unused capacity 
prevailing in pre-1914 economies. Two measures 
are conceivable: the extent of unused labor and 
the extent of unused equipment. The figures for 
unused labor are represented by unemployment per- 
centages. Figures on capacity are very uncertain, 
as shown by recent studies by Hickman (1965, 
p. 114). This is due in part to the fact that produc- 
tion at lowest cost is well below capacity produc- 
tion. Therefore our estimate must be seen as only 
a first crude approach to the problem. But a fair 
guess of unused capacity may be 10 to 15 per cent, 
meant as an average for unplanned societies. This 
guess implies that anticyclical policy may have 
contributed 10 to 15 per cent to national income. 

The further question may now be asked, “What 
degree of detail in planning is optimal?” It is the 
author's contention that the optimum degree of 
detail in planning is below the degree of detail 
applied in the communist-ruled societies. Some 
support is given to this contention by a recent 
assessment of the fluctuations in economic activity 
in various countries over the period 1950-1960 
(Staller 1964). These fluctuations appear to be 
roughly 4 per cent for all communist countries, 
1.5 per cent for the Soviet Union, 4 per cent for all 
Western countries, and 7.5 per cent for the United 
States. The gain from more detailed planning as 
applied in the Soviet Union appears to be less than 
3 per cent. This points to decreasing returns for the 
intensification of planning, if one tries to imagine 
the much higher level of planning effort in the 
Soviet Union as compared with western Europe. 
The question is in need of more precise research. 


J. TINBERGEN 


[See also ECONOMETRIC MODELS, AGGREGATE. ] 
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1 
EASTERN EUROPE 


The nations of eastern Europe, defined to encom- 
pass Poland, Czechoslovakia, East Germany, Hun- 
gary, Yugoslavia, Bulgaria, Rumania, and Albania, 
are ruled by Marxist-Leninist parties exercising dic- 
tatorial powers in the name of the proletariat. While 
the leaders of these parties may differ among them- 
selves in points of doctrine or in their foreign 
policy, they are all committed to public owner- 
ship of the basic means of production and to the 
pursuit of certain economic goals, including chiefly 
the promotion of heavy industry by means of state 
direction and control. The economies of eastern 
Europe, with the partial exception of Yugoslavia, 
are centrally managed by a complex apparatus of 
government and party officials similar in most re- 
spects to that prevailing in the USSR. Yugoslavia 
differs from the other communist states in that its 
economic organization is substantially more decen- 
tralized; in the Yugoslav system, which will be 
described in the last part of this article, market 
prices play a significant role in determining the 
allocation of resources under the over-all guidance 
and supervision of state policy. 

Early stages. Nationalization of large-scale in- 
dustry, banking, and foreign trade in Czechoslo- 
vakia, Poland, and Yugoslavia proceeded rapidly 
after liberation from Nazi occupation and was yis 
tually completed by 1946. In the countries alli 
with the losing side in the war—East Germany, 
Hungary, Rumania, and Bulgaria—which weit 
burdened with heavy reparations payments, main y 
in favor of the Soviet Union, comprehensive H 
tionalization measures were delayed until 1947 o 
1948. In the area as a whole the “socialization d 
local industry and retail establishments and pie 
crafts, which were either placed under state m: 
agement or reorganized as cooperatives, pes 
chiefly between 1949 and 1951, after which 0 
a minority of these enterprises were eee Ae 
remain in private hands. Since then the nu ime 
of private enterprises has varied from in’ a the 
as state policy became more or less liberal fp raw 
issuance of licenses and in the allocation © 
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materials to the private sector. Agriculture, for a 
time, represented a notable exception to the policy 
of rapid socialization. Over three-quarters of exist- 
ing farms remained in private hands until the late 
1950s in all countries but Bulgaria, where a ma- 
jority of farms were already collectivized in 1951. 
By 1962, however, collectivization was virtually 
completed in every east European country, with the 
exception of Poland. 
By 1949-1950 the principal features of the So- 
viet economic model had been adopted in the 
management of the socialized sector of the east 
European states. The resulting economic systems 
were Virtually identical up to 1956. Each Politburo, 
ostensibly under the guidance of the Central Com- 
mittee of the Party, formulated the general eco- 
nomic policies of the country, The Secretariat of 
the Central Committee supervised the execution of 
these policies by government organs. Responsibility 
for drafting long-term plans (from four to six years 
in the several countries) and for elaborating cur- 
rent (one-year) plans was vested in a Planning 
Commission formally attached to the Council of 
Ministers. In the late 1940s the economic minis- 
tries, which supervised the management of the 
state-owned industries, were organized according 
to broad sectors (e.g., industry, light industry, agri- 
culture). In the course of time, these ministries 
underwent progressive fragmentation. By 1955, for 
instance, Czechoslovakia’s machine-building and 
equipment industry was managed by three separate 
ministries. Enterprises were directly subordinated 
to industrial departments of the ministries, called 
chief administrations or central boards (e.g., in 
Poland, the Central Board for Agricultural Ferti- 
lizers under the Ministry of the Chemical Industry). 
In conformance with Soviet practice, enterprises 
were normally placed on khozraschét: they be- 
pane autonomous accounting and financial entities 
Pih subject to the directives they received from 
i Superior authorities, were expected to mini- 
ees show profits, and pay taxes to the cen- 
i udget; each had its own account at the na- 
onal bank, which financed its short-term credit 
tequirements. 
ee to 1953-1954, the trend was toward ever 
highs Hensia planning under centralized 
dustry 3 The bulk of materials consumed by in- 
mission a rationed ‘out by the) Planning Com: 
med a an ministries on the basis of 
prises wer imates. In theory, all state-run enter- 
Process ~ supposed to take an active part in the 
FE cae Se the plans: on the basis of 
lesek ol figures handed down by their minis- 
» they drew up counterproposals, which usually 
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proposed more ambitious output targets than those 
they had received and which also called for more 
materials and other resources than the control 
figures provided for. When the Planning Com- 
mission summed up the “counterplans,” they fre- 
quently turned out to be internally inconsistent. 
The changes in targets and materials allocations 
the Planning Commission eventually had to make 
to harmonize the plans rendered most of the pre- 
liminary paper work nugatory. 

The above system, known as “planning from 
below” or “counterplanning,” was replaced in 
Czechoslovakia in 1953 and in Poland in 1954 by 
a new scheme known as “planning from above,” 
where enterprises were formally cut out from the 
planning process. Henceforth, the option to consult 
subordinate enterprises was left to the discretion 
of the ministries. They could do so, if they wished, 
to obtain information on the consumption of ma- 
terials or on productive capacities at the plant 
level; but they could also rely on statistical data 
obtained through other channels. This more cen- 
tralized procedure for elaborating the plans was 
accompanied by a gradual reduction in the work- 
load of the Planning Commission, as an increasing 
number of balancing and allocation decisions were 
delegated to the ministries and to lower adminis- 
trative organs. The new scheme was designed to 
enable the central authorities to cut the time spent 
on the mechanics of plan setting and to concen- 
trate their efforts on improving the substance of 
the plans. 

To this reform, which took place about the 
time of Stalin’s death, there corresponded certain 
changes in economic policy which also affected 
the role of the plans as instruments for governing 
the pace and pattern of economic growth. Prior to 
1953, industrialization proceeded at an extremely 
rapid pace in every nation of the area, irrespective 
of its initial level of development. The long-term 
plans promulgated during the Korean War over- 
taxed the capacities of the individual economies, 
which could not simultaneously build up their 
heavy industries, including armaments manufac- 
tures, expand their output of consumer goods, and 
promote their agriculture. The yearly plans, which 
were bent toward the achievement of the long-term 
plans, also overcommitted resources. Hence, pri- 
orities had to be established to determine which 
sectors would suffer the shortfalls. Each year, re- 
sources in short supply were deflected from light 
industry, residential construction, and agriculture 
and toward metallurgy, machine building, and 
other branches of heavy industry. Since the yearly 
plans were often poorly balanced, crucial allocation 
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decisions had to be taken in the course of the plan- 
year on an ad hoc priority basis. The rationing of 
consumer goods, which prevailed in the entire area 
in the early 1950s, was symptomatic of this ab- 
normal state of affairs. 

Increased emphasis on consumer goods. In 
the summer and fall of 1953, however, a New 
Course was launched by the ruling parties of 
eastern Europe, in line with developments in the 
Soviet Union, which significantly raised the priority 
standing of consumers’ goods and services. Con- 
sumers’ rationing was generally abolished either 
just before the death of Stalin (as in Poland) or 
shortly thereafter (as in Czechoslovakia and Ru- 
mania). From then on, shortfalls in the output of 
consumer goods manifested themselves in declin- 
ing inventories in retail stores, in queues, and in 
overt or covert price increases; in countries such 
as Czechoslovakia, which are dependent on food 
imports to supply their population, agricultural 
failures gave rise to balance-of-payments crises. 
Now that the claims on resources of heavy and 
light industry and of agriculture were more nearly 
equal, an allocation system based on ad hoc pri- 
ority decisions was too blunt an instrument to 
carry out state policy, The current plans had to 
have sufficient “authority” to guide the economy 
throughout the year. 

The need for a priori consistency of the plans 
compelled the planners to budget their resources 
more realistically. The trend toward more careful 
balancing was reinforced after the Poznan riots 
of June 1956 and the Hungarian revolution in 
October and November of the same year. Since that 
time the ruling parties in the region, but especially 
in the more developed nations, have recognized 
as a political constraint on the formulation of na- 
tional economic plans that the total level of con- 
sumption and the average level of urban real wages 
must not be adversely affected by investment 
efforts or by circumstances of any sort. (It is re- 
markable, for instance, that private consumption 
increased slightly in Czechoslovakia in 1963, even 
though national income declined by 3-4 per cent 
during that year.) Another factor making for a 
more systematic attempt to achieve consistency in 
the current plans after 1953 was the increasing 
relative scarcity of labor available to industry, not 
only in Czechoslovakia and East Germany, where 
the pool of surplus labor from agriculture had been 
exhausted some time previously, but even in Ru- 
mania, where the intake of labor from the coun- 
tryside was greatly reduced, compared with the 
first industrialization campaign after 1949. (The 
area-wide shortage of housing and other urban fa- 


cilities contributed to the decision to slow down 
rural-urban migration.) The increasing scarcity of 
manpower reduced the planners’ ability to correct 
mistakes in the allocation of materials or invest- 
ment funds by lavishing surplus labor on the sec- 
tors that had not received their share of critical 
resources. 

In the years since 1956, the economic reforms 
carried out in certain of the east European states 
have broken the previous pattern of institutional 
uniformity and have given rise to discrepancies 
from the Soviet model as well as to divergencies 
among the individual states of the region. Before 
describing the reforms that eventually caused the 
systems to diverge, it may be in order to consider 
the problems the central planners of these econ- 
omies continued to face in common. 

Principal planning problems. The principal 
problem the central planners have to solve consists 
in preparing an internally coherent set of balanced 
estimates for materials and equipment. The “ma- 
terial balances” actually worked on by the Planning 
Commission in each country are those for “funded” 
materials and equipment, that is, for the crucial 
items which it rations out with the approval of 
the Council of Ministers. In Czechoslovakia these 
items made up less than half of the total consump- 
tion of materials in the mid-1950s. In both Czech- 
oslovakia and East Germany, this proportion was 
higher in heavy than in light industry and in the 
production of raw materials and semifabricates 
than in processing sectors. What is also signifi- 
cant is the share of each material’s “consumption 
for productive purposes,” calculated according to 
standard norms based on approved engineering 
practice. In Poland, according to a special survey 
conducted in 1959, this share was over 90 per cent 
for steel semifabricates and for cement, 84 per 
cent for sulfuric acid, 67 per cent for bituminous 
coal, between 60 and 65 per cent for tann 
leather, salt, soda ash, and pine lumber, and x 
little as 5 per cent for refined petroleum produc i 
The principal gaps in the availability of ne 
tion at the planning center are, first, the ea 
cients relating the inputs of nonfunded mate a 
to the outputs of funded or nonfunded goods # = 
second, the coefficients corresponding to the “a i 
sumption of funded products not subject to et 
ard norms. 5 

The reliability of the coefficients themselves Bos 
often been limited. Allotments of materials bdi- 
made to economic ministries, which then he 
vided them among their central boards. pier 
latter ultimately parceled out their quotas itra- 
their subordinate enterprises. At each admin 
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tive level—whether functional or regional—lower 
organs bargained for more generous rations of 
materials from their immediate superiors. If the 
central authorities persisted in drafting overambi- 
tious plans, then there was pressure from above, 
at each echelon, to cut back requirements below 
what enterprises considered a safe minimum. 
While the range within which this bargaining can 
affect final allotments has become more narrow 
with the passage of time, the bargaining process 
has not been eliminated. 

Decreasing or increasing returns to scale have 
also been a source of instability in the technical 
coefficients. If an industry has surplus capacity 
that may be utilized during periods of peak de- 
mand for its products at a high unit cost—due to 
the operation of worn or obsolete equipment—its 
average input coefficient will tend to rise with the 
level of its output. When the product was homo- 
geneous—electric power, for instance—the coordi- 
nating organ on occasion made rough adjustments 
in the coal- or oil-input coefficients for varying 
levels of power output; but where the product-mix 
was complex, this task defied bureaucratic solution. 

Another factor bearing on the stability of co- 
efficients, about which next to nothing is known at 
the planning center, is the possibility of reducing 
the consumption of material inputs by the substi- 
tution of other factors, Enterprises were frequently 
forced to adapt to materials shortages by expending 
more labor on their products. At best, the central 
authorities could make a rough adjustment in the 
allotments of materials to enterprises in anticipa- 
tion of these economies; but neither input-output 
nor any of the planning techniques. in use at pres- 
ent are capable of predicting with any degree of 
Precision the enterprise's ability to make these 
adjustments without running into acute shortages 
of material inventories, such as may eventually 
Cause production breakdowns. 
bi the absence of electronic computers, which 
ae available in planning practice in any 
ae of eastern Europe during the period under 
Bean ration, the preparation of an internally con- 
g Pe plan is a laborious and time-consum- 
ine, Given the limited information on input 
the ead and productive capacities available to 
ne ee Commission, failure to make optimal 
D za information in order to relate the total 
tities of ae in the various sectors to the quan- 
adds a pee product available for final demand 
Principal pints source of potential error. The 
occurrence a problem in this regard is the 
interdepend feedbacks on output levels due to 

encies in the technology matrix. If the 
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planners proceed from a known final demand to 
calculate a mutually consistent set of output levels, 
they will find that as soon as they have added 
enough output to an initial level to satisfy ma- 
terials requirements derived directly from final 
demand, they will need to make a secondary ad- 
justment to meet the added demand for materials 
caused by this last increase in output. Theoreti- 
cally, it may take five or six rounds of adjustment 
before a satisfactory degree of consistency has 
been achieved (satisfactory at least in terms of 
the range of error to which the coefficients are 
subject). In practice, however, feedbacks were 
reduced by two factors. It was known in advance 
that a certain number of sectors—in eastern Eu- 
rope these were generally the sectors producing 
raw materials and semifabricates—would be op- 
erated at capacity; their output being known in 
advance, no feedback effects had to be allowed for 
in these sectors. The only feedbacks were those 
associated with sectors that were free of capacity 
restraints in the normal range of their operation, 
and these feedbacks were likely to be small. Fur- 
thermore, the balancing usually started with trial 
levels of gross output for all sectors. If these out- 
put levels were not too far removed from the final, 
mutually consistent targets, large feedbacks were 
avoided. Since there was flexibility on the final 
demand side, insofar as the planning authorities 
were more or less indifferent between various 
closely related bills of goods, fewer output adjust- 
ments were necessary; this also helped to cut down 
on the computing workload. 

While it was possible to prepare a single bal- 
anced version of the plan, there was usually no 
time to draw up alternative versions or to trace the 
effects of a change in demand for one or more 
endproducts on the outputs that were not tightly 
restricted by capacity limits. 

Another computing task which, while present- 
ing no theoretical difficulty, was still extremely 
demanding on the limited bureacratic facilities of 
the central planning organizations is the reconcilia- 
tion of the material and equipment balances with 
the “synthetic” plans. These include the costs and 
profits plans of industry and trade, investment 
plans, the state budget, and all other macroeco- 
nomic plans that may be affected by the material 
balances (via reductions in material costs, larger 
sales due to increased outputs, etc.). Repeated 
references in the economic literature of the area 
to national economic plans whose component parts 
were not harmonized testify to the difficulty of 


elaborating “complex” plans. s 
To construct efficient as well as consistent plans, 
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the Planning Commission would have to collect 
information on alternative production processes 
open to enterprises and to choose from among 
these processes the ones most likely to economize 
on factors in short supply. In practice, the cen- 
tral planners had no systematic procedure either 
for collecting or for processing this information. 
(There was no analogue in current practice to 
linear or nonlinear programming.) This deficiency 
was especially serious insofar as it affected invest- 
ment and foreign-trade decisions, which normally 
involve choice either among domestic processes for 
producing a given good or between domestic pro- 
duction and imports. (Since exports alone in coun- 
tries such as Czechoslovakia, East Germany, and 
Hungary represent between 30 and 45 per cent of 
national income, the correct allocation of resources 
between industries supplying the national market 
and industries oriented mainly toward exports is 
crucial.) Since the mid-1950s, economic calcula- 
tion, based on comparisons of capital outlays with 
economies in operating expenses and on compari- 
sons of the domestic costs of exportables with 
their foreign-currency prices, has become an im- 
portant alternative to the traditional method of 
balanced estimates in arriving at suboptimal de- 
cisions. These calculations, however, are under- 
mined by defects in domestic prices, to the extent 
that the latter are used to compute costs and 
returns. 

As late as 1965, the price systems of the east 
European socialist states, with the single exception 
of Yugoslavia, were still grounded on the principle 
that the price of each producer good should be set 
at its average cost of production among the enter- 
prises responsible for the bulk of its output, plus 
a small profit margin calculated as a percentage 
of costs. Since price reforms were promulgated 
only once every three to five years, and relative 
costs tended to change rapidly, the structure of 
current costs soon diverged from established price 
ratios. Even current cost levels did not express 
Opportunity costs, if only because neither interest 
charges on capital nor rents on the use of land 
were accounted for. It should also be noted that 
prices of imported goods, owing in part to the arti- 
ficiality of exchange rates applied to foreign prices, 
were improperly related to the domestic prices of 
home-produced substitutes, 

1956-1965: divergencies among states. During 
1956 the excessive centralization of the command 
economies came in for sharp public criticism, par- 
ticularly in Poland and Hungary. The immediate 
reforms that were promulgated at the time, how- 
ever, went no further than the extension of certain 


rights and prerogatives to the directors of enter: 
prises, in line with recent Soviet initiatives in this 
direction. Prior to mid-1957, when the Soviet gov- 
ernment abolished most ministries in charge of 
industrial sectors and subordinated most industrial 
enterprises to regional economic councils, the or- 
ganization of the east European economies was still 
closely patterned after the Soviet. It may fairly be 
said that the first important divergence stemmed 
from the failure of the east European states to fol- 
low the Soviet example and shift from a functional 
to a regional hierarchy of economic organs. 

Bulgaria and East Germany. Only two coun. 
tries—Bulgaria and East Germany—eventually 
adopted, in whole or in part, the new Soviet design. 
Both of these countries abolished their industrial 
ministries. In East Germany, the supervision of 
industrial sectors was entrusted to specialized de- 
partments of the state Planning Commission, 
whose role as chief planning organ was reinforced, 
from 1958 on, by new executive responsibilities. 
Enterprises of national significance were grouped 
in associations (Vereinigungen Volkseigener Be- 
triebe) directly subordinated to the Planning Com- 
mission. Economic councils were also formed in 
regions (Bezirke) and districts (Kreise), but, in 
contrast with the situation of the councils in the 
Soviet Union, they had direct authority only over 
enterprises of local importance. In view of this 
crucial limitation on the power of the councils, the 
reforms of 1958 and 1959 did not alter the essere 
tially functional character of East Germany's in- 
dustrial organization. 

In Bulgaria, however, out of 800 large enter- 
prises formerly attached to ministries, 700 were 
given over to district national councils. The remain- 
ing 100 enterprises were placed under the central 
control of the Committee for Industry and T 
cal Progress, the Committee for Construction ai 
Architecture, the Ministry of Transportation an 
Communication, the Ministry for Education and Cul- 
ture, and the Central Union of Cooperatives. In 
conjunction with the reorganization of the see 
omy, 30 districts, corresponding to natural wi 
nomic regions, were carved out of the 12 pone 
districts. But even in Bulgaria the regionalizat oe 
was not complete, inasmuch as the branch ue 
mittees, in particular the Committee for Te A 
and Technical Progress, retained excl ae 
sponsibility for issuing methodological ne 
to enterprises and continued to PESE jae 
“directional” functions, described in officia trial 
guage as “the fulfillment of the plan of oe 
enterprises for the development of new vation, 
for inter-enterprise cooperation and specializ 
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for the introduction of technical progress, for the 
timely manufacture of essential machines and 
equipment earmarked for construction projects of 
national importance, and for the distribution of 
machines and equipment among districts in cases 
where the districts concerned have not reached 
agreement with each other.” 

In Albania, Czechoslovakia, Hungary, Poland, 
and Rumania, the old system of industrial manage- 
ment by specialized ministries has been maintained 
to the time of writing in 1967. Within this system, 
however, Czechoslovakia, Poland, and, to a minor 
extent, Hungary have introduced reforms tending 
toward the devolution of administrative powers on- 
to lower organs and toward the increased auton- 
omy of socialized enterprises. In 1964 and 1965, 
Bulgaria and East Germany also took further ini- 
tiatives in this direction, The only countries that 
have adhered strictly to the old, tightly centralized 
system for planning and managing industry are 
Rumania and, as far as is known, Albania. 

Czechoslovakia. The most comprehensive of 
the reforms tending toward decentralization were 
undertaken in Czechoslovakia for the first time 
in 1958 and 1959 and again in 1966 and 1967. 
In 1958, enterprises were consolidated into “pro- 
ductive economic units”—operating as enterprises 
in their own right or as associations of enterprises 
—which were directly responsible for the entire 
production of a sector; in certain cases where pro- 
duction facilities were spread over a large num- 
ber of plants, the productive economic units were 
entrusted with the guidance and supervision 
(“gestion”) of the entire output of the sector, in- 
cluding by-products originating in other units. The 
balancing and distribution of less essential prod- 
Ucts formerly carried out in the Planning Commis- 
an ae in the ministries devolved on the produc- 
iB ore units, which were expected to keep 
ates i touch with their customers and to coordi- 
ae production decisions with each other, 
Astri Supervision of higher authorities. The old 
ties a temained in operation, but their activi- 
‘ioe a oe less toward routine direction of 
the gen ae enterprises than toward ensuring 
peduclic He progress and the rationalization of 
centives n in their sector. Two new types of in- 
ee. created: (1) to recompense the 

aie Staff and workers, the average wage of 
creases aie was linked to labor-productivity in- 
of the aa bonus fund was set up on the basis 

ae ment of the enterprise’s profit plan; 
Prise, it wa n the financial autonomy of the enter- 
of the year : authorized to keep a significant share 
-to-year increases in its registered profits, 
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which share could be raised by higher authorities if 
the enterprise agreed to take on more challenging 
tasks. As a result of the new system of “enterprise 
stimulation” and of certain changes in financial 
regulations, the share of decentralized investments, 
in large part financed from enterprise profits and 
depreciation allowances, rose to 60 per cent of 
total investments in 1960 and 1961. 

While the 1958-1959 reforms enlarged the au- 
tonomy of Czechoslovak enterprises in a number of 
areas, price setting for all important producer and 
consumer goods remained the exclusive prerogative 
of the central authorities. No alteration was made 
in the practice of pegging producers’ prices to an 
initial level of average unit costs and keeping them 
in force for several years, regardless of changes in 
supply and demand relations. In 1963, when the 
reforms were for the most part abandoned and 
recentralization took place, the failure to adapt the 
price system to the decentralization was widely 
criticized. The collapse of the reforms and the 
eventual abandonment of the five-year plan orig- 
inally scheduled for 1961 to 1965 were attributed 
by the partisans of decentralization to the distort- 
ing effects of prices on the decisions of enterprises, 
to the plan’s excessive tautness, which made de- 
mands on the construction sector and export indus- 
tries widely in excess of their capabilities, and to 
a crisis in the balance of payments, due in large 
part to shortfalls in agriculture. The proponents of 
tight central controls, including the highest Party 
authorities, laid the failure to an excessive devolu- 
tion of powers on lower organs and to a slackening 
of discipline. In late 1964 and early 1965, after a 
year and a half of debate, the views of the decen- 
tralizers apparently prevailed. The Central Com- 
mittee of the Communist party called for another 
round of decentralizing reforms. This was effected 
in 1966-1967. 

The newest Czechoslovak reforms, in a signifi- 
cant departure from all previous attempts at re- 
modeling the economic system, now provide for 
some flexibility in producers’ prices. Prices set by 
free negotiation between suppliers and purchasers 
will apply to goods representing an estimated 7 
per cent of total industrial output. The bulk of 
these freely priced items are produced in consumer 
goods industries. In addition, lower and upper 
limits within which prices are allowed to fluctuate 
have been imposed on a wide range of goods. Re- 
cent experience has shown that prices quickly gravi- 
tate to the upper limit, so that the significance of 
this innovation is rather restricted. These limits 
and the prices of all other goods will still be set by 
central organs. In general, the quasi-monopolistic 


116 PLANNING, ECONOMIC: Eastern Europe 


the imposition of charges on the fixed capital and have been assigned a special status Their mam: 
the inventories of enterprises, at rates of G and 2 agers are for the most part rewarded » 
per cont respectively, to be paid into the state a bonus system based on profits; some of them 


rate of 5 per cent of the grows value of fixed assets they require to adapt the structure of thelr ovtpat 
and of working capital, ) to market demands. A few have been given pam 
‘The chief incentive to enterprises now consists mission to alter thelr sales prices. These experk 
in the part of “gross income” (equivalent to value ments, if they are successful, may pave the way i 
addod) left to them after deduction of capital comprehensive legislation covering the bulk of 
charges and of various other obligations to industry in the countries mentioned. 
sate. This residue is earmarked for decentralized Yugoslavia, The “Yugoslav model” has exerted 
Investments, contingency reserves, housing projects considerable influence on the character of the eam 
and other social amenities, and wages, salaries, and nomic reforms at present contemplated in tt 
bonus payments to the staff of the enterprise. experimenting countries, It may be observed that 
In the spirit of the new system, plans are to be the first steps taken toward decentralization i 


implemented via financial pressures and induce- Yugoslavia in 1950 and 1951 were also aimed at 


ments rather than by direct commands. Output relaxing the rigid system of distribution in Mgt 
targets and limits on the consumption of materials industry. Retail trade establishments, formenty 
and other inputs, however, may still be issued to bound by the plan to specific sources of sappii 
enterprises by central planning authorities. Al- were allowed to choose their suppliers among 
though it is not yet apparent how widely the cen- wholesalers and manufacturers 

tral organs will avail themselves of their preroga- These first reforms, in the case of Yugouarta, 
tives, there was considerable apprehension among were approximately coeval with the formatat, M 
the proponents of the reforms that this permissive July 1950, of the workers’ councils, which airas 
regulation might be used to reimpose the petty entailed a more radical departure from Soviet prem 
tutelage over enterprises that prevailed in former edent. (In Poland and Hungary, workers’ 
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which 
became known as the “debate over Polish in Poland by 1950.) The members of the Yagmiar 
fconomic motel.” The Uconomic Council, whose councils, which represent a litte ower J 
membenhip inchaded Poland’s most influential of the total number of employees in 
economists, heued a number of “theses” on the economy, are elected from among the 
seonmmnic model and on price-setting policy which the management staff of each enterprise. The miii 
proposed the extabtinbment of a partially decen- Prerogative of the workers’ council is to deserment 
tralied economic system. guided by prices shaped the share of the profits remaining at the 
under the indarno of supply and demand. How- of the enterprise to be distributed as hana 
ever, these heses were not approved by the Party extra income to the staff 
amd never frentved legislative sanction. The decen- plowed back as investment 
Od oot o ae ee al in 1059 à voice in appointing and 
Od WA gm as far as the reforms. directors. In general, it ts expected 
Moreover, these mesures were eroded by centrip- over-all orientation of the enterprise 
ttal forces bermeen 1900 and 196). The associa. Composition of its output, technologies 
Gone of enterprises, created in 1950 to replace the and price-setting decisions 
od omarai boards fermerty operated as functions! Prior to 1952° the infieence of 
Gepertmmenns of the ministries, gradually cessed to on output and price policy was 
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were willing to pay for borrowed funds; but this 
system could not be allowed to remain in effect 
under conditions where prevailing prices provided 
such an inadequate reflection, in terms of the plan- 
ners’ priority scale, of the relative importance to 
the national economy of the different sectors. 

In 1965, the Yugoslav economy, despite the in- 
troduction in recent years of various recentralizing 
measures taken in conjunction with the govern- 
ment’s efforts to subdue inflation, remained far 
more decentralized than any other east European 
economy, Nevertheless, a number of Yugoslavs 
concerned with economic affairs, including high- 
placed officials in Croatia and Slovenia, have voiced 
their support for further decentralization along 
market lines and for the elimination of some of the 
formal and informal controls hemming in the man- 
agement of socialized enterprises. Opposition to 
these views emanates from economists and officials 
concerned with the development of the less indus- 
trialized regions of the country, such as Macedonia, 
Montenegro, and Bosnia and Herzegovina, where 
socialized enterprises are not yet capable of with- 
standing competition from abroad or from the 
more advanced republics to the north and where, 
in the opinion of these “centralizers,” funds ought 
to be invested on a priority basis even if market 
criteria cannot be met, As in the rest of eastern 
Europe, divergent views on the goals of develop- 
ment policy and on the speed at which they might 
be attained underlie the conflicts among the pro- 
ponents of different institutional means for inte- 
grating the national economy. 
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[See also COMMUNISM, ECONOMIC ORGANIZATION OF; 
ECONOMIC DATA, article on THE SOVIET UNION AND 
EASTERN EUROPE; ECONOMIC THOUGHT, article on 
SOCIALIST THOUGHT.] 
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I 
DEVELOPMENT PLANNING 


This article confines itself to development aE 
ning in economies with substantial private pens 
prise sectors. In this context, a development P 2 
is a document which sets out the chief pee 
that the government intends to take in © pe 
raise national output per person. The typical ist g: 
opment plan will include most of the follo pore 
(1)a survey of current economic condition a 
cially national income, productivity, foreign 


plan- 
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and trends in each major industry; (2) a survey 
of the current social situation, especially popula- 
tion changes, education, health, housing, and social 
security; (3) an evaluation of progress achieved 
under the preceding plan; (4) a statement of 
general objectives of economic and social policy; 
(5) estimates of growth, or targets, for each major 
economic or social component during the period 
covered by the plan; (6) suggested measures de- 
signed to raise the rate of economic growth, espe- 
cially measures to stimulate saving and investment 
and to increase productivity, and measures to im- 
prove the institutional framework of economic 
activity, such as land reform or reorganization of 
the markets for commodities, labor, or capital; 
(7) a program of government expenditures, capital 
and recurrent. 

The plan normally looks several years ahead. 
Typical periods are three, four, five, and six years. 
Longer plans are sometimes made for 10, 15, or 
even 20 years ahead; these are sometimes called 
“perspective” plans. The shorter (three- to five-year) 
plans include statements of what the government 
intends to do. A 15- or 20-year plan cannot serve 
this purpose; its object is partly to try to identify 
long-term trends, as a guide to planning services 
which require a longer perspective, such as har- 
bors, hydroelectricity, roads, and education. A five- 
year period is too short for perspectives in such 
matters but is also too long for specific commit- 
ments to some others. Hence, governments nor- 
mally make a new capital budget every year, and 
this, rather than the plan, is the final commitment 
to expenditure. Similarly, statements relating to 
Policy are not final until embodied in legislation. 

The first documents resembling development 
Plans were published just after World War 11. Most 
Countries had established economic controls during 

e war, including rationing of some consumer 
goods, raw materials, or foreign exchange, and 
this experience gave to the idea of planning a de- 
Aa ey which it had not achieved 
a e war. In Europe the emphasis of these 
{bitha as on reconstruction, with equal attention 
aan cae and public sectors, but in the less 
plas r countries the prime concern of the early 
eo Sk oe government policy making, 
ah Stablish priorities for government expend- 
eee is nothing novel in surveys of current eco- 
tvs z) Sonal conditions; many governments 
evera a 7 such surveys from time to time and 
he fan do this annually, whether or not there 
in Bree plan. There is also nothing novel 

cing measures designed to raise the rate 
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of economic growth, What development planning 
introduced was a comprehensive review of public 
expenditures looking several years ahead, with pub- 
lication of a priority list. Before this concept was 
introduced, it was not usual for governments to 
review all their likely expenditures for more than 
a year ahead; reviews stretching five or ten years 
ahead were a novelty and were an improvement of 
government procedures. Despite much subsequent 
elaboration of planning techniques, medium-range 
projection of expenditures continues to be the main 
object of a development plan. 

With time, the emphasis of a development plan 
has tended to shift away from the review of govern- 
ment expenditures and toward other measures de- 
signed to raise the rate of economic growth. The 
general economic survey aspect and concern for 
the economy as a whole have increased. Some 
development plans are now devoted mainly to dis- 
cussing such subjects as industry, agriculture, 
mining, investment, saving, and the balance of 
payments, and a few give only a sketch of proposed 
government expenditures. 

The nature of a plan for industry or agriculture 
or mining depends, in the first place, on whether 
the sector is under public ownership and opera- 
tion, as in countries under communist control. 
Where public ownership and operation are in ef- 
fect, a plan acts as an authorization to managers 
of industrial plants, telling them how much capital 
they may invest, and even perhaps how much of 
each commodity to produce and how much labor 
and material to use. Exactly what such a plan 
must contain is now under active discussion in 
communist countries. 

If industry depends on private investment, a 
development plan can exert negative control but 
not positive control; i.e., it can prohibit investment 
in certain fields, but it cannot force persons to 
invest in other fields if they do not wish to do 
so. Hence, in the private sector, “plan” is not quite 
the right word for what a development plan tries 
to do; it is as much concerned to analyze, inform, 
persuade, and indicate inducements which the 
government offers as it is to describe the restric- 
tive controls with which the economy will be man- 
aged. 

How comprehensively the private sector is treat- 
ed varies from plan to plan. If the government in- 
tends to take the initiative in inducing private 
investors to follow a particular course, or in 
shaping industrial institutions (e.g., marketing), 
or in providing finance through credit institutions, 
the plan may contain an extended discussion of 
each major industry, indicating what the govern- 
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ment hopes will be the outcome. The closer the 
relations between government and commodity pro- 
duction, the more extensive must be the sections 
of the plan which deal with individual commod- 
ities. 

Beyond this, a development plan may make a 
projection, for the whole economy, of output, con- 
sumption, saving, investment, imports, and exports 
in every sector for, say, five years ahead. The 
primary purpose of such a projection is to provide 
a consistent background for policy. It ensures that 
assumptions about investment are consistent with 
assumptions about domestic savings, foreign bor- 
rowing, and the budget surplus; that ¢ach of these, 
in turn, is consistent with assumptions about in- 
come, taxes, government expenditure, and demand 
and supply in each major sector. The use of an 
input-output matrix is particularly helpful in econ- 
omies where industries use each others’ products 
as raw materials or equipment, since it is otherwise 
difficult to assess the full ramifications of an ex- 
pected change in demand for any particular com- 
modity (e.g., to trace the effects of an expected 
increase in the demand for bicycles on the steel, 
iron ore, rubber, coal, oil, chemical, and other in- 
dustries which produce components or components 
of components). A projection is not a plan; it is a 
background exercise which planners do to clarify 
the interrelationships in their thinking. It is worth 
doing in complex economies, where interrelation- 
ships are not easily held in the head, but is less 
useful and hardly necessary in small countries with 
simple economies. 

A forecast for the whole economy is also the 
basis of what is now called “indicative planning.” 
The argument here is that it helps investors in 
any one industry to know what is proposed in all 
other industries, since these others generate de- 
mand for its product. If all investors believe that 
the economy will grow only at an average annual 
rate of 3 per cent, they will keep their own invest- 
ment levels low, and so bring about a low rate of 
growth. If, on the contrary, they all expect an 
annual growth rate of 10 per cent, they will all 
make large investments to cope with this rapidly 
growing demand, and so the growth rate will be 
high. Consequently, if, after consultation with all 
concerned, the government issues a projection 
based on the highest rate of growth that can be 
achieved with the resources likely to be available, 
this “indication” of what is possible in each indus- 
try may itself serve to induce a higher level of 
private investment than would otherwise occur. 
France is the leading practitioner of indicative 
planning, which is gaining ground mainly in 


western Europe because the sophisticated relation. 
ship between government and private enterprise 
which the exercise requires is more common and 
the statistics more readily available in the advanced 
than in the poorer countries. 

Policy objectives. The elements which go into 
deciding what to put into a development plan can- 
not be described in a few words. The following sec- 
tion sketches briefly a few of the principal concerns 
of development planners. 

At low levels of development, ignorance is the 
principal obstacle to development: ignorance of 
physical resources, ignorance of markets, and 
ignorance of techniques. The first object of the 
plan must be to reduce ignorance. 

Development planners begin with surveys. The 
land should be mapped and surveyed for minerals, 
water, and fertile soils. Adequate staff must be 
provided for the departments of geology, hydrology, 
meteorology, survey, and soil survey. Equally im- 
portant is research into the utilization of these 
natural resources, especially agricultural research, 
and research into industrial uses of local products. 
Next comes economic, statistical, and market re- 
search. The planner’s first task is to see that all 
these fields of survey and research are covered, 
since it is these that will yield potential develop- 
ment projects. 

Development flows from applying skills to nat- 
ural resources. A policy for improving skills is 
therefore fundamental to planning. This covers all 
levels of training and education, including general 
(primary, secondary, university) as well as oes 
tional (trade schools, farm institutes, specialized 
institutions) education, institutional as well as on- 
the-job training, and adult education (especially 
agricultural extension) as well as schooling for the 
young. Since a poor country cannot afford all the 
education which is offered in rich countries, choles 
have to be made. It is possible by paso fects mi 
tional income to estimate roughly how many i 
there will be for persons with various kinds | 5 
special skills, and provision can be made for foe 
ing them. Projecting the effects of general be 
tion is more difficult and not, perhaps, $0 ha a 
since in most poor countries today the deon be 
places in school exceeds any figure that cou 
reached by trying to estimate the productivity 
general education expenditures. 3 

Infrastructure plays an important role in ae 

A A estment. 
ing (or its absence in repelling) new inv (rail 
Electric power, water, and communica oa T 
ways, roads, ports, telephones) often Eo adde 
as much capital as industry and agricu’tu ‘dvance 
together. Because of the need to build in a 
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of demand, such investments are especially risky 
in less developed countries. Thus, the governments 
of such countries play a larger role in providing 
jnfrastructure, or in guaranteeing investment in 
such undertakings, than is necessary in more de- 
veloped countries. 

The planner’s objective is to discover the most 

productive projects, public or private, and to ensure 
them priority. Productivity cannot be calculated 
simply in terms of current prices, since current 
prices do not necessarily reflect the true relative 
scarcity of resources or the true net benefit to the 
economy as a whole, and also may not sufficiently 
reflect future gains. A large part of the literature 
of development economics discusses the desirable 
criteria for investment; the substitution of “ac- 
counting prices” or “shadow prices” for current 
prices when calculating costs and benefits to the 
economy as a whole; and the assessment of benefits 
and costs deriving from the interdependence of in- 
vestments. Such calculations show that the set of 
projects which private investors would choose if 
seeking to maximize their profits is not necessarily 
the set which maximizes national income. Hence, 
if the government seeks to maximize national in- 
come, it must offer inducements to invest in some 
Projects and impose restrictions on investment in 
Others, Administering such inducements and re- 
Strictions is an important art, in which the poorer 
countries tend to be relatively unskilled. 
} Investment criteria include capital-intensity, that 
is, the ratio between the capital invested in a proj- 
ect and the number of persons whom it employs. 
In countries plagued with unemployment, open or 
disguised, labor-intensive techniques are preferable 
to capital-intensive techniques if the difference in 
Cost per unit of output is not very great; for lower 
Productivity per man may yield a larger total out- 
Put if more men are employed. 

Another consideration is the distribution of in- 
vestment among the geographical regions of the 
Country. It is desirable to concentrate investment 
ees growth potential is greatest, but this may 
lig see e modified in favor of reducing regional 
work fae in per capita income or of providing 

where unemployment is relatively high. 

ee an adequate infrastructure and adequate 
mane aa of feasible investments, the planners 
beige or induce entrepreneurs to invest. 
investi and inducement are seldom needed for 
ae oe s within the scope of small-scale enter- 
a eee os underdeveloped countries there is 
oeren of small businessmen to invest in 
Fite il or small industrial enterprises, such as 
S, printeries, trucking, bottling plants, 
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cinemas, and the like. What is lacking is experience 
in building and operating large factories or mines. 
Most governments are anxious to attract outside 
entrepreneurship for such undertakings and at the 
same time to arrange training programs in which 
their own people may acquire managerial skills. 

Most countries start developing by exporting 
agricultural products or minerals. The income thus 
generated creates demands which are met in the 
first instance by imports, but which also present 
an opportunity for domestic production of substi- 
tutes for imports. The possibilities of import sub- 
stitution are limited, however, partly by natural 
resources and partly by the economies of specializa- 
tion. When these limits are reached, further devel- 
opment will bring an increase in imports which 
has to be matched by an increase in exports. Devel- 
opment planners therefore need to watch the effect 
of development on imports and to keep the import 
propensity in balance with the export propensity, 
either by increased efficiency in import substitution 
or by increased efficiency in exporting. Failure here 
brings a balance of payments crisis and structural 
inflation. 

Failure of agricultural output to expand at an 
adequate rate is normally one of the main causes 
of slow economic growth or of structural inflation. 
Small farmers are quick to take up profitable crops 
and have produced large supplies of such commer- 
cial crops as cocoa, coffee, bananas, rubber, and cot- 
ton. Now that population is increasing more rapidly 
in the less developed than in the more developed 
countries, the challenge to farmers is, rather, to 
meet domestic needs by increasing the productivity 
of existing crops (especially domestic foods), and 
this seems harder to achieve. Measures to raise 
agricultural productivity (especially new breeds 
and varieties, use of fertilizers, disease and insect 
controls, irrigation, and new methods of processing 
and storing) are the core of any good development 
plan, since increases in agricultural output are 
needed to raise living standards, to provide an ex- 
panded market for industrial output, and to serve 
as a basis for higher savings, taxes, and earnings 
of foreign exchange. 

Before World War 11, governments of poor coun- 
tries spent mainly on day-to-day administration of 
law and order. Planning can easily double the re- 
current budget and add as much again in public 
expenditure on capital account, if the money can 
be raised, Public expenditure increases partly be- 
cause an infrastructure of services is required to 
support an increased output of agriculture or manu- 
factures, but it increases just as much because the 
public demands some services for their own sake, 
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irrespective of their effect on productivity (e.g., 
curative medicine, general education, social secu- 
rity benefits). How much to spend on public services 
is a political rather than an economic decision, in 
the sense that from the economic standpoint the 
public should have as much service as it is willing 
to pay for. 

When development planning began, most poor 
countries were spending less than 20 per cent of 
gross national income on gross capital formation 
and public expenditure together, whereas the rate 
of growth and standards of public service they now 
demand would require at least 30 per cent. Foreign 
investment and foreign aid contribute, but at the 
beginning of the 1960s the net inflow from these 
sources averaged only about 4 per cent of the 
poorer countries’ incomes; the main part of the cost 
must be met from domestic sources. Domestic 
private saving is low and has to be supplemented 
by public saving out of taxes and other revenues. 
Rapidly increasing demands for public expenditure 
and for public savings have forced equally rapid 
increases in taxation; in this sense, creating a good 
tax structure with adequate rates is fundamental 
to success on both the economic and the political 
levels. What development planning tries to ensure 
is that the moneys so raised will be spent intelli- 
gently, but this depends even more on political 
institutions and good sense than it does upon eco- 
nomic techniques. 

Development plans are full of figures, but policy 
matters more than figures. The public sector, to 
which most of the figures relate, is seldom as large 
as 20 per cent of a poor economy, and development 
cannot be ensured even by the most intelligent 
planning of the public sector. In the last analysis, 
economic growth depends on the willingness of 
thousands or millions of private citizens to make 
some change: to change their jobs, acquire new 
skills, try out new techniques, or invest in new 
resources. The secret of successful development is 
to provide a framework which induces people to 
make the best use of the Opportunities which exist 
in their economy. 

Effectiveness. Development plans vary widely 
in the extent to which they are taken seriously, A 
few are followed faithfully by the governments 
which have issued them, a few cease to be consult- 
ed within six months of publication, and the rest 
range between these extremes. The hazards of the 
plan derive from three sources: (1) lack of realism 
and commitment; (2) differences between those 
who make the plan and those who make economic 
decisions; (3) difficulties in forecasting. 

The original purpose of planning was to select 


priorities in government expenditure. However, a 
five-year plan is not an expenditure authorization; 
normally authorization occurs only in the annual 
budget. Hence, what a government actually author- 
izes in due course is not necessarily tied to what 
it has put into the plan; and it cannot be assumed 
that the plan is a true statement of what it intends 
to do, even at the moment when the plan is issued, 

If this is true of the public-sector part of the 
plan, it applies even more to the private sector, 
Nobody is committed by what the plan says about 
the private sector. Hence, those who prepare the 
plan are not constrained by having to put into ef- 
fect what they are saying. 

Why does a government issue a plan to which 
it does not intend to adhere? Such a plan usually 
predicts a larger achievement than is possible with 
the available resources. This may be done for one 
of two reasons, either as a political smoke screen 
or as a means of attracting larger resources. 

A development plan has something of the same 
Status as a party’s political program. It is not a 
final commitment (only the annual budget author- 
izes), so the government can make promises which 
it would like to carry out if it had the resources, 
without stopping to count the actual resources too 
closely at the moment of publication. Obviously a 
document published in this frame of mind does not 
deserve to be called a plan, since the essence of 
planning is matching needs against resources and 
determining priorities. k 

The second reason for publishing a plan in ex- 
cess of available resources is to attract more 
resources. A large gap between needs and resources 
may attract either internal or external resources. 
One problem of poor countries is that people have 
not learned to pay large sums in taxes. Their gov- 
ernments have in the past taxed only for eae 
tration, war, and personal enrichment, so it 18 
axiomatic that paying taxes is a useless burden. 
The modern welfare state, supplying schools, n ‘ 
pitals, agricultural education, fae 
roads, village water supplies, and so on, 1s 2 D 
phenomenon; the public is accustoming itse] n 
demanding these services from the governm T 
faster than it is accustoming itself to paying be a 
The strategy of making a plan in excess pees 
sources demonstrates to the public that it can es 
more of what it wants if it is prepared to eh: 5 
taxes. The government of the typical under share 
oped country has substantially increased the ti 
of taxes in the national income over the pas g. 
years, and this has certainly been aided by par ih 
tice of setting high targets for the public sec 
development plans. 
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The same strategy may work with administrators 
of foreign aid. These administrators look for gaps 
in a development plan which they think they can 
fill. A plan which showed resources equal to desired 
expenditures would leave no room for foreign aid. 
This problem can, however, be solved without vio- 
lating the chief purpose of planning, which is to 
determine priorities. A good development plan as- 
signs degrees of priority to the projects which it 
includes. Hence, there is always a list to use for 
drawing out more taxes or more foreign aid. 

Implementation. The plan may be well de- 
signed in every sense but yet may fail because it 
is made by people who cannot put it into effect. 
This is obvious in regard to the private sector; what 
the planners write may bear no relationship to 
what investors mean to do. But it applies also to 
the public sector. A government is not monolithic; 
power to make decisions is diffused among many 
ministries and departments. Many a “good” plan 
fails because it is ignored by the very ministries 
which are supposed to implement it. Hence, the 
machinery for implementing the plan is of crucial 
importance, 

If the plan is to have a good chance of being 
implemented, the agency which makes the plan 
must meet three tests: (1) it must have the sup- 
port of the head of the government; (2) it must 
allow all the leading decision makers in the econ- 
omy to participate in drawing up the plan; and 
(3) it must control crucial decisions at the stage 
of implementation. 

Governments usually establish a special agency 
for making a development plan. Its size varies ac- 
cording to the size of the country, the complexity 
of the plan, how the preparatory work is shared 
with other government agencies and departments, 
and the extent of the planning agency's responsi- 
bility for implementation, Normally ministries and 
departments prepare their own proposals and have 
Preliminary survey and estimation done by their 
own engineering and other experts; but if the 
qcPartments are weak in these techniques, a good 
ie of Preparatory technical work will fall on the 
bs ney, which must then have its own architects, 
4 Sineers, accountants, and other experts. The 
poe should also be able to rely on the statistical 
Ribs the government’s economic advisory 

» and the development corporations for a 
ae of the statistical and economic material 
aes pai may have to supplement this toa 

e a esser extent with its own investigations. 
CT a no formula for dividing the preparatory 
ies ween the planning agency and other gov- 

agencies, other than that the planning 
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agency will find itself having to do such necessary 
work as the others fail to do. Similarly, implemen- 
tation depends to some extent on the competence 
of the other agencies, There are cases where the 
planning agency has become the sole authority for 
executing all public works, cases where it merely 
records how implementation is proceeding, and 
extreme cases where it has nothing whatsoever to 
do with implementation. 

Since the plan accords priorities to ministerial 
proposals, rejecting many outright, the planners 
must have authority vis-à-vis the ministries. Typi- 
cally the planning agency is responsible to the 
prime minister or president, and its chief execu- 
tive officer is frequently a well-known administra- 
tor brought in from outside the civil service and 
paid above the civil service rates. Unless the 
agency's head has the confidence and support of 
the head of the government, the agency will have 
no effective influence on what goes into the plan. 

This is not enough, however. The planning 
agency needs the support not only of the head of 
the government but also of his strongest ministers. 
A typical device is to have a special subcommittee 
of the cabinet for “development plan affairs” which 
includes the minister of finance and the chief min- 
isters for industry and agriculture, However, the 
problem is not confined to ministers. Department 
heads and civil servants are also powerful and can 
sabotage a plan if they do not agree with it. The 
only way to have fair certainty that the plan will 
be effective is to bring into the planning process 
all the leading persons in the government appara- 
tus, ministerial and administrative, who have the 
power to make important decisions, and produce a 
plan which is, as far as practicable, acceptable to 
them. 

The same goes for the part of the plan relating 
to the private sector. A plan for private investment 
in which the private investors have had no say is 
not likely to have much effect. Hence, persons 
knowledgeable and influential in all the industries 
where substantial change is desired must be con- 
sulted in the making of the plan. 

The machinery for consultation varies in struc- 
ture and formality. In some small countries all 
the chief decision makers can sit around the same 
table, whereas large countries need a network of 
planning committees including officials and private 
citizens. Typically, not enough opportunity is given 
for consultation. The planning agency makes a 
plan which is unacceptable to important decision 
makers outside its circle, and they simply ignore it. 
This is especially apt to happen in countries with 
a federal structure, where the plan covers many 
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functions which are state or provincial responsi- 
bilities without adequate assurance that state and 
provincial authorities are willing to play the role 
which the plan assigns to them. 

If the plan is to be effective, the planning agency 
must have some control over its implementation. 
In the public sector, implementation is governed 
by the annual capital budget, which is the au- 
thority for expenditure. The planning agency is 
in the strongest position when the making of the 
annual capital budget is assigned to it rather than 
to the authority which makes the annual recurrent 
budget (the ministry of finance or the budget 
bureau). This division brings its own problems. 
Since capital expenditure brings recurrent expend- 
iture, the planning agency must consult the budget 
bureau when making the capital budget, and the 
budget bureau must consult the planning agency 
when making the recurrent budget. If, as is some- 
times the case, the planning agency has no control 
over the annual capital budget, the plan is soon 
forgotten, and actual expenditure may bear little 
relation to the plan forecast. 

The part of the plan relating to the private sector 
is not so easily controlled by a single agency. Im- 
plementation may involve the issue of licenses (for 
building, imports, or investment), tariffs, subsidies, 
recommended legislation, and many other kinds 
of action, vested in half a dozen ministries. The 
planning agency should be represented on all com- 
mittees which make decisions involved in imple- 
menting the private sector of the plan, though it 
cannot hope to have a controlling voice, 

Implementation does not depend entirely on 
machinery. The most important prerequisite is a 
realistic plan, If the plan makes assumptions which 
are far from reality, civil servants and private 
persons are forced, willy-nilly, to ignore it when 
making important decisions. 

However realistic the plan may be at the time 
it is made, its influence will decline unless it is con- 
tinually revised. Events turn out differently from 
what was expected; new physical resources are dis- 
covered; the financial situation changes; and so on. 
With the passage of time the relevance of the pub- 
lished document must inevitably decline. Hence, 
the plan must be flexible. The private-sector part 
of the plan is necessarily flexible. If investors turn 
up with good schemes not previously considered, 
they will not be turned away; and if other investors 
decide to abandon proposals which were highly 
regarded at the time of making the plan, they can- 
not be forced to continue, Usually the government's 
concern is only to ensure a high level of private 
investment; whether this is in one project or an- 
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other is likely to be of secondary importance, 
except in industries considered (rightly or wrong- 
ly) to be “strategic” in some sense. Not all govern- 
ments see the situation in this light; some believe 
that their officials’ assessment of investments is 
better than the private investor’s and try to induce 
or coerce the private investor to keep in line with 
the published plan; they have only limited success, 

The public-sector part of the plan is revised every 
year for the annual budget. Whether conditions 
have changed or not, the government is continu- 
ously besieged by ministers trying to bring forward 
schemes which are not in the plan, some that 
were previously rejected, some that are new. As 
time passes, the published plan must become less 
relevant. Some countries wil] openly abandon a 
plan two or three years before it is due to expire, 
and issue a new one in its place; others establish 
new priorities without formally abandoning the 
plan; still others abandon both the plan and any 
consistent priorities, 

A plan must not be considered to have failed 
merely because things turned out differently. Its 
value lies in organizing consistent thinking at the 
time it is made. Constant revision is as important 
a part of planning as is the process of making the 
original document. The fact that the economy will 
not turn out as forecast is not an argument against 
planning. Decisions cannot all be postponed be- 
cause we cannot forecast the future correctly; plan- 
ning helps to clarify what we expect, to reveal its 
inconsistencies, and therefore to improve the proc- 


ess of making decisions. 
W. ARTHUR LEWIS 
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I 
INTRODUCTION 


Social planning involves the drawing up of plans 
for future action in regard to social institutions and 
resources. A “social” plan is designed to meet the 
needs of a society, which means, in many cases, an 
entire nation. This usage, in which social planning 
is equivalent to societal planning, is generally 
accepted by social scientists (see, for example, 
Myrdal 1959); but social planning is sometimes 
also used to mean planning by a group as opposed 
to planning by an individual. Also social planning 
is sometimes viewed as complementary to, rather 
than inclusive of, economic planning; in that case, 
social planning means the planning of a society's 
noneconomic activities. 

Recent history. In the present century, and 
especially since World War 1m, there has been a 
rapid growth in the importance of social planning, 
both as an idea and as an institutional complex. 
Although the index to the Encyclopaedia of the So- 
cial Sciences does not refer to social planning 
as such, its article entitled “National Economic 
Planning” concludes: “. . . the direction of modern 
Society would seem to be toward a planned econ- 
omy” (Lederer 1933, vol. 11, p. 205). In spite 
of Controversy, often at a high level of intellectual 
abstraction, on whether so comprehensive a con- 
oe n planning is either possible or desirable, the 
a oł social planning appears to have been insti- 

“onalized over the greater part of the modern 
Ba and in the programs of international organi- 
Ei a so that, whatever its logical and philosoph- 
th cee it has achieved de facto recognition. 
caren may be compared to the ideas of progress, 
ian fe and welfare, stemming from the same 
Sa end of modern life toward industrialization 

technical rationality. 
ie the earlier part of the industrial era the 
e a cology of laissez-faire was in the ascend- 
+ Governments did indeed make social and eco- 


PLANNING, SOCIAL: Introduction 125 


nomic interventions, but on the whole it was only 
old-fashioned autocrats and certain socialist minor- 
ities who thought in more total terms. Military 
general staffs were, in the pre-1914 period, the 
bodies most likely to draw up “plans.” Since modern 
warfare depends on industry, military planners 
could not ignore industrial and civil questions. 

World War 1 involved a far more comprehensive 
mobilization by the major belligerent governments 
of all their resources, economic and social as well 
as military, than had ever before been contem- 
plated. National emergency brought home the im- 
portance of, for instance, national levels of health 
and education to a total war effort. In some coun- 
tries the war was followed by revolution, and in 
all the major European countries involved there 
was a recognition, at all political levels, of the 
need for social and economic reform. The idea of 
social planning, and especially of national eco- 
nomic planning, emerged into full daylight during 
the interwar years. The problem of the business 
cycle, and unemployment, strengthened this tend- 
ency and led to increased state intervention and 
planning even in the United States, where, how- 
ever, faith in individual private enterprise remained 
strong. 

World War 1 repeated and intensified the effects 
of World War 1. Not only was there total planning 
for the war itself, but plans for postwar reconstruc- 
tion began to materialize, in Great Britain and some 
other countries, almost as soon as the war started. 
After the war, international organizations multi- 
plied, and social planning became international. 
Moreover, by the mid-1960s nearly all colonial 
territories had gained independence, and a large 
group of new nations had started programs of na- 
tional development, in which the idea of a “plan” 
was almost de rigueur. 


Scope and theory 

As early as 1841, Friedrich List had written: 
“Tt is the task of politics to civilise the barbarous 
nationalities, to make the small and weak ones 
great and strong, but, above all, to secure to them 
existence and continuance. It is the task of national 
economy to accomplish the economical develop- 
ment of the nation, and to prepare it for admission 
into the universal society of the future” (1841, 
p. 142 in 1904 edition). While his doctrines served 
the purposes of militaristic nationalism in Germany 
after Bismarck and in Japan after the Meiji resto- 
ration, they also contributed, if indirectly, to Soviet 
Russian state planning. “Historically, Friedrich List 
preceded Marx as the father of the theory of plan- 
ning; Rathenau, who organized the first modern 
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planned economy in the Germany of the first world 
war, preceded Lenin, whose approach to the prob- 
lem of planning in Soviet Russia was consciously 
based on the German precedents” (Carr 1951- 
1953, vol. 2, p. 363). List also emphasized the 
distinction between present and future advantage 
from the national standpoint. He not only justified 
protective tariffs for young industries but stressed 
the importance of developing “immaterial” produc- 
tive forces, such as technical knowledge and skill. 
Thus even during this early period it was seen that 
while the core of social planning consists in a plan 
for balanced capital investment in industry and 
agriculture, national plans must also allocate re- 
sources for education, health, housing, and other 
kinds of social investment. 

Planning and social values. Decisions on prior- 
ities to be adopted cannot simply be based on 
economic calculations; essentially they are political 
decisions based on social values. The decisions 
embodied in a plan are made for political reasons; 
they also have long-term political effects (Diesing 
1962, p. 231). Although the models of development 
currently used by national planners are economic 
and based on quantitative analysis, it can be ar- 
gued that only when the economic model has been 
transformed into a general social model can policy 
conclusions logically be inferred from it (Myrdal 
1959, p. 166). 

It is relatively easy to decide that capital should 
be invested, for instance, in a steel plant with a 
given capacity and location. An economic decision, 
such as the siting of an industry, can to some 
extent control the direction of social trends, such 
as urbanization and population movement. But 
even in societies with comprehensive state plan- 
ning, it has not been possible to achieve full control 
over internal migration, let alone over demographic 
growth. The noneconomic part of social planning 
must at the present time largely consist in attempt- 
ing to foresee some of the ways in which economic 
decisions will affect social behavior and some of 
the ways in which social behavior will impede or 
facilitate the implementation of the economic deci- 
sions. Thus, social planners must take logical ac- 
count of nonlogical motivation (Pareto [1916] 1963, 
vol. 3, chapter 11), “Radical planners have to cope 
with the paradoxical situation that the success of 
their intellectual labor, which is an achievement 
of reason and logic, is best secured if the indispen- 
sable consensus rests on a non-logical basis” 
(Speier 1937, p. 476). 

How rational can planning be? But these diffi- 
culties are not new. They are inherent in govern- 


ment and in the framing and execution of policies 
and programs generally. In what, then, does the 
added problem of the social planner consist? It 
would seem to stem from the more ambitious at- 
tempt of the planner to arrive at decisions ration- 
ally. By comparison with earlier procedures, this 
attempt involves more systematic fact-finding be- 
fore the plan is formulated, more systematic coor- 
dination of separate decisions and policies, and 
more explicit formulation and phasing of objec- 
tives. Another feature of planned, as against un- 
planned, national development is the greater stress 
on evaluation of results achieved and on objective 
measurements of success or failure. In a nutshell, 
the problem of social planning is how to insure that 
it at least approximates to its rational intentions— 
a problem that falls fairly and squarely in the lap 
of the social sciences. 

Social planning as ideology: pro and con. The 
idea of comprehensive social planning has been 
criticized as a pseudo-scientific or “scientistic” delu- 
sion stemming from Comtean positivism, which in 
turn is seen as stemming from the Ecole Polytech- 
nique instituted in Paris after the French Revolu- 
tion. The desire to apply engineering technique to 
the solution of social problems is seen as a dan- 
gerous aberration (Hayek et al. 1935, p. 210; 
Hayek 1944 passim; 1952, pp. 94, 105), “All such 
general plans of social reconstruction are merely 
the rationalization of the will to power. For that 
reason they are the subjective beginnings of fanat- 
icism and tyranny” (Lippmann 1937, p. 365). 

Such strictures were a reaction not so much 
against Bolshevik or Nazi total planning (though 
these of course were conspicuous features of the 
interwar political scene) as against the idea that 
liberal democracies could, and indeed should, seek 
to avoid maladjustment (socioeconomic and psy- 
chological) by democratic planning (Zweig 19%% 
Mannheim 1935; 1950). Early in the 1930s Pa 
Mannheim propounded a new stage of mouen 
the level of planning,” in which the balance of = 
planned and unplanned activities of society m 
about to be decisively tipped in the direction , 
the former. In this view, liberalism appeared ai 
transitional phase between two forms of pe 
order” of which the earlier was that of ne 
Christendom, while the new phase was the ie y 
of “the growth of a coherent and co-ordinate Le 
tem of social techniques” within the moderi 
tional state (Mannheim [1935] 1940, pP: °° 
362). ; e 

Although freedom and democracy, in Eo 
these words have in the liberal West, were un 


edly highly valued by Mannheim, his penchant for 
portentous phraseology laid him open to charges 
of “historicism” (a belief in inevitable laws of 
historical development) and “holism” (a belief that 
social reconstruction should be all-embracing). 
Karl Popper sharply attacked Mannheim’s concept 
of planning as utopian and totalitarian; instead, 
Popper advocated piecemeal social engineering, in 
which social arrangements are changed “by small 
adjustments and readjustments” rather than by 
trying to redesign society as a whole. However, he 
did not exclude the possibility “that a series of 
piecemeal reforms might be inspired by one general 
tendency, for example, a tendency towards a greater 
equalization of incomes. In this way, piecemeal 
methods may lead to changes in what is usually 
called the ‘class structure of society” (Popper 
1957, pp. 66, 68). 

While a Popperian social planner might fail to 
see the wood for the trees and a Mannheimian 
might lose his way in the wood, it is doubtful that 
we really have to choose between these two ap- 
proaches. The controversy, begun in the 1930s and 
concluded in the 1950s, seems increasingly unreal. 
Without wishing to suggest a unilinear trend, one 
cannot fail to observe the continued increase in 
the number of plans and planning agencies, both 
in advanced industrial and in underdeveloped coun- 
tries. There has, however, been no corresponding 
development of a general theory of social planning. 
For hints of such a theory one must look at com- 
parative cross-national studies by economists, his- 
torians, political scientists, and sociologists (Shils 
1963; see also Duveau 1954; Bettelheim 1959; 
Ossowski 1959). 

In the rich countries planning takes on a less 
Comprehensive and programmatic character than 
in the poor countries, which are compelled to at- 
tempt what in the light of the history of the rich 
Countries appears as a short cut (Myrdal 1959, 
P. 159). In some underdeveloped countries (for 
example, India) social planning is combined with 
$ democratic political system of free elections 

ased on universal suffrage. In others (for exam- 

po China) there is planning along with what has 
; a described as a “mobilization system,” in which 
BaT of development are high and possibly 
ann istic and stress is placed on militancy and 
Sige. organization (Apter 1963a). The more 
is itious the plan, the more necessary it will seem 
ae upengte consensus by fostering a sense of emer- 
(S Sp in the face of external and internal danger 
‘ ones 1937). If the plan succeeds, perhaps this 
sphere of “political religion” can be relaxed 
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(Apter 1963b). It may be surmised that “total” and 
“piecemeal” theories are the poles between which 
actual ideologies of social planning vary. 


Practice and organization 


Historical priority for comprehensive national 
planning in peacetime must go to the Soviet Union. 
After initial hesitations, planning machinery was 
effectively established by 1925 (Carr 1958-1964, 
vol. 2, p. 490); the first five-year plan started in 
1928. In Germany, the first Nazi four-year plan 
started in 1933. Fascist Italy launched a plan in 
the same year, which also saw the start of the 
Roosevelt administration’s scheme for the develop- 
ment of the Tennessee Valley in the United States, 
while in 1934 the Turkish government adopted a 
five-year plan for the industrialization of the coun- 
try (Zweig 1942), and there was published in 
India, not yet independent, a pioneering first pro- 
gram of planned development (Visvesvaraya 1934). 

In 1950, soon after independence, the Indian 
government set up its Planning Commission. In- 
dia’s first five-year plan covered the period 1951- 
1956. This was probably the first instance of so 
influential a planning body being set up within 
the framework of a parliamentary democracy, and 
its constitutional status has been criticized (Chanda 
1958, p. 92; Jagota 1963). Ghana has had three 
five-year development plans, the first dating from 
1951, when it was still a British colony (Gold 
Coast); the second and third plans were instituted 
after the country became independent in 1957. 
Since 1950 both the number of newly independent 
nations and the number of national economic plans 
have been increasing rapidly. In 1964 there were 
about 125 nations, of which some 93 were currently 
defined as “developing.” Nearly all these had devel- 
opment plans (Moyes & Hayter 1964). 

Wide variations in political context and in the 
capacities of nations to provide expert staff for 
planning bodies must be borne in mind, but the 
trend toward the institutionalization of social plan- 
ning is unmistakable. Comparative study by social 
scientists and historians of this new wave of na- 
tional plans has barely begun but must surely in 
coming decades become a leading concern of 
macrosociology. 

The example of India. The Indian govern- 
ment’s Planning Commission has nine members, 
drawn from the central political leadership and 
including the prime minister, the finance minister, 
the defense minister, and the minister for plan- 
ning. It is assisted by a National Development 
Council (which includes the chief ministers of the 
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states) and has a secretariat of 369, excluding the 
240 attached to the Program Evaluation Organiza- 
tion and the Community Projects Administration 
(Indian Institute of Public Administration, New 
Delhi, 1958). The staff includes economists and 
other social scientists and promotes relevant re- 
search projects in the universities and research 
institutes. It is also assisted by numerous foreign 
advisers, either on a bilateral or an international 
basis. 

In broad terms, the members, staff, and advisers 
of the Planning Commission constitute the intel- 
lectual, theoretical, perspective-seeking wing of the 
political elite. Their function of preparing middle- 
term and long-term plans serves to remove many 
of them from the cruder pressures of everyday 
political existence. There is thus an institutional 
differentiation between long-term and short-term 
politics. Can the long-term prevail over the short- 
term? What is the interaction between the two? 
These are important questions for future empirical 
investigation. 

Planning as propaganda. National plans are 
“approved” by parliaments and other national 
bodies. To this extent they are statutory, though 
in practice their standing is somewhere between a 
legal prescription and a political aspiration. “Con- 
sensus is the key phenomenon of macrosociology” 
(Shils 1963, p. 23). In 1954 the Indian Planning 
Commission requested state governments to ar- 
range for the preparation of district and village 
plans, especially in relation to agricultural produc- 
tion, rural industries, and cooperation (India [Re- 
public], Planning Commission, 1956). In some 
states not only were village plans requested but 
each family was asked to prepare its plan. It is 
hardly surprising that this kind of “planning” re- 
mained a device of political Propaganda and even 
as such had little impact. Nonetheless the propa- 
gandistic significance of the national plan, and its 
place in the ritual of “political religion,” are not to 
be ignored. The function of a plan is not only to 
allocate national resources but to mobilize national 
effort. 


CHARLES MADGE 


[Directly related are the entries PLANNING, ECONOMIC; 
POPULATION, article on POPULATION POLICIES. Other 
relevant material may be found in Foon, article on 
WORLD PROBLEMs; Housinec; Pusric HEALTH; and 
in the biographies of List; MANNHEIM. ] 
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Il 
REGIONAL AND URBAN PLANNING 


In its generic sense, planning is a method of 
decision making that proposes or identifies goals 
or ends, determines the means or programs which 
achieve or are thought to achieve these ends, and 
does so by the application of analytical techniques 
to discover the fit between ends and means and 
the consequences of implementing alternative ends 
and means. Urban (or city) and metropolitan plan- 
ning apply this method to determine public invest- 
ment and other policies regarding future growth 
and change by municipalities and metropolitan 
areas, 

City planning has existed ever since man began 
to build towns and to make decisions about their 
future, In most societies, but particularly in the 
United States, there has been little consensus about 
oe decisions. The diverse classes, ethnic groups, 
te interest groups that live in the city have dif- 
: ee conceptions of how the city ought to grow 
nd change and of who should benefit from the 
Policy and allocation decisions. Consequently, these 
nee have attempted, directly or indirectly, to 
T ence the ends, means, and techniques of plan- 
lo z oe even the role of the planners. A socio- 
Ae shea ie of American city planning must 
are. 10 plans with what ends and means for 
nee interest group. Since the variables in this 
Ene gm are affected by changes in the population 
a Power structure of the American city, the 

ysis is best carried out historically. 
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Planning as civic reform 


American city planning can be said to have 
begun with the laying out of the infant country’s 
first cities, usually by engineers who mapped grid 
schemes of rectangular blocks and lots, largely for 
the benefit of land sellers and builders (Blumenfeld 
1949). Most American cities came into being with- 
out prior planning, however, and with only spo- 
radic attempts to regulate their growth. By the 
middle of the nineteenth century, the provision of 
utilities and other municipal services could not 
keep up with the rapid increase in population, and 
the cities became overcrowded and congested, with 
vast slums in which epidemics, unchecked crime, 
and political corruption were commonplace. 
Shortly before the Civil War, these conditions 
stimulated the formation of a number of civic 
reform movements, which were the forebears of 
contemporary city planning. 

The reform groups were made up of predom- 
inantly Protestant and upper-middle-class civic 
and religious leaders whose major end was the 
restoration of order. They sought physical order 
through slum clearance and the construction of 
model tenements to improve housing conditions 
and through the park and playground movement, 
which tried to preserve the supposedly health- 
giving features of the countryside by building 
parks and other recreational facilities in the 
crowded areas. They sought social order through 
the erection of educational and character-building 
facilities such as schools, libraries, and settlement 
houses, hoping that these would Americanize the 
immigrants and make them middle class, and thus 
eradicate crime, vice, and even the harmless forms 
of lower-class hedonism. They promoted political 
order through the “good government” movement, 
which advocated nonpolitical methods of urban 
decision making to eliminate the new political 
machines and the fledgling socialist movement. 
They were attempting to maintain the cultural and 
political power they had held before the arrival of 
the immigrants by imposing on the city the physi- 
cal and social structure of Protestant, middle-class, 
preindustrial America. 

The means by which they proposed to achieve 
these ends included new legislation to regulate and 
control city growth; the use of public administra- 
tion and, later, scientific management procedures 
to run the city; and the establishment of “facilities,” 
such as parks and settlement houses, which would 
improve living conditions and alter the behavior 
of their users. After the reformers had built a few 
model facilities with private funds in the slums of 
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several eastern cities, they realized that they lacked 
the resources to alter the entire city by this ap- 
proach, and so they began to use their considerable 
social status and what remained of their political 
influence to propose that the cities take over their 
programs as municipal functions. 

The reformers’ efforts were supported by archi- 
tects, who had created the City Beautiful movement 
during the 1890s to develop park, civic center, 
and other schemes to enhance the city, and by 
downtown business and property interests, who 
sought higher land values and also favored effi- 
ciency in government to minimize taxes. Others, 
especially upper-middle-class homeowners, were 
disturbed by the invasion of commercial and indus- 
trial establishments, low-status residents, and 
slums into previously “good” neighborhoods. These 
groups called on the city to pass zoning legislation, 
which would prohibit such invasions by regulating 
the kinds of buildings and land uses permitted in 
different zones, Zoning not only encouraged the 
Kind of order, beauty, and efficiency sought by its 
advocates but also segregated land use and popu- 
lation by class. 


The rise and fall of the “master plan” 


By the end of World War 1, planning and zoning 
had become municipal responsibilities. Since their 
advocates usually opposed the political machines, 
these functions were incorporated in quasi-inde- 
pendent city planning and zoning commissions, 
headed by lay boards of civic leaders and business- 
men. These commissions were staffed principally 
by civil engineers and architects, who were called 
city planners. The new agencies and professionals 
codified and further operationalized the ends and 
means they had inherited from the reformers, prin- 
cipally through the “master plan,” which proposed 
both the traditional solutions and some new ones 
in a single comprehensive scheme. 

The typical master plan, which has changed 
relatively little since the first one was drafted in 
1914, portrays a future ideal: a city without slums, 
divided into zones for each major land use, with 
efficient highway and mass transit systems, vastly 
increased amounts of open recreational space and 
other public cultural facilities, and served by a 
system of neighborhood, district, and downtown 
retail and civic centers. The proposals for new fa- 
cilities and rearranged Jand-use and transportation 
patterns are synthesized into a master plan map, 
with proposals for implementing this map through 
a Zoning ordinance to order land use as prescribed 
by the plan, building codes to discourage slums, 
subdivision regulations to guide the development 
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of vacant land, and governmental reorganization 
schemes to coordinate the proposed changes (Bas- 
sett 1938; Dunham 1958; Kent 1964), 

No master plan has ever become a blueprint for 
the growth of the city, although individual recom. 
mendations have often been implemented. Perhaps 
the main reason for the failure of the master plan 
was its assumption of environmental or physical 
determinism. Like the nineteenth-century reform- 
ers, the master planners assumed that people's 
lives are shaped by their physical surroundings and 
that the ideal city could be realized by the provision 
of an ideal physical environment. Since they were 
architects and engineers, they believed the city was 
a system of buildings and land uses that could be 
arranged and rearranged through planning, with- 
out taking account of the social, economic, and 
political structures and processes that determine 
people's behavior, including their use of land. 

The ends underlying the planners’ physical ap- 
proach reflected their Protestant middle-class view 
of city life. As a result, the master plan tried to 
eliminate as “blighting influences” many of the 
land uses and institutions of lower-class and ethnic 
groups. Most of the plans either made no provision 
for tenements, rooming houses, second-hand stores, 
and marginal loft industry, or located them in 
catch-all zones of “nuisance uses,” in which all 
land uses were permitted. Popular facilities that 
they considered morally or culturally undesirable 
were also excluded. The plans called for many 
parks and playgrounds but left out the movie 
theater, the neighborhood tavern, and the club- 
room; they proposed churches and museums but 
no night clubs or hot dog stands (Wood et al. 
1966). 3 

The units into which the plan divided the city 
were determined by transportation routes and other 
physical criteria, and did not reflect established 
social groups or ecological conditions of change 
and growth. Neighborhood boundaries ignored 
class and ethnic divisions in the population, an 
the planners made a conscious effort to break uP 
ethnic enclaves in order to achieve nineteen + 
century Americanization goals. The planners ee 
tainty about how people ought to live and how 
city ought to look resulted in a nearly static plan, 
a Platonic vision of the city as an orderly as 
finished work of art. The only land uses a 
grammed for future growth were those favore 
middle-class residents, high-status industrial ie 
commercial establishments, the real estate intere 
catering to both, and the tax collector. ee 

The planners’ unwillingness to recognize he 
native values also made them unable to se 
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role of politics in implementing the plan. Believing 
that their solution was the best blueprint for the 
future, its formulators thought that they needed 
only to publish their report, obtain support from 
the civic leaders and businessmen who sat on the 
boards of the planning commission, and then per- 
suade elected officials that the plan expressed the 
public interest. The planners’ opposition to partisan 
political methods of decision making convinced 
them that the plan was “above politics” and that 
anyone who rejected it was acting from selfish, 
and therefore evil, motives. 

The master planners did not realize that the 
ends they sought were opposed by many voters, 
that most city residents place less value on parks 
and open space than planning ideology dictated, 
and that most residents do not center their life on 
the elementary school, as the neighborhood unit 
plan proposed. Worse yet, the plan advocated a 
middle-class life style for all, but it did not recom- 
mend economic programs to enable low-income 
people to move out of tenements and buy single- 
family houses. As a result, master plans have rarely 
generated any widespread enthusiasm among the 
voters but have always aroused considerable polit- 
ical opposition from the groups who would have 
to pay economically, socially, and politically for 
the proposed changes. 

The master planners were also hampered by 
conditions not of their making. For one thing, their 
authority to plan stopped at the city limits, although 
the growth processes which they sought to control 
Cover a much wider area, Also, the planners were 
poorly funded, so that their conclusions were often 
based on whatever data, however inadequate, were 
already available. Yet their methodology was ulti- 
mately also a function of their own ends, means, 
and techniques. Their belief in physical determi- 
nism limited their analysis to the determination of 
the land-use implications of their demographic and 
economic projections, and their faith in the solu- 
tions they proposed discouraged consideration of 
the fit between ends and means and of the conse- 
quences of these means (Meyerson & Banfield 
1955; Altshuler 1965). 

_Yet although master plans had little impact on 
city development, some of the planners’ ideas were 
adopted. For example, their arguments that exces- 
sive land coverage inevitably produced traffic con- 
ae that off-street parking was necessary, and 
the TEH too much land for commercial use in 
fieh LE of increasing tax income would also leave 
ae, and lying needlessly unused were eventually 
an Pted in principle by city officials. Being politi- 

Y weak, however, the planners could not stand 
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up to the powerful economic interests who opposed 
the implementation of these principles, and being 
apolitical in their approach, they could not develop 
compromise solutions, which would have salvaged 
at least some of their proposals. 

Master planning activity reached a new high 
after World War u, especially in areas of rapid 
suburbanization. At the same time, however, plan- 
ners working in the cities began to lose confidence 
in the master plan. As cities began to grow and 
rebuild again, many of the development proposals 
that came before the planning commission con- 
flicted with the master plan, and the conflict was 
usually resolved in favor of the new proposals. 
Some planners argued that, in order to be useful, 
the master plan had to be constantly revised and 
updated, but others began to suggest that planning 
was not only a method but also a process of deci- 
sion making and that master planning was only 
one tool among many to be employed in this 
process. 


The emergence of rational planning 


Since 1950 the city has increasingly become the 
residence of a small number of rich people and a 
rapidly rising number of poor nonwhites. The 
latter are forced to live in ghetto slums, and these 
slums, as well as the pathologies associated with 
poverty, are reducing the livability of the city for 
the middle class and creating new problems for 
city officials. Low-income neighborhoods and resi- 
dents cannot pay the taxes needed to provide them 
with municipal services, and the exodus of middle- 
class residents, stores, and industries to the suburbs 
is also depriving the city of its most profitable 
sources of tax income. Some help is now coming 
from the federal government, which is providing 
financial support for new city programs. City plan- 
ning has been funded by federal subsidies as well, 
particularly for transportation and urban renewal 
plans. 

Transport and metropolitan planners. Master 
planners have concerned themselves with transpor- 
tation ever since car ownership began to increase 
urban congestion and suburbanization. After World 
War 1m, however, the new spurt in car buying and 
the growth of suburbia resulted in the building of 
expressways, subsidized by federal grants, in every 
American city. These made it more convenient to 
use cars and thus encouraged the exodus to sub- 
urbia even more. As a result, they not only failed 
to reduce congestion but also took more customers 
away from the already declining mass transit sys- 
tems. Consequently, by the late 1950s a number of 
large cities began to formulate massive transporta- 
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tion planning studies, which aimed to determine 
the location of future expressways and to revitalize 
mass transit as part of a metropolitan-area trans- 
portation system serving the city and its suburbs. 
Since new transportation facilities stimulate fur- 
ther urban development and thus affect land-use 
patterns and the economic base of the cities 
(Mitchell & Rapkin 1954), the transportation plan- 
ners were in effect formulating a new kind of mas- 
ter plan (Fagin 1963). 

In this process, they introduced several innova- 
tions into planning. Well financed and able to 
measure the amount and flow of traffic with relative 
ease, the planners obtained large masses of data 
and brought in the newly available computer to 
analyze them. Moreover, since the aim of their plan 
was limited to serving the transportation needs of 
the area and the achievement of the economically 
most productive and most efficient land-use pat- 
terns, they were working with a smaller number 
of ends than were the master planners, This en- 
abled them to formulate a number of alternative 
schemes, rather than a single one, using the com- 
puter and later the simulation model to choose 
a final plan from among the alternatives (“Land 
Use...” 1959). Because the studies were staffed 
by transportation experts, economists, and opera- 
tions researchers rather than by architects and 
master planners, the ends and means of traditional 
city planning were of lesser importance. 

Finally, their plans were metropolitan. Because 
so much of the traffic flow was generated in the 
suburbs, transportation planning could not end at 
the city limits, and regional planning bodies were 
set up to do the studies. Metropolitan planning had 
been advocated by city planners and public admin- 
istrators for more than a generation. The control of 
growth and the achievement of order and efficiency 
required a metropolitan government to coordinate 
municipal services and planning for the city and its 
suburbs in a single supergovernment, which would 
do away with the duplication and conflict among 
the many hundreds of local bodies. 

The appeal for metropolitanism fell on deaf ears, 
however. The suburbs rejected any plan which 
would require them to share their power with, or 
give up their local authority to, the city. The cities 
were predominantly Democratic, working class, 
and Roman Catholic, and the suburbs were pre- 
dominantly Republican, middle class, and Protes- 
tant; thus political, class, and religious conflicts 
were endemic, and they were magnified when cities 
became increasingly nonwhite (Banfield & Grod- 
zins 1958). Moreover, the suburbs had no great 
interest in metropolitan planning or coordination. 


Their middle-class homeowners could well afford 
the duplication of municipal services and were un- 
willing to accept any infringement on their local 
autonomy, not to mention the possible arrival of 
lower-status and nonwhite city residents in ex- 
change for a slight saving in taxes. These consider- 
ations may even prevent the implementation of the 
regional transportation systems now being planned, 

Urban renewal programs. The second stimulus 
for change in city planning theory and practice has 
come out of urban renewal programs. In 1949 the 
federal government set up the urban renewal pro- 
gram, which funneled considerable amounts of 
money to the cities, with a view to the elimination 
of slums (Woodbury 1953; Wilson 1966). Intense 
political opposition to public housing prevented it 
from making any significant inroads on the slums, 
and thus the federal government turned to private 
enterprise, hoping that it would rebuild on land 
that had been cleared and reduced in price by 
federal grants. The program received strong local 
support from the cities, which expected that new 
building programs would increase their tax reve- 
nues, and from downtown businessmen, who saw 
urban renewal as a way of replacing the slum 
dwellers of the inner city with more affluent cus- 
tomers and, later, as a way of modernizing down- 
town districts with federal aid. 

The theory behind urban renewal was the tradi- 
tional nineteenth-century one that if slum dwellers 
were relocated in decent housing, they would give 
up their lower-class ways and the social patholo- 
gies thought to “breed” in the slums. Since inex- 
pensive housing was in short supply, however, and 
the private redevelopers of cleared sites were build- 
ing only luxury housing, the displaced were often 
forced to move into other slums, where they usually 
had to pay higher rents (Anderson 1964; Hartman 
1964; Greer 1965). In clearing entire neighbor- 
hoods, urban renewal also destroyed viable social 
communities, thus saddling the displaced with 
emotional costs as well (Gans 1962; Fried i 
In the larger cities, the unanticipated consequence: 
of slum clearance created so much political ae 
sition that by the start of the 1960s, when Me 
market for luxury housing had been sated, ka 
promiscuous use of the bulldozer was halted. a 
newal agencies are now experimenting with ai 
bilitation, so as to minimize relocation, od low- 
Proposing new subsidies for both builders an iol 
income residents in order to increase the r , 
low-cost housing (Frieden 1964; Abrams 19 A r 

Initially, renewal had been carried out Aes 
project-by-project basis, but once the choice a 
desired by private builders had been used uP: 
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planners were asked to develop more comprehen- 
sive urban renewal programs, The selection of 
future renewal sites required the determination of 
the best re-use of the sites on the basis of city-wide 
considerations; and the federal government pro- 
vided planning grants to the cities to create “work- 
able programs,” which had to demonstrate how 
individual projects fitted into a larger, more com- 
prehensive renewal plan before federal renewal 
funds were made available. Subsequently, the fed- 
eral government broadened its requirements, for 
example, by including social considerations in the 
choice of sites, and set up the “community renewal 
program” (Grossman 1963). 

In many cities the community renewal programs 
resemble the traditional master plan, but in others 
the failure of urban renewal to help the slum 
dwellers has called attention to the need for solving 
the more basic problems of the low-income popu- 
lation. This concern, together with the “War on 
Poverty,” is encouraging planners in a number of 
cities to develop quite different kinds of plans. Thus 
there have been innovations in the choice of ends 
and means as well as in the techniques of plan- 
ning. Urban renewal is no longer conceived solely 
as a process of eliminating slums, but as a means 
of dealing with the problems that force poor and 
nonwhite people into them, Some community re- 
newal programs are exploring programs of job 
creation, opening up housing, educational, and 
other opportunities to nonwhites, and improving 
social services in the hope that these, together with 
housing programs, will eliminate the deprivation 
of the slum dwellers as well as the slums. New 
ends, such as equality of opportunity and redistri- 
bution of public resources to the poor, are com- 
plementing those of order and efficiency, particu- 
larly since it is being realized that social order 
can be maintained only through greater economic 
and political equality. The means are new as well, 
Since, to an increasing extent, programs to help 
People are being added to those for changing the 
physical environment. In 1966, the federal govern- 
ment formally integrated social and physical plan- 
ning for urban renewal by developing the Demon- 
Stration Cities program, later retitled the Model 
Cities program, through which 60 to 70 American 
Cities were to be funded to rebuild and rehabilitate 
oe somplotety. the physical and social struc- 

their major slum ghetto neighborhoods 

(Taylor et al. 1966). 
pe lems planning. ý The extension of economic 
5 roe opportunities bears little resemblance 
a itional planning solutions, and the out- 
e of the planning process is no longer en- 
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visaged as a master plan but as a set of incremental 
or developmental programs that will improve pres- 
ent conditions, formulated so that they can be 
plugged into the ongoing decision-making processes 
of city officials (Fagin 1959; Mitchell 1961; Web- 
ber 1963a). Physical determinism is thus being 
replaced by a broader systems approach, which 
seeks to deal with the causes of the problem. In 
this conception of the planning process, land-use 
studies are less relevant, and the community re- 
newal planners are turning to surveys of the pres- 
ent behavior and future wants of the populations 
for whom they are planning, analyses of the qual- 
ity of opportunities and social services now avail- 
able to them, and economic as well as political 
studies to determine the feasibility and the results 
of the programs and policies they recommend. 

This conception of planning also presages a 
change in the role of the planner. The city planner 
is no longer a nonpolitical formulator of long-range 
ideals but is becoming an adviser to elected and 
appointed officials. Planning commissions are being 
reorganized into planning departments that take 
their place in the mayor’s cabinet, and conse- 
quently the planner’s traditional antipathy to poli- 
tics has decreased. 

The planner’s new role and his rising influence 
are partly a result of changes in the city and in 
city hall. The new problems of the city and the 
gradual replacement of the working-class machine 
politician by the middle-class, college-educated 
politician-administrator, supported by profession- 
ally trained bureaucrats, have made city govern- 
ments much more responsive to expert advice, 
while the reduction of class differences between 
the planners and the new politicians has improved 
communication between them. In addition, down- 
town business and property interests have become 
more favorable to planning, partly because they, 
too, face new problems they cannot solve them- 
selves and partly because the planners share their 
concern with the revitalization of the central busi- 
ness district, Moreover, planners are becoming 
more sympathetic to the city and are giving up 
their traditional antiurban ideology. As the suburbs 
threaten to engulf the city and to endanger the 
survival of the upper-middle-class cultural and 
civic institutions that have traditionally been lo- 
cated in the city, the planner has become an advo- 
cate of urbanism, supporting schemes to bring the 
middle class back into the city. 

Finally, with federal funds rapidly becoming a 
major source of support for local planning and 
state planning agencies and metropolitan ones 
also participating in local planning activities, the 
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planner is no longer so closely tied to the small 
group of local businessmen and civic leaders who 
have traditionally supported him against his politi- 
cal opponents. This has encouraged him to pay 
more attention to the needs of other interest groups 
in the city, and, since the advent of the “War on 
Poverty,” particularly the low-income population. 
His greater concern with people has also made him 
more aware of the diversity of values among resi- 
dents and more in favor of direct citizen participa- 
tion in the planning process, rather than depending 
on the traditional appeal for citizen endorsement 
of the planner’s decisions (Wilson 1963). 

The rational programming approach. The 
changes in the conditions under which planners 
work have been complemented, and even preceded, 
by changes within the planning profession and in 
the recruitment of planners, especially the entrance 
of social scientists into city planning. During the 
1930s, social scientists helped to conduct national 
and regional planning studies in various federal 
agencies, After World War 1, the University of 
Chicago established a planning school which taught 
national as well as city planning and was the first 
to stress social science rather than architectural 
techniques (Perloff 1957). Subsequently, this cur- 
riculum spread to other planning schools, and stu- 
dents with social science backgrounds streamed 
into them, so that today architects and engineers 
are fast becoming a minority in these schools. 

The Chicago school approached planning as a 
method of rational programming. Briefly, it argued 
that the essence of planning is the deliberate choice 
of ends and the analytic determination of the most 
effective means to achieve these ends, these being 
means which make optimal use of scarce resources 
and which, when implemented, are not accom- 
panied by undesirable consequences. The Chicago 
planners argued that ends are not imposed by 
planning ideology or by a priori determinations of 
the public interest, but by political and market 
Processes and by other forms of feedback from 
those affected by planning. According to this view, 
means and consequences are determined through 
predominantly empirical analyses, and by other 
studies that test the fit of means to ends and predict 
the consequences of these means (Meyerson & 
Banfield 1955; Meyerson 1956; Davidoff & Reiner 
1962; Perloff & Wingo 1962). 

Rational programming has much in common 
with concepts of planning and methods of rational 
decision making being developed in political sci- 
ence, public administration, and management; 
thus, it is reducing the differences between city 
planning and planning for other clients and ends. 
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In turn, this has enabled city planning to use the 
personnel and approaches of other disciplines, in- 
cluding operations research (Branch 1957; Whea- 
ton 1963), decision theory (Dyckman 1961), cost- 
benefit analysis (Lichfield 1960), input-output 
studies (Isard et al. 1960), information theory 
(Meier 1962), and simulation models (“Urban De- 
velopment Models” 1965), as well as sociological 
and manpower analyses for understanding the be- 
havior, attitudes, and ends of the clients of plan- 
ning [Willmott & Cooney 1963; Reiner et al. 1963; 
see also ADMINISTRATION, article on ADMINISTRA- 
TIVE BEHAVIOR; REGIONAL SCIENCE]. 

The social scientists and the rational program- 
mers owe no allegiance either to the master plan 
or to physical determinism. Aided by research 
findings, which indicate that the portions of the 
physical environment with which city planners 
have traditionally dealt do not have a significant 
impact on people's behavior (Rosow 1961; Wilner 
et al. 1962), and by studies of social organization 
and social change, which demonstrate that eco- 
nomic and social structures are much more im- 
portant than spatial ones, the rational programmers 
devote their attention to institutions and institu- 
tional change rather than to environmental change 
(Webber 1963b; 1964). 

The new conception of planning has also af- 
fected general land-use planning, creating a greater 
concern with the social and economic functions of 
land use and leading to incremental policy formu- 
lations for rearranging it (see, for instance, Whea- 
ton 1964). Even urban design, traditionally based 
primarily on aesthetic considerations, is now pay- 
ing attention to the social processes that shape 
what city planners call the “urban form” (Lynch 
1960; Wurster 1963). 

New towns. Nevertheless, traditional master 
planning and land-use planning have received new 
support through the revival of another nineteenth- 
century concept—the “new town.” Originally con- 
ceived as a way of moving urban slum ee 
into the countryside and halting the Lae 
cities (Howard 1898), the new town is a eee! 
self-sufficient and independent community ee 
beyond the city limits; it provides local enbi oE 
ment opportunities for many of its residents, affic 

> Prig: tys tr 
reducing their journey to work, the city burbs 
congestion, and the alleged defects of the ea 
as so-called “bedroom communities” (Stein 1 Tate 
The hope is that if the master plan and Ee 
solutions cannot work in established P 
might be applied on the tabula rasa of a PE al 
It seems likely, however, that the same P ene 
and market forces that prevented the implem 
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tion of the master plan will also frustrate portions 
of the new-town schemes, particularly if they do 
not meet the wishes of the people who will have 
to be attracted as home buyers (Eichler & Kaplan 
1967). 


Social versus physical planning 


In the 1960s rational programming is rapidly 
achieving dominance at the level of planning the- 
ory and is being advocated and used by professors 
and researchers in the profession. However, it has 
not yet been accepted by most city planning com- 
missions and departments. Instead, the city plan- 
ning practitioners have typically reacted to the new 
theory—and the new urban problems—by adding 
programs of “social planning” to the traditional 
land-use approach, now often called “physical 
planning.” “Social planning” is used to describe 
planning for people or, by implication, for low- 
income people, probably because physical plans 
have catered so largely to building programs for 
more affluent city residents. The dichotomy be- 
tween physical and social planning is theoretically 
unjustifiable, however, because physical plans af- 
fect people, rich and poor, as much as do social 
plans. Even so, planning agencies are now adding 
social planning divisions to their staffs (Perloff 
1963; 1965; Dyckman 1966). 

The first social planning activities were intended 
to meet the requirements of the federal community 
Tenewal program and to correct the inequities of 
slum clearance for the slum dwellers, but increas- 
ingly they are being carried out to coordinate city 
Planning and renewal with the community action 
activities and other programs of the “War on Pov- 
erty.” Frequently, however, this so-called social 
Planning is only a modernized version of previous 
attempts to impose middle-class ways on the low- 
come population, with “human renewal” educa- 
tional and social work schemes to “rehabilitate” 
the slum dwellers while—or even in lieu of—im- 
Proving their living conditions, thus subverting the 
antipoverty efforts into yet another device for 
maintaining the present inequality of the low- 
oe population (Gans 1964). This possibility 
T led to proposals for bringing planners into 
De er contact with the low-income population. 

i Vidoff (1965) has suggested that because most 
eae are employed as technicians to pursue the 
nis oe of the so-called Establishment—that is, 
e all, the political party, and the business com- 
mes Saas planners ought to become “advo- 
ane anners,” that is, technical consultants and 
a spokesmen for the low-income population. In 

eral U.S. cities, planners have begun to work 
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with local civil rights groups and with community 
organizations in poor neighborhoods; they have 
formed a national professional association, Plan- 
ners for Equal Opportunity; and they are question- 
ing orthodox planning beliefs about how to plan 
for the slum ghetto (Piven & Cloward 1966). 

The politicization of planning. These activities 
are leading to the politicization of the city planning 
profession. It appears that the profession is being 
split into progressive and conservative wings: the 
former calling for social planning to reduce eco- 
nomic and racial inequality, and the latter de- 
fending traditional physical planning and the 
legitimacy of middle-class values. The rational pro- 
grammers are a third wing, seeking to develop an 
approach that makes it possible to plan for all 
interest groups, but they, too, are split over the 
issue of working with or against the Establishment. 

In their day-to-day practice, city planning agen- 
cies tend to favor the conservative wing and the 
traditional land-use approach to planning, although 
they are both threatened and influenced by the 
progressives and the rational programmers. Not 
only are the planning agencies in existence to con- 
duct land-use planning, but most of the directors 
and senior staff members are trained in this ap- 
proach, and some of the leading practitioners in 
the profession still come from architectural and 
civil engineering backgrounds. Rational program- 
mers and progressives are more often found among 
the junior staff and have less influence on agency 
activities. Moreover, since planners are employees 
of municipal bureaucracies, they are expected to 
conform to agency policy on the job, and in some 
cities they are even restrained from off-hours par- 
ticipation in activities and groups that question or 
oppose agency policy. This naturally reduces the 
impact of progressive ideas on local planning 
activities. 

The future of planning. The future direction 
of city planning is at present uncertain. As archi- 
tecturally trained planners become a minority in 
the profession, city planning will undoubtedly 
make more use of the concepts and methods of 
rational programming and will pay more attention 
to the social and economic problems of the city. 
Traditional physical planning is still needed to 
modernize the city, to raise its efficiency in hous- 
ing and moving workers and residents, and to pre- 
vent further physical and aesthetic deterioration. 
The ends of physical planning may be most rele- 
vant to the suburbs and to the newer communities 
of the western United States, although even they 
do not always favor the particular kinds of order, 
efficiency, and attractiveness that physical planners 
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seek. In the older American cities, however, pov- 
erty, unemployment, racial and class discrimina- 
tion and their pathological consequences are not 
only the most crucial problems but also are major 
causes of the low quality of city living, the middle- 
class exodus, and the city’s financial difficulties 
(Gans 1965; Wilson 1966). The inequalities and 
pathologies of the urban low-income population 
must, therefore, be eliminated, before the attrac- 
tive, efficient, and slumless city for which physical 
planners are striving can be realized. When these 
traditional planners can be persuaded of the valid- 
ity of this approach, it may be possible to achieve 
a synthesis of the so-called social and physical 
planning approaches and thus to create a city 
planning profession that uses rational program- 
ming to bring about real improvements, not only 
in the lives of city residents but also in the con- 
dition of the cities themselves. 

HERBERT J. GANS 


[See also Crry; Housinc; NEIGHBORHOOD; REGION; 
TRANSPORTATION; and the biography of GepvEs.] 
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Til 
RESOURCE PLANNING 


A plan is a course of action laid out in ad- 
vance, A resource is an opportunity in the environ- 
ment that has been identified and appraised by a 
population of potential users. Resource planning 
is a general term for the process of laying out 
better programs for the development of natural 
resources than are formulated from the opportun- 
istic, short-run expediting of independent resource 
use projects. 

Resource planning is an outgrowth of the prin- 
ciples for preserving natural resources and utiliz- 
ing them economically [see CONSERVATION]. When 
optimal use cannot be achieved automatically 
through the workings of the price system, but only 
through adjusting technologies on a regional, na- 
tional, or international scale, then some form of 
centralized, long-range planning must be adopted. 
Long-range planning is often undertaken for other 
reasons than the rationalization of resource de- 
velopment, but resource planning almost always 
becomes a major component of such national and 
regional plans. 

Resource planning has not been fully centralized 
in any society but is distributed among depart- 
ments and bureaus responsible for agriculture, 
forestry, fisheries, mining, education, recreation, 
and regional development. After national plans be- 
come explicit, they still contain an assortment of 
specialized programs initiated and implemented by 
these functional units, as well as by cooperatives 
and private corporations, that are only partially 
controlled and coordinated by the national plan- 
ning effort. No typical administrative structure 
exists as yet, but it is evident that to be effective, 
planners need almost daily contact with agencies 
collecting relevant, reliable information. In so- 
cialist states, and in many others in addition, the 
hydroelectric potentials and the resources below 
the ground are state-owned, so that these resources 
must either be developed by state enterprises or 
licensed, mainly to large international firms. Al- 
though both approaches are amenable to planning, 
the incentives and controls used are very different. 

Thus, in Soviet planning an extraordinarily heavy 
emphasis was placed upon reconnaissance, pros- 
pecting, and survey of mineral deposits. A huge 
centralized, information-gathering operation was 
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organized which was able to pinpoint the geo- 
graphic sources of raw materials to be fed into 
the expanding industrial complex (Shimkin 1953). 

In Pakistan, Egypt, Brazil, and elsewhere, future 
progress hinges upon the development of water re- 
sources, Then the natural unit for analysis becomes 
the basin, Where and when most projects are un- 
dertaken depends upon the long-range program 
for watershed development [Krutilla & Eckstein 
1958; Maass et al. 1962; see also WATER. RE- 
SOURCES]. 

Workable plans are built up of sequences of 
projects undertaken by specified agencies working 
within the public sector, the private sector, or, in- 
creasingly, some combination of both. The projects 
are selected and shaped according to criteria set 
within the context of a national strategy for sur- 
vival, welfare, and growth. Additional criteria may 
be important at the international, regional, or 
agency level. The initiation of the projects is 
scheduled to meet anticipated requirements. 

Experience has shown that resources essential 
to the functioning of a modern society may also 
be tapped through the channels of international 
trade. The technology of extraction, which is trans- 
ferable to sites throughout the world, is in most 
cases sufficiently advanced to meet the rigorous 
specifications for commodity grades and standards 
established by modern industry and the markets 
of modern countries. Transport costs have been 
declining simultaneously, Consequently, the rela- 
tive richness of the resources found in accessible 
parts of the world, rather than the political bound- 
aries that separate supply from use, will usually 
determine the sequence with which the respective 
Opportunities are exploited. 

Military demands for national self-sufficiency, 
particularly in case of mobilization for total war, 
have quite seriously deflected the rational, trade- 
based program for the use of world resources. 
Much resource planning in the United States be- 
tween 1910 and 1950, for example, was intended to 
meet various projected national emergencies, The 
reason for this emphasis on disaster planning was 
the experience with rapid shifts in supply in the 
international market that occur in response to po- 
litical crises. Raw materials and manufactured in- 
termediates with a heavy resource component 
(fuels, refined metals, vegetable oils, paper, sugar, 
etc.) have a history of reacting to threats of con- 
flict with sharp upward price movements and 
irregular supplies, It is not surprising that in many 
countries during World War 1 elaborate resource 
planning efforts were made part of the plans for 


total mobilization of productive capacity for war, 

Self-sufficiency for military reasons lost its co- 
gency in the 1950s. The introduction of megaton- 
size nuclear weapons into international military 
calculations has rendered meaningless all prior 
approaches to strategic resource planning, since 
short-term emergency needs can be met better with 
inventories and strategic stockpiles of quite man- 
ageable magnitudes. Installations to develop sub- 
marginal reserves can no longer be justified for 
reasons of national survival. 

A transition to the new way of thinking was 
marked by the U.S. Presidents Materials Policy 
Commission report, Resources for Freedom (1952). 
National boundaries were superseded to an im- 
portant extent and were replaced by a group of 
political units subscribing to common procedures 
for trade and to conciliation of international dis- 
putes. Autarchic arguments were retained, but they 
were moved to a larger arena. The focus of sub- 
sequent policy proposals has shifted to the use of 
resources to improve the levels of living. The re- 
ports of the United Nations Scientific Conference 
on the Conservation and Utilization of Resources 
(United Nations 1953) provided the basis for such 
planning. 

Economic and social development. When re- 
source planning leads to economic development, 
the significance of natural resources will be Te- 
duced. This planning will result in a decline in the 
share of human effort applied to natural resources 
which support economic growth. The primary Oc 
cupations (e.g., agriculture, forestry, fisheries, min- 
ing) will shrink in over-all importance as the 
planning becomes effective. Even though the 
planning thus far conducted in the twentieth cen- 
tury has been uncoordinated, piecemeal, and jes 
rochial, the effort devoted to these primary activi- 
ties has dropped from a share that was oe 
than 50 per cent of total product to a new sie 
that is considerably less than 10 per cent of | 
total in the most advanced economies. In editou 
economically feasible substitutes for sere 
materials have been found. The ready availal mee 
of a variety of substitutes implies that are oe 
of the major world mineral reserves (as is i ah 
eventual prospect for fossil fuels and phosphal 
need not result in a reversal of this progress. ried 

Because a large share of the world has Re o 
resource scarcity catastrophes since the Sr 
Malthus, a strong trust in automatic, almost a 
cal, improvements in technology exista, aoe 
source economists, The law of dain 5 west 
has been repealed at the macro level, in the 
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at least, by the regularly repeated introduction 
of cost-reducing technical innovations (Boulding 
1955). However, planners cannot make extrapola- 
tions on the basis of faith in the results of research. 
They must project the development of known re- 
serves with tested technologies to fill the needs of 
expanding populations. When this exercise is car- 
ried out, as by Landsberg and his colleagues (see 
Resources for the Future 1963), the future for 
such resource-rich territories as North America 
appears secure for at least a generation. Careful 
matching of supply and demand suggests that no 
critical shortages should occur during the twentieth 
century. 

However, the same exercise addressed to the 
needs and the accessible reserves in the rest of 
the world reveals the real challenge facing resource 
planning. Many parts of the world can project 
ahead no further than a decade before foreseeable 
needs will begin to outrun apparent supply. The 
technology thus far evolved in the West will be 
unable to cope with the transformation of still 
lower-grade resources into the manyfold increase 
in the output of raw materials that will be required. 
The foreign exchange needs for the equipment, the 
skilled personnel required for its operation, or the 
organization involved in distributing the output 
will reach unprecedented dimensions. Long-range 
development planning, attempting to plot a course 
toward adequate levels of living, is therefore forced 
to resort to new technologies which do not yet exist 
but could be evolved from the scientific knowledge 
now available in journals, laboratories, and pilot 
Plants somewhere in the world. 

The soil and water resources available for the 
Production of food and fiber are a case in point. 
Land reclamation, fertilizer use, and improved 
crop varieties are being introduced quite rapidly 
into the less-developed regions. The foreseeable 
limits of the combined effects of the known tech- 
nologies are a doubling, occasionally a trebling, of 
the output available to a population. But the de- 
mand for food often doubles within a generation 
as a combined result of population growth and 
qoewation: The anticipated crisis will occur 
rahe a bad crop year and will become disastrous 
one or more poor years follow each other. The 
a ie are all too great that in the late 1960s 
Tatin, 970s the stresses triggered by uncontrolled 
hore e will overwhelm the government in one or 
ae of the densely populated territories; Bengal, 
Tele China, and Java are particularly vul- 
ign <a ecause their respective populations will 

too large to be supported by the world’s 
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surpluses. Tens of millions of unfortunates would 
die in the course of the crisis; the tragedy Malthus 
envisaged would be enacted on its most massive 
scale. 

In the laboratories, however, a set of principles 
for the utilization of photosynthesis under opti- 
mum yield conditions is unfolding which promises 
a tenfold increase of output per unit area, These 
modes of plant culture have not been perfected, 
primarily because the societies best equipped to 
work out the details in the technology have been 
experiencing food surpluses. Nor have the incen- 
tives for rapid development, parallel to those 
granted to weapons technology, been authorized. 
Solving production problems is not sufficient 
either, because new foodstuffs will have to be 
converted into an acceptable diet. Many process- 
ing and marketing innovations are needed to ac- 
complish such acceptance. The prospective diffi- 
culties of supply are no longer nutritional, but 
arise primarily from the strongly conservative 
character of food preferences in most cultures. 
Heavy investments in public education are required 
to bring about significant changes in diet. 

The inadequacy of contemporary Western en- 
ergy conversion technology is more subtle. The 
crisis previously anticipated when fossil fuels were 
depleted will probably be avoided, largely because 
of the discovery of nuclear energy sources. How- 
ever, nuclear energy cannot be safely adapted to 
the operation of independent vehicles, such as 
small ships, aircraft, trucks, buses, and automo- 
biles, without huge increases in cost. Very likely 
microbiological processes for the conversion of solar 
energy will be able to bridge the fuel gap (Golueke 
& Oswald 1963). Attempts of this sort are held 
back by the diffuseness of solar energy; eventually 
much more of the earth’s surface may have to be 
used for the production of fuels than for that of 
food. 

Long-range resource planning encounters a few 
other instances demanding shifts to new, science- 
based technologies (e.g., substitutes for paper and 
some replacements for nonferrous metals). Doubt- 
less, the expanding investments in scientific work 
will provide many unexpected opportunities, al- 
though they are almost as likely to reveal unsus- 
pected constraints in the application of what is 
already known and projected to be used. Once the 
important new findings appear, the affected plans 
must be adjusted to take them into account. 

These newest technologies for resource utiliza- 
tion are highly instrumented. As greater depths 
and more stressful environments are penetrated, 
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an increasing need for sophisticated equipment is 
experienced. These processes are more susceptible 
to human error than the older technologies are and 
therefore must be designed in automated or cyber- 
nated versions which reduce the risks. Low wage 
scales do not significantly modify the technical 
solutions, Therefore, resource-oriented production 
in the future is expected to become much less 
labor-intensive. The operation of the instruments 
and controls, however, does depend heavily on the 
transmission of large quantities of data through 
a communications network and the rapid process- 
ing of such information at decision centers. 

Information and organization. Planning quickly 
becomes routinized in a large bureaucracy. When 
a new project for producing raw materials (or 
services) from a natural resource is prepared 
within a planning context, it undergoes a series 
of systematic tests of feasibility. One of the first of 
these hurdles is an estimation of its likelihood of 
becoming obsolete. The newest scientific findings 
and their influence on competing producers, on 
consumers, and on the producers of substitutes 
are assessed, Also, consultants who have access to 
bodies of unpublished knowledge are asked to sub- 
mit appraisals, Similar checks are made concern- 
ing whatever factors are scarce in the economy. 
How much would this proposal worsen the scarcity 
of foreign exchange, technical skills, transport ca- 
pacity, etc.? As a result, the revised project de- 
sign is more closely linked to the characteristics 
of the market, it is automatically alerted to a wider 
variety of danger signals emanating from the en- 
vironment, and a series of monitoring and control 
systems are incorporated into the design which 
combine information acquired from the outside 
with data generated from within the operation. 

When projects are meshed into national pro- 
grams and regional plans, a coordinating agency 
is needed to administer them. The implementation 
process involves gathering information about prog- 
ress, interpreting the findings, and instructing the 
operating units about desirable adjustments in 
plans. The emphasis in this description is upon 
the information-processing features of planning be- 
cause most of the agency effort is expended in this 
direction. Extra information introduced into re- 
sources planning results in a reduction of physical 
waste and a stabilization of the cost of producing 
raw materials. 

It should be noted in passing that the planning 
agency also has an important political function, 
since it must resolve conflicts between government 
units, between individuals, and between the ob- 
jectives of both groups. That feature of its duties 


is not considered here because a high degree of 
consensus among the involved parties is a pre- 
requisite for the acceptance of plans. 

Resource planners now encounter situations 
where the best resource-conserving projects do not 
deal directly with the scarce resources, or with 
their recognized substitutes, but with the expansion 
of “knowledge creation” capabilities and of spe- 
cialized information transmission systems, For 
example, the best way of reducing the energy re- 
quirements of transportation may involve expand- 
ing the telephone system, so that on the average 
several telephone calls (which expend virtually 
no energy and very little other scarce natural re- 
sources ) may save a trip. Similarly, an information 
service warning based upon knowledge about the 
effects of changes in weather may reduce spoilage 
and waste. 

A different natural resource is being exploited 
in these instances, one that as yet is rarely recog- 
nized in the resource literature. The electromag- 
netic spectrum is vital to high capacity telecom- 
munications. This resource is still as free as the 
sea, requiring only international agreement on the 
allocation of channels. Recently, moves have been 
made to organize spectrum conservation measures 
(Ryan 1959), particularly in the vicinity of large 
industrialized metropolitan areas. Stray radiation 
acts as a kind of pollutant, causing noise in the 
channel and errors in the communication process. 
Fortunately, however, huge capacities remain to be 
exploited, and the underlying sciences have an- 
nounced recent findings (e.g., lasers, wave guides) 
that open up further cost-reducing opportunities. 
Forecasts of the growth in demand (Litchman 
1963) suggest that teledata flows will double every 
two years; they can increase at this rate until 1980 
and still remain within technological capabilities. 

Human resources. The fact that human Te- 
sources can substitute for natural resources at the 
margin has been recognized for at least a century 
[see CAPITAL, HUMAN; CONSERVATION, article on 
ECONOMIC AsPECcTs], but the principles for aE 
ing and conserving this resource have not yet F 
very precisely formulated (“Needs and Resources A 
Social Investment” 1960). Nevertheless, TE 
planning is expanding most rapidly in this E 
tion. Highest priorities are given to formal a 1i 
tion systems and public health units. Schoofs 
braries, and clinics allow a society to draw up Jy 
a world-wide fund of knowledge and, with gy 

i Eran f that know: 
predictable results, distribute parts o: pison 
edge to individuals (Machlup 1962; Har redite 
Myers 1964). Disease prevention programs ie ne 
the frequency of loss of informed individu 
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fore they have made their full contribution. Edu- 
cated persons, working collectively, can extract a 
better living from a given environment than un- 
educated persons can. 

Human capital may be accumulated by expand- 
ing the number of persons acquiring education and 
by making sure that the subjects taught are rele- 
vant to future states of the society. The role of 
human capital is analogous to that of raw mate- 
rials inventory. Optimal investment in human re- 
sources implies quite large-scale experiments in 
search of formulas for cooperation. An important 
share of the effort associated with the founding of 
an organization may therefore also be categorized 
as human capital formation. Thus, plans will at- 
tempt to balance the human investments in health, 
education, and communications; and the timing 
of the investments will be phased according to the 
magnitude of the organizing effort required to 
launch the specialized agencies desired. Fast-paced 
organizations of well-informed people will usually 
make more efficient decisions regarding resource 
use than will organizations that depend too heavily 
on computers. 

Because of these complex substitutabilities, 
much of the most productive effort expended in 
resource planning is devoted to organizing the 
spread of relevant knowledge. The United Nations 
conference on science and technology (United 
Nations . . . 1963) was intended to be a bench- 
mark in the development of resources. 

The resource planning strategy. Thus the over- 
all mandate for planners is outlined: The richest 
nonrenewable resources should be used first. Or- 
ganizations are created to implement the projects 
and arrange for the distribution of output. As 
lower-grade reserves must be drawn upon, costs 
tise, and the organizations are forced to seek more 
efficient technologies. They also pay attention to 
eee of scrap and to fabrication techniques 

use less of the resource. As still lower 
grades are resorted to, the organizations must ex- 
ee an more effort on reprocessing scrap and 
sie acturing, intermediate products from sub- 
eae 2 easier supply. Eventually the salvage 
ace ons so efficient that exploitation prin- 
fae annot be distinguished from those applying 
ana a eed renewable natural resources. The 
sea ati Somay, be abandoned; but the organiza- 
terials $ . skills in the conversion of raw ma- 
its ai ody of technological knowledge, and 
a ities for meeting the specifications of 

omers, remains. 

patie sustained ; yields from renewable re- 
re also obtainable only through improved 
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knowledge and organization. Communications sys- 
tems development makes possible the coordination 
of utilization over much larger areas. 

Ultimately, the asset value of the extracted re- 
sources must be transformed into self-renewing 
human and physical capital of even larger value, 
if the effects of the law of diminishing returns 
are to be postponed indefinitely. The worth of hu- 
man resources, however, is also subject to laws. 
If population continues to grow, the effort to edu- 
cate, organize, investigate, and communicate must 
reach saturation at some point, and the marginal 
value of men will become negative. 

So far it has not been possible to discern any 
natural laws which identify these saturation limits 
for human organization or communication, but 
the requirements of a healthy human organism do 
impose constraints. A “minimum adequate stand- 
ard of living,” which provides just enough free- 
dom from environmental stresses to allow social 
and cultural organization to become as effective 
as is presently known, can be recalculated as a 
per capita requirement for resource inputs. The 
ultimate limit appears to be the energy incident 
from the sun and its thermodynamic degradation 
in the course of storage and transport. The earth 
will probably be able to support indefinitely about 
ten to twenty times its present population (say, 
30,000 million-80,000 million) when employing 
the stock of scientific knowledge which has yet to 
be translated into technology (Meier 1956). 

It will take only three to five generations at 
present rates of growth for world population to 
reach these saturation levels. Resource planning 
has hitherto accepted population as given and not 
subject to manipulation, but henceforth it must 
become closely linked with social planning that 
endeavors to bring births and deaths into balance 


(Brown 1954). 
RICHARD L. MEIER 
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WELFARE PLANNING 


An inquiry into planning for the social services 
is handicapped by conceptual ambiguities as to 
the meaning of the terms “social services” and 
“social welfare.” With refreshing candor, Donnison 
recently observed that “the distinctive feature of 
our subject is neither its body of knowledge (for 
most of this could be incorporated into other disci- 


plines), nor its theoretical structure (for it has 
very little), and we are not interested in method- 
ology for its own sake. We are concerned with an 
ill-defined but recognizable territory, ‘the develop- 
ment of collective action for the advancement of 
social welfare’” (1962, p. 21). 

But the global concept of “collective action for 
the advancement of social welfare” obscures the 
clashing conceptions of the political purposes of 
social services. At one extreme is the view that 
social services “assign claims from one set of 
people who are said to produce or earn the national 
produce to another set of people who may merit 
compassion and charity but not economic rewards 
for productive service” (Titmuss 1965, p. 14), 
So defined, social services are regarded as a burden 
to be carried by the productive institutions of so- 
ciety. Thus they constitute a residual function of 
government and philanthropy, and social service 
agencies should, ideally, strive toward self-liquida- 
tion. At the other extreme, Titmuss defines social 
services as “all collective interventions to meet cer- 
tain needs of the individual and/or to serve the 
wider interests of society; [these] may now be 
broadly grouped into three major categories of wel- 
fare: social welfare, fiscal welfare, and occupa- 
tional welfare” ([1958] 1959, p. 42). 

This discussion of the purposes of the social 
services has immediate import for planning, be- 
cause estimates of total expenditures for social 
welfare clearly depend on the definition we accept. 
In the U.S. Social Security Administration's annual 
estimate of total welfare expenditures in the public 
and private sectors, social welfare is defined as 
“activities that directly concern the economic and 
social well-being of individuals and families (Mer 
riam 1965, p. 3). Consistent with this definition, 
Ida Merriam includes those agricultural programs 
that make surplus food available to needy persons 
(school lunch program, food stamp program, oes 
Lampman, on the other hand, employs a ey 
encompassing definition of welfare in his asse > 
ment of the scope of money and nonmoney ert 
fer payments. Public transfers include that ia i 
of earned income (factor income) that is ae, 
payment for current production having any salle 
Thus, in the agricultural sector he defines Pi at 
payment for nonproduction as a welfare ski 
Moreover, Lampman urges that the Social Se ms 
Administration’s definition of public aaa 
expanded to include “subsidies and eae that 
alter prices paid by the consumers and taxi a. 
reduce disposable money income” (1966, Ae 

According to this more inclusive defini ; fan" m 
estimate of U.S. expenditure for “social we! 
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1964 is about $97,000 million in transfer pay- 
ments and transfer goods and services. By contrast, 
the more limited definition of the Social Security 
Administration leads to an estimate of $118,000 
million, (No aggregate estimate is available for 
the conception of welfare favored by Titmuss.) 
This surprising finding arises because the broader 
view excludes health and education expenditures 
that are paid for directly by current users and are 
thus not regarded as extrafamily transfers. Lamp- 
man insists that the concept of welfare be limited 
to public transfer expenditures. He argues that 
there is no rationale for including personal expend- 
iture for dental care while excluding personal ex- 
penditure for housing, as is done in the Social 
Security expenditure series. Direct payments by 
consumers for health services alone amounted to 
$17,000 million. 

Not only does the size of the social welfare sec- 
tor vary according to the conception of welfare we 
employ, but the pattern of distribution of welfare 
services is affected even more dramatically. In 
Lampman’s pioneer study of the American welfare 
system (1966, p. 8), he estimates that almost 
40 per cent of the total amount allocated for trans- 
fer payments, or $38,000 million, goes to those 
individuals who would be classified as poor before 
they received any transfer payment. Assuming that 
the poor contribute approximately $8,000 million 
in taxes and private contributions, he thus con- 
cludes that they receive a net benefit of $30,000 
million. 

In a more limited analysis of the distribution of 
housing benefits among various income groups, 
Alvin Schorr comes to almost the opposite con- 
clusion. He observes that in 1962 the U.S. federal 
government spent $820 million to subsidize hous- 
ing for individuals who would be classified as poor; 
this estimate includes direct subsidies for public 
housing and additional housing payments for 
those receiving public assistance, as well as savings 
from income tax deductions. By contrast, the 
amount the government refrained from collecting 
i Icome taxes by granting tax concessions to 
fae owners (a predominantly middle- and upper- 
Theret, group) is estimated at $2,900 million. 
a fore, in regard to housing subsidies, the fed- 
a (Boreal spent more than three times as 
ee for those who are not poor than for those 
a are poor. In this way, Schorr concludes, the 
e N gives more to those who have more” 
ie Hike 437). The main difference between the 
as Lampman and those of Schorr is their 
led ent of tax deductions. Lampman asserts that 

Stribution of social welfare benefits must be 
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seen both in relation to “who pays” and to “who 
benefits.” Tax deductions in his scheme are re- 
flected in a smaller “pay” figure, and they are of 
interest only to the extent that they alter the bur- 
den on the rich as compared to the poor. By con- 
trast, Schorr assigns tax concessions to the benefit 
rather than the cost side of the ledger, which per- 
mits him to compare the relative benefits of tax 
concessions between those who are poor and those 
who are not poor. 

The answer to the question of who benefits from 
America’s welfare state must remain inconclusive, 
not alone because of conceptual differences, but 
also because of the limitation of available data on 
the cost of employee fringe benefits and the size 
of tax concessions and how these are distributed 
by income groups. As indicated above, there are 
two main positions regarding the redistributive 
effect of social services. On one hand, Lampman 
suggests that, in the aggregate, welfare systems 
are redistributive and that they work for the ad- 
vantage of lower-income people. On the other hand, 
students of British social policy, for instance, have 
concluded that the major beneficiaries of England’s 
welfare state are the middle- and upper-income 
groups; thus Abel-Smith (1958) does not hesitate 
to assert that the high-cost sectors of social serv- 
ices, such as education, health, and housing, are 
instruments for the multiplication of advantage 
ahd privilege in an advanced industrial society. 
The rediscovery of poverty in an affluent society 
has riveted attention on the distributive effects of 
social policy: who are the beneficiaries of public 
largess? This question provides a useful context in 
which to begin planning. (There is little scholarly 
concern in continental Europe, South America, or 
the developing countries for the study of social 
policy; I have therefore been forced to rely heavily 
on the experience of the United States and the 
United Kingdom. ) 

The planning process. Planning is traditionally 
defined as a method of rational decision making 
that counterposes means and ends in an attempt 
to assess how these can be best brought together 
at the least cost and with maximum effectiveness. 
This formulation tends to emphasize the technical 
problems of deciding on means; it implicitly as- 
sumes that the goals are clearly defined and capable 
of being measured. But this technical bias can be 
minimized, and the conception of planning recast, 
if one treats either the objectives or the implemen- 
tation of the plan as problematic. The approach 
that focuses on implementation raises the question 
of feasibility and shifts the matter of planning 
away from the more dispassionate technical analy- 
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sis of means and ends to a preoccupation with the 
realities of the environment as constraints on im- 
plementation. Similarly, the focus on objectives, 
including both the short-term and long-term aims 
of social policy, confronts the issue of ideology— 
the conception of the good society. 

Planning can also be seen in terms of three 
interrelated stages: policy development, with an 
emphasis on the progressive modification of ends 
and means to achieve feasibility; policy implemen- 
tation, concerned with the administrative circum- 
stances that create disparities between purpose and 
performance; and policy evaluation, which attempts, 
through the rigors of social science technology, to 
document these disparities. The results of these 
evaluations in turn serve as an information input 
for the modification of established policy. Thus 
planning is a continuous and circular process, 
rather than an occasional event initiated to solve 
a specific problem. 

The following discussion of planning will focus 
on the clarification of the goals of welfare and on 
the instruments for maximizing them; that is, it 
will deal primarily with substantive issues rather 
than with procedural questions. This approach will 
enable us to explore further some of the concep- 
tual problems raised above in connection with the 
definition of welfare. Two interrelated tasks of 
planning—allocation and coordination—will be 
considered. 


The nature of allocative decisions 


Allocation is concerned with the problem of how 
to divide limited resources among competing claim- 
ants. In order to simplify this analysis, we will 
consider these allocative decisions in relation to 
specific tasks, such as the apportionment of money 
among items in an annual budget or among the 
components of a development plan. Allocative tasks 
can be roughly sorted into three categories that 
vary according to the types of claimants involved: 
that is, resources are allocated, first, among the 
social, economic, and physical sectors; second, 
among the various sectors of the social welfare 
field; and, finally, among the various types of pro- 
grams within a single specialized social sector. 

Social, economic, and physical planning. First, 
we will be concerned with the social sector as a 
claimant for resources in competition with the 
economic and physical sectors. Consider the recent 
proposal in the United States for creating a dem- 
onstration cities program, in which 60 to 70 cities 
would request federal funds to develop a coherent 
attack on slums by combining physical and social 
renewal. According to the preliminary model pro- 
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grams for large cities, the estimated net cost of a 
neighborhood renewal program of 25,000 housing 
units is approximately $350 million, of which $100 
million would be set aside to meet social costs— 
roughly a 3 to 1 ratio between these sectors, The 
expenditures set aside in the federal budget and 
in the demonstration cities program are examples 
of allocative decisions among different sectors of 
the economy. 

How physical, economic, and social planning fit 
together will vary according to the prevailing con- 
ception of the function of social planning, At least 
four different views of the functions of social plan- 
ning can be identified. 

Welfare as a burden. One view of social plan- 
ning sees welfare as a burden. According to this 
conception, since social welfare expenditure is 
heavily financed by taxation, we must pay special 
attention “to the specific sources of revenue used 
to finance it, to insure that the minimum of dis- 
couragement to enterprise and savings results from 
any given volume of welfare spending” (Burns 
1954, p. 139). Eveline Burns suggests that we must 
be prepared to pay the costs of the depressive effect 
on enterprise and initiative arising from welfare 
expenditures, if these services enjoy a high com- 
munity priority and are regarded as necessary. 
She exhorts the profession of social work to dem- 
onstrate that welfare services are indeed necessary 
but accepts the position that aggregate welfare 
costs inhibit economic growth. This view directs 
attention to the question of how much welfare we 
can afford without overburdening the economy. 

Likewise, in developing countries some have felt 
that funds expended for the social sector detract 
from the maximum growth of the economy. Agri 
cultural experts have sometimes said that it 1s 
almost immoral to spend money in the social sec: 
tor when the country has insufficient food for its 
population. According to this view, agricultural A 
velopment warrants the highest priority, and 7 
allocation of funds for the social sector is at bes 
a distraction that may be politically necessary: 

Welfare as handmaiden. Recently, the one 
tion of welfare as burden to economic ee 
being replaced with a view that both economic ae 
physical planning are best advanced by an ve 
ment in the social sector. One formulation 0. 
view of welfare as “handmaiden” has been hee 
by economists concerned with investment in ‘hu he 
capital.” Schultz suggests that inequalities a O 
distribution of personal income are most li T 
be affected by such investments; he r arily 
“the structure of wages and salaries is P on- 
determined by investment in schooling, health, 
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the-job training [etc.]” (1962, p. 2). [See CAPITAL, 
HUMAN. | 

Physical planners have also become interested 
in a rapprochement with social planning. Although 
a physical plan may be immediately accepted as 
desirable and necessary, it cannot readily be im- 
plemented without the endorsement of citizens 
affected by it or without making provision for those 
who will be displaced by the renewal plans. Thus 
social planning has come to be defined in terms 
of the processes of relocation and citizen participa- 
tion, namely, those activities that are necessary to 
facilitate the physical renewal plan. However, social 
planning for urban renewal and planning for in- 
vestment in human capital are perhaps best seen 
as handmaidens for physical and economic devel- 
opment, respectively. 

Welfare as complementarity. A third view holds 
that raising the level of living and increasing the 
investment in the social sector can only proceed 
in conjunction with greater economic growth; 
efforts to redistribute limited resources result only 
in the redistribution of poverty. 

Interdependence arises when activities in one 
sector can serve either as objectives or methods 
for activities in another sector. Thus we sometimes 
may use primarily economic tools to achieve social 
objectives, as when we create employment oppor- 
tunities and training opportunities in order to re- 
duce youth unemployment and raise personal and 
family income, including those families receiving 
Public assistance. Conversely, methods that are 
Primarily social may be employed to promote eco- 
nomic objectives, as illustrated by the efforts to 
demonstrate that health improvement can make a 
major contribution to economic growth (see, for 
example, Mushkin 1962). Finally, complementarity 
1s regarded as essential if more resources are to be 
added to the social services sector and if low- 
income groups are to receive a larger share of these 
Tesources. As Donnison states: 


ease feature of the present concern for equality 
beak endency for some of its most forceful exponents 
onal such stress upon the redistribution of the na- 
T that its rate of growth is liable to be 
kala en. . , . Redistribution can only be achieved by 
and ta more schools, hospitals and houses, recruiting 
ers, Sate more teachers, doctors and social work- 
Where bY Concentrating the increase in the areas 
growth is most needed. A high rate of economic 
justices ae be achieved without alleviating social in- 
igh rat ut injustices cannot be remedied without a 
Tate of growth. (Donnison 1966, p. 7) 


eine as a means of social control. The 
rth view of welfare programs claims that funds 
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earmarked for such programs are intended largely 
to reduce social deviancy and thereby to protect 
the political and economic stability of the com- 
munity. Hence, according to this view, the resources 
of the “war against poverty” in the United States 
are increasingly being used as a means of tran- 
quilizing the unrest in urban slums. For example, 
the riots in 1965 in the Watts section of Los An- 
geles, California, brought forth sustained efforts to 
bring large amounts of federal funds into the 
community for social welfare programs in order to 
avert future violence. The jobs and training gen- 
erated in the United States by the Neighborhood 
Youth Corps will be made available to youths in 
the urban ghettos, based on the theory that the 
provision of summer employment opportunities can 
serve to avert or at least minimize the occurrence 
of riots. 

Allocation among the social services. Allocative 
decisions must be made in regard to the distribu- 
tion of resources among the various sectors that 
constitute the social welfare arena, such as health, 
education, housing, and income maintenance. 
Thus, a developing country may need to choose 
between an investment of resources in the area of 
pensions for old people or educational programs 
for the young. It can, of course, attempt to dis- 
tribute its resources according to some principle of 
balance that fixes the relative importance of these 
objectives, and so cuts short the debate over 
priorities. 

The problems of choice are also confounded by 
the interrelationship among the social services. A 
study of social services in Mauritius illustrates how 
a policy of denying public assistance to the able- 
bodied unemployed created pressure on the medical 
care system. Unemployed persons sought to acquire 
sickness certificates in order to qualify for welfare 
payments. Thus a policy choice in the area of pub- 
lic assistance had the effect of creating excessive 
demand on the medical care system (Titmuss & 
Abel-Smith 1961, p. 81). 

Allocation within a single social sector, Alloca- 
tive decisions must also be made within any one 
specialized, functional area, such as health, educa- 
cation, or housing. Two types of problems can be 
broadly identified: the distribution of resources by 
domain and by clientele. Questions of domain con- 
cern the distribution of resources by type of service. 
For example, to what extent should resources be 
allocated for curative as contrasted with preventive 
programs? The question of clientele concerns the 
issue of determining who should be the major ben- 
eficiaries of the program. The development of uni- 
versal programs designed to reach the total popu- 
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lation does not avoid the need to make choices. 
It is widely acknowledged that upper-income groups 
avail themselves more of education and other high- 
cost social services (although the case for other 
sectors has not been as well documented) than do 
lower-income groups, “with the result that these 
services . . , represent in their mode of operation a 
redistribution of income away from the poor” 
(United Nations . . . 1965, p. 53). For example, 
as a rule of thumb, America spends less than half 
as much educating the children of the poor as the 
children of the rich (Jencks 1966, p. 18). Since 
universal programs result in inequalities, allocative 
policies aimed at redistribution may be necessary. 
This is becoming increasingly recognized in legis- 
lation, such as the Elementary and Secondary 
School Act of 1965, designed to distribute federal 
funds to schools with a substantial number of low- 
income families. 

If we accept the traditional compartmentaliza- 
tion of programs as given, allocative decisions may 
typically be concerned with the distribution of re- 
sources by functional domain. However, allocations 
assigned in terms of traditional sectors have severe 
limitations; this approach often produces ineffective 
and inefficient solutions of social problems, Con- 
sider the following case-in-point. National levels of 
health, defined in terms of low infant mortality 
rates, are more highly correlated with measures of 
income, school enrollment, and calorie consump- 
tion than they are with the number of physicians 
in the country. This Suggests that, to maximize 
health, it may be necessary to invest in nonhealth 
programs. “An extremely important requirement 
for improving health in a given country may be 
the improvement of the nutrition of children, if, 
as frequently happens... , malnutrition . . . is 
leading to numerous diseases of adulthood, as well 
as to excessive morbidity and mortality among chil- 
dren” (United Nations . , , 1965, p. 55). Thus, if 
planning to maximize health takes into account 
only such matters as health facilities and health 
personnel, it may be ineffective or may achieve its 
aims in a very inefficient fashion. 


Politics of allocation 


The problems of allocation are the abiding issues 
of social policy planning. They pose the fundamen- 
tal questions of the Purposes and effectiveness of 
the social services, However, while the allocative 
questions are the most crucial, they are also the 
most intractable and the least well understood. By 
and large, we have developed neither the admin- 
istrative machinery nor the technology for making 
allocative decisions, So broad a generalization may 


seem arbitrary at a time when there is great in- 
vestment and interest in policy-oriented research 
that goes beyond the traditional needs—resources 
approach into the more sophisticated cost-benefit 
analysis (Levine 1966). But most of these ap- 
proaches fail to resolve at least two fundamental 
problems endemic to the social services. 

First, programs for converting social values into 
economic terms continue to have a disingenuous 
ring to them. A cost-benefit analysis for an urban 
renewal program that attempts to convert the loss 
of aesthetic pleasure of enjoying greenery into some 
monetary equivalent must make assumptions that 
are arbitrary, forced, and, in the end, unconvinc- 
ing. We have no universally accepted social arith- 
metic for converting values into dollars. Measures 
of social achievement lack a common scale of 
measurement that would permit us to add the ben- 
efits of programs, to delete the costs of various social 
problems, and to assign relative values, weights, 
and priorities to different social programs. In short, 
we have no rules to guide us in choosing among 
educational, health, welfare, housing, and other 
programs. Most attempts at planning for social 
services are forced to bypass the issues of priority. 

The second problem concerns the nature of social 
objectives. Economic models tend to be fashioned 
on the assumption that economic policy is analo- 
gous to a theory of rational consumer choice; 
according to this theory, economic man seeks to 
maximize his “utility” or preferences within the 
constraints of his income and the prevailing prices 
of the goods and services that he wishes to acquire. 
But the concept “need” does not lend itself to this 
type of rational calculation: what a man PE 
is by definition his preference, but social nee ‘ 
imply necessities, and a man may not choose wha' 
he needs or what others think he needs. He may 
also lack the freedom to choose, because he we 
the necessary funds, information, or access to a 
services he requires. Moreover, the economic mo R 
is oversimplified, based as it is on the maximize 
tion of a single goal, which is defined as ae 
and measured by preference functions and in i 
ference curves. [See Urixiry.] Social life tends ai 
be more complex than this. Allocative aee 
involve the pursuit of multiple social goals, s ach 
of which are in partial conflict either with “ie 
other or with economic and physical objectives, cial 
all of which, as this discussion of the goals of K i 
policy has attempted to illustrate, are difon 4 
identify and select. Thus we cannot, in ns i 
develop a single best social plan. We are dri re are 
the more unsatisfactory conclusion that thei 


eg'tO 
only multiple plans, which will vary according 
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which items we accept as constraints for the 
achievement of certain objectives. For example, 
we can ask what level of inflation we are willing 
to accept in order to achieve full employment, thus 
treating price stability as a constraint and full 
employment as the goal; or we can ask the reverse 
question. 

All of these problems confound the task of al- 
locative planning. The secretary-general of the 
United Nations has effectively summarized these 
issues in a report to the U.N. Social Commission: 
“,, social planning does not have the unity and co- 
herence of economic planning. . . . Social allocation 
is ultimately a matter of judgment, operating among 
incommensurable values” (United Nations . . . 
1965, pp. 37, 39). These judgments, of course, 
are not advanced as arguments against the system- 
atic gathering of evidence-and data, which can lead 
to judgments based on information. But they do 
suggest that, in the last analysis, allocative deci- 
sions are based on value judgments—that is, there 
is no planning without ideology and politics. It is 
possible to identify at least four ways in which 
such decisions are made. 

First, tradition plays an important role. In an 
analysis of the allocative process used by the 
United Fund organizations, Arthur Vidich observes 
that “the formula of tradition tends to work, as no 
unit is apt to feel unduly cheated if last year’s 
Proportions are carried over into this year” (1963, 
chapter 2, p. 6), Second, preference or pressure 
can play a significant role in the determination of 
allocative decisions. Since, as we have seen, there 
is no objective way to measure the relative need 
for housing as against education or health, knowl- 
edge of the preferences of consumers could, in 
Principle, serve as one effective means for resolv- 
ing these dilemmas of choice. Preferences can be 
Secured either through some form of planning that 
is concerned with the channeling of the needs and 
aspirations of potential service uses (i.e., planning 
for citizen participation in policy decisions) or 
through the use of survey research methods that 
attempt to gather information on preferences and 
then treat these as inputs for planning decisions. 

hus, citizen participation in social planning and/or 
ae, development of measures of consumer pref- 
ike an can serve as purposive techniques for mak- 
a foe judgments. One of the unique features 
Eivo merican “war on poverty” is the attempt 
tial ete in policy making those who are poten- 
ie sumers of antipoverty programs. There is, 
ning ae the unfortunate tendency in most plan- 
has orts to involve the citizens only to the extent 
Sking them to endorse plans already prepared 
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in advance rather than to use citizen groups for 
the initiation of policy ideas (Wilson 1963). There 
has been little attempt to use consumer preference 
surveys in the determination of policy. Admittedly, 
the development of such surveys poses a complex 
technical task, but their potential as an instrument 
of policy has not been exploited. 

To the extent that preferences do play a role in 
policy development they usually emerge when or- 
ganized interest groups for the aged, veterans, and 
other recipients of social services exert pressure on 
those who control allocative decisions. In the vol- 
untary sphere, the selection of fund-raising pro- 
grams that are likely to appeal to the community 
is, of course, one way of implicitly taking prefer- 
ences into account, although these preferences are 
not necessarily those of the consumer. However, 
allocative decisions that are based on considera- 
tions of compassion, the popularity of the program, 
or community pressure may have little to do with 
the severity or pervasiveness of the social problems 
for which funds are raised, and they fail to take 
into account the needs of socially unpopular groups 
—that is, the ones that, like the American Indians, 
are unable to exert much pressure on their own 
behalf. 

Allocative decisions are often avoided altogether, 
since policy, in response to different pressures, 
tends to develop piecemeal—a process that is often 
called proliferation by welfare planners. Prolifera- 
tion, however, can itself be regarded as a strategy 
of allocation, particularly at the early stages in the 
development of new programs. Thus if we seek to 
expand the amount we spend in the area of, say, 
manpower training, we tend to proliferate many 
new programs and to créate new agencies, which 
often lack adequate coordination with previously 
established efforts. Each program may even be sup- 
ported under a separate piece of social legislation, 
as in the case of current training programs for 
disadvantaged youths in the United States. This 
form of allocation by accretion may seem a very 
irrational process after a sufficient number of these 
programs have been developed, but some prag- 
matists believe that it can also be an effective 
means of mobilizing resources for these services. 
Others, however, regard the way in which federal 
manpower programs have developed as the result 
of vested departmental interests and their clienteles 
(see especially Levitan 1966). Whether regarded 
as strategy or necessity, administrative dysfunc- 
tions arise when fragmented but interrelated serv- 
ices are not brought together into some coherent 
whole. This leads to pressure for coordination. But 
coordination increases visibility, and when new 
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funds are sought in competition with other claim- 
ants there may be pressure to further fragmentize 
services as a technique for securing resources. How- 
ever, specialization may arise from other pressures 
as well. 


Planning for coordination 


The United States has an enormously complex 
and varied system for distributing social services. 
First, there is a three-tier vertical system, in which 
some services are distributed by sponsors adminis- 
tratively located at the national, state, or local 
levels. The three hierarchical tiers are bound to- 
gether by financial, administrative, legal, and pro- 
fessional loyalties. The ties may be loose, as in the 
case of federated structures, where local operations 
are autonomous and create a national body to serv- 
ice their needs; or tight, as in the case of corporate 
structures, in which the locals are branch offices 
of a national agency. Within the boundaries of any 
one tier, there is a horizontally organized system, 
which can be sorted by auspices (public, voluntary, 
or private), or by functional specialization (health, 
education, housing, etc.), or by the type of clien- 
tele serviced (classified by age, problem, income 
grouping, etc.) and by the skill performed (teach- 
ing, medicine, social work, etc.). 

Moreover, the instruments or means for carrying 
out these functions tend to take on varied forms. 
The government may directly operate services, or 
it may purchase them from voluntary and private 
agencies or professionals in the form of subsidies, 
grants, loans, or vendor payments. The latter pro- 
cedure helps to support private and voluntary activ- 
ity. The government may subsidize the facilities 
themselves, as in the case of low-cost housing pro- 
grams; alternatively, it may provide individuals 
with subsidies, grants, or transfer payments so 
that they can purchase services in the open market, 
as in the case of unrestricted rent subsidies. 

Even so condensed a summary of the organiza- 
tion of social services Suggests its great variety and 
complexity. The components of the structure are 
not only complex but are also interrelated; pro- 
grams cannot be treated in isolation, for they affect 
each other in at least two ways. First, the work 
loads of agencies in the same community do not 
vary independently. Thus, a protective agency, 
eager to conform to the principle that a child 
should not be removed from its own home, may 
assume that it has achieved its goal by decreasing 
its petitions to the courts, but as its court referrals 
decline, other agencies may begin to refer the cases 
of the protective agency to the courts, with the 
result that the total volume of court referrals re- 
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mains unaffected. Second, many different activities 
are relevant to the solution of a single problem, 
Thus, as we have suggested, to promote the objec- 
tive of health we may need to create indirect pro- 
grams that are directed at expanding income, 
housing, education, nutrition, etc. The complexity 
and the interrelatedness of the social service sys- 
tem creates obvious pressures for coordination to 
reduce competition and promote coherence. 

But the term “coordination” itself remains curi- 
ously vague and abstract, and it is more often used 
as a slogan than as a strategy for solving a specific 
problem in the organization of social services. Co- 
ordination strategies are, in the main, bound bya 
common problem: how to bring services into better 
harmony without reducing the autonomy of the 
component agencies and professions. In the follow- 
ing discussion of coordination, I will develop a 
typology of types of problematic situations that 
create the need for coordination and then examine 
the nature of the coordinative strategy that has 
been created, as well as the problems that are still 
unattended or are created by the very efforts at 
coordination. 

In examining the problems of coordination, it 
may be useful to set our discussion within the 
market terminology of a supply system of social 
service agencies and a demand system of con- 
sumers or recipients of services—that is, a social 
service delivery system. This analogy should mot 
be pressed too far, however, for many services 
do not follow a demand pattern and are imposed 
on individuals regardless of their preferences, as 
in the case of programs for delinquents or the 
mentally ill. Use of the term “service” minimizes 
the function of welfare as an instrument of ae 
munity protection. Furthermore, “consumer de 
mand” is too global a term, for it masks one of the 
most interesting characteristics of the delivery T 
tem: that the recipients of social services can E 
selves be classified, according to the agency whic! 
distributes the services, as clients, patients, Er 
tomers, victims, or deviates. These labels also E 
professional dispensers into a status nie z 
thus can affect the quality and the nature © be 
treatment that individuals in similar circum 
will receive. Moreover, many services ee a 
only as devices for meeting demand but jewel 
rationing systems designed to limit either te 
ume or the quality and extensiveness of the aln 
that each person receives. Services a A 
scarce resources in terms of money, faci He E 
professional skill. Despite these ea it per- 
above analogy is nevertheless useful pe 
mits us to classify coordinative problems 
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of disjunctures between the supply and the demand 
system. Within this frame of reference, six “deliv- 
ery problems” will be examined. 

Service inundation. The same client is often 
known to many community services, each of which 
may send a worker into the home of the family 
and may hold out to the family a set of expectations 
for desired social performance and exert sanctions 
for nonconformity. In the United States in the late 
1940s, Bradley Buell’s studies of “problem families” 
(Buell et al. 1952) directed attention to the issue 
of service inundation. He suggested that 6 per cent 
of the families in St. Louis were receiving more 
than half of the services offered by the community. 
However, in Buell’s studies the term “services” was 
never adequately defined, for it included long-range 
continuing contact with the agency, as well as brief 
contact only at the point of intake. This ambiguity 
in the definition of services and the absence of data 
on types of services used make it plausible to sug- 
gest that Buell’s findings can be restated as follows. 
A small proportion of families make contact with 
many agencies, most of which refer the family to 
other agencies, instead of offering them quality 
services. Thus families in need bounce from agency 
to agency, with no one assuming responsibility and 
accountability for them; the experience of these 
“multiagency families” reflects as much the frag- 
mentation of the social service system as the pathol- 
ogy of the families. This fragmentation of services 
became the subject of debate within the social 
work profession in the early 1960s, and attention 
shifted from the problem of how the lack of coordi- 
nation leads to clients becoming “overserviced” to 
how the same lack created the problem of clients 
who were “underserviced,” or rejected and un- 
wanted, 

Strategies to reduce “inundation” appear to be 
Concerned with rationalizing the function of the 
Professional agents and with developing coordina- 
tion through a more coherent personnel ‘policy. 
Planning is thus directed chiefly at substituting 
Simultaneous visiting for serial visiting and at cut- 
ting down duplication of effort by workers located 
in different social agencies who perform similar or 
oe tasks (for example, when each agency 
bite its own family history to determine eligi- 
on or a specific service, although one history 

Serve the needs of most agencies). 
oo broad strategies for minimizing inundation 
hae identified —reducing the number of workers 
ER a the same family, or enhancing the com- 
e tion among workers. The Service of Coordi- 
ieee in Paris pushes the first principle to its 
extreme. It achieves coordination by three 
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simple but radical principles: only one social worker 
is assigned to a given family; each worker is re- 
sponsible for all families in a defined geographic 
area; and no work is done twice (Schorr 1965b, 
p. 38). Thus, if a family caseworker believes that 
a family is in need of a supplementary financial 
allowance, she can secure the additional grant 
without a second investigation by a welfare worker. 
In 1967, the feasibility of reorganizing personal 
social services was being discussed in England by 
the Seebohn Committee. The changes most fre- 
quently discussed seemed to require combining the 
various separate departments and services into one 
ministry of social welfare, with status equal to that 
held by the existing ministries of education and 
health. This new ministry, or department, was to 
be organized on the basis of the skill to be per- 
formed, and was to be founded on the principle of 
merger rather than of coordination, so that the in- 
dependence of established agencies could be re- 
spected. Exactly how the boundaries of such a 
welfare service might be drawn and which func- 
tions were to be included had not yet been agreed 
upon. However, it was thought that such a de- 
partment would probably service other major 
departments by allotting social workers to courts, 
schools, hospitals, clinics, welfare departments, and 
other such agencies. One of the major purposes of 
this reform was seen as the coordination of scarce 
manpower skills (Titmuss 1966; Great Britain... 
1966). 

In the United States, numerous attempts at ex- 
perimenting with similar programs can be identi- 
fied. For example, under the auspices of the New 
York City Youth Board an attempt has been made 
to create an interdepartmental neighborhood serv- 
ice center. The organizing principle of this service, 
as described by its director, is the assumption by 
one worker of responsible concern for the total 
family (Lampkin 1961, p. 8). Most such efforts 
have succumbed under the pressure of established 
institutions after having survived, with little suc- 
cess, for only short periods. 

Somewhat less radical than such administrative 
surgery are strategies that attempt coordination 
through increased communication. The social serv- 
ice exchange is an example of an attempt to create 
a central repository of information. Such exchanges 
operate under the assumption that if workers could 
identify the other community agencies that have 
contact with their clients, they would, in fact, con- 
tact them and work out some formal or informal 
coordination. When many voluntary agencies were 
distributing money rather than services, this seemed 
an efficient way of preventing the same person 
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from securing aid twice. Case conferences provide 
another example of an organized structure whereby 
many workers from different agencies have an op- 
portunity to meet to discuss their contacts with the 
same families, Often such conferences lead to a 
decision that one worker should assume responsi- 
bility for maintaining the ongoing contact with a 
family and so be able to keep the other agencies 
informed of the family’s progress. (For a detailed 
case study of the inundation problem and the strat- 
egies used by community agencies to reduce it, see 
Willson 1961.) 

These strategies seem to emphasize cooperation 
of workers rather than coordination of services, A 
major difficulty with these strategies is that many 
agencies that have significant contact with the 
family refuse, or are unwilling, to participate in 
joint deliberations or in the common sharing of 
staff. Nevertheless, the search for a polyvalent 
social worker continues as one strategy for reduc- 
ing the inundation that specialization seems to 
create, 

Problems of access and procurement. Whereas 
inundation involves multiple contacts by many 
agencies with the same family, the problem of 
access concerns the obverse problem of how to 
secure needed services within a highly fragmented 
and specialized service system, in which each 
agency defines its own service boundaries. When a 
family needs a nursing home service for an aged 
member, remedial services for a child having diffi- 
culty with reading, or a summer camp, it is likely 
to encounter the problem of access. In the situation 
where the consumers initiate the requests for these 
services, the social service delivery system comes 
closest to the demand situation of the market sys- 
tem; this is most likely to occur in cases where the 
recipients can be classified as customers, clients, 
or patients rather than victims and deviates. Access 
problems are exacerbated when a family has little 
income to purchase these services and must rely 
on the nonmarket system of public and voluntary 
social services, 

Three broad strategies to facilitate access can be 
seen. Some coordinative Strategies focus on an 
effort to educate the client, thus assuming that 
lack of access is a function of lack of information, 
Typically, they rely on devices such as information 
and referral services to inform consumers as to 
where available services can be secured. Scattered 
evidence shows that “arrival” rates from these re- 
ferral services tend to be quite low (Furman et al. 
1965). Such findings suggest the need for supple- 
mentary approaches, 

A second approach is based on the assumption 
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that although information is necessary, it is insuffi- 
cient. According to this view, the referring agent 
must himself play a more active role as negotiator 
of the service jungle on behalf of the service con- 
ice that is requested. Some experimentation with 
knowledge of how the system works and with some 
power to accomplish his special aims. The political 
ward leader is a prototype of this kind of expediting 
agent, for he is often willing and able to apply 
pressure on appropriate agencies and to follow 
through to assure that his client receives the serv- 
ice that is requested. Some experimentation with 
indigenous workers serving as expediters has at- 
tempted to follow what is at least implicitly a sim- 
ilar model (Brager 1965). 

A third strategy attempts to assure access by 
altering the structure in which social services are 
performed. In the United States, a new federal 
program providing neighborhood service stations, 
financed under the Community Action Program of 
the Economic Opportunity Act of 1964, has tried 
to create a new structure within the local com- 
munity, in which many community agencies estab- 
lish branch offices in the same facility. Thus a wel- 
fare department, an employment referral agency, 
a family guidance program, etc., will all locate 
their services in a single structure, providing some- 
thing like a social service variety store, in which 
a range of services are available under a single 
roof. Ideally, a single intake worker is available to 
route the client to the appropriate service. This 
strategy is a form of coordination by physical pro- 
pinquity; a client need only stop in one center to 
secure the necessary range of services he requires. 

These coordinative strategies typically suffer 
from what has been called the problem of “green 
tape”: where the total volume of services in a S 
munity remains constant, the strategy may only 
alter the pattern of claims (Rein & Riessman me 
That is, it may provide preferential treatment fe 
its constituency at the cost of assuring that ie 
other individuals will not be able to secure acces 
to these services. 

Access for community agencies. 
access must be seen not only in term 
for citizens to secure services but also in fone pi 
the need of agencies to secure social sevi 
behalf of their clientele, especially those who t 
victims and deviates. Community agencies ai 
service an unwanted or unpopular population er 
the problem of access when they try to secur H 
them the services they need. Such agencies ae 
that in order to carry out their mission they sat 
the cooperation of other community agencies. esc 
a public welfare department must secure 
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operation of family counseling, employment, and 
health services in order to implement the national 
policy of reducing dependency. Similarly, a redevel- 
opment agency involved in the problem of the 
physical renewal of the city may need a range of 
community services in order to make possible the 
relocation of difficult-to-place tenants, such as those 
with very large families or with special problems. 
When an agency tries to secure different types 
of services on behalf of its clientele, these efforts 
may be labeled as “comprehensive planning.” How- 
ever, comprehensiveness is often the last stage in 
a series of earlier efforts to solve the problem of 
access. The agency may start first by referring its 
clientele to other community agencies; but if the 
referral process fails, it may find itself impelled to 
undertake these tasks in its own operation. Thus a 
settlement house may create a psychiatric clinic, 
a health service, or a remedial reading program. If 
funds are available, the agency may purchase the 
service from established community agencies, or 
alternatively, it may finance programs intramurally. 
Occasionally, new structures are created to pro- 
vide or facilitate services for an unwanted popula- 
tion. Thus the comprehensive planning financed 
in the United States under the Juvenile Delinquency 
and Youth Offenses Control Act of 1961 was ori- 
ented to securing relevant services on behalf of 
those who are in danger of becoming delinquent— 
that is, disadvantaged youths living in urban 
ghettos. In this context, the term “comprehensive- 
ness” is best interpreted as the effort to get com- 
munity agencies to provide services on behalf of 
the population of disadvantaged youths. Similarly, 
the 1962 amendments to the U.S. Social Security 
Act encouraged the creation of services for the 
underserviced population receiving public assist- 
ance. Planning for access could involve purchasing 
the needed services from established agencies. 
However, because resources are limited, such plan- 
ning often relies on the use of small-scale demon- 
stration projects, in the hope that the agency in 
Which the demonstrations are placed will come, in 
time, to adopt the reforms it has started. Alter- 
natively, it can work outside of established institu- 
tions (parallelism), or it can operate the services 
directly, All approaches have been used. 
A major problem created by comprehensive plan- 
ee is that it leads, in the short run, to competition 
or resources, as each community agency tries to 
ae the relevant services for a specialized pop- 
Fee at risk—victims of relocation, the poor, 
‘antaged youths, public assistance recipients, 
ie ‘rama ill, etc. Without a major increase in 
otal amount of resources available, these agen- 
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cies may find themselves in competition for limited 
personnel and facilities. 

Resource competition. One way of defining co- 
ordination is to say that it ensures that a varied 
range of services, frequently technically different 
or separately administered, are planned so as to 
operate as interrelated parts of a total treatment 
process in meeting family need or in remedying 
breakdown. It is surprising how little theorizing 
has been done about the kind of program schedul- 
ing that is necessary to contribute to meeting 
family need and to preventing social problems. 
Most sociological thought and research has tended 
to fragment the individual between the worlds of 
work, school, family relationships, community, and 
peer relationships. But much less work has been 
done on how the scheduling of events in these 
various worlds affects the capacity of an individual 
to function in one or another of them. Thus what 
is obscure in this approach to coordination is the 
failure to specify how varied services are or ought 
to be linked if they are to meet need or remedy 
personal and social breakdown, Without some 
scheme for ranking the relative importance of 
needs or problems, and without some procedure 
for determining the priorities to be allotted to vari- 
ous services, the task of integrative planning must 
proceed without benefit of theory and thus without 
benefit of any rules for reducing competition 
among claimants for scarce resources. Moreover, 
in the social services there is no theoretical analogy 
to the general medical practitioner who treats the 
“whole” person, exploiting the skills of specialists 
but retaining accountability for the total condition 
of the patient. While many professionals claim 
unique competence to manage the task of central 
coordination of social services, none have been as- 
signed to fill this role. Thus to talk of “human 
need” or the “whole person” obscures the dilemma 
of choice confronted in developing strategies of 
coordination. 

Clearly, what is needed is a system of priorities 
for allotting resources that is based on function, 
age, social problem, etc. Although we lack the 
means for resolving the value and jurisdictional 
issues of resource competition, we do have plan- 
ning instruments that can either formally or in- 
formally perform the function of arbitrator and 
coordinator of competing requests for service. The 
U.S. Bureau of the Budget at the federal level is, 
of course, one case-in-point. Acting as the planning 
arm of the executive branch, it frequently inter- 
feres with the operation of specialized departments, 
encouraging them to cooperate and to avoid dupli- 
cation of effort. But in the end the final allocative 
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decisions are made in Congress through what is 
essentially a political process. 

The problem of discontinuity. Social service 
can become disjointed or discontinuous if there is 
a failure to provide component services that are 
necessary to complete the cycle of change. This 
situation arises when programs assume coherence 
only when they are linked with other activities. An 
obvious case is that of job training, which includes 
the tasks of recruiting, screening, training, and 
placement. Unless these components relate to each 
other, the program lacks coherence. In practice, we 
often create separate agencies to carry out these 
specialized functions. Consider the case of man- 
power training for disadvantaged youths in the 
United States. The screening and recruiting tasks 
are, in part, the function of a community action 
program; a pre-skills training program (training- 
for-employability) might be carried out by the 
Neighborhood Youth Corps program or the Con- 
servation Job Corps; the actual training task is 
often done by a program administered under either 
the Manpower Development and Training Act or 
the Urban Job Corps; and, finally, the job place- 
ment of graduates of the training program may 
be administered by the state employment office 
which, in turn, carries the dual responsibility of 
job placement and job development. 

Problems of discontinuity can arise at any point 
where there is an interruption in the flow of the 
person through the total system, starting with re- 
cruitment and ending with placement. One of the 
typical bottlenecks has been the transition from 
pretraining into training programs because of the 
lack of training facilities. and another has been 
the transition from training into job placement, 
often because of the failure of placement agencies 
to pay sufficient attention to the problems of job 
development. 

Examples of discontinuity can be found in the 
flow of patients through a mental hospital program, 
Continuity of care is defined by Schwartz as “plan- 
ning of a patient’s treatment so that the help given 
at any point is part of a total program. ... A num- 
ber of programs for in-patients, out-patients, and 
ex-patients have continuity of care as a goal, 
though phases of treatment are administered by 
Separate agencies, It is rare for a single program 
to encompass all phases of a mental hospital 
patient’s career” (Schwartz et al. 1964, p. 257). 
Continuity of care is indeed a crucial issue, since 
psychiatrists acknowledge that a patient’s release 
from a mental hospital does not necessarily mean 
that he is cured. Thus post-hospitalization activi- 
ties can be legitimately defined as parts of treat- 
ment. Indeed, the theory of community psychiatry 


rests on such assumptions [see MENTAL DISORDERS, 
TREATMENT OF, article on THE THERAPEUTIC 
COMMUNITY]. 

Strategies to reduce discontinuity seek to max- 
imize coherence among specialized agencies that 
perform part of what is, in effect, a single task, 
It should be noted that the goal of “coherence,” 
which means the provision of closely related sery- 
ices, contrasts with “comprehensiveness,” which 
emphasizes the provision of a range of different 
services. In examining strategies to reduce discon- 
tinuity it is useful to specify at least three inter- 
related tasks: service entry; training or treatment; 
and reabsorption or placement. Whatever the 
ideals, planning for continuity tends, in practice, 
to focus on either the entry or the absorption prob- 
lem. Thus job training programs for disadvantaged 
youths emphasize recruitment, largely because 
training and placement opportunities have been 
limited. On the other hand, planning for mental 
health or the reduction of delinquency directs atten- 
tion to the problem of absorption, largely because 
of the concern about high recidivist rates. 

The increased concern for community care, or 
outpatient treatment, makes the problem of ab- 
sorption quite crucial for hospital and correctional 
institutions. Ideally, the goal is to provide the full 
range of services. In the United States the Com- 
munity Mental Health Facilities Act of 1964 has 
attempted to initiate planning that focuses on the 
creation of structures which can process individ- 
uals from diagnosis, through treatment, to rehabili- 
tation, and back to treatment institutions if neces- 
sary. They have encouraged the erection, where 
appropriate, of a single facility that offers in- 
patient, outpatient, and aftercare programs. 

Planning for continuity can, however, create 
problems of duplication and fragmentation. Con- 
sider the following case-in-point. The U.S. Job 
Corps had, by the summer of 1966, enc 
projected average annual cost per person enrolle 
of about $7,500, and another $600 per person 
enrolled when the capital costs were taken me 
account. In order to protect this investment, it Mies 
considered necessary to create special ae 
counselors who would assist the graduates of Be 
program in securing jobs. The Bureau of se 
Budget questioned whether these placement oe 
selors should be assigned to the Job Corps or to ; 
state employment service, and it was ae 
agreed that they would be placed in the Emp’ a 
ment Service but that the counselors would g 
Priority to the graduates of the Job Corps. Ore 

Entitlement and protection. One of the dered 
intractable problems of delivering services = ad 
by bureaucratic agencies to clients is that 0 
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suring that clients receive the services that they 
are entitled to, and a related problem is that of 
protecting clients from unfair practices of the in- 
stitutions which are designed to serve them. The 
entitlement question arises when individuals are 
denied services because of decisions that are based 
upon the judgment or discretion of the profes- 
sionals who operate the services. For example, some 
potential clients are denied public assistance pay- 
ments because they are judged to be “ineligible.” 
The definition of eligibility is typically subject to a 
variety of interpretations. 

The problem of protection arises when bureauc- 
racies either abuse their power or carry out their 
mission in such a fashion as to inadvertently create 
difficulties. For example, many school systems have 
developed ability groupings, or a “track system,” 
that sort individual pupils into four tracks—honors, 
college preparation, regular, and basic. The so- 
called basic track is designed for those individuals 
who have subnormal intelligence and a demon- 
strated incapacity to perform intellectual tasks. A 
recent restudy of 1,273 youths assigned to the 
basic track in Washington, D.C., revealed that 886 
youngsters were placed in the wrong track; that 
is, they were found to be capable of doing “regular” 
work (Hechinger 1965). 

A great deal of interest has been shown in cre- 
ating instruments for assuring entitlement and 
consumer protection. I have defined these as co- 
ordinative strategies because they attempt to coordi- 
nate the preferences and rights of clients with the 
performance of institutions. For example, in the 
United States there has been some discussion about 
the feasibility of employing an ombudsman, who 
would act as an impartial agent to receive com- 
plaints made by clients against the service bureauc- 
Tacies. Neighborhood legal services sponsored by 
the U.S. antipoverty program have broadened the 
definition of legal services for the poor so as to in- 
clude not only protection in criminal cases but 
also protection from the operations of service 
agencies. For example, in New York City the Mobi- 
lization for Youth agency has developed a vigorous 
Program that provides legal counsel for public 
assistance clients who have been designated as in- 
oe because of residence requirements; all of 
aa ae! that have been brought to court with the 

d of this program were finally judged to be 
eligible for assistance. 
ce structures. Coordinative planning re- 
Geet a planning structure that can collect and 
the aoe the resources of various agencies to solve 
sing ER delivery problems. However, most plan- 
te e uctures have a common constraint, in that 

annot reduce the autonomy of community 
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agencies, nor can they control the base budgets of 
agencies, although they can supplement these 
budgets as an incentive for cooperation. For ex- 
ample, the antipoverty program in the United 
States has created new community centers for 
comprehensive planning (community action agen- 
cies). These centers will function as funnels for 
federal funds that are now being made available 
directly to localities, as contrasted with the earlier 
pattern of grants-in-aid that were allocated through 
the states. However, the question of the appropri- 
ate structure for these new coordinative planning 
instruments remains highly controversial. Thus, 
important policy issues have emerged in connection 
with several problems, which can be briefly sum- 
marized as follows: how to create a viable struc- 
ture that can overcome the jurisdictional problems 
which arise in forging a coalition of established 
but autonomous service institutions; how to pro- 
mote radical democracy which respects the pref- 
erences of service consumers; and how to create 
rational problem solving based upon research and 
planning. Moreover, when these three elements 
are combined, they tend to conflict. Most planning 
structures struggle only with the effort to reconcile 
the clash between established power and rational 
knowledge, since the consumer of the service is 
seldom formally involved in planning at the policy 
level. In contrast, these new community action 
programs are confronted with all three problems, 
but especially with the conflict between the estab- 
lished institutions and the consumers of social 


services (Marris & Rein 1967). 
MARTIN REIN 


[Directly related are the entries SOCIAL WORK; WEL- 
FARE ECONOMICS; WELFARE STATE, Other relevant 
material may be found in AGING; BLINDNESS; COM- 
MUNITY, article on COMMUNITY DEVELOPMENT; 
DELINQUENCY; Housinc; MEDICAL CARE; MENTAL 
HEALTH; POVERTY; UNEMPLOYMENT INSURANCE; 
and in the biographies of BEVERIDGE; WEBB, SIDNEY 
AND BEATRICE.] 
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PLANTATIONS 


A plantation is an economic unit producing 
agricultural commodities (field crops or horticul- 
tural products, but not livestock) for sale and 
employing a relatively large number of unskilled 
laborers whose activities are closely supervised. 
Plantations usually employ a year-round labor crew 
of some size, and they usually specialize in the 
production of only one or two marketable products. 
They differ from other kinds of farms in the way 
in which the factors of production, primarily man- 
agement and labor, are combined. 

Typically, the plantation is large as farm or- 
ganizations go, but it does not derive its essential 
character from size alone. Many so-called family 
farms are larger in area and often in value of 
product than some plantations, and the larger 
mechanized wheat and cotton farms of the Ameri- 
can West are the equal of plantations in these 
measures and in capital employed as well. 

Plantation organization of agricultural produc- 
tion is not a necessary consequence of large owner- 
ship units, nor does it require large land ownership. 
When extensive methods of cultivation are appro- 
priate, the large holding may be farmed as a unit 
by highly mechanized methods and with a small 
labor force. When more labor-intensive methods 
are economical, the landlord may lease out his 
property to tenants who make most or all of the 
management decisions. On the other hand, a plan- 
tation may lease a number of small ownership 
units to form an economic unit, or numerous land- 
owners may pool their land to form a cooperative 
plantation. A 

Another characteristic frequently associate 
with plantations is a relatively high degree of ye 
tical integration, even of self-sufficiency. This rt 
result when required inputs or processing facilities 
are not available locally and hence must be Hs 
plied by the plantation itself; or when ees 
plants built to supply an international ed 
establish their own plantations in order to a is 
an adequate supply of raw-material inputs. ia 
cal integration and self-sufficiency are, of cours 
not limited to plantations. 


3 romi- 
Privately operated plantations are most Pi 


nent in the growing of tropical tree crops and 
other perennials, particularly if sugar can be in- 
cluded in this category. Of the principal export 
crops of the tropics, coffee, tea, bananas, sugar, 
and rubber come primarily from plantations; and 
plantations are major factors in the production of 
palm products. But small holders produce about 
one-half of the world’s oil palm products and are 
the principal producers of cocoa, rice, and, in the 
tropics, of peanuts and cotton. Small holders in 
the tropics also produce significant supplies of 
bananas, coffee, sugar, rubber, and copra. Planta- 
tions, moreover, are not confined to tree crops or 
to tropical crops. In the tropics, for example, they 
have been important producers of manioc (tapioca), 
and in the temperate zones, of cotton. The ap- 
parent association of plantations with perennials 
results, at least in part, from the difficulty of 
mechanizing the production of these crops; in the 
tropics this difficulty stems largely from the 
shortage of skilled labor. Degree of mechaniza- 
tion, quality of the labor supply, and the impor- 
tance of tree crops may, of course, all be inter- 
related. 

Private plantations were the principal source of 
increasing supplies of tropical agricultural com- 
modities throughout the seventeenth, eighteenth, 
and nineteenth centuries. The early twentieth cen- 
tury brought a great expansion of tropical planta- 
tions to supply European and American demand 
for such exotic commodities as coffee, tea, cocoa, 
bananas, palm and coconut oil, and rubber. The 
Private plantation was still a major source of tropi- 
cal commodities in the 1950s, especially in Asia 
and Latin America. 
> Post-World War m developments. The planta- 
tion depends on a plentiful supply of cheap labor, 
not in the sense that its cost is low in relation to 
its productivity but in the absolute sense that 
wages are low because skills are few. For the 
Plantation derives whatever economic advantage 
it has from its ability to mobilize unskilled labor 
to achieve greater economic return. Other special 
advantages of the plantation derive not from its 
greater efficiency in production but from the abil- 
ity to exploit market imperfections or to manip- 
ulate them to its advantage. As the range of 
€conomic Opportunities for rural labor in the un- 
derdeveloped world has increased with economic 
oe and the relaxing of administrative restric- 
an that closed various occupations to specified 
ful groups, it has seemed probable that plenti- 
ee of unskilled labor might no longer be 
ie e to plantations at wages they could afford 

Pay. However, pessimistic predictions of the 
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economic future of plantations made after World 
War u have not been fulfilled, and their number 
and importance have not been much altered by 
market forces; political factors, on the other hand, 
have been a major influence. 

As most of the colonial territories of the world 
achieved political independence in the first two 
decades after World War 1, the economic power 
inherent in large foreign-owned plantations was 
frequently regarded as a threat to the autonomy 
of the new national governments. As a conse- 
quence, new restraints were imposed on private 
plantations, when they were not expropriated out- 
right, and political favors that they had enjoyed 
in the past were withdrawn. 

While the new governments’ concern for inde- 
pendence in the realm of economic decisions made 
profitable plantation operation increasingly diffi- 
cult, another set of circumstances threatened their 
existence as integral units. For a variety of reasons, 
including the desire to assure tontinued popular 
support, nostalgia for earlier ways of rural life, 
and a desire to reduce the excessive movement 
from farm to city with its consequent high rate 
of urban unemployment and civil unrest, there 
was a strong sentiment in many of the new gov- 
ernments for subdivision of large plantations into 
small, individual farming units. Subdivision might 
also, of course, be a device for destroying the threat 
to national sovereignty that private plantations 
were thought to offer. 

At the same time that the shift in political 
power made the future of private plantations more 
precarious, the publicly owned plantation and 
semipublic plantation found increasing favor as 
devices for consolidating political power and for 
accelerating economic growth. Plans for establish- 
ing state farms, collectives, and cooperatives after 
the Soviet or Israeli model were widely advocated 
and frequently adopted. Those large farming units 
that were established to introduce laborsaving 
mechanical cultivation and harvesting fall outside 
our classification to the extent that mechanization 
was achieved. Often, however, the reduction of 
labor inputs fell far short of anticipations, and the 
new government farms, like private plantations, 
continued to employ relatively large numbers of 
unskilled laborers working under close supervision. 
In concept, government plantations of tree crops 
and perennials were very much like the private 
ones. 

A type of quasi-plantation, sometimes called 
group farming, also enjoyed a certain vogue in 
the years immediately after World War 1. Out- 
standing examples are the Gezira Scheme in the 
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Sudan, which had its beginnings in the 1920s, and 
the paysannats indigènes of the Belgian Congo, 
which had their greatest expansion in the 1950s. 
In both, actual farming operations were carried on 
by large numbers of individual farmers—share 
tenants in the Sudan, holders under customary 
tenure in the Congo—whose activities were so 
regulated as to permit the economic employment 
of resources beyond the means of individual 
farmers. 

Economic advantages. Under certain circum- 
stances the plantation, either private or public, 
enjoys distinct economic advantages over other 
types of farm organization. When it is desirable 
to introduce a new technology requiring a radical 
change in cultural practices, the plantation sub- 
stitutes supervision—supervisory and administra- 
tive skills—for skilled, adaptive labor, combining 
the supervision with labor whose principal skill 
is to follow orders. The new technology may center 
on a new crop, af for example, rubber in Malaya, 
or it may simply enhance new methods for pro- 
ducing a familiar crop, as with palm oil in the 
Congo. Among a literate and educated population 
with knowledge of a wide range of farming meth- 
ods and high receptivity to innovation, new tech- 
niques can spread fairly rapidly in response to 
economic incentives, extension services, and the 
efforts of commercial suppliers of inputs and buy- 
ers of products, New techniques and new crops 
have also been adopted in this way when the farm 
population was illiterate and supporting services 
were few but economic incentives were strong and 
the new techniques not difficult to understand. It 
was in just this way that the cocoa industry of 
southern Ghana was established. This method of 
generating new supplies takes time—how long de- 
pends on circumstances outside the control of any 
individual. 

Under the same circumstances, the plantation 
employs a few highly skilled managers who com- 
mand the full technology and supervisory skills as 
well. By combining as many unskilled farm la- 
borers as possible with each skilled supervisor— 
manager, the plantation can initiate the desired 
change at once. 

A somewhat analogous situation obtains when 
the agricultural population is only partially com- 
mitted to the market economy, Then money income 
may be desired only in order to meet specific re- 
sponsibilities, such as taxes or bridewealth, or to 
purchase individual items that cannot be obtained 
otherwise; amounts greater than are needed for a 
Specific payment or purchase are likely to be little 
valued, This Phenomenon, sometimes called target 


demand, is most likely to occur in a society with 
only rudimentary development of the market sys- 
tem. Where it is found, it may be difficult to secure 
a steady flow of raw materials from farms within 
easy reach of processing plants or ports as one 
group of farmers after another satisfies its immedi- 
ate need for cash. A plantation located near the 
demand point can employ a succession of laborers 
who come at their own expense in search of em- 
ployment to satisfy their specific needs. Plantation 
laborers so engaged may be tied by contract to 
assure employers of their services for a minimum 
period of time. 

Given the widespread familiarity with the uses 
of money that has been acquired in the twentieth 
century and the increasing familiarity with, and 
demand for, manufactured goods that has marked 
the years since World War 1, it is unlikely that 
target demand continues to provide the plantation 
with special advantages. 

When introduction of a new technique requires 
heavy investment of soil or water conservation and 
the expected period of amortization is long, the 
recapture of economic benefits requires an eco- 
nomic unit larger than the small farm. The planta- 
tion is one way to achieve this. Other devices are 
available. Projects of this sort have been carried 
through by community effort, with costs borne by 
owners of land that gains from the investment or 
by development districts, with power to tax. Land 
taxes, however, are ineffectual when land is not 
owned in fee simple; they are impracticable, be- 
cause of the cost of collection, when landholdings 
are excessively small. Group farming schemes like 
those mentioned, under which occupation is at the 
pleasure of a central authority, facilitate the impo- 
sition of the burden of such large investments on 
those who benefit directly from them. 

Installations of the type discussed here can be, 
and have been, constructed by private bee 
that rely entirely, as the typical development A 
trict relies in part, on the sale of services to rene 
costs and to realize an economic return. But pee 
the legal code sanctions debt slavery, the sales igs 
tract is likely to be difficult to enforce Ks ea 
farmers have few assets other than their own 1a 
and that of their families. 1 

The complementarity between agricultural ar 
essing plants and farm producing units Be 
one consideration in the establishment of poe 
tions. Many of the major tropical export cae a 
undergo preliminary processing shortly a E 
vesting; for some of them—sugar, for exa Pi 
processing plants large enough to pape in- 
mum costs require a relatively heavy Cap. 


vestment and volume flow of output. Others, like 
black tea, demand careful handling and strict con- 
trol of processing if a quality product is to be 
obtained. 

It is possible, of course, to obtain adequate sup- 
plies of raw product for such processing plants 
from independent farmers if they are sufficiently 
skilled and commercially motivated and if the price 
offered is sufficiently high. If not, the processor 
may find advantage in direct production through 
a plantation organization. In some instances, an 
attempt to recapture costs sunk in an inconvert- 
ible, fixed plant may lead the processor to engage 
in plantation production that would be uneconomic 
otherwise, particularly if income tax laws permit 
him to pool returns from the two enterprises. 


Market imperfections favoring the plantation, 


may result either because merchants in the normal 
channels of trade cannot handle the necessary 
volume of product and maintain its quality as it 
moves from producer to processor or port or be- 
cause, through collusion or monopoly, merchants 
are able to enforce their prices on a multitude of 
small farmers. 

Imperfections of the first kind are critical when 
the quality of the product is highly sensitive to the 
way in which it is handled after the harvest. This 
may explain the prominent part that plantations 
play in production of bananas for export. When 
imperfections of the second kind are present, plan- 
tations that control a sufficiently large supply can 
bargain on equal terms with collusive buyers. 
These, of course, are advantages of size that may 
be shared by large mechanized farms. They are 
most likely to be found, however, in the same eco- 
nomic environment that favors the plantation 
organization; imperfect development of the mar- 
ket mechanism and an uninformed rural popula- 
tion tend to be found together. 

_ Large supplies of high quality can be obtained 
from small producers by paying appropriate pre- 
miums for products that meet the desired standards. 
The most striking illustration of what can be done 
F the achievement of the Nigerian marketing 
aoe in upgrading supplies of palm oil coming 
tom Small producers simply by paying a sufficient 
ye (perhaps an uneconomic premium) for 
il of the desired grade. Proper handling of the 
payee after it leaves the farm can be encouraged 
KE taking measures to improve the general market- 
Bae or by removing the product from normal 
ae oe of trade and entrusting it to a coopera- 

Im arketing association or government agency. 

i patsctions in the capital market probably 
Conomic opportunities in favor of plantations 
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more than do imperfections in the product market. 
They are particularly acute in societies where small 
holders cannot offer their farms or groves as se- 
curity for loans, either because they are forbidden 
to do so by law or because customary land tenure 
makes no provision for alienation. The plantation 
has access to international capital markets that 
cannot be drawn on by small operators and would 
probably not be available to farm credit banks or 
perhaps even to national governments seeking 
funds for small holders. 

Economies of scale in the advanced agriculture 
of the United States result primarily from the use 
of laborsaving machinery that can be most effec- 
tively employed on moderately large areas, But 
even mechanized farms reach a point of constant 
costs at moderate size; for production of most crops 
it is markedly smaller than the larger plantation. 
Where plantations are important, the limited alter- 
native occupations open to the rural population 
make laborsaving devices less attractive. Indivisi- 
bility of machine-capital dictates a minimum size 
of efficient operation in the industrialized coun- 
tries. In economies where the plantation finds an 
advantage in combining large amounts of unskilled 
labor with skilled supervisor-managers, it is the 
indivisibility and high cost of skilled supervision 
that make small operations unprofitable. 

Economies of size also result from other causes, 
essentially from the ability of the large unit to pro- 
vide itself with services that are supplied by other 
agencies in the economically advanced countries. 
In the industrialized countries agricultural research 
is liberally financed by government; in the poorer 
countries of the world it may be undertaken by 
private corporations for the direct benefit of their 
own plantations, an expenditure that would not 
be feasible if the operation were small. Where 
roads and bridges are not provided by the state, the 
plantation may build them, just as it may find prof- 
itable the provision of rail transport, communica- 
tion systems, docks and warehouses, and supplies 
of water, electricity, and other utilities that are 
available from the state or from regulated com- 
panies in the developed countries. This category 
may also include such services as soil analysis and 
application of fertilizers and pesticides, which in 
North America and western Europe can be obtained 
from vendors of the products used, from indepen- 
dent specialists, or from the government. 

It is apparent that economies of size which the 
plantation enjoys and the competitive advantages 
it is able to seize in the product and factor markets, 
all derive from the same circumstances that make 
employment of large gangs of supervised laborers 
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economically attractive—the rudimentary nature 
of the economic order and the limited dissemina- 
tion of knowledge to the population. Imperfect 
knowledge, restricted access to the factors of pro- 
duction, and restricted access to product markets 
create a climate in which this particular form of 
farm organization thrives, 

Political advantages. Other reasons besides ef- 
ficiency in production have caused political admin- 
istrators to look on plantations with favor. They 
have been encouraged as a means of pacifying 
colonial areas and territories being brought under 
a central authority because their capacity to sus- 
tain themselves in economically isolated areas and 
to create the essential elements of infrastructure 
have enabled them to assume some of the respon- 
sibilities of internal security, communication, and 
public services that would otherwise fall on gov- 
ernment, Plantation concessions intended to serve 
these purposes of economic development and exten- 
sion of political power have sometimes included 
police and juridical authority as well. Thus planta- 
tions may serve as an arm of the central govern- 
ment, sometimes the only arm of government in 
frontier areas, In some instances, the exercise of 
political authority has been the principal, and 
farming only the nominal, activity of plantations, 

The plantation type of organization may also be 
attractive to governments seeking revenues to be 
used for fostering economic growth, for enforcing 
or extending political authority, or for the personal 
aggrandizement of the chief of state. It is much 
more difficult for the plantation to resist taxation 
than it is for the small farmer among so many or 
for the shifting cultivator buried in the bush coun- 
try. The taxes to be wrung from a single planta- 
tion are also much greater than those to be gotten 
from the small peasant cultivator who consumes 
the greater part of what he grows and whose small 
cash income may be spent as soon as it is received, 
or buried in the ground. 

Criticisms of the system. A principal criticism 
made of the plantation, as contrasted with a farm- 
ing system dependent on small, independent farms, 
has been that wage laborers on plantations have 
less incentive to perform their tasks well than does 
the independent farmer, whether owner or tenant. 
To the extent that this is true, the obvious explana- 
tion would seem to be that reward corresponds 
more closely to effort when the farmer operates 
his own farm than when he enters into wage em- 
ployment. It is also argued, however, that the small 
farmer exerts greater care in the cultivation of his 
own land because of his emotional attachment to 
it and that the biological character of farming and 
the lack of uniformity of the natural resources it 


employs require that decisions be made currently 
in the field by farmers who have intimate knowl- 
edge of their own farms, of the microclimates in 
which they lie, and of the crops they grow and 
injuries to which they may be susceptible, 

Attachment to the land and to a particular par- 
cel of land is a widespread, though not universal, 
characteristic of human society. But the economic 
advance of all the industrial countries has been 
marked by migration from farm to town and bya 
decline in the farm labor force. In the developing 
countries today, this movement to the cities is no 
less marked. Obviously, attachment to the land, 
however real, has not been strong enough to over- 
come for many the appeal of higher income, greater 
amenities, and sociability that city life can offer. 
The plantation can, and often does, offer to its 
employees many of the advantages sought in the 
town—schools, medical services, shops, entertain- 
ment, modern housing and utilities, even a certain 
social life—and they sometimes provide individual 
farmsteads on which the laborer and his family can 
work for their own interest when not engaged in 
work for wages in the field. Nevertheless, the prob- 
lem of the relationship between effort and reward 
remains, just as it does in all wage employment. 

Even on the small independent farm, however, 
the connection between the work the farmer does 
and the income he receives is not as simple or as 
certain as the incentive argument suggests. For 
sons and daughters in the farm household it need 
not be close at all if tasks and rewards are assigned 
by the parents. Small wonder, then, that farm chil- 
dren are often attracted into wage employment 
where payment always bears some relationship to 
the laborer’s contribution. 

For the farmer himself, effort and skill may be 
assumed to correlate with income over the years, 
but uncertainties of weather, of pests and ba 
and of price often cause the year-to-year correta- 
tion to be erratic. Wage employment can be an 
attractive alternative if it provides a similar arenes 
level of income, if wages are related to the vora 
productive contribution, and if reasonable secu 7 
of employment can be assured. It is where a 
has not been the case that the incentive pee 
carries weight; and far too often plantation a als 
have been absolutely low and wages of indivi m 
not much influenced by productivity. The pes 
formance of collective and state farms in the ate 
Union seems to be a case in point. The alee: 
ernment, in its drive to build the industrial “Tange 
of the economy, made heavy demands on its rie 
farms, with consequent low returns fe Jan- 
workers. Wages of workers on many privat as 
tations, too, have often been no more than € 


to support the laborer himself, with no provision 
for wife and family, and contract employment has 
tended to divorce effort from reward. 

There seems to be no inherent reason why plan- 
tation wages cannot be as effective an incentive as 
industrial wages. If the plantation is unable to pay 
at rates comparable to those that can be earned in 
other employment, there must be considerable 
doubt that it is an efficient way of organizing farm 
production. 

The criticism of plantations that rests on the 
need for the individual farmer’s recognition of 
microvariation of soil and climate and the ability 
to identify the condition of growing crops is sim- 
ilarly unconvincing. On the plantation this is the 
responsibility of the skilled supervisor, and the area 
over which he can have this knowledge, though 
limited, is considerably larger than that of the 
small farm with which the plantation usually com- 
petes. If he can implement his production deci- 
sions, there would seem to be no reason why he 
cannot obtain results as good as or better than the 
small farmer's. 

Prospects for the system. From the standpoint 
of national welfare, the plantation can make a 
Positive contribution to economic and technical 
change at that stage of economic growth at which 
it has in fact enjoyed its greatest prosperity—when 
an ill-informed peasantry in an elementary market 
system is faced with the need for rapid transforma- 
tion of farming practices. The plantation brings 
forth desired increases in production more rapidly, 
and it can transmit the new technology to the 
agricultural population generally through the ex- 
perience of farm laborers in its employ. The experi- 
ence of the southeastern Ivory Coast, where African 
farmers now dominate a coffee industry originally 
established by private European planters, despite 
discriminatory legislation favoring the European, 
18 a case in point. As a device for rapid transmis- 
Sion of knowledge about new production methods 
at low cost, the plantation will continue to play a 
Useful role in the developing countries until the 
marketing and communication system is much 
more efficient and until the mass of independent 
farmers are in a position to tap directly the store 
of agricultural and economic knowledge. 

Plantations can achieve these results because 
aa er primarily on the wage contract to obtain 
ne iance in societies where compliance with a 
A Bech purchase contract is difficult to enforce. 
i aa society is progressively modified through the 
Hee of the market system, and restrictions on 
E to factors, markets, and information 
the en, the economic justification declines. But 

plantation may persist beyond the time when 
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it is economically more efficient than other forms 
of farm organization because of the political and 
economic power it wields. This same power, plus 
the pressure of sunk costs, may make the output 
of the plantation less responsive to new economic 
and technical changes than would be a comparable 
output from a large number of smaller independent 
farms. Like all large organizations, its size permits 
broad implementation of decisions: when these 
are wise decisions, the appropriate response can 
be more rapid than that of a number of indepen- 
dent firms; when unwise, they can inhibit response 
and delay adjustment. A farming system made up 
of a large number of small farmers experiences a 
multitude of decisions, some wise and some foolish, 
and relies on the market to reward those who have 
chosen well and punish those who have not. When 
the plantation prospers, it is because the capacity 
of its management to make and implement wise 
decisions is very much greater than that of inde- 
pendent farmers. 

WILLIAM O. JONES 


[See also ASIAN SOCIETY, article on SOUTHEAST ASIA; 
ECONOMY, DUAL.] 
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PLATO 


Plato, son of Ariston, a member of the Athenian 
nobility, was born in 427 B.c. and died at the age 
of eighty in 347 B.c. Perhaps the greatest thinker 
of all times, he was not only a philosopher but the 
founder of political theory (he was himself in- 
volved in practical politics) and of sociology; he 
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was, moreover, a physicist and a cosmologist. His 
influence, direct as well as indirect, upon European 
(and thus American) thought is incalculable. 
Whether his influence was on the whole beneficial 
or not is a question which has recently become 
highly controversial. For his political philosophy 
is authoritarian and hostile to democratic ideas, 
as when he said, “The wise shall lead and rule, and 
the ignorant shall follow” (Laws 6908), just as his 
social theory is collectivistic and hostile to individ- 
ualistic ideas: “You are created for the sake of the 
whole, and not the whole for the sake of you” 
(Laws 903c). He identified individualism with 
egoism and group egoism with altruism, overlook- 
ing the fact that people may be unselfish not only 
for the sake of “the whole” (the collective, the 
state) but also for the sake of other individuals. 

Plato’s deep interest in the problems of politics 
and of society seems to have had two roots. One 
was a family tradition of assuming political re- 
sponsibilities. (His father claimed descent from 
Codrus, the last king of Athens, and his mother 
from Dropides, a kinsman of Solon.) The other 
was the terrifying experience of the political and 
social disintegration which affected not only Athens 
but the whole of the Greek world during the later 
years of the Peloponnesian War (the “Decelean 
war” of 419-404 B.c.), This period coincided with 
Plato’s formative years and culminated for him in 
the trial and death of his friend and teacher 
Socrates in 399 B.c. 

The Peloponnesian War (or wars; 431-404 B.c.) 
was not merely a war between the two most power- 
ful city-states of Greece; it became, one might say, 
the first ideological war, and it involved some of 
the first large-scale ideological persecutions. The 
clash was between the ideologies of a tribalist and 
authoritarian (and perhaps even totalitarian) 
Sparta and the maritime trading empire of demo- 
cratic Athens (the “Delian League”). It became the 
more terrible because some of the leading families 
of Athens and its democratic allies were tradition- 
ally antidemocratic and oligarchic, and sympa- 
thetic to Sparta, (Thus Aristotle mentioned in his 
Politics, 13104, an oligarchic oath which even in 
his time was still in vogue, as he said; it consisted 
of the formula “I promise to be an enemy of the 
people, and to try my best to give them bad 
advice.”) 

When the Spartan King Lysistratus captured 
Athens in 404, he instituted there an oligarchic 
puppet government, under Spartan protection, 
known as the Thirty Tyrants. The Thirty were led 
by two of Plato’s maternal uncles, the highly gifted 
Critias and the much younger Charmides. During 


the eight months of their reign of terror, the Thirty 
killed scores of Athenian citizens—almost a greater 
number of Athenians than the Spartan armies had 
killed during the last ten years of the war (Meyer 
[1884-1902] 1953-1958, vol. 5, p. 34). But in 403, 
when Plato was 24 years old, Critias and the 
Spartan garrison were ‘attacked and defeated by 
the returning democrats. Originally only 70 men, 
led by Thrasybulus and Anytus, the democrats es. 
tablished themselves first in the Piraeus, where 
Plato's uncles were killed in battle. For a time, their 
oligarchic followers continued the reign of terror 
in the city of Athens itself, but their forces were 
in a state of confusion and dissolution. Having 
proved themselves incapable of ruling, they were 
ultimately abandoned by their Spartan protectors, 
who concluded a treaty with the democrats, The 
peace treaty re-established democracy in Athens. 
Thus the democratic form of government had 
proved its superior strength under the most severe 
trials, and even its enemies began after a few 
years to think it invincible. č 

As soon as the restored democracy had re-estab- 
lished normal legal conditions, a case was brought 
against Socrates for “corrupting the youth”; its 
meaning was clear enough: he was accused of 
having corrupted Alcibiades, Critias, and Charmi- 
des, who were thought responsible for the defeat 
of Athens and for the bloody regime of the Thirty. 
In his defense Socrates insisted that he had no 
sympathy with the policy of the Thirty and that 
he had risked his life defying their attempt to 
implicate him in one of their crimes. He also made 
it clear that he preferred death to being prevented 
from speaking his mind freely to the young. Found 
guilty, he became the first martyr for the right of 
free speech. 

Such were the tumultuous times of Plato's most 
important formative years. They led him in ne 
time of maturity to pose his fundamental problem: 
Society, and the body politic, are sick. How can 
they be cured? Pea 

Beginning of literary activity. That the gat 
ical events described influenced Plato in the sen it 
indicated is, of course, conjectural. Indeed, n 
should be stressed that almost everything Ea 
the evolution of Plato's thought, the ares 
his works, and the events of his life is conjectu o 
Our sources, so far as they are consistent, ee 
be largely interdependent. Thus, we cani andl 
tain that this story of Plato’s life is not ae of 
What is probably the oldest source, the ie 
Plato's Letters, may well be an ancient n a 
Even the most informative “Seventh rere “(Cer 
many scholars accept as genuine, is suspect: 


tain other works transmitted under Plato’s name 
are almost certainly forgeries.) Yet even though 
the “Seventh Letter” is probably a forgery, it ap- 
pears to be very old, and the writer must surely 
have been well informed about the facts of Plato’s 
life to get his forgery accepted. For the temporal 
order of Plato’s works, we have now what appears 
to be very good evidence from the statistics of 
minor stylistic peculiarities (“stylometry”). This 
method (which in the main leads to groupings 
rather than to a definite sequence) may fail, how- 
ever, in cases in which Plato revised or rewrote his 
books. (We seem to have some independent evi- 
dence for the revision, by Plato, of at least one of 
his works, the Theaetetus; see Popper [1945] 1963, 
vol. 1, addendum 1.) These uncertainties should 
be kept in mind throughout this account. 

Most of Plato’s literary work consists of “Socratic 
dialogues”—dialogues, that is, in which Socrates is 
the main speaker and the superior intellect. So- 
cratic dialogues were composed by several other 
writers, notably Xenophon; yet most of Plato’s dia- 
logues bear the stamp of supreme originality, and 
we may therefore conjecture that it was Plato who 
invented this literary form. If so, then the view 
expressed by some scholars that it was the tragic 
death of Socrates which turned Plato into an author 
—into a writer of Socratic dialogues, to commem- 
orate (and defend) his friend and teacher—is not 
only attractive but also likely to be true. This view 
would also suggest that the Apology of Socrates— 
Plato’s report of Socrates’ defense and of his con- 
demnation—was Plato’s first work. Admittedly, 
there is important evidence against this: the 
Apology is a masterwork, and quite a number of 
the early dialogues are, in comparison, immature. 
On the other hand, it is not so very unusual that 
an author's first work shows him at a peak which 
he is not soon to reach again; and in this partic- 
ular case, the unique personality of Socrates and 
the immediate impression made by his defense in 
Court (Plato made it quite clear that he was pres- 
ent) may be more than sufficient to explain how 
She of the greatest and most moving works of all 
literature could be the first fruits of a literary 
Novice, 

At at any rate, until we have good evidence to 
the contrary, it seems reasonable to accept the 
Apology as a true portrait of the historical Socrates 
and as a faithful report of the proceedings (hun- 
haa of eyewitnesses of these proceedings must 
en ale when the Apology was first pub- 
pha, : A a marvelous portrait, and the first e 
and eda, in view of Xenophanes and Pericles 

s) and the greatest manifesto of what 
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may be called “critical rationalism’—the charac- 
teristically Socratic view that we ought to be aware 
of how little we know, and that we can learn by 
means of those critical discussions to which all 
theories and beliefs ought to be made subject. 
Though this is hardly a view which could ever 
become a generally accepted one, its influence upon 
Western thought (and Western science) has been 
of decisive importance. 

Socrates’ critical rationalism is not skeptical, nor 
does it take pride in reason or cleverness: he be- 
lieves in truth and in human beings, of whose 
intrinsic fallibility and intrinsic goodness he is 
equally convinced, Moreover, he is loyal to the 
democratic laws of Athens, and he loathes the 
crimes of the Thirty: he is a democrat, though one 
who is not greatly impressed by the democratic 
party leaders; he is a man passionately interested 
in other men; he is ready to die for the right of 
free discussion but despises the art of flattering the 
people. 

Three periods of Platonic works. It is here pro- 
posed to divide Plato's work into three periods. The 
first, or Socratic, period develops Plato's portrait of 
Socrates as a man, teacher, and lover of truth. Its 
dialogues (I mention only the Crito, the Protagoras, 
and the Meno) are opposed neither to democracy 
nor to the value of the individual. 

In the second period, Plato's attitude, imputed to 
the Socrates of his dialogues, is different: Plato 
now blames democratic Athens—nay, democracy 
itself, the rule of the many, of the mob—for having 
murdered Socrates. This mob rule threatens every 
just man, who is like “a man who has fallen among 
wild beasts, unwilling to share their misdeeds and 
unable to hold out singly against the savagery of 
all” (Republic 496c). 

This shows that the social body is sick. Plato 
has found his problem: how to heal the sick body 
of society. 

The problem itself involves a theory—the organic 
theory of the state and of society. (The origin of 
this most dubious and ever-influential theory is 
Oriental. ) 

Plato’s new and very personal version of the 
organic theory of society is his analogy between 
the city-state and the soul of man: the city-state 
is the soul writ large, and the soul is a state in 
miniature. He thus originated the psychological 
theory of the state and also the political theory of 
the soul. The state is class-divided. Its structure is 
characterized by an unstable equilibrium between 
the ruling classes, consisting of the rulers and their 
helpers (or auxiliaries), and the ruled classes, the 
money-earning classes and the slaves. Similarly, 
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the structure of the soul is characterized by an 
unstable equilibrium—indeed a schism—between 
its upper functions, reason and will, and its lower 
functions, the instincts or appetites. (It is interest- 
ing to note that Marx and Freud were unconscious 
Platonists. They also were anti-Platonists in accept- 
ing Plato's scheme and inverting it, Marx by de- 
manding the emancipation of the workers, Freud 
by demanding the emancipation of the instincts or 
appetites. ) 

The challenge of his problem led Plato to an 
almost superhuman intellectual effort. He devel- 
oped not only a diagnosis and a therapy but ( espe- 
cially in his third period) a whole cosmology on 
which he based his diagnosis, and a theory of 
knowledge on which he based his therapy. 

His social diagnosis goes deep. He is not satis- 
fied with blaming democracy, which for him is a 
symptom rather than the malady itself. For the 
malady is social revolution—the revolutionary 
change which has overcome society and which has 
led to the dissolution of the old patriarchal society 
in which everybody knew his place and was happy. 
Society is in a process of degeneration: change is 
evil; stability is divine, 

The stages of political degeneration are seen by 
Plato in the history of the Greek city-states. They 
begin with a golden age of hereditary kingship— 
the rule of the one, the best, the wisest—and an 
organic division of labor: the wisest rule, the cou- 
rageous help them to keep order and defend the 
state, and the people work (in a variety of occu- 
pations). From here we move through aristocracy 
(or timarchy), the rule of the few who are the best, 
to the rule of the many, democracy. In the Republic, 
democracy is shown to lead only too easily to a 
final state of decay: to the rule of the ruthless 
demagogue who makes himself the tyrant of the 
city. 

What are the causes of political degeneration? 
According to the Republic, the main work of Plato’s 
second period, it is the racial degeneration of the 
ruling class which undermines its fitness and its 
determination to rule. According to the Laws, the 
main work of his third period, the main cause of 
social change is culture clash, which is an unavoid- 
able concomitant of the development of industry 
(such as the Athenian silver mines), of trade, of 
Possessing a harbor and a fleet, and of founding 
colonies. All this shows astonishing insight, as 
does the remark that population pressure is one 
of the main causes of an unsettled society. (It 
seems not unlikely that Plato connected population 
increase, or increase in quantity, with racial degen- 
eration, or decrease in quality: his view of the few 
who are good and the many who are bad may have 


suggested this to him (Laws 710p, 740D-741a, 
838E). 

So much for Plato’s sociological diagnosis, 

The therapy which Plato advocated—his polit. 
ical program—fits the diagnosis: Arrest all social 
change! Return (so far as this is possible) to the 
patriarchic state! Strengthen the stability and the 
power of the ruling class, its unity, and its will to 
rule! For Plato formulated the following diag. 
nostic—sociological law of revolution: “Change in 
the constitution originates, without exception, in 
the ruling class itself, and only when this class 
becomes the seat of dissension,” or when its will 
to rule is sapped, or when it is defeated in war 
(Republic 545p, 4658). Thus, the proper education 
of the ruling class becomes a major instrument of 
politics; the degeneration of the ruling class must 
be prevented by eugenics; the unity of the ruling 
class must be strengthened by a radical commu- 
nism (confined to the ruling class) that involves 
the common ownership of women and children: 
nobody may know who his actual parents are, and 
everybody must look upon all the members of the 
older generation of their class as their parents. 
(This startling communism is the only major point 
of Plato's program given up in the Laws as demand- 
ing too much, even though it is still declared to be 
ideally the best form of society.) Culture clash 
must be prevented, and it is said in the Laws that 
the city must therefore possess no harbor and no 
fleet, and that no citizen may possess means for 
traveling: the currency must be a token currency 
without intrinsic value (Laws 742A-C), though the 
government will possess a treasure of “general 
Hellenic currency.” Religion and rites are to be 
developed as important instruments for pee 
change, and no variation in them may be tolerate . 
(This view, which anticipates the idea that o 
is opium for the people, is the more remarkable a 
the absence of any churchlike organization 
Greece. ) 

In his third period (especially in the mer 
Plato no longer used Socrates as his main spea if 
it seems that he had become conscious that he a 
moved far away from Socrates’ teaching. ee 
veloped the political ideas of his middle ae 
further and gave them (especially in the pent oe 
and the Timaeus) a cosmological backgroun Jiti 
deepest cause of racial degeneration and Pa A 
decay is that we are living in a world per divine 
which the world is moving away from its i 
origin; every change makes it less like its oan 
model—the divine Form, or Idea, in whose 
it was created. his 

In this third period, Plato developed ae a 
theory of knowledge. In his first period, i 


optimistic theory which made it possible for every 
man to learn (Meno 81B-D). In his second and 
third periods, only the highly trained philosopher 
can attain true knowledge—knowledge of the di- 
vine Forms or Ideas. 

Later life. It is difficult to relate the works of 
Plato’s second and third periods to his later life, 
whose most important events, according to tradi- 
tion, were his journeys (one to Egypt and three 
to Syracuse), the foundation of the Academy, and 
the Academy's (and Plato's) participation in Syra- 
cusan high politics: his friend Dio invaded Syra- 
cuse, supported by other members of the Academy, 
and overthrew the Dionysian dynasty. Dio was 
murdered by Callipus, another member of the 
Academy, who in turn was murdered by the Pytha- 
gorean philosopher Leptines. (At least nine pupils 
of Plato’s Academy made themselves tyrants of 
some city or other.) 

Traces of these journeys, such as allusions to 
Egyptian institutions, and of some others of these 
events, may be discerned in Plato's work, but most 
of these interpretations, though interesting, are 
highly controversial. 


The influence (for good or ill) of Plato's work 
is immeasurable. Western thought, one might say, 
has been either Platonic or anti-Platonic, but hardly 
ever non-Platonic, 

Kart R. POPPER 


[See also CONSTITUTIONS AND CONSTITUTIONALISM; 
ELITES; JUSTICE; NATURAL LAW; POLITICAL THEORY; 
SOCIAL CONTRACT; STATE; UTopranisM, article on 
UTOPIAS AND UTOPIANISM; and the biography of 
ARISTOTLE. ] 
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PLURALISM 


In the context of public affairs and political 
thought, pluralism refers to specific institutional 
arrangements for distributing and sharing govern- 
mental power, to the doctrinal defense of these 
arrangements, and to an approach for gaining 
understanding of political behavior. Political plu- 
ralism is therefore a historical phenomenon, a 
normative doctrine, and a mode of analysis. As 
the exclusively proper way of ordering and explain- 
ing public life, it remains the heart of the liberal 
ideology of the Western world. 

Six general propositions are integral to the polit- 
ical theory of pluralism: (1) individual fulfillment 
is assured by small governmental units, for they 
alone are representative; (2) the unrepresentative 
exercise of governmental power is frustrated when 
public agencies are geographically dispersed; (3) 
society is composed of a variety of reasonably 
independent religious, cultural, educational, pro- 
fessional, and economic associations; (4) these 
private associations are voluntary insofar as no 
individual is ever wholly affiliated with any one 
of them; (5) public policy accepted as binding on 
all associations is the result of their own free inter- 
action; and (6) public government is obliged to 
discern and act only upon the common denomi- 
nator of group concurrence. 

These propositions may be employed as em- 
pirical generalizations, as components of a model 
for analysis, or as an outline for reform. They also 
define the exemplary state as one within which 
public authority will properly devolve on a plu- 
rality of groups. The public realm will become one 
in which coexisting groups naturally complement 


one another, and little government is necessary, 
The role of government will be limited to presery- 
ing an equilibrium that generally emerges spon- 
taneously. This natural equilibrium, its parts in 
amiable competition with one another, will be 
granted independence from external controls and 
will not have to be made accountable to either the 
sovereign individual or the sovereign state. 


Development of the doctrine 


The theory of political pluralism has not been 
advanced in these idealized terms. In the hands of 
European thinkers at the turn of the century, 
political pluralism received a sophisticated formu- 
lation that at once clarified its liberal aspirations, 
its ideological sources, and its ultimate limitations, 

In the period preceding World War 1, Europe 
witnessed a burgeoning of intellectual interest in 
reconstructing institutions for the protection of in- 
dividuals—individuals who seemed increasingly 
lost in the mass, debased by the industrial division 
of labor, and exposed to manipulation by an un- 
representative state. It was widely felt by different 
syndicalists and socialists (and by some sociol- 
ogists, psychologists, and criminologists) that the 
emergence of the sovereign nation-state under con- 
ditions of industrialism posed a threat to the in- 
dividual by destroying his natural social habitat. 
The state was seen to reinforce conditions under 
which men, while technologically interdependent, 
became emotionally and intellectually estranged 
from one another. Society had become a mere ag- 
gregation of individuals; men were no longer mem- 
bers of a true community. If they were bound 
together, it was not by their natural ties but by 
artificial devices: an ascending industrialism, 4 
centrally controlled economy, and a legal order 
that made all coercion appear legitimate. Since 
these conditions were felt to be deplorable on a 
ground that they were not natural to man, bas 
question was how to recover man’s natural pae 

The first task, it seemed, was to reintegrate ane 
resocialize the masses, Given the view that a 
mitigated individualism was as reprehensi aa 
statism and that the former would indeed lea : 
the latter, it became reasonable to argue for ae 
nizing the rights of a plurality of an i 
Poised between the individual and the te 
newly vindicated cluster of groups might A a 
man to develop his true potentialities, to fin it 
self, and to be himself. Intermediary ae en 
would provide a sense of community while T. 
ing their members against undue state ae, pre- 

This line of reasoning led to a renewe ae 
ciation of the preindustrial landscape dotte 


innumerable groups. The desire for a new social 
pluralism made it irresistible to plead for reorgan- 
izing productive units in the image of the medieval 
guild, for redesigning communities in the image 
of the village, or for reforming the social system 
generally in the image of an obviously “functional” 
regime, The modern world being so obviously “dis- 
functional,” the lost feudal world of guilds, corpo- 
rations, churches, monasteries, universities, muni- 
cipalities, and estates could take on the attributes 
of a utopian society. By integrating individuals in 
such a society, wholeness might be restored. 

Thus it was thought possible to re-establish what 
Aristotle had considered the authentic human com- 
munity—a compact, purposive group of men who 
would “know each other’s characters,” sharing a 
cultural background and a common political out- 
look. More important, it would also be possible 
to move beyond the territorial limits Aristotle had 
envisaged for the ideal state, for a whole variety 
of such groups could now be confederated within 
a single governmental complex. By providing for 
a comprehensive organizational frame and by ex- 
tending the territory of a republic, a diversity of 
homogeneous groups could be accommodated with- 
in the heterogeneous state. Questions of general 
policy comprehensively affecting the public order, 
questions that Greek political theory could not con- 
front because it did not consider the technique of 
federalism, could thus be explicitly met. Individual 
tights could be safeguarded despite modern social 
and industrial changes. 

To guarantee the rights of individuals, it seemed 
primarily essential to fortify the rights of private 
associations. Small, close-knit, face-to-face groups, 
So it was held, constituted man’s true communities, 
Since the modern state had grown large and im- 
Perious, it now became appropriate to dismantle 
it, or at least to have it do no more than (1) foster, 
within its own boundaries, those small groups that 
alone preserve the habits of self-government and 
(2) administer the remaining public order so as 
to keep the peace between these coexisting groups. 

_ These views found particularly cogent expres- 
Sion in English political thought. A number of 
English thinkers, prompted by their respect for 
individual freedom, were aesthetically and morally 
dismayed by what they saw in their social and 
ieee environment. They grew indignant over 
A ugly and degrading effects of industrialization, 
eh its inhuman discipline. They witnessed 
ae Proceeded to document the gradual concentra- 
fait economic power and the concurrent loss of 
aa idual autonomy. A seemingly laissez-faire 

°nomy that was postulated on individualism ac- 
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tually encouraged the exercise of power by a social 
class that performed no socially useful function. 

While all this seemed clear to the critics of the 
ascending state, they did not feel that the state 
itself might be employed to assert individual rights 
against an increasingly concentrated economic 
order. They were dissuaded from relying on posi- 
tive state action both by their liberal past and by 
the alarming developments taking place in imperial 
Germany. There, a nationalistic and militaristic 
state socialism circumscribed individual freedom 
just as effectively as an economic elite had done 
elsewhere. 

The impulse of English pluralist thinkers re- 
mained consistently to protect the individual 
against the corrupting influence of monolithic 
power—against whatever force threatened to en- 
tangle and destroy him, whether political or econ- 
omic. Because power was ever subject to abuse, 
they felt that the very possibility of its unified 
exercise must be frustrated. Because the existing 
state was increasingly the instrument of the domi- 
nant class ruling in its own interest, the neutraliza- 
tion of the state became imperative. And because 
the state was increasingly unrepresentative and 
irresponsible, it had to be fragmented—that is, 
pluralized. 

In behalf of the dignity of the individual person, 
then, but aware of his fate when an unrestrained 
minority had the power to mechanize work and 
to organize the market, two generations of English 
pluralists attacked whatever political theory pre- 
sumed to justify an exploitative capitalism. Fred- 
eric Maitland (1900), John Neville Figgis (1913), 
Arthur J. Penty (1906), and S. G. Hobson (1920) 
were followed by Harold J. Laski (1919), R. H. 
Tawney (1920), and G. D. H. Cole (1920a). All 
were prepared to challenge the theories that bound 
individuals and groups to the state, whether the 
state was defined as the embodiment of the general 
will, as legally omnipotent, or as the highest mani- 
festation of disembodied reason. Rousseau, Austin, 
and Hegel—none of them escaped the onslaught of 
the pluralists. [See CoLE, G. D. H.; Ficeis; LASKI; 
MAITLAND; TAWNEY.] 

On the one hand, the pluralists reacted against 
nineteenth-century liberalism and utilitarianism, 
which had placed the individual in a social vacuum, 
abstracting him from his associations and making 
him the sovereign calculator of his interests. Here 
the pluralists realized that such a detached indi- 
vidual was all too quickly coerced and reintegrated 
by Bentham’s sovereign legislator and by Austin’s 
positive law [see AUSTIN; BENTHAM]. And on the 
other hand, they reacted against Continental ideal- 
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ism, in particular the Hegelian doctrine that the 
power then being exercised by the real state omi- 
nously intimated the all-powerful ideal [see HEGEL]. 

Exposing the pretensions of German idealism, of 
the Austinian theory of sovereignty, and of Spen- 
cers defense of a laissez-faire state, the English 
pluralists sought to direct attention to the social 
reality and the political facts that philosophy and 
jurisprudence had generally begun to blur [see 
SPENCER]. They found philosophy out of touch with 
historical experience; it appeared increasingly arti- 
ficial, contrived, and “unrealistic.” Determined to 
be empirical, they revolted against the prevailing 
fictions, in particular against the fiction of state 
sovereignty. The pluralists aspired toward a theory 
that took its bearing by the observed facts of 
political life, the most fundamental of which was 
the group nature of all politics. Thus, beneath the 
fictions of philosophy, they rediscovered man’s as- 
sociations, What really aroused and commanded 
the individual's interest and loyalty was a plurality 
of groups, and the modern state was not to be 
found among them. Neither the positive state of 
idealism nor the negative state of classical liber- 
alism could satisfy genuine human needs. Men 
were far more inclined to feel obligated to their 
club, their church, or their union than to the 
sovereign state, History could be enlisted to show 
that conflicts between the cluster of these groups 
and the state were not necessarily resolved in favor 
of the state. Groups effectively preserved their pre- 
rogatives; they fought the state, modified it, par- 
celed it out. 

The theories of Rousseau, Austin, and Hegel 
seemed to be placed in jeopardy by the redis- 
covered facts. These facts buttressed the value 
assumptions of pluralist thinkers and supported 
their belief in the wholesome nature of group life. 
Their case could, therefore, be historical as well 
as moral, empirical as well as normative. Since the 
fagade of British public life concealed a reality al- 
ready approximating the pluralistic ideal, political 
theory needed merely to stress and nurture what 
was still hidden. The pluralists called attention to 
the associations that, they claimed, the liberal state 
had unjustly relegated to’ the periphery because, 
like the church, they represented a vanquished 
enemy, or because, like the trade union, they repre- 
sented new economic interests, Theory could ad- 
vance disguised as history. 

This general appreciation of the significance of 
groups was given unintentional support by Otto 
von Gierke [see GERKE]. Ironically, his work had 
been intended as a contribution to German nation- 
alism: it attempted to revive institutions native to 


Germany and to repudiate the alien doctrines of 
Roman law. Roman law had regarded the corpo- 
ration as but a legal personality that owed its exist- 
ence exclusively to state action. Against this, 
Gierke showed that the source of law was actually 
not the omnipotent state, but men acting through 
groups. The corporation, having flowered on Ger- 
man soil without state aid, was, he argued, an 
irreducible entity—not a creature of the law, not 
a fictitious personality made or unmade at the 
pleasure of the state. It was a living organization 
with its own will and consciousness (Gierke 1913). 

Although there certainly was no need in England 
to react against Roman law, Maitland found ap- 
plication for Gierke’s ideas, Supported by Gierke's 
research, he saw the state as but one of a number 
of associations, with no right to pre-eminence. 
Thus Gierke's work, while not presuming to attack 
the doctrine of state sovereignty directly, helped 
the pluralists to perceive the natural autonomy— 
and therefore the natural rights—of a plurality of 
disparaged or unrecognized private associations. 
Figgis used Gierke’s ideas to vindicate the rights 
of the church; Laski and Cole, to defend various 
economic groupings. Should group and state ever 
come into conflict, the question of whom to obey 
could therefore be an open one; it could become 
legitimate to side with the group that represented 
the individual against the state. 

The belief in the representative nature of the 
groups, as opposed to the state, was based on two 
premises. First, a voluntary association would quite 
naturally enlist the interests of those who estab- 
lished and maintained it. Second, the group's man- 
agers would be professionalized and hence socially 
responsible. By giving men freedom where it mat- 
tered, namely within associations that encom- 
passed their day-to-day economic and productive 
relations, men could be counted on to participate 
in politics. They would cease being estranged from 
one another and from the public order. iste 

Whatever label might be attached to it—incu 
trial democracy, economic federalism, occupation 
representation, functional corporatism, oF a 
socialism—the new confederation of ae & 
harmoniously coexisting groups could, it pe poe 
grow out of the old state. The state would “A ra 
a passive coordinating authority, acting mer he 
response to group desires. Sound ete ate 
would emerge without independent, positi ae 
action. At most, the state would be a publics 
corporation. a 

A full theoretical treatment of the state oe 
mere public-service corporation was given iei im 
by Léon Duguit (1913) [see Ducurt]- 


terested in defining the basis of law, Duguit took 
note of his country’s assumption of responsibility 
toward the community through its system of de- 
centralized administrative courts. He saw the state 
not as the autonomous source of law but as the 
neutral servant of the community. Its commands, 
he observed, appeared imperative only by virtue 
of a legal fiction. Realistically seen, public enact- 
ments grew out of prevailing social groupings. 
Those who exercised governmental public-service 
functions, Duguit said, did so under laws actually 
expressive of that same underlying social solidarity 
by which governments first arose. 

Although an idealist residue remained in Du- 
guit’s formulations—just as it was to remain in 
Laski’s—the notion of state sovereignty was foreign 
to his thought. Both he and the English pluralists 
deprived the state of that primacy that idealist 
theory had postulated. After all, they had seen that 
men respond more strongly to various subnational 
associations, regions, and productive units than to 
the state, They found sovereignty divisible and al- 
legiance to the state contingent and qualified. They 
argued what American statesmanship—insisting 
on bills of rights, on a separation of powers, and 
on the institution of federalism—had concluded 
more than a century before. They affirmed that the 
state could have no inherent natural right either 
to impose its will upon governmental competitors 
or to oblige men to obey the law of the state merely 
because the state so desired. 

In adhering to this position, English pluralism 
gave a rational cast to political movements that 
professed to sponsor mass participation in politics, 
and that sought to draw interest groups into the 
Policy-making process by providing for representa- 
tion through trade associations, labor unions, cham- 
bers of commerce, and consumers. Insisting on the 
diffusion of the power to make public policy, plu- 
ralism encouraged movements for “economic de- 
mocracy,” “functional representation,” and “codeter- 
mination in industry.” By helping to rationalize the 
oclnesjof State rights, home rule, and decentral- 
zed administration, it established a link with the- 
Ane of government by soviets, by syndicates and 

rtels, or by fascist corporations. 

Critique 
A ee the advocates of pluralism had taken 
bea a at society in the name of realism, they 
thin ae succumbed to new fictions. For one 
life = ey ignored the complex reality of group 
aes thus failed to deal with the possibility of 
peas tyranny over individuals. To them, it ap- 
that groups necessarily represent human 
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purposes and functions, realize individual values, 
and thereby make freedom meaningful. They de- 
pended on an individual rationality and interest 
in politics that they did not subject to empirical 
inquiry. Nor did they consider the possibility of 
manipulating individuals through groups that were 
themselves subservient to state power, a possibility 
realized by the corporatism Mussolini instituted in 
1926, Furthermore, they were so concerned about 
the abuse of governmental power that they pre- 
ferred to risk political stalemate, inviting what 
Laski frankly called contingent anarchy. Having 
devitalized the state, they saw it as a neutral um- 
pire who would never act unless there were a 
common denominator of group interests. They saw 
it as carefully protecting the lawmaking preroga- 
tive of groups, and not presuming to take the 
initiative in the field of public policy. Government 
was considered an agency facilitating agreement, 
but not an independent source and initiator of 
policy. Pluralists realized that this might well make 
for deadlock, but inaction seemed better than 
action when action might be directed toward some 
group-transcending purpose discerned by the lead- 
ership. 

Still, by the 1930s it appeared necessary to some 
of the pluralists themselves to reintroduce what 
they had previously banished: a unified purpose 
above and beyond the will of a plurality of groups. 
They had rejected such ideas as common good, 
community interests, and general will. Yet they 
found it scarcely possible to conceive of the politi- 
cal process without the purposeful, helping hand 
of the state, especially as domestic group competi- 
tion and foreign threats endangered the viability 
of a pluralistic political order. Thus both Laski and 
Cole were finally driven to recognize needs more 
fundamental than a vibrant group life. They were 
to argue for both leadership to give expression to 
these needs and a state equipped to satisfy them. 
Thus, under the pressures of modern group life 
and revolutionary technological and economical 
developments, pluralists’ theory began to grow ir: 
relevant. 

During the first half of the twentieth century, 
various administrative reforms that appeared to be 
modeled on the theory of pluralism made the pre- 
dicament of its proponents increasingly explicit. 
The national economic councils of Europe in the 
1920s, Fascist corporatism, the Industrial Recovery 
Administration during the New Deal period, the 
use of advisory committees for departments of 
government, reliance on worker control of industry 
in Germany, Poland, and Yugoslavia after World 
War m—all these indicated that to implement gov- 


168 PLURALISM 


ernmental policy along geographically decentral- 
ized, federalistic, and pluralistic lines means to 
transfer the power to govern to the commanding 
elements within the most prosperous, entrenched, 
and unscrupulous groups. Rather than liberating 
individuals, it reinforces the power already cen- 
tralized and consolidated under private auspices. 
Inasmuch as the proponents of pluralism were 
dedicated to the integrity of the individual person 
and to the extension of his freedom, they inevitably 
became embarrassed by the ostensibly voluntary 
and representative business corporation, agricul- 
tural organization, trade association, labor union, 
and professional group. Maturing under modern, 
industrial conditions, associations assumed a scale 
and an organizational structure making them oli- 
garchical in their actual operations, and likely to 
smother the individual as effectively as the state, 
Furthermore, associations could preclude state ac- 
tion in areas in which liberty might yet be extended. 


American analytical pluralism 


Although empirical evidence jeopardized the 
case for pluralist institutions, it was slow to touch 
academic research, especially in the United States. 
In the United States, no systematic theoretical de- 
fense of political pluralism has ever been formu- 
lated. So secure and “natural” was the pluralistic 
foundation of government that it could be accepted 
as the axiomatic point of departure for political 
practice and public discourse. Alexis de Tocque- 
ville’s reflections on self-governing “intermediate 
bodies” capable of countervailing both an atomistic 
and a totalitarian state could become the exclusive 
basis for public philosophy: it seemed to be at once 
descriptively accurate, normatively desirable, and 
analytically fruitful, 

Thus, even though the structures of industry and 
of the economy have undergone radical changes 
during the past century, transforming the very or- 
ganization of Society, a considerable part of Ameri- 
can academic research continues to follow the lines 
first suggested by Arthur F. Bentley’s Process of 
Government (1908) [see BENTLEY]. Sustained by 
an analytical model Suitable for understanding the 
behavior of smal] groups, it treats large groups as 
if they were homogeneous, voluntary, single-inter- 
est associations. It variously seeks to give descrip- 
tive accounts of group life and to construct abstract 
frameworks charting the flow and counterflow of 
group pressures. It postulates (1) that the dynam- 
ics of social change are determined by interacting 
groups, (2) that government is but a responsive 
instrument for Keeping stable the natural equi- 
librium of competing interests, and (3) that public 
policy is most usefully understood as the product 


of the free play of group pressures. Thus some 
American academic research identifies political de- 
cisions with the group process, and the group 
process with the significant reality of public life 
[see FOLLETT; POLITICAL GROUP ANALYSIS]. 

So thoroughly has the group basis of politics 
been accepted that scholars could characterize the 
goals of Populism, Progressivism, and the early 
New Deal as irrational whenever these goals were 
too inclusive to be reducible to the special interests 
of the groups that advanced them. Whenever 
groups upheld their policies as being in the public 
interest generally, these policies could be pointed 
to as either screens for self-interest or as mani- 
festations of irrationality—rationality being identi- 
fied with self-interest expressing itself through a 
pluralistic political system. This approach has pre- 
sented individual as well as public purposes as 
mere functions of interacting groups and ignores 
those aspects of individual goals, technological 
tendencies, and social forces that defy the tech- 
niques of group analysis. 

To the extent that public policy, academic re- 
search, and such institutional arrangements as 
federalism adhere to the pluralist position and 
foreclose alternatives, they may be viewed as a 
conservative reaction against the presumed effects 
of a mass society. This reaction is unified by the 
unstated assumptions that (a) the social disinte- 
grations, maladjustments, and alienations fostered 
by large-scale technology and a mass society are 
indubitable evils; and (b) mass society, being un- 
justifiably coercive, has intolerable implications for 
individual development, while small-scale technol- 
ogy and a pluralist society have not. These unde- 
bated premises remain operative in many ee 
that cripple responsible public government, oe 
they also remain lodged in a well-endowed segmen 
of research in the social sciences. However, ivi 
cal pluralism as an ideology has lost most of a 
explicit apologists and only lingers quietly ae 
submerged, inarticulate ingredient of Western 


eralism. 
Henry S. KARIEL 


[See also INTEREST GROUPS; POLITICAL GROUP Ae 
sis; VOLUNTARY ASSOCIATIONS. Other Beate, 
terial may be found under ORGANIZATIONS; POL! 
PARTICIPATION; SOCIAL MOVEMENTS.] 
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POISSON, SIMEON DENIS 


Siméon Denis Poisson (1781-1840) was born at 
Pithiviers. He entered the Ecole Polytechnique in 
1798 and did so well there that the school exempted 
him from his final examinations and immediately 
appointed him assistant in mathematical analysis 
in 1800, He became successively an associate pro- 
fessor and a full professor (this in 1806) there, 
and then was called to the Bureau des Longitudes 
and to the Institut de France in 1812. After the 
Bourbon restoration, he was appointed, in 1816, 
professor of mechanics in the Faculté des Sciences 
at Paris and was then appointed to the Royal Coun- 
cil of the University, where he took charge of the 
mathematics curriculum for all the secondary 
schools of France. In 1837 Poisson was ennobled, 
and in 1840 he succeeded Thénard as dean of the 
Faculté des Sciences. 

Poisson’s principal interests were mechanics, 
mathematical analysis, physics, and the theory of 
probability, Although we are primarily concerned 
here with his work on probability and statistics, it 
is important to mention the fundamental nature of 
his contributions on such subjects as the invari- 
ability of the major axes of planets (1811), the 
distribution of electricity on the surfaces of bodies, 
capillary phenomena (1831), and the mathemati- 
cal theory of heat (1835a). 

It was toward the end of his life that Poisson’s 
interests focused on probability and statistics. His 
most important work in this field is Recherches sur 
la probabilité des jugements en matiére criminelle 
et en matière civile, précédées des règles générales 
du calcul des probabilités (1837), but slightly 
earlier he had published several brief papers that 
were very important for their time, since they con- 
tained the bases of a precise statistical method for 
the social sciences. 

Poisson, like Laplace before him, sought to estab- 
lish the probability that a juror may err in arriving 
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at a verdict of guilt. Laplace had tried to formulate 
a hypothesis that does not do violence to common 
sense; he assumed that the probability that the 
juror will not be mistaken is governed by a law of 
equal distribution between 4 and 1. From this law 
he deduced the probability that the verdict is cor- 
rect as a function of the majority obtained. Poisson 
sharply criticized this procedure (1835b) and laid 
down the principle that any chain of reasoning in 
this type of problem should be based on the obser- 
vation of such facts as demonstrate the existence 
of a law of large numbers. He attacked the prob- 
lem by going back to the sources and examining 
the texts of laws as well as legal statistical docu- 
ments, Thus, he analyzed the records of the Cours 
d'Assises. The juries of the Cours d’Assises had 12 
members, and although before 1831 the majority 
required for conviction was 7 to 5, from that date 
it was 8 to 4. Poisson noted that before 1831 the 
annual proportion of acquittals was 0.39 in the 
mean, always remaining between 0.38 and 0.40, 
and the proportion of convictions by a vote of 
7 to 5 was 0.07. He then claimed that even before 
the required majority was changed in 1831, the 
effect of the new rules could have been predicted: 
“The proportion of convictions . . . had to be 
0,61 — 0.07 = 0.54. If a vote of 8 to 4 was required, 
the ratio of acquittals to trials would thus be 0.46. 
This is in fact what happened during the year 
1831” (1836a), 

This sort of reasoning seems simple and obvious 
to us today, but at the time it was received with 
skepticism, Thus, immediately after Poisson read 
his paper before the Academy of Sciences, Poinsot 
objected violently to “the idea that a calculus could 
be applied to the moral Sciences . . .” (Poinsot 
1836). 

In Poisson's view, the ordinary law of large num- 
bers (which he evidently knew in the case of Ber- 
noulli’s urn) was primarily an experimental fact 
that should be taken as the foundation of statistics. 
It is important to remember that at that time the 
axioms used to construct the calculus of probabili- 
ties were different from the axioms used today, 
probability being defined, after Laplace, as the ratio 
of favorable cases to the total number of cases, 
assuming all these cases to be equally probable. 
Such a definition certainly does not preclude suc- 
cessful work, and Poisson used this definition in 
discovering his eponymous distribution. But the 
definition cannot deal with statistical problems, so 
that it was necessary for Poisson to enunciate what 
might be called a physical law. 

The Poisson distribution (or law). The prob- 
ability of having r successes in the course of n 


independent trials for an event whose probabil- 
ity is p is given by the binomial term Pr(r) = 
(?)p"(1 — p)". If, in this formula, P approaches 0) 
and n approaches infinity, with np> À finite and 
not 0, we find, by using Stirling’s formula, that 


A n! Ar ANES 
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We are thus led to consider the discrete distribution 


Pr(0) = e 
Pr(1) = de® 
Pr(2) = Mek 


Pr(r) = x en 


which is called the Poisson distribution (better, 
Poisson’s law) or rare events distribution. This last 
term has the advantage of indicating that a Poisson 
distribution may be applicable when a large group 
is considered, each member of which has only a 
small probability of incurring an accident (as, for 
example, the number of men killed every year by 
the kick of a horse in a regiment of the Prussian 
army). The term “rare” is, however, ambiguous, 
and one should avoid the error of supposing that 
the absolute number of events is necessarily small. 
The limit proof sketched in (1) above is incom- 
plete, and we are indebted to Cauchy, a colleague 
of Poisson’s, for the notion of characteristic ne 
tion, which makes it possible to establish rigorous y 
the tendency of the binomial distribution to ap- 
roach that of Poisson. 
P The characteristic function of the binomial 
law is 


[1 — p + per = [1 + A (et = 1] 


00, 
This tends toward 4(t) = exp[A(e** — 1)] as pasts 
and the latter is the characteristic function © 
Poisson distribution. - 
The cumulant generating function for n 
son distribution is clearly w(t) “ME P 
which shows that all the cumulants are $ poisson 
The quantity A is called the parameter of EIN 
law; in particular, both mean and varianc 


Poisson distribution tables. Various tables and 
charts of the frequencies of the Poisson distribution 
have been prepared. The cumulative distribution 
function for the Poisson distribution may readily 
be expressed in terms of the incomplete gamma 
function T given in separate tables (for example, 
Pearson & Hartley’s Biometrika Tables for Statis- 
ticians 1954), for if X is the number of successes, 
then Pr(X > r) =T[A(r)]/T(7). 

Generalized Poisson distribution. Let Xo, X, 
+++, Xp, ++ be quantities in arithmetical progres- 
sion having the form x, =a + pr, where p is any 
positive number, and let X be a stochastic variable 
such that 


Nd 
Pr(X = x) = it ew 


we then say that X obeys a generalized Poisson dis- 
tribution (or law). Other, and deeper, directions of 
generalization are discussed in the next section [see 
also DISTRIBUTIONS, STATISTICAL, article on MIX- 
TURES OF DISTRIBUTIONS]. 

Poisson process. Suppose that the number, X(t), 
of events taking place between instant 0 and in- 
stant t obeys a Poisson law (thus, the number of 
calls received by a telephone exchange between 0 
and t might obey a Poisson law). Clearly, X(t) can 
be considered as a random variable with integral 
values which, as t varies, engenders a nondecreas- 
ing random function with nonnegative integral 
values, called a Poisson process. 

Stationary Poisson process. Without knowing 
anything a priori about the distribution of events 
between instants 0 and t (for example, again, the 
number of calls received by a telephone exchange 
between specified times), we can grant that the 
number AX = X(t + At) — X(t) of events between 
t and t + At is independent of X(r) for 7 < t. This 
hypothesis, whatever its legitimacy, means that the 
occurrence of an event between t and t + At is in- 
dependent of the previous occurrence of a similar 
event; under these conditions X(t) is a stochastic 
function with independent increments. Let us grant, 
further, that the events under consideration consti- 
RHD stationary phenomenon. Then, by dividing 
i eral (0,t) into n partial intervals, X(t) can 
tis garded as the sum of n independent random 

ariables, all obeying the same law. 

oe if it is assumed, as it is natural to do, 
at Whatever the (0,t) interval, there is a zero 

Saw that the number of events is infinite 

4 ie zero probability that several events will occur 

ie aps instant, it can be shown that, syuaevel 

ia a e; the random variable X(t) obeys a'Poisson 

ty ving the parameter A = at, where a is a posi- 
© constant. 


POISSON, SIMEON DENIS 171 


The foregoing process is called the stationary 
Poisson process and is characterized by the con- 
stant a, known as the density of the process. It can 
be shown to have the following properties: (1) over 
a given time interval (7,7 + À) the events are dis- 
tributed at random and independently of each 
other; (2) if t is a given instant and t + v the in- 
stant at which the first event subsequent to t 
occurs, the probability density of the stochastic 
variable v is f(v) = ae”. 

R. FERON 


[Directly related are the entries PROBABILITY; QUEUES. 
Other relevant material may be found in DISTRIBU- 
TIONS, STATISTICAL; and in the biographies of BORT- 
KIEWICZ and LAPLACE.] 
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POLANYI, KARL 


Karl Polanyi (1886-1964 ), whose concept of sub- 
stantive economics did much toward integrating the 
study of economics with that of society more gen- 
erally, was born in Vienna and grew up in Buda- 
pest. He studied law and philosophy in Budapest 
and later, for a short while, practiced at the bar. 
In 1908 he helped found the Galilei circle, a center 
of the intellectual ferment that provided Hungary 
with its liberal and socialist leadership during and 
after World War 1 and that was to remain effective 
as a spiritual influence for a long time. After mili- 
tary service, Polanyi moved to Vienna, where in 
1920 he met his wife, Ilona Duczyńska, who had 
played a distinguished role in the Hungarian revo- 
lution of 1918, He became foreign editor of the 
Osterreichische Volkswirt, then Austria’s leading 
economic publication, comparable in scope to the 
London Economist. Although he never belonged to 
any political Party, he considered the socialist ex- 


periment that took place in Vienna between the 
two world wars to be “one of the high points of 
western civilization.” With the rise of fascism he 
lost his post and, foreseeing the European cata- 
clysm, moved to England. There he earned his 
living lecturing for the Workers’ Educational As- 
sociation as an extramural tutor for Oxford Uni- 
versity; he traveled and held classes in the small 
towns of Sussex and Kent. In 1940 he was invited 
by the International Institute of Education to give 
lectures on the international situation in colleges 
throughout the United States. 

Between 1940 and 1943 he held the post of 
resident scholar at Bennington College in Ver- 
mont, and there he wrote the first of his two 
principal books, The Great Transformation (1944), 
He went back to England but returned to the 
United States in 1947 to become visiting professor 
of economics at Columbia University, a post he 
held until his retirement in 1953. In the years that 
followed, he lived part of the time in New York 
and part of the time at his little house in Pickering, 
Ontario. He continued his research in collabora- 
tion with a group of younger scholars working in 
the fields of economics, anthropology, history, 
and sociology. Together they wrote a symposium 
volume, Trade and Market in the Early Empires 
(Polanyi et al. 1957). 

Polanyi’s last years, when he was in his seven- 
ties, were extraordinarily productive. His work 
culminated in a study of the economic anthro- 
pology of the west African kingdom of Dahomey 
during the eighteenth century, posthumously pub- 
lished as Dahomey and the Slave Trade (1966). 
He also helped his wife and a number of Ca- 
nadian poets with the preparation of an Be, 
of Hungarian writing covering the period 193 
to 1956 (see Duczyńska & Polanyi 1963). His 
major concern during his last years was the a 
ervation of world peace. All his efforts were ee 
on founding an international journal for a 
comparative study of economics and politics m 
was to serve the cause of peace. Under the a 
tinguished international sponsorship of Ragn ii 
Frisch, P. C. Mahalanobis, Gunnar Myrdal, Joan 
Robinson, Hans Thirring, and others, Coes 
was founded. Polanyi lived to see the first ie 
through the press; he was buried on the day 
first copies appeared. 

Substantive and formal economies. 
Polanyi’s scholarly work was the study of hip be- 
of the economy in society—the relations r 
tween the arrangements for the prode A aE 
acquisition of goods, on the one hand, and d cul- 
religion, and other forms of organization an 


The core of 
the place 


ture, on the other. Since the study of these relation- 
ships transcends modern economic theory, Polanyi 
suggested that it be designated substantive eco- 
nomics to distinguish it from formal economics. 
His point was that the word “economic” is used 
in two very different senses, which must be borne 
in mind to avoid perceiving all economies—primi- 
tive and archaic especially—simply as crude vari- 
ants of modern industrial ones. 

“Economic” in the substantive sense is used by 
Polanyi as a synonym for “material.” For example, 
when anthropologists talk about the economic as- 
pects of primitive society, they simply mean the 
arrangements for acquiring, producing, or using 
material items or services for individual or com- 
munity purposes. According to this meaning of 
the term, all societies, whatever their size, tech- 
nology, or political structure, have an “economic” 
system—that is, structured arrangements for the 
provision of livelihood. In the formal sense, “eco- 
nomic” means to “economize” or to be “economical” 
—to choose among alternatives for the purpose 
of maximizing output, profit, or gain in exchange; 
or to minimize the cost of producing something, 
within the context of material “scarcity,” relative 
to what the economist calls the demand. 

In the capitalist, market-integrated economy 
and in conventional economic theory, the two 
meanings of “economic” are fused, since in capi- 
talism the market institutions serve both to pro- 
vide the material means of existence and to en- 
force economizing activities on the participants: 
to earn their livelihood people must abide by the 
Tules of the market. Economic theory reflects this 
Separation of the economy from other social in- 
Stitutions by making market transactions almost 
Its sole concern; it has thus become essentially a 
theory of valuations—of prices and their mutual 
interdependence. The market economy, however, 
is a very special case, historically and anthropo- 
logically. Preindustrial societies frequently have 
economies in which the structured mode of pro- 
viding the means of existence does not consist of 
economizing institutions. Polanyi’s reason for dif- 
ip aang the two meanings of “economic” was 
fier _ he called the “economic prejudice — 
ing hie tl G perception of all economies (includ- 

primitive and the archaic) as variants of 
oor industria] ones, and the translation of all 
ean Institutions into market-economizing 
wotila e sought conceptual categories which 
eee both the analysis of the relation of 
Sits to social organization and the direct 
son of economies. 
By confining itself to market phenomena, he 
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felt, economic theory has become inadequate in two 
ways: it has removed from its ambit the social 
organization which links the economy to the cul- 
tural, psychological, and political structure of so- 
ciety; and by its exclusive concern with industrial 
capitalism, it has forced the analysis of other so- 
cieties into a conceptual framework—in modern 
terminology one would say “model’—that does not 
fit them. 

Modes of organizing economies. Polanyi’s anal- 
ysis of the uses of money, the forms of external 
trade, and the role of markets in different econ- 
omies illustrates how devices that are superficially 
similar, such as money and foreign trade, have 
different functions in market and nonmarket econ- 
omies: the fact that both the Soviet and American 
economies make use of money, foreign trade, 
markets, and trade unions does not mean that 
either these institutions or their underlying organ- 
ization are the same. This is also true of the struc- 
ture and function of money, markets, and foreign 
trade in primitive and archaic economies. 

The problem of the place of the economy in 
society is the main topic of Polanyi’s two principal 
books: The Great Transformation (1944), which 
deals primarily with contemporary society, and 
Trade and Market in the Early Empires (Polanyi et 
al. 1957), which deals chiefly with primitive and 
archaic societies and their remnants. In the latter 
book Polanyi created a conceptual framework for 
analyzing preindustrial societies in which markets 
had little significance. 

In his view the market economy is only one of 
the three historical modes of organizing econ- 
omies; the others are reciprocity and redistribution. 
Polanyi did not consider his types as evolutionary 
stages, although some did develop earlier than 
others. Nor are the types mutually exclusive: in 
any economy two, and sometimes three, of the 
types of transactions are usually present, although 
one type tends to be dominant. In politically cen- 
tralized primitive societies, such as the Bantu, and 
in archaic societies—such as the Inca, the Nupe of 
Nigeria, eighteenth-century Dahomey, the indige- 
nous kingdoms of east and south Africa, and the 
pre-Christian Middle East—redistribution was the 
dominant pattern of integration (or transactional 
mode), but gift giving and market transactions 
were frequently present. In some economies such 
as those of the Tiv and the Trobriand Islanders, 
where reciprocity was the dominant mode of trans- 
action, petty markets were present as well. 

Planning and freedom. Polanyi first encoun- 
tered the problem of the relationship of the econ- 
omy to society when studying the British indus- 
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trial revolution; this revolution appeared to him 
not only to multiply man’s wealth but also to 
threaten the very fabric of society. Polanyi argued 
in The Great Transformation that a laissez-faire 
capitalistic market economy is not socially viable. 
The attempt to make the fear of hunger and the 
quest for profit the governing motives of the econ- 
omy is socially divisive and humanly destructive. 
The European and American upheavals of the 
1920s and 1930s—communism, fascism, and the 
“New Deal—were emergency transformations of 
market societies which had become economically 
and socially intolerable. 

Polanyi’s theoretical work never failed to be in- 
formed by his desire to solve what he considered 
the crucial problem of modern society: How can 
society regain control over the forces of the econ- 
omy that were relegated to the autonomous market 
during the industrial revolution without abandon- 
ing freedom? Modern society, in his view, is to 
some extent compelled to compel, and he wished 
to distinguish the economic realms that require 
planning and control from the cultural spheres 
that require freedom. Polanyi thus held a modified 
socialist position, in antithesis to the economic de- 
terminism of both the orthodox left and the Mises- 
Hayek school, both of which are based on the 
same premise, although they prognosticate dia- 
metrically opposed outcomes. 


Even the wide range of Polanyi’s writings hardly 
reflects the enormous breadth of his interests in 
the humanities, in arts and letters, and in the his- 
tory of the day. It was his custom at the end of a 
day’s work to discuss political events with friends 
and collaborators, and it was in these informal 
conversations that the astounding analytical and, 
at times, prophetic power of his unorthodoxy re- 
vealed itself most fully. 

Hans ZEISEL 


[See also ECONOMY AND SOCIETY; TRADE AND MARKETS. | 
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through the army), special 
ing of command personnel, 
mand personnel produced 
systems whose personnel were 
the influence of the local politics 4” 


that have plagued many departments in the United 
States. However, such police systems have also 
been associated with successful suppression of po- 
litical dissidence and have traditionally developed 
in societies where the state has been confronted 
with serious problems of organized, armed political 
unrest, 

The constitutional preference in the United 
States has been for a police under local control 
that would be available neither to the duly consti- 
tuted central government nor to any organized 
group that might seize control of the federal ap- 
paratus. This constitutional situation has been ac- 
companied in the United States by special problems 
of securing efficiency and honesty in police opera- 
tion. However, recent very rapid improvements in 
the quality of police practice, especially in the 
larger cities, indicates that local control per se need 
not be incompatible with high-quality police. 

In addition to municipal police, there are county 
sheriffs’ departments which in large parts of the 
United States constitute full-fledged police units. 
State police units exist in all but a few of the 
states. These concentrate on highway patrol but 
may perform general police functions in rural 
areas and are also important forces available for 
suppressing disorder. At the federal level in the 
United States, the tendency has been to create 
specialized police units to deal with specific prob- 
lems, such as counterfeiting, narcotics, customs, 
and immigration, The specialized units are in addi- 
tion to the Federal Bureau of Investigation, which 
has much more general functions in enforcing fed- 
eral law, 

k The English police system stands in many ways 
intermediate between that of the United States and 
the more centralized pattern of some Continental 
countries. The London Metropolitan Police are re- 
sponsible directly to the central government, while 
other departments are under a loose inspectorial 
control which is coupled with a system of monetary 
ee ie from the central treasury (Cramer 

When the Allies occupied Germany and Japan 
at the end of World War n, they introduced police 
systems patterned after the Anglo-American con- 
Stitutional structure. In Japan the change from a 
completely decentralized police introduced by Oc- 
cupation authorities in 1947 to a much more cen- 
tralized one in 1954 illustrates the difficulties 
involved in adapting the police systems of one 
Country to the conditions of another. Decentraliza- 
tion resulted in underfinancing of local units; police 
oe on large private donations, including some 

tom organized gangster groups; and near helpless- 
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ness of local units in the face of organized gang 
violence and later in the face of a systematic and 
disciplined campaign of political rioting directed 
against the central government (Nakahara 1955). 
In contrast, the disorders and riots that confront 
the police in the United States are rarely led by 
disciplined cadres, nor have they been directed at 
overthrow of the constituted government. 

Police organization and social control. The es- 
tablishment of uniformed, armed, and organized 
police as a response to the problems of social con- 
trol in modern societies creates a number of prob- 
lems, such as the political neutralization of the 
police, organizational creation and maintenance of 
discipline and commitment to legality in behavior 
toward the citizen, and development of efficient 
and impartial enforcement of the law. The means 
evolved to solve some of these problems may have 
negative results on others. Moreover, the control, 
organization, and behavior of the police will reflect 
the differences in civic culture between countries: 
for example, differences in the degree of consen- 
sus about the values represented in law and in the 
institutional supports for police authority, on the 
one hand, and traditions of police restraint on 
the other. The civic culture will, in turn, reflect dif- 
ferences in the nature and degree of heterogeneity 
of different societies, the degree to which laws rep- 
resenting minority viewpoints are passed, and the 
way in which law enforcement through the police 
relates to other forms of authoritative control in 
the particular society. 

Location of governmental centers. Of obvious 
though often overlooked importance is the relation- 
ship between patterns of urbanization and the loca- 
tion and nature of governmental centers. In many 
countries—for example, England, France, Ger- 
many, and Argentina—the capital city is also the 
metropolis or at least one of the largest cities. The 
location of national government physically in a 
large city means that problems of civil disorder 
can become very serious from the point of view of 
the governing regime. When such disorder is po- 
liticized, the government is accessible physically, 
and even relatively minor disorders can seem to 
pose fundamental threats to the regime. Where 
the regime is represented in the person of a king 
domiciled in the metropolis, sensitivity to disorder 
will be even greater (Chapman 1953). 

One of the major differences between the United 
States and many other industrialized, urbanized 
countries lies in the simple fact that the national 
capital is not a major metropolis and until recently 
was not even a large city. Furthermore, the capitals 
of the states, which under the federal constitutional 
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system share sovereignty, are often not the largest 
cities in each state. Since it is the large cities that 
have been the major sources of politically relevant 
disorder, the United States has thus had less need 
of the highly disciplined riot police of other coun- 
tries. This geographical isolation of governmental 
elites from the mass of the population has helped 
maintain the stability of democratic institutions, 
despite the volatile history and extreme heteroge- 
neity of the United States; while there have been 
many riots, the mob has not been a prominent actor 
on the political stage. Thus simple geography has 
partially performed the function of insulating elites 
from the mass—a precondition of modern democ- 
racy, according to theorists of mass society ( Korn- 
hauser 1959), From this point of view, distance 
can be a partial substitute for force. 


Police bureaucratization 


Bureaucratization—an organizational technique 
whereby major centers of civic power become neu- 
tralized from the point of view of the governing 
regime—has been of great significance in the de- 
velopment of modern police. Bureaucratization is 
a device whereby commitment to the occupational 
organization or occupational community and to its 
norms of subordination and service takes prece- 
dence over extraoccupational social commitments. 
Thus, in modern societies the political neutrality 
and legal reliability of the police is less a matter 
of the social sources of recruitment than of internal 
organization, training, and control. This, of course, 
is no less true of the organization of, for example, 
tax-collecting services or public education. In this 
sense, then, the police do not differ from the other 
large-scale civil services. Nevertheless, the situation 
is particularly crucial with respect to the police, 
because they are commonly called upon to enforce 
unpopular laws and, of course, because they are 
armed and organized. Perhaps this fundamental 
significance of police bureaucratization can be seen 
by attending to the fact that given well-organized, 
well-disciplined, and internally well-regulated po- 
lice, it is necessary for civil authorities to assure 
themselves only of the political loyalty and dedica- 
tion to legal process of the controlling police elites. 

Such problems are not confined simply to the 
unstable governments of some developing societies 
or to the often conflict-ridden regimes of continen- 
tal Europe. A major device whereby American 
Negroes have been able to secure better police 
protection as well as relief from police suppression 
has been the political capturing of city govern- 
ments, or at least the development of sufficient 
political influence to be able to persuade city offi- 


cials to appoint top police officials, chiefs, com- 
missioners, etc., who are more sympathetic to the 
civic aspirations of the Negro population. Of 
course, the successful translation of elite perspec- 
tives toward minority groups throughout police 
rank and file is not to be taken for granted; never- 
theless, it is true that wherever a determined top 
police administration backed by a city government 
which has been under pressure from powerful 
Negro groups has seriously attempted change of 
racial law enforcement, the efforts have generally 
been successful. To a very considerable extent, 
American minority groups have been able to secure 
civic respect from the police to the degree that they 
have been able to generate political influence. 

Negroes represent the last American minority 
group still undergoing this process. The develop- 
ment is particularly slow in parts of the southern 
United States, where law enforcement and mainte- 
nance of caste relations have traditionally been 
partly synonymous. Although police professionali- 
zation and bureaucratization are often strongly re- 
sisted by the white political structure, they have 
begun to appear as one of the major gains of the 
so-called Negro revolution (McMillan 1960). Per- 
haps in the long run, however, the most significant 
feature of police bureaucratization is that it tends 
to prevent the police from substituting their pri- 
vate attitudes or private occupational cultures for 
legal norms in law enforcement. One has only to 
cite examples of county sheriffs and sheriffs’ dep- 
uties in parts of the rural South, where not only 
personal animosities but also racial attitudes have 
played a part in law enforcement, to show that 
recruitment of amateurs from among the local pop- 
ulace involves serious dangers. 

Very broadly speaking, it may be pointed out that 
the process of internal bureaucratization of the 
police tends to make recruitment a less significant 
factor in the reliability and neutrality of the police. 
However, when a particular governmental regime 
is highly unpopular with some particular segment 
of the population, and when the political reliability 
of the police has not been achieved by bureaucrati- 
zation, it may be particularly important to recruit 
from specialized segments of the population whose 
loyalty arises from extraorganizational, social com- 
mitments as well as from adherence to the police 
occupation. Thus, during the early stages of polit- 
ical development in some modern multitribal states, 
the police may be recruited almost entirely from 
one particular tribe. In this way, the regime counts 
on intertribal tensions and frictions of the com- 
munity as a device for assuring that the police will 
enforce the law against members of other tribes. 


Many of the most significant problems of modern 
police, however, lie precisely in the balance be- 
tween bureaucratic isolation, on the one hand, and 
integration into the local community, on the other. 
Among the most significant consequences of this 
dilemma that have been pointed out by American 
researchers is the fact that isolation from the com- 
munity without careful bureaucratic organization 
can result in police occupational cultures which 
can become private subcommunities heavily in- 
volved in illegally determining the distribution of 
law enforcement and the use of violence (Westley 
1953). Isolation to a degree is obviously necessary. 
However, isolation may be accompanied by public 
contempt, which has been so frequent in the United 
States, with its generally low regard for public 
service. Thus, when isolation is not accompanied 
by organizational control, training, and indoctrina- 
tion, it can, on occasion, result in a situation where 
the police are, to be sure, apolitical but where their 
conceptions of appropriate conduct and appropriate 
relations between police and populace can become 
an autonomous and disturbing element, even in a 
predominantly civil and legal social structure 
(Banton 1964). 


Problems of modern police organization 


The organization of the police into disciplined 
bureaucracies is dependent upon certain social con- 
ditions. However, if such organization is successful, 
it can, in turn, create problems related to the link- 
ages between police and community, the nature of 
policing in modern urban societies, and the prob- 
lems of decentralized decision making peculiar to 
police work (Wilson 1963). Most detailed research 
on these matters has been done in the United States 
and England, especially the United States. Devel- 
opments in the United States are especially useful 
for illustrating the general conditions which make 
for both successful police organization and newer 
problems in policing. 

Professionalization of the police. Bureaucrati- 
zation of the police, is, of course, importantly 
facilitated where public employment is highly re- 
garded and well paid, and where police service can 
provide prestigeful lifetime careers. It is under 
these circumstances that the necessary personal 
commitment to the service and the necessary 0C- 
cupational morale are most likely to develop. The 
United States has had particular difficulty in this 
regard. The state as an abstract entity has not been 
regarded with the special mystique of some coun- 
tries, nor has it been invested with the special sym- 
bolic aura of the English crown. In addition, the 
public service occupations in general have not been 


POLICE 177 


of the highest prestige; this is generally the case 
in market-oriented societies which are traditionally 
devoted to the pursuit of private fortune. Judging 
from recent research, however, there has been con- 
siderable improvement in this respect since the 
early 1900s, and this trend can be expected to con- 
tinue (Janowitz et al. 1958). In addition, the in- 
creasing concentration of population in large cities 
has allowed the development of larger departments 
with the possibility of more rigorous selection of 
personnel, better training, and the development of 
a more self-consciously professional administrative 
corps among police commanders. 

Until recently it has been difficult for police 
commanders in the United States to secure suffi- 
cient independence from local politics to develop 
systems of recruitment, promotion, and assignment 
based upon merit rather than political preferment. 
These difficulties in securing professional auton- 
omy have been especially pronounced in the older 
and more heterogeneous cities of the North and 
East, where the combination of unpopular laws 
and ethnic minority involvement in both machine 
politics and municipal services has often led to 
complex linkages between organized crime, the 
police. and ethnically based political machines. 
The major cities in the South, Southwest, and West 
have had fewer of these problems. During a period 
of rapid social change, underdeveloped public serv- 
ices in general, great heterogeneity of population, 
and legislatures dominated by rural interests who 
often wrote their own sectional and class norms 
into formal law, such connections between police, 
politics, and community were inevitable. Indeed, 
they served to ease social change and the urbani- 
zation and civic assimilation of the large immigrant 
populations, who were able to acquire relatively 
quick access not only to local politics but also to 
the police arm of government. 

Patterns of police authority. Under conditions 
of increased professionalization of municipal po- 
lice, the problems of internal discipline, efficiency, 
honesty, and respect for formal legality are increas- 
ingly being solved. Their solution, however, raises 
other difficulties related to police~community rela- 
tions, on the one hand, and new determinants of 
discretionary decision making, on the other. Where 
the police, as in England, have been able to rely 
on long-standing patterns of deference to the sym- 
bols of legitimate authority, coupled with a long- 
established cultural homogeneity and crowned 
symbolically by the monarchy, relations between 
police and populace reflect the gradual develop- 
ment of customary forms of behavior as well as 
formal legal restrictions on police and populace 
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alike. Under such circumstances, police authority 
is supported by the institutional environment, such 
as the law courts, as well as by popular attitude and 
social custom. On the other hand, where patterns 
of deference are not well developed, as in the 
United States, and the police are seen as simply 
doing a job rather than as representing an insti- 
tutional order, the legal system in general receives 
less support from custom and other institutions of 
society; in turn, the legal system is likely to repre- 
sent the deep ambivalence toward authority which 
characterizes the larger culture. Thus, in the United 
States police powers are severely limited as com- 
pared with those of the Continent, and police 
authority is not buttressed by developed custom 
within the bounds of formal law, as has been the 
case in England (International Conference... 
1962; Banton 1964). 

This combination of lack of supportive deferen- 
tial custom, the relatively unprivileged position of 
the state, severe legal restrictions on police powers, 
and high crime rates—especially high rates of vio- 
lent crime, on the one hand, and organized crime, 
on the other—has posed especially difficult prob- 
lems for the police in large American cities, As a 
consequence, certain informal practices that are 
illegal or of doubtful legality, which are probably 
found to one degree or another among police every- 
where, have received special attention from Amer- 
ican social researchers, Among these are the wide- 
spread use of discretionary powers that are poorly 
based in formal law and the noticeably declining 
problems of corruption (LaFave 1965; Skolnick 
1966). 

Increased concern with these practices, together 
with the rapidly growing political and civic aware- 
ness and influence of Negroes, the increasing 
bureaucratization and professionalization of the 
police, and the increasing necessity to regulate an 
affluent and mobile citizenry (especially in traffic 
control, which cannot always be identified with tra- 
ditional categories of criminal conduct) have led to 
an awareness that older patterns of police practice 
and older patterns of contact between police and 
community must give way to rationally calculated 
and organizationally inculcated forms of conduct. 
Rather than seeking to share communal sentiments 
and reflect them in their behavior, the police must 
try to develop ways of managing interpersonal con- 
tact so as to minimize challenge to police authority 
as well as degradation of the citizen, 

Symptomatic of this change is the increasing 
amount of social research, including public opinion 
surveys, with implications not only for general 


understanding of police but also for police practice 
(Great Britain...1962a; 1962b; Clark 1965; 
Preiss & Ehrlich 1966; Bordua 1967). This replace- 
ment of custom by rule and management has often 
been described by sociologists as characteristic of 
modern mass societies. It seems likely that as def- 
erence patterns decline in other societies, many 
features of recent American experience will devel- 
op there, 

Discretionary decision making. In democratic 
societies the problem of assuring that the police 
are organized so as to be a reliable instrument for 
the maintenance of order and the suppression and 
prevention of crime, while at the same time assur- 
ing that they exhibit restraint and sensitivity to citi- 
zen rights, means that organization of the police 
cannot be like that of an assault battalion which 
on command can be relied upon to do its duty 
regardless of cost to itself, its enemies, or the sur- 
rounding social fabric. Nor can it be organized to 
respond purely to external stimulation in an auto- 
matic and “programmed” fashion. Police organiza- 
tion involves elements of military command from 
the center, as well as elements of automatic re- 
sponse to stimuli at the periphery; however, the 
peculiar conditions of modern policing require 
above all that discretionary judgment within a 
framework of rules be widespread. 

Because rules cannot specify all eventualities, 
the decision making of police in a democracy is 
determined by morale, on the one hand, and by 
the relationship to the local community culture, 
on the other. The problem of morale is essentially 
one of assuring a commitment to central organiza- 
tional and legal purpose as a basis for discretionary 
decision. The problem of relation to the local com- 
munity is essentially one of maintaining a sensi- 
tivity to the culture of the community as it involves 
questions such as modes of exercising authority 
and kinds of offenses which can be proceeded 
against only at great social cost; it also involves 
a sharing by the police of the general societys 
sense of the significance of individual dignity. 

Involvement in the community. The balances 
between discretion and command and between 
isolation from and involvement in the general cul- 
ture are, in some ways, the same problem, since 
it is participation in the general life of the cian 
munity that helps the police fill in the ambigui- 
ties of formal law and departmental regulation. 
Integration into the community also prevents the 
police from substituting their own occupational 
and organizational culture for the law or for the 
moral sentiments of the local populace. Such com- 


munity integration is facilitated by cultural ho- 
mogeneity and by the absence of wide disparity 
between the formal law and social custom. 

The involvement of police in the community 
can, however, be overdone, as evidenced by many 
of the difficulties of developing restrained law en- 
forcement in racially mixed communities in the 
United States. Where dominant community senti- 
ment in the segments of the population holding 
power dictates denial of legal rights and protec- 
tions to Negroes, integration of the police into local 
culture may defeat the specific aims of legal policy 
as well as the general aim of equal dignity for all 
citizens, In general, the use of law as an instru- 
ment of social change requires sufficient isolation 
of the police from local involvements that they can 
act in accordance with new legal directives that 
are likely to meet resistance. 

In stable and homogeneous societies, there is 
likely to grow up a body of customary practice 
linking police and citizenry which serves to supple- 
ment and refine the formal law and the use of 
police authority. Where such custom has been of 
long standing, thus making both citizen and police 
behavior predictable and mutually tolerable, the 
basic trust of the citizenry in the police is likely 
to make demand for more elaborate formal con- 
trol of the police unlikely. However, where social 
change is rapid and the conditions for such cus- 
tomary adaptation are not present, demands for 
more control are likely to be prominent. The recent 
movement for civilian police review boards in the 
United States is an example of one such demand 
deriving from such a lack of stabilized trust. 


Deployment of police resources 

Police operations vary greatly, depending on a 
number of factors in addition to those already men- 
tioned. Among these factors are the kind and 
amount of illegal behavior, as well as its social 
organization; the degree to which uniformed police 
are expected to carry out the general functions of 
public protection and regulation in areas other 
than dealing with crime and disorder; the nature 
and distribution of communication facilities in the 
population; and the relative use of mechanical 
and electronic aids in crime detection and law 
enforcement. 

In some societies the police have many duties 
and become a general administrative arm of gov- 
ernment. This may be true either because of a 
general conception of the role of central govern- 
ment or because of the relative scarcity of other 
trained administrative services. In the United 
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States, where crime rates are high and where 
traffic regulation consumes an important share 
of police resources, police commanders have at- 
tempted to limit police operations to the central 
ones of protection of life and property through the 
suppression of crime and disorder. In addition, in 
their attempts to develop a more prestigeful and 
professional service, leading police spokesmen in 
the United States have tended to look to the tra- 
ditional professions, such as law and medicine, 
where specialization and the ability to limit func- 
tions have been accompanied by high prestige. This 
effort is especially significant in the United States, 
where autonomously developed performance stand- 
ards and specialized technical competence have 
been sources of occupational prestige, whereas 
performance of generalized service to the state has 
not. Performance of functions outside the areas 
of crime control and the maintenance of public 
order is not necessarily incompatible with a high 
standard of police service, however. Indeed, the 
public image of the police may be more favorable 
where police service includes considerable sup- 
portive activity in addition to more repressive 
services. 

Problems of organization and deployment of 
police are closely related to the nature of illegal 
behavior, its social organization, and the social 
channels whereby information on illegal activity 
is made available to the police. In modern, urban- 
ized societies much routine information concerning 
crime comes from the citizen complainant, who is 
usually the victim of the crime. The spread of 
telephones in the population, coupled with the use 
of two-way-radio-equipped patrol cars in the larger 
cities of the United States, Canada, and Scandi- 
navia, has made possible very rapid reaction to 
citizen calls. This reaction is usually mediated 
through a central radio dispatch center, which also 
enables considerable control over the activities of 
policemen by superiors. 

In the United States the high incidence of crime 
—especially crimes against the person—plus the 
lack of tribal or other means for dealing with crime 
results in a very large volume of work for urban 
police departments. This large volume is partially 
due to the lack of private security arrangements 
of a nonpolice nature; homesteads are not walled, 
and there is nothing like the famous concierge 
arrangement of France. Thus there is a need for 
the police to react very rapidly in a very decen- 
tralized fashion, as well as a need for centraliza- 
tion of police communication. For this kind of 
police work, deployment and allocation of police 
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resources are determined largely by the willingness 
of the citizenry to request police protection. 

Other police operations, such as vice control, are 
less reactive in nature and, indeed, may have no 
victim or complainant in the ordinary sense. Under 
these conditions, police work involves a more ini- 
tiatory strategy, and decisions as to deployment 
and resource allocation are much more under the 
control of police authorities themselves. Such 
“service” crimes as gambling, prostitution, and 
liquor or narcotics sales involve considerable or- 
ganization on the part of criminals; moreover, they 
have no victims willing to complain and act as 
prosecuting witnesses, and they can take place in 
premises protected by legal guarantees of privacy 
(Stinchcombe 1963). It is also characteristic of 
these offenses that they are likely to involve viola- 
tion of laws which do not necessarily represent a 
high degree of social consensus. This combination 
of circumstances, coupled with the large amounts 
of money which result, makes corruption of the 
police more likely than in the case of other types 
of crime, and corruption can function much like 
a private tax. It also means that removal of police 
from politics is more difficult, since the social 
differences which underlie the law will make them- 
selves felt in political demands for more strict or 
more lenient enforcement. 

Among new techniques being used by the more 
highly mechanized police systems of large United 
States cities, and to a lesser degree in Canada and 
Scandinavia, are elaborate electronic computer in- 
Stallations used to search records, sort through 
modus operandi files, and provide very rapid re- 
sponses to inquiries from field officers concerning 
Suspicious automobiles. These new techniques of 
communication, information retrieval, and dis- 
patching are significant not only with respect to 
the speed and effectiveness with which police 
respond but also with respect to the effective- 
ness of command and control within departments 
(Bordua & Reiss 1966). 


Organizational discipline 

Over and above control through the usual tech- 
niques of rule making and supervision, many de- 
partments in the United States have been led to 
maintain undercover units designed to monitor the 
efficiency and honesty of the police themselves, 
Such internal investigation units have been effec- 
tive when combined with firm control in reducing 
corruption and greatly increasing citizen respect 
for the police. Internal investigation units and 
central communications audits allow organiza- 


tional contro] of dispersed field officers, but they 
may also create problems of morale, since they 
bypass subordinate command levels and imply mis- 
trust. Such extreme measures for internal control 
are likely to become less necessary as conditions 
change in the United States, especially as police 
recruitment, training practices, and overt super- 
vision practices improve. These organizational im- 
provements, coupled with improving public respect 
for police, should help increase the sense of occu- 
pational honor among the police, which is crucial 
in the long-run assurance of competence and hon- 
esty (McMullan 1961). 

The internal control problems of large city de- 
partments have been greatly complicated in the 
United States by the long history of local political 
interference in internal police affairs, As a re- 
sponse to this problem, civil service tenure for police 
has become very widespread in an attempt to pro- 
tect individual policemen from threats to their 
livelihood by political influence. In the process, 
however, it has become quite common for the ul- 
timate power that imposes severe disciplinary 
measures, such as demotion or dismissal, to be 
lodged outside the police department. Dismissal 
for cause may require highly formal hearings, and 
appeal may lie to the regular courts. While this 
system helps eliminate unfair, arbitrary, or politi- 
cally motivated decisions by police command per- 
sonnel, it simultaneously weakens internal control 
exercised along more acceptable lines. This se- 
curity of tenure for the rank and file and the 
necessity for quasi-judicially presented evidence 
is one reason why reforming departments stress 
the need for undercover internal investigation. 
Through such investigations it is possible to secure 
appropriate evidence of wrongdoing and, in most 
cases, induce resignation rather than initiate formal 
proceedings. 

Given the widespread willingness of the pop- 
ulace to corrupt the police—usually in petty ways 
—and the widespread ambivalence toward author- 
ity exercised in the name of the state, it is neces- 
sary in the United States to place the police under 
a far more severe discipline than would be accept- 
able to the general population. Police everywhere 
are, of course, subject to close discipline, yet in 
many societies this is accompanied by either a 
complementary restraint and respect on the part 
of the citizenry or by a definition of the police 
as having a special elite position which helps n 
maintaining their morale. In the United States, in 
a well-run police unit, the rank-and-file officer is 
likely to experience a subordination less stringent 


than that experienced by his counterparts in other 
countries; however, the discipline that he experi- 
ences is far more severe, relative to the patterns 
of authority in the surrounding civil society. 

The organizational discipline—more apparent in 
large or rapidly improving departments than in 
small or static ones—not only sets police apart 
from the general populace but, perhaps more im- 
portantly, makes for a great contrast between the 
values of the police occupation and those of other 
segments of the system for the administration of 
criminal justice. The increasing emphasis on re- 
habilitation, reform, and resocialization of offend- 
ers has been accompanied by a decline in the use 
of legal coercion and a de-emphasis, at least in 
ideology, of ideas of punishment and rigid restric- 
tion, It thus sometimes seems to the police that 
modern ideas of treatment of criminals require 
relaxation of restraint on offenders and an in- 
crease in restraint on officers. Despite the increas- 
ing similarity of education and the decreasing in- 
tellectual isolation of police administrators in the 
United States, the differing conditions of organiza- 
tional control will probably make for continuing 
conflict between police and other aspects of the 
criminal justice system. 

Moreover, it may be anticipated that some of the 
difficulties faced by police in the United States will 
appear or increase in other countries. As wealth 
and education are more widely distributed, tradi- 
tional patterns of citizen deference break down 
and the traditional supports for police authority 
weaken. These changes produce difficulties not 
only in relating police to populace but also in the 
internal control of police themselves. 


Davip J. Borpua 


[See also CRIMINAL LAW; GAMBLING; PENOLOGY; Pux- 
ISHMENT; SOCIAL CONTROL.] 
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POLICY SCIENCES 


The policy sciences study the process of deciding 
or choosing and evaluate the relevance of available 
knowledge for the solution of particular problems. 
When policy scientists are concerned with govern- 
ment, law, and political mobilization, they focus on 
particular decisions. Policy scientists also study the 
choosing process of nongovernmental organizations 
and individuals and consider the significance of 
the current stock of knowledge for specific issues. 
Since an official decision or a private choice is a 
problem-solving activity, five intellectual tasks are 
performed at varying levels of insight and under- 
standing: clarification of goals; description of 
trends; analysis of conditions; projection of future 
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developments; and invention, evaluation, and selec- 
tion of alternatives. The policy sciences integrate 
philosophy, history, science, prophecy, and com- 
mitment. 

Although policy scientists appear in every civili- 
zation, the term dates from the years immediately 
after World War m (Lerner & Lasswell 1951). The 
present discussion is concerned with the profes- 
sional problems of the policy scientists, who in our 
civilization are recruited from legal, historical, so- 
cial, biological, and physical sciences. 

Goals. The value goals of policy scientists dif- 
fer widely from one another. For many scholars, 
especially historians and social scientists, the pri- 
mary lure is knowledge. They seek understanding 
of the spectacle of kings and presidents, ministers 
and generals, popes and prophets. Closely related 
to the pursuit of enlightenment is the desire to ac- 
quire, exercise, and achieve excellence in legal or 
other skills that are directly implicated in some 
phase of decision. When the value of greatest sa- 
lience is power, scholars identify themselves with 
the ambitions of individuals, political parties, na- 
tion-states, or other participants in the internal or 
external arenas of politics. In many cases religious 
and ethical goals predominate, or perhaps special- 
ists aspire to nothing more grandiose than a steady 
income, a respectable job, and a happy home life. 

When a policy scientist deals systematically with 
value goals he meets two challenges: to justify 
(ground) his choices, and to specify abstract con- 
ceptions in operational terms. It is sometimes sug- 
gested that lacking value orientation, the individual 
must search for values. However, by the time any- 
one is mature enough to struggle with fundamental 
questions his predispositions have been shaped by 
years of exposure to the social environments of 
childhood, youth, and early adulthood. Since the 
problem is to clarify rather than to acquire value 
demands, the starting point is self-observation of 
conscious and unconscious value perspectives. In 
what sequence were they acquired? What factors 
of opportunity and capability help to explain why 
they were adopted? Can they be attributed to the 
influence of culture, class, interest, or personality? 

Several methods of self-observation are available. 
There is, for example, the psychoanalytic technique 
of free association, and there are tests devised by 
social, clinical, and individual psychologists. Life 
history information can be obtained by interviewing 
Kinfolk, friends, and acquaintances. The emerging 
self-image can be projected to the future as a means 
of judging the consequences of adhering to current 
preferences and volitions. 

A value problem culminates in the comparison 


of alternative commitments in the social context. 
What “ought” one to want? The fundamental 
choice refers to the degree of value shaping and 
sharing. A policy scientist may commit himself to 
democracy, fraternity, and security, or he may sup- 
port a social order in which power, wealth, and all 
other valued outcomes are in the hands of a self- 
perpetuating caste. 

Value positions may be grounded empirically or 
transempirically. In the latter case the method may 
be theological or metaphysical. The fundamental 
propositions of the theological method assert: “God 
prefers value Y; I ought to prefer God’s will; I can 
know (or have faith in) God's will in this matter; 
hence I prefer Y.” The metaphysical method is 
parallel: “Nature (or history) prefers value Y; I 
ought to prefer what nature (or history) prefers; 
I can know (or feel convinced of) the preferences 
of nature (or history) in this matter; hence I pre- 
fer Y.. 

When value positions are grounded empirically, 
no attempt is made to justify the subjective event 
of commitment by a further subjective event of ref- 
erence to theology or metaphysics. The pattern 
may be “I prefer Y” or “I prefer what he prefers, 
which is Y.” In the former case the ego assumes 
full responsibility; in the latter responsibility is 
masked. 

Since evaluations cannot be eliminated, it is pos- 
sible to modulate them only by methods that reduce 
arbitrariness and produce a relatively disciplined 
result. The policy scientist who applies a contextual 
approach forestalls arbitrary judgment. Arbitrari- 
ness is precluded by the use of principles that pro- 
vide criteria for the content to be considered and 
propose an agenda for guiding attention to the dis- 
covery and assessment of appropriate content. 
Principles of content and procedure formulate 
strategies that render any initial value commitment 
subject to the discipline of suspended and tested 
judgment. 

Policy scientists who are committed to the free 
man’s commonwealth find it difficult to believe that 
anyone who applies a fully contextual approach 
can stop short of making a similar commitment. It 
is particularly persuasive to emphasize that all 
members of the community are better off when 
everyone has an opportunity to develop latent tal- 
ent, subject only to the limitations necessary to 
obviate destructive mutual interference. It must be 
conceded, however, that human judgments are infi- 
nitely diverse and only partially predictable. 

A contemporary policy scientist who seeks to 
specify his preference for an abstraction such as 
human dignity finds that the task is somewhat 


lightened by the provisions of the Universal Decla- 
ration of Human Rights. The declaration is in 
harmony with the following definitions of value 
sharing: (1) shared power is authoritative and 
controlling participation in the making of impor- 
tant decisions; (2) shared enlightenment is access 
to the intelligence required for the discovery of 
common interest; (3) shared skill is opportunity 
to acquire, exercise, and achieve excellence in the 
arts, crafts, and professions; (4) shared wealth is 
access to the benefits of production; (5) shared 
well-being is enjoyment of safety, health, and com- 
fort; (6) shared affection is enjoyment of congenial 
and intimate human relations and love of inclusive 
groups; (7) shared rectitude is a common demand 
to act responsibly on behalf of human dignity 
(McDougal et al. 1963). 

Trends. An inclusive view of the role of policy 
scientists calls for an examination of the part they 
have played in history and in contemporary affairs. 
In some measure the importance of basing policy 
on knowledge has been recognized in every civiliza- 
tion. Confucius, the most influential thinker in 
Chinese history, spent a lifetime preparing himself 
and his students to answer a call that never came 
to him. The idea of a secular scholar—adviser was 
competitive with other advisers, especially with un- 
trained kinsmen and friends, soldiers, magicians, 
and priests. The sayings attributed to Confucius 
gave prominence to three of the five intellectual 
tasks we have identified, since he dealt with the 
clarification of values, the examination of historical 
succession, and the analysis of conditioning fac- 
tors. Confucius took note of local traditions and 
consulted historical records. In modern terms we 
can say that he engaged in field work and historical 
criticism. 

Ancient Indian civilization gave prominence to 
teachers and advisers of the prince who were more 
concerned with the invention and weighing of stra- 
tegic alternatives than with religion or magic. The 
writings of Kautilya, for example, are impressive 
demonstrations of objectivity in the critical evalu- 
ation of principles of efficient statecraft. They were 
designed to instruct the prince, and his advisers, in 
the management of all aspects of foreign and do- 
mestic affairs. 

The world of Greece and Rome led to the exfolia- 
tion of intellectual tasks adapted to the city-state, 
to the confederation and the empire, and to citizen 
assemblies, senates, tyrannies, and autocracies. In 
the Roman and Byzantine empires the most distinc- 
tive intellectual figures were jurists. The roles of 
jurist and theologian were merged in Muslim 
countries, 
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The revival of learning in western Europe 
brought with it the renewal of scholarly proficiency 
in law, philosophy, medicine, and science; universi- 
ties came into existence. Science depended upon 
academies, which eventually made peace with uni- 
versities. Many scientists became official advisers 
and expert witnesses on the significance of the new 
knowledge for policy. With the recent rise of the 
social and behavioral sciences to supplement law 
and philosophy, new specializations are being de- 
veloped in order to mediate between research 
workers and teachers, officials, or revolutionaries. 

Conditions. What factors favor the rise of the 
policy sciences? The coming of urban civilization 
played a crucial part in the history of intellectuals. 
The urban division of labor introduced literacy and 
record keeping and favored new arts and sciences. 
‘As civilizations expanded in the sea of folk so- 
cieties, institutions were adapted to the purposes of 
authority and control. Power passed from kinship 
to territorial units, a change that favored the im- 
personality of legislation, bureaucratization, and 
policy-oriented knowledge. 

Although urban civilization arose before policy 
scientists existed in considerable numbers, many 
basic intellectual specializations—partly explicit in 
folk societies—were carried into the new environ- 
ment. Since value goals were taken for granted in 
simpler communities, no provision was necessary 
for clarifying values. The older generation collec- 
tively performed the historical function. The earli- 
est tendency to specialize had probably been linked 
with the interpretation of omens or with attempts 
to employ magical or religious rites as means of 
coercing the future to yield value indulgences. The 
early cave paintings of southern Europe and Africa, 
for instance, were evidently expected to influence 
the future in favor of good hunting. Side by side 
with coercive spells and related strategies were 
propitiatory rites that were probably assumed to es- 
tablish necessary, if not sufficient, claims on trans- 
empirical forces or spirits to grant value indul- 
gences, either by preventing losses or by bringing 
abundant food, health, successful childbirth, or se- 
curity from enemies. 

The approaches we have described were largely 
manipulative, rather than contemplative, since they 
were expected to compel or induce the future to 
gratify the value demands of groups or individuals. 
The origin of the contemplative frame of mind and 
the pursuit of enlightenment as an end in itself, a 
scope value, cannot be traced. It is probable that 
soothsayers and astrologers would foresee many oc- 
currences that were expected to remain outside the 
realm of manipulation. We can also imagine that 
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speculative minds devised images of man and na- 
ture that seemed credible, even when known tech- 
niques were unable to modify their traits. 

Magicians or priests were ritualists whose prin- 
cipal reliance was exact repetition. Hence repetitive 
skill was a base for influencing all preferred events. 
The distinction between skill and enlightenment 
was latent in the difference between specific infor- 
mation closely related to circumscribed ends and 
comprehensive pictures of social and natural order. 

In the context of urban life empirical observation 
gained importance. When any attempt at manipu- 
lation fell short of success some shadow of doubt 
was cast on fundamental assumptions, and the 
quest for better empirical information was enliv- 
ened. In many cultures the technique of discovery 
was highly internalized, since it was believed to 
depend on cultivating moments of illuminating con- 
tact with the transempirical. However, many em- 
pirical and relatively externalized methods were 
also used. Towers were built to observe the sky; the 
natural habitat was scoured to find unusual plants, 
animals, or inorganic objects; laboratories were 
constructed to assist in disclosing the inner work- 
ing of substances. 

Without pretending to isolate unvarying se- 
quences, it is possible to identify several long-range 
trends that have been conditioned by the cumula- 
tive growth of civilizations. We can follow the rise 
of astronomy from astrology, of chemistry and 
physics from alchemy, of physicians from medicine 
men, and of the social, behavioral, and policy scien- 
tists from early practitioners of witchcraft, divina- 
tion, and sorcery. Empirical methods improve; 
transempirical perspectives grow dim. The move- 
ment has been from the fear and propitiation of 
nature, or of individual and collective personifica- 
tions, to universal religions and secular ideologies. 
Some secular myths appeal to faith in history or 
science or sources of absolute knowledge of coming 
events. The empirical techniques and the tentative 
perspectives of science are of relatively recent 
prominence or weight; they are incompatible with 
claims for absolute knowledge of the future, even 
in the name of science, since they put the accent 
on probability, not inevitability. 

As the globe shrinks into interdependence, rely- 
ing more fully on science and technology, the pol- 
icy sciences gain significance. The scientific pattern, 
diffusing slowly through the world, is at last receiv- 
ing detailed application to the policy process itself. 
Interdependence implies that every participant and 
every item in the social process is affected by the 
context in which it occurs and that the future 
structure of the context is in turn influenced by the 


changing pattern of detail. The growth of factual 
interdependence is followed, even when unantici- 
pated, by subjective awareness of interdependence 
and by demands to obtain a more comprehensive 
and realistic map of the whole, a map capable of 
providing rational guidance for attempts to maxi- 
mize values. The policy sciences play a part in the 
social process equivalent to the role of specialized 
channels of information in individual organisms. 
Besides ministering to the requirements of each 
cell, specialized channels integrate the total organ- 
ism in the context of its environment. 

Policy sciences spread as a function of perceived 
conflict, as when nations, classes, interests, and 
personalities are aware of crises where seemingly 
incompatible demands are put forward. During less 
critical phases there may be less attention to spe- 
cialized advice. 

Degrees of effective centralization and of autoc- 
racy or oligarchy influence the function of policy 
sciences. The decision makers of an autocratic or 
oligarchic body politic strive to maintain their po- 
sition, especially if new, by editing the public map 
of reality and excluding competitive elements from 
access to information. Access is deliberately seg- 
mented so that comprehensive summaries and 
credible reports are restricted to the pinnacle elite. 
The media of public communication become ritual- 
ized and trivialized. Ritualization leads to the man- 
aging of news to provide parables of confirmation 
for the established ideology; trivialization is accom- 
plished by omitting realistic images of the whole 
and by playing up news and comment whose rele- 
vance to serious decision is marginal. 

In such a body politic those who are skilled in 
inquiry are pushed to one side by dogmatists. The 
technique of the dogmatist is to reaffirm the full 
truth of selected doctrinal propositions. Hence if 
public policy culminates in military defeat, the re- 
sponsibility may be alleged to lie not with the doc- 
trine but with false interpreters. Inquiring minds 
are discouraged, since able people must devote time 
and ingenuity to demonstrating that empirical 
facts, estimates, and analyses confirm the basic 
dogma. 

The avenues to power in a centralized autocracy 
or oligarchy put premiums on the arts and crafts 
of the political police, who monopolize or fabricate 
the information on which estimates of loyalty and 
capability are based. Indoctrination is carried on in 
an intimidating context. However, the elite of a 
body politic of this type suspects that subversive 
thoughts and impulses lurk behind any façade of 
seeming acceptance. Hence the use of physical 
means of ensuring conformity is both attractive 


and widespread. Among the oldest physical strate- 
gies is insistence that activity be incessant and ob- 
servable; hence there is the compulsion to apply 
programs of supervised work and play. A related 
strategy favors extremes of physical mobility or im- 
mobility, the former being exemplified in the mix- 
ing of strangers in mass undertakings, the latter 
in the prohibition of unapproved and unsupervised 
travel or change of residence. As modern science 
and technology expand, they are likely to put ever 
more sophisticated physical instruments of compli- 
ance in the hands of political elites—such devices 
for the invasion of privacy as secret electronic 
equipment or narcosynthesis. 

In large bodies politic where power is rather 
widely shared, the policy sciences are as diverse as 
are the public orders themselves. Every decision- 
making organ usually finds it expedient to obtain 
expert advice. Hence all structures of government 
from the world community to a locality tend to 
draw on trained policy specialists. Political parties, 
pressure groups, and other voluntary associations 
develop administrative services and a brain trust. 

An autocratic or oligarchic elite that depends on 
science and technology finds that it must engage 
in a continuing struggle against the liberating con- 
sequences of freedom to theorize and observe, 
which are built-in threats to an established myth. 

Projection. It is possible to estimate our posi- 
tion in the past, present, and future of world poli- 
tics with the aid of “developmental constructs” 
(Lasswell 1935). The first step is to characterize 
in provisional fashion the most important political 
patterns at a recent cross section and to imagine a 
similar slice through the immediate or more remote 
future. The best publicized approximation to a de- 
velopmental construct is the Marx-Engels image 
of our epoch as in transition between capitalism 
and socialism. The sequence is alleged to move 
from the primacy of the bourgeois class to the class- 
less society (via the proletarian society ). 

The Marxist model is not, however, a true devel- 
opmental construct, since it is put forward in the 
name of the inevitability that is alleged to be justi- 
fied by scientific knowledge. A true construct makes 
no claim to the status of a scientific proposition, 
since it puts forward no generalized hypothesis 
about invariant relations among basic factors. 
Nevertheless, it would be a mistake to assert that 
developmental constructs are undisciplined by his- 
torical and scientific knowledge. For instance, they 
are influenced by the extrapolation of trend curves 
into the future. Such extrapolations often show 
when and where conflicting lines of development 
are likely to collide. In attempts to estimate the 
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probable result of collision, the existing stock of 
quantified scientific knowledge is brought to bear 
on the variables involved. When quantified knowl- 
edge is exhausted it is acceptable to use with ap- 
propriate reserve less-organized information from 
expert sources. 

Developmental constructs can be employed as a 
means of directing critical attention to any signifi- 
cant feature of the social process. We referred above 
to a class construct, and suggestive models have 
dealt with cultures, interests, forms of personality, 
and ideological or organizational components of 
institutions. 

Because our present concern is with policy scien- 
tists, we briefly present a model that sets forth the 
role of intellectuals in the world context. The con- 
struct proposes that a distinctive political change 
in our epoch is the rise of the intellectual class to 
power—more specifically, the permanent revolu- 
tion of modernizing intellectuals (Lasswell 1965). 
Relying upon knowledge as their base value, intel- 
lectuals express and manipulate the aspirations 
and dissatisfactions of other elements of society. 
In the last century one section of the intellectual 
class began to specialize in organizing the dissatis- 
factions of peasants and factory workers in the 
name of proletarian socialism. A subsection of this 
group seized power in Moscow in 1917 and liqui- 
dated both the caste system based on land and the 
rising business class. Another subsection seized 
power in Germany (in 1933) in the name of na- 
tional racist socialism and installed a bureaucratic 
garrison—police state. In Japan the officers’ corps 
and small groups of conspirative agitators reacted 
against the parliamentary institutions borrowed 
from western Europe and took power (in 1931 and 
after). In advanced industrial and democratic coun- 
tries the intellectual classes have been less seriously 
provoked to revolutionary violence by lack of up- 
ward mobility in the social system since they have 
been able to advance through the institutions of a 
nonhierarchized society, specializing in propaganda, 
diplomacy, administration, engineering, and related 
skills. In countries of former colonial and nonin- 
dustrial culture the political initiative most obvi- 
ously lies in the hands of party agitators, mass 
party organizers, officials, military, police, and 
engineers. 

The persons who seize or exercise power are not 
necessarily recruited from among policy scientists. 
But they must rely on policy scientists to cope with 
the complexities of large-scale modern civilization. 
The study of decision making has been facilitated 
by modern social and behavioral sciences and by 
several branches of mathematics, physics, and en- 
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gineering, particularly the specialties connected 
with communication. The use of the propaganda 
instrument has been affected by the research and 
advice of political scientists and social psycholo- 
gists, as well as by students of journalism, adver- 
tising, and marketing. Decision analysis owes much 
to the mathematics and statistics of probability, to 
industrial management, and to the political and 
administrative sciences. The fundamental analysis 
of culture and personality—pioneered by anthro- 
pologists, psychiatrists, and specialists on child de- 
velopment—has deepened and widened the relevant 
map of knowledge. 

As the physical and biological sciences become 
more diversified and important, self-appointed spe- 
cialists act as mediators between colleagues of the 
field, the library, or the laboratory, on the one hand, 
and the final decision makers of government, busi- 
ness, and other institutions, on the other. 

If the conditions suitable to democracy are not 
realized, power will eventually be concentrated in 
an elite class (eventually a caste) of intellectuals 
who constitute a ruling oligarchy. In addition to 
using communicative skills for purposes of indoc- 
trination, they can be expected to adapt medical 
skills to check on loyalty and inculcate obedience. 
Not the whole intellectual class but an oligarchical 
segment can be expected to rise to a dominant 
position under these circumstances. 

Alternatives. What strategies are open to policy 
scientists to improve their effective role in decision? 
As a means of limiting the scope of the present 
discussion, we emphasize the problems of policy 
scientists who identify themselves with realizing 
the goals of human dignity on the widest possible 
scale. 

Gathering intelligence. A major requirement is 
an open network of intelligence that supplies the 
data from which a realistic and comprehensive 
map of the whole can be kept current (Lasswell 
1963). Existing institutions survey selected fea- 
tures of the social process on a global or more 
limited scale; the coverage and technique of these 
institutions can be improved. 

Inclusive surveys of voting or fighting, for ex- 
ample, need to be put in a deeper context of con- 
ditioning factors by mobilizing more intensive tech- 
niques of investigation. Prolonged interviews with 
representative samples of culture, class, interest, 
and personality groupings, for instance, disclose 
the changing directions and intensities of internal 
conflict, often unconscious, which are invaluable 
in estimating future political developments. 

The microstructure of variables can be examined 


by experimental methods in laboratory situations. 
New variables may be brought into view, new the- 
oretical models verified, and new procedures of 
measurement and manipulation invented and test- 
ed. When new measurement methods are simplified 
for application in inclusive surveys, the yield from 
intelligence institutions is improved. 

The strategy of prototyping introduces innova- 
tions in social institutions. The practices are not 
broken up into the many variables appropriate to 
a laboratory experiment, since the main object is 
to weigh the role of the practice as a whole in rela- 
tion to other practices in the social process. Proto- 
typing requires the choice of situations that are 
sheltered from the stress and strain of major po- 
litical arenas. Opportunity is needed to develop a 
group whose members are intellectually committed 
to the innovation and whose character systems en- 
able them to live up to their professed perspectives. 
Prototypes of power sharing, for example, can be 
explored in remote provinces or in pluralistic 
associations. 

When prototypes have operated effectively in 
test situations, it is appropriate for the results to 
be made known and for changes to be introduced 
by public authority. This is the strategy of inter- 
vention which, unlike prototyping, is not typically 
controlled by scientists who are free to devote them- 
selves mainly to the advancement of knowledge. 

Taken together, the strategies of surveying, experi- 
mentation, prototyping, and intervention strength- 
en one another and improve the intelligence or 
appraisal operation of which they are part. The 
correlations among the data obtained by survey 
methods often point to factors whose impact can 
be studied more intensively under experimental 
conditions. Prototyping brings into the open the 
characteristic cluster of predisposing factors cur- 
rent in a particular situation and draws attention 
to the problem of improved analysis and control. 
The intelligence network appropriate to the policy 
scientist needs to identify and describe predisposi- 
tions on a global scale. With such knowledge at his 
disposal a policy scientist is able to estimate the 
probable impact of any combination of environing 
factors upon each cluster of predisposition. 

Presenting information. Special methods are 
essential to permit the policy scientist to cope with 
the enormous load of information relevant to his 
contextual task. Modern technology is making avail- 
able procedures of micromodeling that can be used 
to prepare concise, representative, and realistic 
summaries of the social process of the world cony 
munity or of any component context. Audio-visua 


devices supplement print; they can be mobilized in 
chart, map, and model rooms for purposes of re- 
search, instruction, and consultation. 

Haphazard use of audio-visual aids is not an 
efficient application of these instruments. A “de- 
cision seminar” is necessary to provide continuity 
and organization. When first introduced, each item 
needs to be evaluated by seminar members with 
respect to reliability and relevance. Material that 
gains acceptance is assigned to the wall space allo- 
cated to the feature of the social process it best 
represents. The exhibits provide an auxiliary brain, 
supplementing and guiding individual storage and 
recall, Surrounded by selective representations of 
the entire relevant social context, seminar mem- 
bers are able to give methodical consideration to the 
relationship between each detail and selected goals, 
trends, conditions, projections, and alternatives. 

If details are to be systematically compared with 
one another, it is necessary to describe them in 
categories that refer to a theoretical model of the 
whole social context (Lasswell & Kaplan 1950). 
Exhibit space is allocated to the social process 
under study, which may be world inclusive or a 
territorial or pluralistic component. Space is allo- 
cated to depicting the “participants” in the inter- 
action, to the “outcomes” (values) that the partici- 
pants seek to maximize, to “institutional practices” 
that are relatively specialized to each value, and to 
the “resource” setting involved. Value outcomes are 
culminating events in the unending flow of inter- 
action. They can be described for convenience in a 
list of categories that is manageably short and that 
refers to the value-institution processes studied by 
political scientists, economists, and other specialists. 

Illustrations are provided of the events which are 
classifiable as value outcomes and which are re- 
ferred to in appropriate chart, map, OF model 
rooms. Votes and victories or defeats in combat 
are decisions (power outcomes ) which are depicted 
during selected periods by charts which show the 
frequency of contested elections, for example, and 
maps which display the location and frequency of 
collective violence. Typical wealth outcomes are 
exchanges of claims to goods and services. In en- 
lightenment outcomes, information is exchanged. 
The giving or receiving of physical or mental care 
comes within the scope of well-being. The inter- 
change of intimacy and friendship is affections of 
acts of recognition, respect. Evaluations of excel- 
lence are given at skill outcome phases. Tf evalua- 
tions are in terms of theology or ethics, rectitude 
is the value involved. 

Outcome events are located in pre-outcome and 
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post-outcome sequences. In portraying the power 
process, for example, we note the “participants,” 
such as nation-states who act upon one another. 
It is relevant to describe the “perspectives” of all 
participants, which are the subjective events of 
identification, demand, and expectation. Perspec- 
tives are integrated in myths, each of which is 
composed of doctrine, formula, and miranda. A 
world map of communism, for instance, may show 
the first appearance and the subsequent diffusion 
and restriction of Marxist doctrine. Maps of the 
legal systems of the globe show, among other pat- 
terns, the narrowing or enlarging shadow of Ro- 
man law. A map of the distribution of world heroes 
is evidence of unity (or disunity) of world miranda. 
When the level of interactions that involve power 
reaches a discernible minimum, an “arena” is cre- 
ated. In the case of a nation-state the arena is 
highly organized; not so the present arena of world 
politics. Participants in arenas use the assets (or 
“pase values”) at their disposal by employing 
“strategies” to influence outcomes and effects. A 
comprehensive image of the social process depicts 
all values, not simply power, in terms of partici- 
pants, perspectives, situations, base values, strate- 
gies, outcomes, and effects. 

For the purpose of aggregate comparison, value 
shaping and sharing can be summed up in terms 
of “accumulation” and “enjoyment.” To some extent 
everyone participates in shaping power outcomes 
even though the most controlling and authoritative 
decisions may be dominantly shaped and shared by 
a small elite. We are accustomed to distinguishing 
between the production (shaping) of wealth and 
its distribution (sharing). Those who share any 
value may employ it for further shaping (accumu- 
lation) of the same value, or they may enjoy its 
use in obtaining other values. During any time 
period it is desirable to describe the “gross” out- 
come of any value that results from shaping activi- 
ties and the “net” outcome, which deducts the 
quantity of the value depleted in the shaping 
process. Thus a chart of gross skill outcome during 
a period will show changes in the level of excel- 
lence; the net outcome deducts the outlay of skill 
in teaching. Similarly, gross enlightenment out- 
come is the flow of research and news reports; the 
net outcome deducts the losses of stored informa- 
tion. Besides the outcome figures it is important to 
summarize the aggregate accumulation of each 
value. National wealth, for example, is commonly 
depicted in terms of production factors, which are 
a nation’s predispositions and capabilities to pro- 
duce. A rough depiction of the rectitude assets of a 


188 POLICY SCIENCES 


community shows the strength of predispositions 
to conform to standards of responsible conduct 
(conformities minus frequency of nonconformity ). 
The measurement problems that arise in describing 
some values, such as power, wealth, and well-being, 
have been extensively considered in our civiliza- 
tion. The contextual approach of the policy scien- 
tist is bringing into the open neglected dimensions 
of affection, respect, rectitude, skill, and enlight- 
enment. 

For comparative purposes the outcome events 
relatively specialized to any value require separate 
consideration. Under penalty of severe deprivation, 
for example, some decisions limit access to par- 
ticular intelligence reports. Other decisions involve 
the promotional activities of governments, political 
parties, or pressure associations. Some decisions 
lay down fundamental (constitutive) prescriptions 
in regard to power sharing. Decisions may pro- 
visionally invoke prescriptions in the act of char- 
acterizing concrete cases or make final application 
of prescriptions to particular circumstances. De- 
cisions also provide appraisals of the degree to 
which policy objectives have been accomplished 
and perhaps terminate official prescriptions. Among 
the charts that summarize appraisal are reports of 
the case load handled by welfare agencies or the 
proportion of recidivists among convicted offenders. 
Any comprehensive presentation of a social context 
shows that each value outcome has its own rela- 
tively specialized intelligence, promoting, prescrib- 
ing, invoking, applying, appraising, and termi- 
nating outcomes. 

The distinction between conventional and func- 
tional usages is important in gathering and pre- 
senting all social data for policy analysis. The 
terms employed in any community are part of its 
conventions. Functional definitions, on the other 
hand, serve the purposes of scientific description 
and comparison. As a matter of technique in the 
conduct of field investigation, the observer begins 
by classifying situations according to the conven- 
tional usages that most closely approximate his 
functional categories. Thus, local usage may refer 
to various councils and assemblies as part of “gov- 
ernment.” Scientific study of the values and insti- 
tutions in a community may culminate in a reclas- 
sification of these councils and assemblies and in 
the identification of quite different structures as 
power institutions. A secret society, for instance, 
may make the controlling decisions, and research 
may reveal that this is generally expected to be the 
case and is accepted as authoritative. Hence public 
assemblies may be more significant for general 
information (enlightenment) than for power. 


Recruiting policy scientists. Since the policy 
sciences specialize in the decision process and in 
assessing the significance of all knowledge for 
policy purposes, the range of potential recruitment 
is as wide as the intellectual class. At any stage in 
his career a specialist may choose to perform a 
mediating role between his particular group and 
the decision process of the larger community. In 
some cases a scientist or scholar may become 
aware of the fact that his work is largely adminis- 
trative, perhaps as head of a department or labora- 
tory. He may discover latent talent as a clarifier of 
technical information to nonspecialists or as a rep- 
resentative to the general public of scholars and 
scientists. The discovery of these roles may be 
gradual, perhaps beginning with a consultative 
contact with the officials of governments, parties, 
or pressure associations. If the original specialty is 
regarded as highly important for security purposes, 
the initial step may involve covert or overt intel- 
ligence agencies, Conflicting policy views may bring 
him into the arena of promotional activity, where 
he attempts to influence public demand. Legislative 
committees may invite consultation, and in excep- 
tional instances the individual may take an official 
role in prescribing, invoking, applying, appraising, 
and terminating organs. An important part of the 
problem of the policy sciences is to provide supple- 
mentary orientation for a specialist who is begin- 
ning to concern himself with public affairs. 

The chief obligation of the policy sciences relates 
to the decision process itself. At present the aca- 
demic disciplines most immediately involved in- 
clude political science, law, public administration, 
business management, political sociology, and con- 
temporary political history. Graduate schools are 
only partly successful in providing fully contextual 
programs of preparation and in encouraging the 
use of available theoretical and procedural tech- 
niques of the social and behavioral sciences. Aca- 
demic experience is being supplemented by intern- 
ship programs that bring students into intimate 
contact with the decision process. 

The training of policy scientists does not as yet 
include much systematic exposure to suitable Op- 
portunities for study of the self, nor are the con- 
textual potentialities of the decision seminar fully 
used. During the years of active professional life 
there are insufficient opportunities to participate 
occasionally in advanced centers for inventing and 
evaluating policy in the light of new knowledge in 
every major field. 


The importance of long-range and middle-range 
policy thinking is in harmony with present expecta- 
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tions that change will accelerate. One need consider 
only a few long-range possibilities to substantiate 
this point. Weapons of a novel kind lie close at 
hand, including bombs that paralyze temporarily 
without inflicting permanent damage. Teaching 
and research are already in active reconstruction 
as a result of new instruments of storage, retrieval, 
and instruction. Competent biologists foresee that 
the genetic inheritance of man can be deliberately 
modified. We are told that death itself may be 
abolished by the substitution of molecules as they 
wear out. Engineers expect to devise machines that 
simulate or improve on existing forms of life, in- 
cluding man. We are on the threshold of the astro- 
political age, where we are learning to penetrate 
a new habitat that may eventually bring us in 
touch with other advanced forms of life. The pos- 
sibility is not to be ruled out that new forms. of 
energy, possibly parapsychic, will be uncovered. 
Can policy scientists achieve and sustain a level 
of creativity commensurate with the threats and 
opportunities that face mankind? To a phenomenal 
extent the future of man is contingent on perfect- 
ing the permanent revolution of modernizing intel- 
lectuals—a revolution that is presently in the 
process of finding its distinctive aims and appro- 
priate strategic instruments. 
Haroip D. LasswELL 


[See also BEHAVIORAL SCIENCES; DECISION MAKING; 
POLITICAL SCIENCE; POLITICAL THEORY.] 
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The articles under this heading outline the sub- 
ject matter of political anthropology and its in- 
tellectual history. Specific topics in the field are 
discussed in CONFLICT; FEUD; JUDICIAL PROCESS, 
article on COMPARATIVE ASPECTS; Law, article on 
LAW AND LEGAL INSTITUTIONS; SANCTIONS; STATE; 
STATELESS SOCIETIES; WAR, article on PRIMITIVE 
WARFARE; and in the biographies of Freup and 
Maine. Related subjects are covered in MODERNI- 
ZATION; POLITICAL CULTURE; POLITICAL SOCIOLOGY; 
POLITICS, COMPARATIVE; and SOCIETAL ANALYSIS. 


Elizabeth Colson 
M. G. Smith 


1. THE FIELD 
u. POLITICAL ORGANIZATION 


I 
THE FIELD 


Political anthropology as a distinctive branch of 
social and cultural anthropology is a late growth 
stimulated by the publication of African Political 
Systems (Fortes & Evans-Pritchard 1940). As late 
as 1959, Easton’s provocative essay could deny the 
existence of political anthropology as a discipline : 
“To put the matter formally, political institutions 
and practices tend to be viewed in anthropological 
research as independent variables, of interest pri- 
marily for their effect on other institutions and 
practices of the society of which they are part” 
(1959, p. 212). This, he argued, had the effect of 
obscuring “the analytic distinction between polit- 
ical and other forms of social behavior” (ibid., 
p. 213). Here he pinpointed a major difference be- 
tween the political scientist who concentrates on 
a limited range of institutions which he has de- 
fined as political and the anthropologist who at- 
tempts to make comparisons using data derived 
from multipurpose institutions and who expects to 
include in his formulation all elements relevant to 
the attainment of the various ends which he has 
defined as political. Where political scientists have 
analyzed political institutions, anthropologists have 
sought to distinguish political groupings or polities 
and have analyzed the means whereby these group- 
ings attained their public purposes. 

Easton's article sums up the development of 
much of political anthropology through its first 
hundred years, when it was primarily concerned 
with matters of definition and the creation of 
typologies. During this period, anthropologists de- 
veloped a terminology appropriate to the wide 
variety of political systems with which they dealt. 
They clarified the characteristics that distinguished 
different systems and used these as the bases of 
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typologies useful in comparative work. Inevitably 
they dealt with descriptions of political groups and 
political roles as these ideally functioned. This 
directed attention to “constitutions” and mecha- 
nisms for social control rather than to the competi- 
tive aspects of politics. 

If the period from 1940 to 1960 was dominated 
by the synchronic study of political structures in 
a state of assumed equilibrium and by the creation 
of typologies, the period after 1960 showed an 
increasing interest in the development of a theory 
that could deal with change, faction, party, and 
political maneuver. This shift was signaled in 1954 
by the appearance of Edmund R. Leach’s Political 
Systems of Highland Burma, which emphasized the 
existence of political alternatives and the search 
for power as the effective basis for individual 
choice between alternatives. It became still more 
explicit in 1959 when Fredrik Barth attempted to 
apply game theory to the analysis of political or- 
ganization among the Pathans and again empha- 
sized the search for personal advantage (1959a; 
1959b). M. G. Smith followed the same general 
trend in his Government in Zazzau (1960), which 
concentrated upon political maneuvering by vari- 
ous contestants for power within a Hausa state. 
Smith is perhaps the first anthropologist to intro- 
duce an analytical distinction between government 
and politics which is operationally useful. As long 
as anthropologists dealt primarily with models of 
ideal political structures, this distinction could be 
ignored, and the terms were used interchangeably. 
With the move to the study of competition for po- 
litical power, it became vitally important to dis- 
tinguish the implementation of political decisions 
and the carrying out of administrative routine 
from the struggle for control of decision-making 
positions. 

Anthropological interest in political typologies 
and the attempt to correlate these with other fea- 
tures of social life began with the 1861 publication 
of Maine’s Ancient Law. In harmony with the in- 
tellectual climate of his period, Maine attempted 
to deduce evolutionary stages through which de- 
veloped political systems had passed. His primary 
distinction, between societies organized in kinship 
terms and those organized on a territorial basis, 
still has some influence. He argued that in ancient 
societies only kinship existed as a reason for 


. .. holding together in political union. The history of 
political ideas begins, in fact, with the assumption 
that kinship in blood is the sole possible ground of 
community in political functions nor is there any of 
those subversions of feeling, which we term emphat- 
ically revolutions, so startling and so complete as the 


change which is accomplished when some other prin- 
ciple—such as that for instance, of local contiguity 
—establishes itself for the first time as the basis of 
common political action. .. . The idea that a number 
of persons should exercise political rights in common 
simply because they happened to live within the same 
topographical limits was utterly strange and monstrous 
to primitive antiquity. (Maine 1861, pp. 124, 126 
in 1963 edition) 


Maine’s attribution of a kinship basis to societies 
of “primitive antiquity’ was quickly modified by 
Lewis Henry Morgan and his followers to include 
so-called primitive societies of the contemporary 
world, which have been the particular province of 
the anthropologist during the last hundred years. 
This classification still has some currency among 
political philosophers of other disciplines and 
among some anthropologists, despite the repeated 
demonstrations that even very simple societies have 
a territorial base and that kinship is only one of a 
number of conventions that may be used for de- 
scribing territorial relationships. Some indeed have 
gone beyond Maine and accepted the kinship con- 
vention as evidence that territorial groupings are 
actual kinship groups. Lowie (1922), however, 
had early shown the importance of associations 
both as institutions of political integration and as 
organs of government in various tribal societies. 
Later, Radcliffe-Brown (1940, p. xiv) was to sum 
up the accumulating ethnographic evidence in the 
sentence “Every human society has some sort of 
territorial structure.” The interest in segmentary 
lineage systems—a marked feature of the 1940s 
and 1950s—produced new confusion on this par- 
ticular issue (Fortes 1953, p. 30; Mair 1962, pp. 
11-14). Schapera returned to the attack in 1956 in 
his study of South African political systems, Gov- 
ernment and Politics in Tribal Societies. The im- 
portance of territorial organization runs like a 
theme throughout his book. The truth appears to 
be that people who have much the same degree 
of technical and economic development may vary 
in the way in which they conceptualize their po- 
litical relationships, which are always territorial in 
nature. Some express them in kinship idiom; others 
use some other model. The differences, whatever 
they are, that distinguish highly developed political 
systems from simpler systems do not rest upon $0 
easily formulated a dichotomy as kinship versus 
territoriality. 

That Maine’s formula could survive so long Te- 
flects the fortuitous nature of the development of 
comparative political studies. In the late nineteenth 
and early twentieth centuries, ethnographic aC- 
counts of American Indians provided much of the 
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data for the formulation and testing of anthropo- 
logical hypotheses, but ethnographers had little to 
say about American Indian political systems, which 
could only be reconstructed from the memories of 
a demoralized conquered people confined on res- 
ervations under alien rule. Lowie was perhaps the 
only American anthropologist of the period who 
made any notable contribution to political anthro- 
pology on the basis of research among American 
Indians. His book The Origin of the State (1922) 
was most influential in stimulating research in 
regions other than America. The general lack of 
concern for political formulations based on Ameri- 
can ethnography is reflected in the contents of a 
notable collection of essays published as Social 
Anthropology of North American Tribes (1937). 
Only one essay, by John Provinse (1937), was 
primarily concerned with problems of a political 
nature, The development of Australian and Mela- 
nesian ethnography in the early twentieth century 
strongly influenced studies of kinship, religion, 
and social structure in general but did not stimu- 
late research on political problems. American 
Indian, Australian, and Melanesian societies tended 
to be small in scale and without sharply differ- 
entiated political institutions; political and social 
organizations were seen as one and the same. 

It was Malinowski’s experience in Melanesia 
which led him to create the functional school of 
anthropology and to insist that the anthropologist’s 
primary obligation was the study of systems of 
behavior in ongoing societies. This cleared the way 
for a study of political action as well as political 
structures, but Malinowski himself contributed 
little directly to the development of political 
studies. The definitive evaluation of his influence 
on social science (Firth 1957) includes no chapter 
on political studies. He had an abundance of field 
data on the subject, recently analyzed by Uberoi in 
his Politics of the Kula Ring (1962), but no rele- 
vant theoretical framework which would have 
allowed him to bring his observations together in a 
systematic fashion. Political anthropology received 
its real impetus when students trained by Mali- 
nowski and Radcliffe-Brown encountered still func- 
tioning large-scale political units when they began 
to work in Africa in the 1930s. As Fortes has 
pointed out, they were forced to study government, 
whereas their predecessors, who had dealt with 
small-scale societies, had studied social control 
(1953, p. 18). ne 

In Africa they encountered polities organized gn 
centralized states having differentiated political in- 
stitutions of a type which linked them with the 
kingdoms, principalities, and republics of Europe 
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rather than with the undifferentiated, loosely organ- 
ized groupings previously studied by anthropolo- 
gists. Well-defined political offices, a hierarchy of 
authority and communication, and the explicit 
control of organized force were familiar features 
and could be handled in terms of the political 
theories of the day. The problem of political defi- 
nition—the need to develop a minimal formula to 
isolate political from other forms of action—be- 
came apparent only when they sought to compare 
their data with that being produced by anthropolo- 
gists working in a very different type of society. 
Populations comparable in size to those found in 
centralized states and with much the same level 
of technical and economic development existed in 
loose associations without any observable form of 
centralized authority or indeed without specialized 
political offices. Large populations could unite in 
common action and on occasion could identify 
themselves as a common polity, though on other 
occasions they stressed their independence and 
mutual antagonism. Their organization as it ap- 
peared in action was situational rather than main- 
tained by the continuous existence of an ad- 
ministrative structure. Standard definitions of 
government or political organization provided no 
place for such systems, yet the anthropologists 
knew they were faced with an order, which they 
identified with political order comparable to that 
maintained within the more organized polities. It 
became necessary to develop a definition which 
would include both types of polities, and this in- 
volved isolating the essentials of political organ- 
ization. Radcliffe-Brown (1940, p. xiv) proposed 
a minimum working definition: “In studying po- 
litical organization, we have to deal with the main- 
tenance or establishment of social order, within a 
territorial framework, by the organized exercise 
of coercive authority through the use, or the pos- 
sibility of use, of physical force.” 

His emphasis upon physical force and social 
order has been strongly criticized, but the formula 
proved useful to those attempting to discover and 
analyze the mechanisms which underlay cohesion 
and control of violence in the seemingly amor- 
phous stateless societies. Political science had no 
convenient framework for dealing with such sys- 
tems, and it was the challenge they posed to the 
workers from the 1930s to the early 1950s which 
provided the major stimulus toward the develop- 
ment of a field of political anthropology. Fortes 
and Evans-Pritchard (1940) proposed a prelimi- 
nary classification of political systems into three 
types: state, stateless, and band. It was soon 
pointed out that they had treated stateless systems 
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as synonymous with segmentary lineage systems, 
which represented only one of several methods for 
organizing noncentralized systems. Segmentary 
lineage systems remained a focal point of research 
until the early 1950s, when Fallers and others 
who had been influenced by the political sociology 
of Max Weber began to examine the nature of 
authority within traditional African states which 
were then part of the colonial system. Fallers in 
Bantu Bureaucracy (1956) drew attention to the 
clash of values produced by the introduction of 
bureaucratic institutions under colonial regimes 
and stimulated an increasing interest in the current 
political development of the new African nations. 

Prior to this time the dominant theoretical in- 
fluence had stemmed from the school of Durkheim, 
which treated societies as moral systems and em- 
phasized common values, integration, equilibrium, 
and continuity. The major purpose of any society 
was taken to be the maintenance of existing order, 
and political action was assumed to represent 
action governed by this purpose. Even those like 
Gluckman, who sought to deal with conflict, tended 
to treat it as an integrative device serving to main- 
tain existing systems of political relationships. Re- 
bellions served to maintain the system, since they 
only involved competition among rivals for posi- 
tions which remained unquestioned. Politics, Law 
and Ritual in Tribal Society (Gluckman 1965) is 
a masterly summary of this approach which at- 
tempts to combine the insights of Durkheim with 
those of Simmel, whose sociology of conflict is be- 
coming increasingly relevant to anthropological 
work. Conflict, no longer contained within and 
maintaining the system, is becoming a central in- 
terest of those concerned with political questions. 
The shift in interest within anthropology reflects 
a change in the type of data now available for 
observation. Colonial authorities used and main- 
tained indigenous political institutions where these 
were adaptable, but as administrative institutions 
rather than as a means whereby competing inter- 
ests and dissent could be expressed. Political dis- 
sent was not recognized as legitimate within the 
subordinate political units which most anthropolo- 
gists studied. The decline of colonialism, the rapid 
growth of political parties, and the emergence of 
new political regimes in the period after World 
War u radically altered the anthropologist’s con- 
ception of his field of study. Anthropological 
studies dealing with national political parties and 
their impact upon local political events began to 
appear in the late 1950s. In a period of contending 
interests, most studies deal with competition, with 
conflict, and with rapid change. The essays in 
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Political Systems and the Distribution of Power 
(Conference on New Approaches in Social Anthro- 
pology . . . 1965) represent this interest in the 
study of political maneuver in systems whose sta- 
bility is no longer a given. 

ELIZABETH COLSON 
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H 
POLITICAL ORGANIZATION 


Comparative politics is the study of the forms of 
political organization, their properties, correlates, 
variations, and modes of change. The object of 
these inquiries is to formulate general statements 
and hypotheses about the nature and conditions of 
political processes and their relations to other social 
phenomena. 

Political organization has been defined struc- 
turally by reference to institutions that regulate the 
use of force (Radcliffe-Brown 1940; Weber 1922; 
Almond 1960) and functionally with reference to 
social cooperation and leadership (Schapera 1956; 
Mair 1962). Political organization can be described 
in terms of the processes of decision making found 
in a given population (Easton 1953; 1957; Macridis 
1955). All these schemes identify political organ- 
ization with discrete social units, “societies” or 
“political communities,” within which force is con- 
trolled or excluded and valid decision making or 
directed cooperation obtain. Thus political organ- 
ization is restricted to social units having these 
characteristics. However, in some societies, the 
political community is indeterminate. As Mair 
observes: 


. .. some populations regard themselves, and are regard- 
ed by others, as distinct entities, but yet do not recognize 
any person or body of persons as having general au- 
thority to take decisions in matters affecting them all. 
In different contexts, different subdivisions of the 
whole take collective action for the purposes for which 
they are autonomous; so that if such action is the 
criterion of a political community, there are series of 
overlapping political communities. (1962, p. 106) 


If political organization is identified with discrete 
communities, populations that lack such units 
must be said to lack political organization. How- 
ever, since these aggregates persist, they presum- 
ably have some adequate methods for regulating 
their internal and external affairs. They must be 
able to restrict disruptive violence and take some 
kind of action in common. Definitions of political 
organization that cannot accommodate such phe- 
nomena are likely to be inadequate in other re- 
spects as well. We must therefore seek a more 
appropriate analytic framework. 

Power and authority. Max Weber has remarked 
that the common element in our everyday use of 
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such terms as “political events,” “actions,” “insti- 
tutions,” and “processes” consists in their reference 
to “striving to share power, or striving to influence 
the distribution of power, either among states or 
among groups within a state” ([1919] 1946, p. 78). 
Again, in considering the same question, Weber 
noted that these terms “have to do with relations 
of authority within what is, in the present termi- 
nology, a political organization” ([1922] 1957, 
p. 142). Except for an obscure distinction between 
political and politically oriented action, Weber 
seemed content not to pursue the question further; 
however, power and authority differ sharply in 
their nature, although they often overlap in their 
distribution; and if they are the elements of polit- 
ical organization, it is necessary to distinguish 
them and to study their interrelations with special 
care. 

Power is the capacity to take autonomous action 
in the face of resistance from persons, groups, 
rules, or material conditions. It is the ability to 
pursue one’s will effectively, if necessary by impos- 
ing it on others. As such it is manifested directly 
as pressure against resistance, that is, by contra- 
position and confrontation. Although power may 
be latent and indirectly applied, it may gradually 
assume the character of authority if it is regularly 
effective. There are many historical cases of re- 
gimes based on conquest or usurpation that have 
later acquired authority through general acceptance. 
Power without authority remains uninstitutional- 
ized, labile, and relative. Because it is instrumental 
and conditional, its effectiveness is uncertain. 
Authority, being institutionalized, is fixed in its 
scope, character, and distribution. It enjoins ob- 
servance on obligatory and normative grounds 
rather than instrumental ones. 

Authority is the right to take certain kinds of 
action, including decisions to issue commands ap- 
propriate to the circumstances. Authority thus rep- 
resents the set of rules, procedures, traditions, and 
norms that are regarded as binding when they are 
applied within a given social unit. The rules that 
establish and allocate authority also serve to limit 
the authority that they institutionalize. 

Among the members of a group, the distribution 
of authority may be marked by special observances 
of etiquette or dress or by the use of such symbols 
as the miter, crown, seal, or staff. A person in au- 
thority holds a special position that entitles him 
to take certain types of action not open to every- 
one. The right to act in this way may in fact be 
obligatory or it may be discretionary; but if authori- 
tative, the act imposes an obligation of conformity 
or obedience directly or indirectly on others in the 
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social group. Thus the person holding authority 
enjoys an immunity that extends at least as far as 
the right to take these actions, He may also exer- 
cise sanctions to secure observance and have a 
limited discretion in their use. 

A “structure of authority” is simply a distribu- 
tion of authority among a number of intercon- 
nected social positions. With regard to authority 
of the same kind within a single social unit, this 
Structure is often, but by no means always, hier- 
archic in form. Such a hierarchic authority struc- 
ture operates administratively, that is, its members 
administer the affairs under their control in an 
orderly, coordinated fashion by procedures and 
rules that they and their colleagues regard as cor- 
rect and binding. The principal characteristic of a 
hierarchic authority structure is that the members 
of inferior levels of the hierarchy are responsible 
directly to those above them, including their ulti- 
mate superior, Members of superior levels are au- 
thorized to issue commands, to make decisions 
about the unit's operating procedures, and to super- 
vise the conduct of their subordinates. 

Being in some respects discretionary, although 
bound by precedents and rules, authority may often 
be applied incorrectly or inappropriately. Incor- 
rectly applied, it is not truly authoritative. If it is 
effective, its effectiveness is a function of power. 
The same conditions that create authority ensure 
continuous interest in its application. Ensuing 
efforts to change the rules, conditions, and distri- 
bution of authority require the mobilization of 
opposing interests and alignments of power. 

Political organization consists in the combina- 
tion and interplay of relations of authority and 
power in the regulation of public affairs. Briefly, 
the political organization is the set of arrangements 
by which a public regulates its common affairs. 
Such regulation always integrates two modes of 
public action, the administrative, which consists in 
the authoritative conduct of public affairs, and the 
political, which consists in the exercise and com- 
petitions of power to influence or control the course 
of these affairs. Political organization thus com- 
bines authority and power; it is not independently 
coterminous with either. Without power, authority 
is ineffective; without authority, power may domi- 
nate but remains uninstitutionalized. 

Authority, power, and regulation appear in many 
social contexts that are not strictly political in the 
sense that they do not relate to the regulation of 
the affairs of a polity or its main components. For 
example, economic power is the capacity to pur- 
sue economic goals despite resistance and compe- 
tition. Ritual authority is the right to make ritual 


pronouncements and to carry out certain ritual 
acts. The regulation of a family or a work team 
is a domestic or private affair, of concern only to 
those immediately involved and their circle of close 
kin and friends. Although regulation, authority, 
and power are closely associated empirically, they 
are analytically distinct principles and modes of 
social relation and action. Regulation, which in- 
tegrates authority and power, is correspondingly 
more complex. In much the same way, government, 
as a form of political organization, integrates spe- 
cifically political and specifically administrative 
forms of action. It is thus more complex than 
either. 

The corporate group. To maintain cohesion 
and continuity, all social groups that succeed in 
persisting must have adequate arrangements for 
the regulation of their common affairs. However 
much these affairs may vary from one group to 
another, they always include control over the con- 
ditions of membership and admission and over 
those relations which the members have with one 
another or with outsiders which are directly rele- 
vant to the aggregate as a whole. Beyond this, the 
subject matter of public regulation varies as widely 
as the form, scale, character, and function of the 
groups and their differing social contexts. 

All persistent social units that have distinct 
identities, membership, closure, presumptions of 
indefinite continuity, exclusive common affairs, 
and the autonomy, procedures, and organization 
necessary to regulate them are corporate in nature 
and form. The public, which is the persistent social 
group that forms a unit regulating its common 
affairs, is a corporate group. The comparative study 
of political organization is in large measure a 
study of the correspondences between differing or 
similar modes of corporate organization and differ- 
ing or similar systems of power and authority. 

As regulatory units, corporations provide the 
basic framework for alignments of authority and 
power in the regulation of public affairs. However, 
being themselves very varied in their character and 
form, these units constitute differing frameworks 
for power and authority and distribute them in dif- 
ferent ways and on different principles. Accord- 
ingly, different types and combinations of Bees 
tions establish differing modes and relations © 
power and authority. For example, in a polity based 
on only one or two types of corporate unit, the 
political organization consists of relations of power 
and authority within, between, and among these 
corporations and, collectively or separately, be- 
tween them and external bodies. As their corporate 
composition differs, even those polities which have 
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an identical number of corporate forms will differ 
in their regulatory processes and organization. 

Although all persistent social groups with the 
properties of corporations are units of public regu- 
lation for their own internal affairs, many, such as 
universities, firms, clubs, or occupational associa- 
tions, may have no collective role in the processes 
by which the affairs of the wider unit to which they 
belong are regulated. In this case, the regulatory 
arrangements that constitute the internal self- 
governments of these units are not directly relevant 
to the analysis of the polities to which their mem- 
bers belong. On the other hand, if, as is often the 
case, the members of religious communities, uni- 
versities, and occupational or other groups do take 
collective action through these units to influence or 
control the conduct of affairs in their immediate 
public or some larger aggregate, then these corpo- 
rations enter directly into the political organization 
of the populations concerned. 

Publics, or corporate groups, vary widely in their 
character, form, complexity, size, functions, and 
modes of procedure. They may include village com- 
munities, cities, nations, certain types of descent 
groups, age sets, secret and other closed continu- 
ing associations, cult groups, monasteries and 
ritual orders, primitive bands, regiments, guilds, 
trade unions, political parties, and so on. Despite 
their many differences, all corporate groups have 
a distinct identity, a determinate membership, clo- 
sure, the presumption of indefinite continuity, com- 
mon exclusive affairs, and the autonomy, proce- 
dures, and organization necessary to regulate them. 
However, they differ notably in the bases on which 
they recruit and organize their members. Some 
corporate groups recruit their members by locality, 
others by descent or cult, by occupation, age, Sex, 
joint property interests, ethnicity, or by exclusive 
association for specific or diffuse purposes. Most 
corporate groups recruit their members on two or 
more of these principles simultaneously. They dif- 
fer correspondingly in their organization, pro- 
cedures, interests, and affairs. Similar properties 
found in different types of corporate groups Te- 
flect the common conditions of continuing group 
organization. 

Since it is identified with an exclusive set of 
common affairs, the universitatis juris (Maine 
1861), which includes control of the admission 
and organization of its members, the corporate 
group requires some set of institutional procedures 
to regulate itself. These procedures involve a set 
of rules and practices by which the actions of the 
members or representatives of the unit may be dis- 
tinguished as correct or incorrect and disagree- 
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ments within the collectivity can be contained or 
settled. If the rules and procedures of the corporate 
group are not adequate for these purposes, under 
favorable conditions dissatisfied members may 
withdraw to establish a separate unit of their own. 
An accepted framework of rules and procedures is 
thus indispensable if the members of a group are 
to regulate their mutual relations with sufficient 
harmony to permit joint action in managing their 
essential common affairs. And for this reason, the 
rules of a corporate group will only specify those 
forms of conduct which are relevant to the welfare 
of the collectivity as a whole. 

The institutional prescriptions and proscriptions 
by which a corporate group organizes its members 
and regulates its common, exclusive affairs create 
a field for internal disagreements, dissension, and 
initiative. The organization and processes of cor- 
porate action provide opportunities for the mem- 
bers, individually or in small groups, to try to 
influence the unit’s decisions, to modify or main- 
tain its rules, procedures, and organization or the 
conditions, range, and implications of membership, 
to extend or reduce the scope of its common affairs, 
and to maintain or change its relations with ex- 
ternal bodies. While the unit’s institutional organ- 
ization and rules establish its structure and dis- 
tribution of authority, these rules and precedents 
are made, retained, changed, or broken by autono- 
mous acts which may also decide the distribution 
and tenure of regulatory positions and the char- 
acter of relations within the group and between it 
and other bodies of similar or differing constitu- 
tion. And since no persisting social unit can ever 
have a framework of organization and procedures 
that is fully comprehensive and appropriate for all 
the eventualities that arise in its history, all social 
units must retain sufficient autonomy to enable 
them to modify, supplement, reinterpret, repeal, 
or develop their rules, procedures, and organiza- 
tion as a necessity of successful adaptation to their 
changing external environment and internal dis- 
tributions of power and need. Moreover, as units 
of public regulation, all corporate groups also re- 
quire an internal authority structure by which their 
affairs can be representatively handled; and this 
structure provides a major focus of internal polit- 
ical competition as individual members and seg- 
ments try to influence, limit, evade, or control 
the uses, conditions, objects, and distribution of 
authority. 

Types of corporations. Corporate groups repre- 
sent one type of corporation; others are categories, 
offices, colleges, and commissions. Of these five, 
corporate groups, offices, and colleges possess all 
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those conditions which are necessary to maintain 
their continuity as self-regulatory units. Commis- 
sions and corporate categories lack some of these 
essentials. Some commissions, such as military, 
magisterial, and certain ritual commissions, lack 
uniqueness and determinate spheres of compe- 
tence; others, such as charismatic leadership or ad 
hoc parliamentary commissions, although unique, 
lack the appropriate bases for continuity. Such 
corporate categories as moieties, uncoordinated 
clans, castes, or age sets among the Turkana and 
Karimojong (Gulliver 1958; Dyson-Hudson 1963) 
lack the necessary bases for organization as groups. 
They cannot therefore serve as inclusive units of 
positive regulation. 

The office. The corporation sole (Maine 1861 >a 
or office, has all the properties of the corporate 
group except that at any time only one member, 
the incumbent, fully represents it, Normally the 
office is also represented indirectly and incom- 
pletely through an official staff controlled by its 
holder. In Weber's terminology, this structure con- 
stitutes an “administrative organ” ([1922] 1957, 
pp. 302-303). Offices vary in the complexity and 
size of the staff through which they regulate their 
affairs, 

As a centrally directed organization of roles and 
responsibilities, the office and its staff form a con- 
tinuing aggregate which is in many ways quite 
similar to a corporate group. However, the struc- 
ture is always a specialized segment of a wider 
public or corporate group, some or all of whose 
common affairs it serves to regulate. Those proc- 
esses of political competition and rivalry which 
characterize the corporate group operate also with- 
in the administrative staff attached to an office. 
Members of this staff compete with one another 
for influence, promotion, autonomy in their respec- 
tive spheres, and for such other rewards and ad- 
vantages as the structure provides. Such political 
conditions will modify the formal hierarchy of 
authority within the office if they are not con- 
trolled; and if this formal authority structure is not 
maintained, relations between the staff and the 
wider public will also change as a function of those 
tedistributions of autonomy within the office. 

As a unique perpetual unit with public regu- 
latory roles, office is always distinguished from the 
personality of the individual who holds it. Even in 
those societies, such as the Bemba, Wambugwe, 
Luapula, and Yao, which institutionalize Office as 
hierarchic units linked to one another by ties of 
“perpetual kinship” individuals are totally identi- 
fied with the perpetual statuses to which they are 
recruited (Richards 1940, P. 91 ff.; 1950, p. 224; 


Gray 1953; Cunnison 1956; Mitchell 1956). In its 
more familiar forms, such as presidencies, uni- 
versity chancellorships, managing directorships, 
kingships, or the papacy, office is clearly an ex- 
tremely important organ for the regulation of the 
affairs of the corporate group in which it exists, 
However, offices differ widely in their forms, char- 
acter, and capacities. For some the method of re- 
cruitment is hereditary; for others elective; and 
others have mixed forms of succession. The au- 
thority of office may be narrow or wide in its scope, 
that is, the affairs to which it relates, or in its 
range, the size of the aggregate it regulates. Some 
offices, such as the Shilluk and Ngonde chieftain- 
cies are primarily symbolic and integrative, execu- 
tive authority being dispersed among other units 
(Wilson 1939; Evans-Pritchard 1948). Other of- 
fices, although executive in character, may be ac- 
countable for their conduct to some other body— 
a college, a corporate group, or a superior office. 
The mere existence of an office or hierarchy of 
offices, however exalted, has very indefinite conno- 
tations for the working of the political organization. 
What really matters is the basis of recruitment to 
the office, the scope and range of the office, the 
character of the authority vested in it, the limits 
of its autonomous discretion, the sanctions that it 
controls, the actual distributions of power and 
authority among the office staff, and the relation- 
ships between the office, its staff, and the public 
to which it relates. 

The college. The college is an extremely adapt- 
able type of corporation, possessing certain proper- 
ties common to corporate groups and offices alike. 
The college has two distinguishing characteristics: 
first, it contains a plurality of members who thus 
have equal status; second, the members of a college 
are always a minority of the public with which the 
college is identified. Like offices and corporate 
groups, colleges are presumed to be indefinitely 
continuous, have distinct identities, control the 
admission of members, and have a determinate 
membership, organization, procedures, common af- 
fairs, and autonomy to regulate them. In such cases 
as the College of Cardinals, the U.S. Supreme 
Court, the council of the Iroquois League, and 
modern cabinets, the membership of the college is 
entirely confined to the holders of particular offices; 
in others, the members are persons who hold repre 
sentative commissions and meet certain qualifica- 
tions. The latter would include, for example, the 
Athenian ecclesia and council, the British House of 
Commons, the Yampai grade of the Egbo society 
among the Efik (Jones 1956, p. 138 ff.), Ls 
panchayats in India (Dube 1955; Mayer 1960; 


POLITICAL ANTHROPOLOGY: Political Organization 197 


Marriott 1955), or community councils among the 
Kikuyu, Ibo, and Bini (Meek 1937, p. 148 ff.; Mid- 
dleton 1953; Lambert 1956; Bradbury 1957). 

Some colleges have symbolic or executive heads; 
some do not. Colleges may meet more or less regu- 
larly, frequently, and publicly. The representation 
that they provide the public may be more or less 
exclusive. Admission may be restricted on grounds 
of descent, as with the ruling councils of southern 
Bantu chiefdoms (Schapera 1937), of age, as 
among the Galla (Huntingford 1955), of wealth, 
or of ritual status. Greek thinkers, having distin- 
guished collegial from monarchic forms of govern- 
ment, classified collegial systems as aristocratic, 
oligarchic, or other on grounds of their representa- 
tiveness and composition. They may also be classi- 
fied according to the functions they perform: 
judicial, legislative, executive, ritual, or electoral. 
While some colleges are functionally specific, others 
regulate a variety of functions. 

If the college consists only of officeholders, they 
can execute its decisions. Alternatively, members 
of the college may enforce its decisions personally, 
as in Athens, or through dependents and subordi- 
nates, as in such graded secret societies as the 
Ogboni, Poro, or Egbo (Morton-Williams 1960; 
Little 1949; Jones 1956). Alternatively, the college 
may delegate executive roles to hereditary or elected 
officials, as in Sparta and Rome; or it may dis- 
tribute various tasks among corporate units of dif- 
ferent kinds, as among the Cheyenne or Yak6 
(Hoebel 1960; Forde 1961; 1964). Where a num- 
ber of regulatory colleges coexist, they may be dif- 
ferentiated functionally, by the types of affairs that 
each regulates; or hierarchically, by reference to 
their jurisdictions and the type of unit that each 
controls—age set, lineage, ward, village, and so 
on, Executive responsibilities may rotate among 
the different colleges. 

Colleges and offices are often found together. As 
the preceding remarks indicate, the main strength 
of a collegial system lies in its representative char- 
acter, which, even when sectional, provides oppor- 
tunities for politically active segments of a public 
to deliberate together on matters of common con- 
cern, Although an office is a collective representa- 
tion, it does not provide for adequate participation 
by the public in the formation of official policies. 
Thus, while collegial organization is ill suited to 
continuous administrative action, the office is ex- 
cellently adapted to that end. 

Offices and colleges may be combined within a 
common regulatory system in various ways. Where 
the college retains decisive control of appointments 
and policy formation, the office may be either sym- 


bolic, as among the Yoruba (Lloyd 1960; 1962), 
or, if executive, lacking in any significant dis- 
cretionary power. In the opposite situation, where 
the senior office enjoys dominance, the college 
may be merely a device for mobilizing periodic 
support and an instrument for administrative 
communication. 

Combinations of office and college always lead 
to competition between these units to preserve or 
extend their relative autonomies and spheres of 
competence. Since both the college and the official 
organization will be divided internally by political 
alignments and cleavages, a wide variety of inter- 
locking alliances and oppositions are possible be- 
tween their members. The stability of the political 
organization will then depend on the cohesiveness 
of those underlying structural conditions which 
regulate the distributions of power and authority. 
The corporate groupings in which the members of 
the wider public are organized have certain struc- 
turally requisite conditions and relations, and these 
very conditions and relations serve to limit changes 
in the distribution of power and authority and thus 
to maintain continuity. On the other hand, change 
or lack of change in the power—authority structure 
may limit the capacity of the corporate system to 
persist. An excellent example of such structural 
limits to political change is found among the 
Kachin of highland Burma (Leach 1954). 

The polity. The most inclusive aggregate hav- 
ing a common corporate organization and func- 
tional cohesion is a polity. The polity coincides in 
its form and limits with the maximum spread of 
the institutional framework within which power 
and authority are distributed and public regulation 
proceeds. The term “polity” accordingly is used to 
identify an aggregate with a common set of cor- 
porate forms. As the basic analytic unit for com- 
parative studies, the nature of the polity deserves 
special attention. In one sense, the term denotes a 
distinctive form of political organization; in an- 
other it refers to the society of which this is the 
political form. It also denotes the largest social ag- 
gregates organized separately in this form, How- 
ever, the limits of the aggregate organized as a dis- 
tinct unit do not always coincide with the limits of 
the society of which it is part or with the territorial 
spread of the polity which it represents. 

As the form of political organization, a polity 
represents the political aspect of a society, and 
often the two are confused. As Nadel says, “mostly, 
when we look for a society, we find a political unit, 
and when speaking of the former we mean in effect 
the latter” (1951, p. 187). Analytically, a society 
is a self-sustaining system of social relations and 
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institutions (Levy 1952, p. 111 ff.). Concretely, it 
is the aggregate that shares these institutional 
forms. Comparatively, societies vary in the inclu- 
siveness as well as the forms of their political insti- 
tutions. Some societies, such as France, the United 
States, Tikopia, and the Cheyenne, include all their 
members in a single polity. In these cases the polity 
coincides with the limits of the societal aggregate. 
Other polities, such as the Hausa, Tswana, and 
western Malays (Hogben 1930; Schapera 1938; 
1953; Gullick 1958), do not include their societies 
in a single unit. Although their polities are central- 
ized, the Hausa, Tswana, and western Malays lack 
a single inclusive political organization. Therefore, 
as societies they are “acephalous.” On the other 
hand, the societies of Tikopia, the Cheyenne, and 
the Shilluk, which constitute single polities, are 
clearly uncentralized. 

Societies may be subdivided into a number of 
autonomous, acephalous units of similar form and 
Kind, for example the Nuer and the Gusii (Evans- 
Pritchard 1940; Mayer 1949). Acephalous political 
organization may take two alternative forms, which 
we can illustrate by contrasting the Nuer and the 
Tallensi (Fortes 1940; 1945), The Nuer polity 
consists of aggregates whose members act together 
to regulate common affairs. These are a number 
of socially and territorially discrete tribes, Each 
Nuer tribe can thus be studied separately, as rep- 
resentative of the whole society. The political or- 
ganization of the Tallensi is quite different. Their 
largest publics are local communities. Each com- 
munity is internally divided into lineages and seg- 
ments of clans, which are connected to similar 
units elsewhere by a network of collective ties. 
Since these kinship bonds are coextensive with the 
society, the entire population constitutes a cohesive 
acephalous unit. To study a Tale community in 
isolation, therefore, yields a misleading picture of 
the society and polity alike, since in this case both 
coincide. Thus, acephalous and centralized polities 
alike may embrace entire societies as single units, 
or they may subdivide these societies into a num- 
ber of structurally similar parts. Moreover, a society 
which has an effective centralized organization may 
also extend its rule over other aggregates with dif- 
fering social institutions, thereby increasing the 
complexity of all. 

Instead of looking for discrete political communi- 
ties characterized by authoritative decision making, 
the exclusion of internal force, or inclusive coopera- 
tive action, or of simply classifying polities as uni- 
tary or segmentary states or nonstates, we may 
regard the widest ageregates having a common 
form of political organization as the unit of com- 
parative analysis. Where such aggregates are sub- 


divided into a number of discrete autonomous 
self-regulating publics, we can treat these units as 
representatives of the polity as a whole. This is 
indeed the general practice of anthropologists and 
political scientists alike. In this way the modes of 
organization, degrees and areas of centralization, 
forms of internal order, cooperation, and decision 
making can be analyzed comparatively. 

Autonomy. The autonomy that a public re- 
quires as a condition of its viability may be re- 
served wholly to its membership, as in Nuer com- 
munities, in which case the public is genuinely 
acephalous; or autonomy may be concentrated 
wholly in one or several representative regulatory 
organs, such as colleges or offices; or it may be 
distributed variously among the members of the 
public and its representative regulatory institutions. 
In all cases, the institutionalized distribution of 
such autonomy is a critical dimension of the unit’s 
authority structure. This distribution promotes the 
kinds of continuous political action that maintain 
or change the authority structure; and just as the 
distribution and scope of relative autonomy varies 
with the composition of the public, so do the typical 
forms of political issues and activities. 

The greater the authority allocated to a regu- 
latory office or college, the greater the autonomy 
it requires. But the greater the autonomy this regu- 
latory organ holds, the less easily is it controlled 
by the members of the public whose affairs it 
regulates. 

The autonomous discretion that inheres in regu- 
latory authority is neither arbitrary nor unlimited. 
It represents the minimum range of discretion 
necessary to ensure appropriate action in various 
emergencies. The range may be very narrow, as in 
ancient Athens or modern Britain, or it may be 
very wide, as among the Inca (Moore 1958) and 
in Ottoman Turkey (Gibb & Bowen 1950). It may 
also be wide in some areas of activity and Ely irc 
stricted in others, as for instance in imperial China 
(Sprenkel 1962). The executive autonomy may be 
retained by the senior office of the hierarchy, as in 
the absolute monarchies of eighteenth-century 
Europe or the Roman Catholic church, or it may 
be exercised primarily by a senior subordinate, such 
as the Lozi ngambela (Gluckman 1951), the Otto- 
man grand vizier, the Aztec ciuacoatl (Soustelle 
1955), or the Japanese shoguns (Sansom 1932). 
Executive authority may be vested in a college, 
such as the Inca opocunas, the Yoruba iwarefa 
(Lloyd 1962), the Roman senate, or modem 
cabinets. 

Weber, who seems to have regarded all stable 
concentrations of authority and expansions of po- 
litical aggregates as correlates of “patrimonial,” or 
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chiefly, rule, characterizes the regime in which 
“absolute authority is maximized” by the chief as 
“sultanism” ([1922] 1957, p. 318). But, however 
broad and overwhelming such authority may be, 
it can never be absolute, since its effectiveness 
depends on the exercise of power to the degree 
that its action transgresses the legitimate range of 
permitted discretion. And power, however decisive, 
is always unstable and relative, never absolute or 
fixed. It breeds its own rivals. Few rulers have in 
fact been more uneasy in their rule and more de- 
pendent on their staffs than those sultans who 
sought to maximize an absolute authority, only to 
find that they were increasing their dependence 
on the staff, through which authority was exer- 
cised, and by which it would be appropriated unless 
rigorous discipline was maintained. As we have 
seen, the Lozi, Ottoman, and Aztec monarchs found 
it strategic to delegate executive power to “grand 
viziers,” whose offices then bore the odium for 
actions necessary to maintain the system. But as the 
Ottoman case shows clearly, in transferring this 
autonomy, the sultan effectively placed his regime 
in the hands of his grand vizier. 

Comparative politics, Only in the very simplest 
societies are the publics not divided into a number 
of internally autonomous corporate groups. Exam- 
ples of such societies include the Andamanese 
(Radcliffe-Brown 1922), the Mbuti (Turnbull 
1961), the Bushmen (Marshall 1960; 1961), the 
Shoshoneans (Steward 1938; 1955) or the Nam- 
bikuara (Lévi-Strauss 1955). In all these cases, 
local communities or bands are sections of a wider 
aggregate, the tribe. Yet even among the Nambi- 
kuara and Shoshoneans, the leading members of 
these bands exercised little authority beyond that 
required by relations of ritual, kinship, and sub- 
sistence, 

In the absence of a representative public au- 
thority, autonomy is dispersed among the adult 
male members of the group, who may transfer their 
affiliations when they change residence. There be- 
ing no hierarchic authority structure within such 
groups, there can be none between them. None- 
theless, within each band, positions of authority 
identified primarily by kinship and ritual relations 
do exist: the band as a whole may have a leader 
or a symbolic head, or it may regulate its common 
affairs collectively. Individual misconduct may lead 
to collective reactions; and dissatisfaction with 
such band government may Jead to individual with- 
drawal. Occasionally feuds arise between bands. 
These feuds may actually reinforce internal co- 
hesion. 

The segmentary lineage systems of such soci- 
eties as the Nuer and Tiv exhibit similar conditions 
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and principles of organization. Their most distinc- 
tive feature is a schematic alignment of local 
groups by their closeness or remoteness of kinship 
according to putative unilineal descent. This genea- 
logical classification of autonomous local groups 
serves to guide their political relations of alliance 
and opposition. Internally, each group forms a 
single unit within which authority is distributed 
and collective action handled administratively. Ex- 
ternally, as coordinate units, groups may combine 
at one moment to oppose outsiders and then segre- 
gate to oppose one another, according to the pre- 
vailing scheme of lineal and local alignments. As 
in the case of band-organized units, authority is 
dispersed within each segment, but the genealogi- 
cal connections of these segments cannot constitute 
a hierarchic organization of groups. Unless some 
principle of differential ranking prevails, segments 
are equal, each a distinct autonomous unit with its 
own internal concerns and regulatory arrange- 
ments. Under these circumstances, there may be 
an inclusive college, as among the Kikuyu and Ibo, 
but not necessarily a representative official struc- 
ture, as, for example, in Tikopia (Firth 1949; 1959, 
p. 254 ff.). Political and administrative relations are 
here completely fused and corporately differenti- 
ated from other types of social relations. 

The autonomy of these corporate groups corre- 
sponds in its scope to the jurisdiction they exercise 
on behalf of their members and over them and thus 
to the limits of their collective responsibility. This 
autonomy may either be based on the absence of 
alternative forms of grouping or on the coexistence 
of a plurality of corporate forms that dislocate one 
another by dispersing loyalties and generating 
conflict. There are many differing types of such 
structurally disharmonic polities, for example the 
Murngin (Warner 1937), the Plateau Tonga (Col- 
son 1953), the Siuai (Oliver 1955, p. 361 ff.), the 
Kachin (Leach 1954), the Usurufa, Kamano, and 
Fore of highland New Guinea (Berndt 1962), or 
European feudalism (Bloch 1939-1940). The prev- 
alence of disharmonic systems should be noted. 
Some may perpetuate themselves almost indefi- 
nitely, having institutionalized adequate adapta- 
tions to their basic disharmony; other disharmonic 
systems oscillate between two poles, as among the 
Kachin or in medieval China and Japan; still others 
undergo changes of structural type as a result of 
the conflicts their disharmonies generate. Such 
structural changes occurred, for example, in an- 
cient Athens and Rome and in feudal France and 
Britain. 

In extreme autarchy, relations between corporate 
units are almost entirely political, while those 
within them are primarily administrative. If these 
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units are successively segmented or if they are 
linked serially by ties of increasing inclusiveness, 
then their political and administrative alignments 
alternate. To stabilize the larger aggregates it is 
necessary to find some basis other than mere gene- 
alogy for a more consistent differentiation of politi- 
cal and administrative relations. Wnether such 
differentiations take a collegial, official, or mixed 
form, they all consist in the institutionalization of 
certain types of regulatory roles and responsibilities 
on the basis of some distinctions of seniority and 
precedence, however diffuse or flexible. The only 
alternatives to an acephalous structure of coordi- 
nate corporate groups are a collegial organization, 
a system of central regulatory offices, or both in 
various combinations. The broader the authority 
such representative organs exercise, the more the 
autonomy distributed among other segments or 
members of the group is restricted. The larger the 
aggregate that regulates its public affairs as a unit 
and the greater the functional differentiation of its 
regulatory roles, the more cohesive and integrated 
the unit becomes. 

Under collegial or official direction, the various 
segments of the larger public will continue to ad- 
minister their internal affairs autonomously, al- 
though within the limits required by the unity of 
the entire aggregate and by the authority of its 
regulatory organs. These limits may be extensive 
or limited. Among the Iroquois, Kikuyu, and Ibo, 
whose local communities had a collegial regulatory 
organization, lineage rights of retaliation prevailed, 
and violence was sporadic, although feud was ruled 
out. In such “centralized” systems as Ruanda, 
feudal Europe, and Japan violence was so common 
as to be virtually institutionalized. It is thus not 
essential to the character, cohesion, or continuity 
of a public that violence should be excluded among 
its members; neither is it the case that all publics 
are units of internal cooperation and external inde- 
pendence. What is necessary is that the incidence, 
forms, and occasions of violence should be gov- 
erned by authoritative rules and that responsibili- 
ties for violence should be allocated to corporate 
units. For the unity of the aggregate, a common set 
of external relations is much more essential and 
seems to be decisive. 

It is not always the case that polities differ in 
their scale according to their levels of centraliza- 
tion. Some large populations, such as the Tiv, Gusii, 
Masai, or Ibo, maintain polities of an acephalous 
type, while many small units, such as the Mambwe 
villages (Watson 1958, p. 72 ff.) or some Tswana 
tribes, are relatively centralized, 

Authority may be said to be centralized when 


the autonomy to regulate certain types of affairs by 
positive action has been transferred to some single 
organ or set of organs which then coordinates these 
affairs for all members of the aggregate. This trans- 
fer of autonomy from the members and primary 
corporations of the unit may be voluntary, or it 
may be forced. Once achieved, in order to be instru- 
mentally effective, it requires that the central regu- 
latory organ should control an adequate concentra- 
tion of sanctions and should also be capable of 
employing these autonomously to enforce its orders, 
especially to discourage political aggrandizements 
among its staff. When authority is effectively cen- 
tralized in this way, whether on a collegial or an 
official basis or on both simultaneously—as in mod- 
ern societies—those who hold this authority may 
try to increase its range or scope by pursuing terri- 
torial expansion or by extending the types of affairs 
that fall under their regulation. The latter is the 
course that most consistently requires and pro- 
motes increases in the size, complexity, and func- 
tional differentiation of the administrative structure 
and in its segregation from public political proc- 
esses. This may either lead to the development of 
a strictly bureaucratic order (Weber [1919] 1946, 
p. 196 ff.; [1922] 1957, p. 302 ff., p. 360 ff.) or to 
the domination of an aggregate by a “ruling bureauc- 
racy” (Wittfogel 1953; 1957). 

An essential condition for the effective central- 
ization of the administrative staff within any unit 
of moderate size is the segregation of this admin- 
istrative structure from the context and processes 
of public political competition. This is a more diffi- 
cult achievement than the simple differentiation of 
judicial, military, and civil functions. These func- 
tions may be distributed among different persons 
in both acephalous and centralized collegial sys- 
tems, as among the Kipsigi, Galla, and Kikuyu. Or 
these functions may be combined and exercised 
through a single official or collegial structure in 
various centralized systems, as in the southern 
Bantu chiefdoms (Schapera 1937). However, to 
preclude the appropriation of power by administra- 
tive staff, to maintain disciplined performance, and 
to penalize inefficiency or corruption, it is essential 
that the different regulatory roles should be segre- 
gated and distributed among different units and 
that provisions for supervision and staff control 
should also be adequate. When an administration 
relies for its staff on territorial chiefs, who are 
often recruited hereditarily and may be identified 
politically with their communities, as among the 
Tswana or in feudal Europe, disassociation of the 
regulatory roles and the maintenance of central 
control are both very difficult. The Chinese sought, 
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during the Ch’ing period, to prevent the political 
decentralization of the administration by restricting 
tenure of office to three years, by excluding heredi- 
tary appointments in the bureaucracy, by sending 
officials to communities where they were strangers, 
and by specializing the staff of different offices 
(Sprenkel 1962). The Inca sought to control their 
territorial officials through an elaborate system of 
specialized traveling inspectors. The Zulu, Ruanda, 
and Lozi kings, like the Kamakura shoguns, sought 
simultaneously to subdivide the loyalties of their 
subjects and to set their subordinates against one 
another by differentiating the types and sources of 
control, The Hausa of Zazzau simply concentrated 
their senior territorial officials in the capital and 
thus segregated them from the communities they 
were appointed to rule (Smith 1960). Despite the 
high levels of functional specialization of the ruling 
colleges of Athens and Rome, those colleges sought 
to control their highest officials by institutions of 
indictment, accountability, and time-limited tenure 
of office. In the acephalous collegial polities of the 
Cheyenne, Yaké, and Kipsigi, authority to regu- 
late different affairs was systematically dispersed 
among differing organs of government. One legacy 
of royal absolutism which is quite basic to the 
complex governments of modern society is the seg- 
regation of the bureaucratic apparatus from the 
arena of political competition for control of policy. 

M. G. SMITH 
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The Forest People. New 


POLITICAL BEHAVIOR 


The analysis of political behavior proceeds from 
the assumption that politics as a special form of 
human activity is not, and cannot be, independent 
of what is known or knowable about social be- 
havior generally. The primary objective of such 
analysis, therefore, is to link what is specifically 
political with other aspects of social relations. 


Specification 


In particular, political behavior analysis refers 
to a number of modes and methods of inquiry in 
the discipline of political science that have the 
following four general characteristics. 

(1) Political behavior analysis takes the indi- 
vidual person’s behavior—broadly conceived as in- 
cluding not only his acts but also his orientations 
to action (identifications, demands, expectations, 
evaluations )—as the empirical unit of analysis. 

Most behavioral research in politics is not, in 
fact, concerned with the individual actor as such. 
It more often seeks to describe and explain the 
political behavior of a group, an organization, a 
community, an elite, a mass movement, or a nation, 
but also assumes that such collectivities do not 
exist apart from the conduct of their individual 
members. The interactions and transactions of 
these members make for systemic relationships 
that are structurally discrete and functionally 
specific and can be meaningfully studied. 

Political behavior analysis does not deny the im- 
portance of political institutions; rather it con- 
ceives of them as patterns of individual behavior 
that are more or less uniform and regular and can 
be analyzed in terms of the behavior of their molec- 
ular units. Individual political behavior is seen as 
deriving its meaning and significance from the 
institutional context in which it occurs. 

(2) Political behavior analysis chooses a frame 
of reference that is shared by the behavioral sci- 
ences, notably anthropology, psychology, and soci- 
ology. 

As man’s political behavior is only one aspect of 
his total behavior as a social being, political be- 
havior analysis must be interdisciplinary; it can- 
not neglect the wider context in which political 
action occurs. It is bound, therefore, to consider 
the possible effects of social, cultural, and personal 
factors on political behavior. 

_ The interdisciplinary focus of political behavior 
inquiry sensitizes the observer to the level of analy- 
sis on which research may be conducted most 
appropriately and fruitfully, Some problems of po- 
litical behavior are best formulated and explored 
on the level of social relations; others, on the level 
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of culture; and still others, on the level of person- 
ality. Which level of analysis is chosen depends on 
the problem under inquiry and on the degree to 
which generic rather than particularistic explana- 
tions are sought. 

(3) Political behavior analysis chooses theo- 
retical propositions about politics that lend them- 
selves, in principle at least, although preferably in 
fact, to operational formulation for the purposes 
of empirical research. 

Political behavior analysis seeks to join theory 
and the data of experience in a single operational 
discourse that allows theory to fertilize empirical 
inquiry and that utilizes empirical findings to ad- 
vance the development of theoretical propositions 
about politics. Of course, where a particular study 
will fall on the empirical—theoretical continuum 
depends partly on the inclinations of the researcher 
and partly on his ease of access to a given research 
arena. For instance, access is relatively easy in the 
domestic legislative arena but extremely difficult 
in the field of international relations. Where access 
is easy, theoretical development may lag behind 
data accumulation; where access is difficult, theo- 
retical sophistication may outstrip empirical in- 
vestigation. 

(4) Political behavior analysis chooses methods 
and techniques of inquiry that permit as rigorous 
treatment as possible of theoretical formulations 
and empirical data for the purposes of description 
and the testing of hypotheses. 

Many of the theoretical formulations of political 
behavior research are now posed in terms of no- 
tions derived from game theory, decision theory, 
general systems theory, and communication theory, 
as well as notions derived from political science. 
In its empirical work political behavior analysis 
also utilizes techniques taken over from the other 
social sciences. It is concerned with the problems 
of research design, reliability of data-gathering and 
measuring instruments, criteria of validation, and 
other features of scientific procedure. Techniques 
now employed include participant observation and 
survey research, content and cluster-bloc analysis, 
scalogram and factor analysis, sociometry and 
psychiatric procedures, laboratory and field experi- 
ments, multivariate analysis, and computer pro- 
grams. Application of these methods and tech- 
niques has had the effect of enormously broadening 
the range of political phenomena available to be- 
havioral investigation and of systematizing the 


study of politics. 


Development 
Behavioral analysis in the study of politics stems 
from the confluence of many converging tenden- 
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cies that are isolated here only for the sake of 
identification. It is probably impossible to trace any 
particular approach to a single intellectual source. 
This section, therefore, though organized more or 
less chronologically, does not attempt precise his- 
torical reconstruction. 

In their earliest manifestations, political be- 
havior approaches in political science were dec- 
larations of protest against what was felt to be an 
unduly historical—descriptive, legal—formal, or nor- 
mative orientation in the study of government and 
politics. The protest came, on the one hand, from 
psychology-oriented observers of the political scene, 
perhaps best represented by Graham Wallas’ Hu- 
man Nature in Politics (1908), and on the other, 
from process-oriented students, stimulated by Ar- 
thur F. Bentleys The Process of Government 
(1908), But neither Wallas nor Bentley had an 
immediate impact on the course of academic po- 
litical science. It was the work of a nonacade- 
mician, Walter Lippmann (Public Opinion 1922), 
that, during the 1920s, directed political scientists 
to the study of public opinion and political attitude 
formation. Bentley's “group approach” evidently 
had little influence on the studies of pressure 
groups which, in the late 1920s and early 1930s, 
gradually brought a new realism to the study of 
politics, It was not until after World War 1 that 
Bentley’s recommendations and formulations were 
explicitly acknowledged and developed in David B. 
Truman’s The Governmental Process (1951). Simi- 
larly, another early scholarly document of protest, 
Charles A. Beard’s An Economic Interpretation of 
the Constitution of the United States (1913), came 
to influence the study of politics only in the 1920s. 

The efforts initiated in the early 1920s at the 
University of Chicago by the political scientist 
Charles E. Merriam exerted a more continuous and 
cumulative influence. His New Aspects of Politics 
(1925) was more than a critique of academic po- 
litical science or a mere agenda for behavioral 
research. Men like Harold F. Gosnell and Harold 
D. Lasswell implemented Merriam’s teaching by 
ingenious studies ranging from field experimenta- 
tion (Gosnell 1927) to intensive personality re- 
search (Lasswell 1930). A second generation of 
Chicago political scientists, product of the 1930s, 
provided the leadership of the behavioral revolu- 
tion in political science after World War m (Key 
1949; Simon 1947; Almond 1950; Truman 1951; 
Leiserson 1958; Pool 1952). 

Other efforts in the interwar years to influence 
the course of political theory or research in a more 
behavioral or scientific direction were less success- 
ful. George E. G. Catlin’s The Science and Method 
of Politics (1927) was too far removed from the 


operational capabilities of the time to have much 
influence. On the other hand, Stuart A. Rice’s 
Quantitative Methods in Politics (1928), inventive 
in the application of statistical and quasi-statistical 
techniques to aggregate election data and legisla- 
tive roll-call votes, tended to ignore the theoretical 
relevance of this type of investigation, However, 
his techniques proved viable in the more theoretical 
context of postwar research on electoral and legis- 
lative voting. 

Perhaps even more difficult to identify and evalu- 
ate is the impact on political behavior analysis of 
the group of European or European-trained (Fried- 
rich 1937; Parsons 1937) social scientists who, in 
the 1930s, familiarized American political science 
with new categories of thought rooted in the works 
of Marx, Durkheim, Freud, Pareto, Mosca, Michels, 
and Weber. Stimulated by the crises of democracy 
occasioned by the rise of totalitarianism, most of 
the work of these scholars sought explanations in 
terms of macro units of analysis, Only a few, like 
Paul F. Lazarsfeld or Theodor W. Adorno, were 
concerned with the individual as the unit of in- 
quiry. The former's now classic study of the 1940 
presidential election, The People’s Choice, co- 
authored by Bernard Berelson and Hazel Gaudet, 
was not published until 1944 but came to have an 
enormous influence on political behavior research 
after the war. Similarly, The Authoritarian Person- 
ality (Adorno et al. 1950), though subjected to 
much criticism since its publication, has inspired 
a good deal of political behavior research. 

Although World War 1 undoubtedly made po- 
litical scientists, serving in manifold capacities in 
governmental and military positions, sensitive to 
the inadequacies of traditional political science, the 
postwar “behavioral revolution” in the study of 
politics was probably less a movement of protest 
(Dahl 1961b) than a realization that the problems 
encountered in concrete research required new 
theoretical and methodological departures. 

The tendency toward behavioralism in political 
science was greatly aided by the activities of sev- 
eral groups of political scientists brought together 
under the auspices of the Social Science Research 
Council. In the 1950s and 1960s behavioral re- 
search was stimulated and financed by a committee 
on political behavior and a committee on compara: 
tive politics. A first “reader” on political behavior 
was published (Eulau et al. 1956). In the ae 
year, a series of special panels devoted to politica 
behavior analysis was included in the program 0° 
the annual meeting of the American Political sa 
ence Association. In 1958 a group of political aa 
tists and political sociologists from the Unies 
States, Europe, and Japan sponsored an “Interna 


tional Yearbook of Political Behavior Research,” 
of which five volumes have appeared (Janowitz 
1961; Marvick 1961; Huntington 1962; Schubert 
1963; Apter 1964). Also, in the 1950s, special 
professorships were set up at almost all major 
universities. At some institutions, the University 
of Michigan, for instance, a “doctoral program in 
political behavior” has been established, while at 
others, notably Yale, the Massachusetts Institute 
of Technology, Northwestern, and Stanford, be- 
havioral approaches permeate the offerings of the 
political science departments. In terms of research 
output, if one compares the pages of the learned 
journals in the first five decades of the century 
with the pages since about 1950, the development 
appears dramatic indeed, The establishment in 
1962 of an “Inter-University Consortium for Polit- 
ical Research,” sponsored by almost all major de- 
partments of political science in American uni- 
versities, attests to the pervasive influence that 
political behavior analysis has gained in the dis- 
cipline. 

While the “behavioral revolution” in political sci- 
ence has been primarily an American phenomenon, 
it has had some consequences for research develop- 
ments abroad as well, especially in western Europe, 
India, and Japan. In some countries research in- 
stitutes have undertaken behavioral or quasi- 
behavioral research in politics—such as the studies 
on British elections undertaken under the auspices 
of Nuffield College (among others see Butler & 
Rose 1960; Butler 1962) or the studies on electoral 
behavior conducted in Norway by the Bergen Chr. 
Michelsen Institute and the Oslo Institute for So- 
cial Research (Rokkan & Valen 1960). In France 
individual scholars have been concerned with pub- 
lic opinion, electoral party behavior, and pressure 
groups (Grosser 1960). In Italy the work of Sartori 
and his associates on parliament is an outstanding 
example of the behavioral approach (Il parlamento 
italiano: 1946-1963). In Germany Otto Stammer 
and W. Hirsch-Weber, among others, have been 
influential and successful advocates of an empirical 
political science (Stammer 1960). In the 1960s 
the behavioral penetration of the study of com- 
parative politics and the increasing interaction be- 
tween American, European, and Asian scholars 
trained in behavioral research techniques promises 
to overcome traditional parochialisms in the study 
of politics [see POLITICS, COMPARATIVE]. 


Inventory—areas of research 

_ Political behavior research can be conveniently 
inventoried in terms of the situational or institu- 
tional contexts and arenas in which men act polit- 
ically. We can, therefore, speak of voting behavior 
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research, legislative behavior research, interna- 
tional behavior research, and so on. What recom- 
mends this classification is not simply the conveni- 
ence of tradition but the fact that political behavior 
analysis is pre-eminently interested in determining 
the consequences of individual political behavior 
for the functioning of political institutions. More- 
over, a classification of research by institutional 
categories facilitates linkage of behavioral and 
other modes of analysis, such as legal or historical 
inquiry, and permits appraisal of the cumulative 
contribution made by political behavior analysis to 
political science as a whole. 

Until the middle of the 1950s political behavior 
analysis was predominantly identified with (a) the 
study of political personality and related topics, 
such as the social composition of political elites: 
(b) the study of political attitudes and public 
opinion, including content analysis of the media 
of mass communication; and (c) the study of 
voting behavior and political participation. There 
were a few pioneering studies of legislative be- 
havior, judicial behavior, community politics, and 
sundry other topics, but political behavior research 
was primarily thought of as involving studies of 
either the attitudes and behavior of the mass pub- 
lic or of elites. Moreover, perhaps because social 
psychologists were especially interested in this 
research, political behavior analysis was often 
assumed to represent a species of “political psy- 
chology.” $ 

Actually, the intensive study of political person- 
ality was and has remained a very marginal con- 
cern. Lasswell’s pioneering Psychopathology and 
Politics (1930) has had few imitators. There has 
been some progress in this respect since 1955, but 
it has been slow [see PERSONALITY, POLITICAL]. 
However, studies of larger populations, indebted 
to various theories of personality and using inven- 
tories or scales derived from personality theories, 
concerned with such syndromes as authoritarian- 
ism, alienation, or anomie, have extended the 
range of political behavior research. 

Closely related to this research and undoubtedly 
influenced by it are an increasing number of 
studies of “political socialization” (Hyman 1959; 
Greenstein 1965) and of “political culture” (Al- 
mond & Verba 1963). Political socialization, of 
course, is only a critical first step in a person's 
political development, which must be distinguished 
from his recruitment and career pattern. The sys- 
tematic study of elites, particularly in terms of their 
social backgrounds, has long been a concern of 
political science [see ELITES], but more recently 
research has been directed to recruitment and 
career patterns, much of it conducted in specific 
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institutional settings [see POLITICAL RECRUITMENT 
AND CAREERS]. 

Electoral behavior. 
politics has been given more attention than elec- 
tions and electoral processes. The ready availa- 
bility of aggregate voting statistics made for an 
early interest in elections as behavioral phenomena 
(Gosnell 1930; Tingsten 1937). But whereas elec- 
toral statistics permitted only inferential state- 
ments about individual voting behavior, the inven- 
tion and application of the sample survey made 
possible the direct study of individual voters and 
isolation of the many institutional—structural, 
social-psychological, and directly political variables 
that influence the voting decision [see Vorne]. 
Evidence of the increasing maturity of this re- 
search is the fact that scholars, using both aggre- 
gate statistics and survey data, have come to seek 
insight into, and test propositions about, macro 
aspects of the political process, such as the via- 
bility of political competition (Key 1956), the 
conditions of democratic consent (Janowitz & 
Marvick 1956), the extent of cleavage and con- 
sensus in political systems (Lipset 1960), the 
function of representation (Miller & Stokes 1963), 
and the nature of democracy itself (Key 1961). 

Legislative behavior. As in the case of electoral 
behavior, research on legislative behavior had, in 
the form of roll-call votes and biographical data, an 
easily accessible reservoir of empirical information 
concerning both the backgrounds of legislators, 
from which presumably relevant attitudes were 
inferred, and the decisions of legislativé bodies. 
Perhaps more than was the case with electoral re- 
search, both statistical and case studies of legis- 
latures sought to link behavioral and institutional 
aspects of the legislative process. Recent studies 
of legislative committees as the critical tension 
points of decision making and of interpersonal re- 
lations have served to link even more closely indi- 
vidual and institutional patterns of legislative be- 
havior [see LEGISLATION, article on LEGISLATIVE 
BEHAVIOR]. 

Judicial behavior. In the study of public law 
and judicial decision making the tradition of for- 
mal legal analysis was so Strong that behavioral 
research did not emerge, with one exception 
(Pritchett 1948), until the late 1950s. When it 
came, the “behavioral revolution” in the field of 
public law took three forms. One group of scholars 
developed a largely nonquantitative “political proc- 
ess approach,” locating judicial cases in the broader 
context of social, economic, and political conflicts 
and specifying the judiciary as a party to the con- 
flicts of interests (Peltason 1955; Rosenblum 1955; 
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Shapiro 1964). A second group, using mathe- 
matical models and quantitative techniques, began 
to analyze opinions and decisions in terms of theo- 
ries derived from social psychology and sociology 
for the purpose of making inferences about the mo- 
tivations and attitudes of judges from a political 
perspective (Schubert 1960; 1963; 1964). A third 
approach in the new direction relies on the sys- 
tematic treatment of judicial biographies (Schmid- 
hauser 1960). [See Jupiciary, article on JUDICIAL 
BEHAVIOR. ] 

Administrative behavior. Most of the work in 
administrative behavior research has been done, in 
the 1950s and early 1960s, by psychologists and 
sociologists (March & Simon 1958) who, with a 
few exceptions (Selznick 1949; Lipset et al. 1956; 
Wilensky 1956; Blau 1955; Janowitz 1961), are 
not particularly interested in governmental institu- 
tions or politically relevant organizational be- 
havior. In a pioneering theoretical work by Herbert 
A. Simon, Administrative Behavior (1947), guide- 
lines were made available for empirical research, 
but political scientists failed to follow this lead, 
probably because public administration has been 
dominated by “bureaus” which were primarily 
program-minded and preoccupied with “practical” 
matters. Only a few students of public administra- 
tion have conducted behavioral research (Marvick 
1954; Meyerson & Banfield 1955; Kaufman 1960; 
Peabody 1964), Although there have been recent 
stirrings in the study of metropolitan government 
(Decisions in Syracuse 1961; Sayre & Kaufman 
1960), on the whole, behavioral research in the 
public administrative process lies dormant. Ak 

Community political behavior. Active political 
behavior research in the context of the local com- 
munity revived the old struggle between monistic 
and pluralistic conceptions of power in society and 
government. Stimulated by Floyd Hunter's Com- 
munity Power Structure (1953), political scien- 
tists, equally dissatisfied with his theoretical prem- 
ises and research procedures, descended into the 
local community and made it, more than any other 
political arena, a testing ground of theoretical 
propositions of a generic character about the bases, 
scopes, and relations of power (Banfield 1961; 
Dahl 1961a; Polsby 1963; Agger et al. 1964). The 
pluralists challenged the work of the monists, is 
they did not succeed in explaining how, in a p 
order, diverse sets of power holders are coordinate 
and integrated into a functioning whole. This is 
clearly a next step in behavioral research on the 
local community. ` A 

International political behavior. Behavioral re: 
search in the field of international relations—i-€» 


the behavior of states in interaction rather than 
foreign policy making—presents many technical 
problems. Until the middle 1960s most of the work 
has been theoretical rather than empirical, indica- 
tive of the paradox that there seems to be an 
inverse relationship between theorizing as an inde- 
pendently creative activity and the empirical acces- 
sibility of the phenomena theorized about (Rosenau 
1961; Kelman 1965; Singer 1965). Research along 
behavioral lines continues to rely on indirect evi- 
dence, produced by techniques such as content 
analysis, simulation, and the exploitation of aggre- 
gate data [see INTERNATIONAL RELATIONS]. 


Inventory—ranges of theory 


Institutional classification of political behavior, 
analysis ignores the theoretical concerns at the core 
of scientific inquiry. A second classification would 
locate political behavior research in terms of either 
specific organizing concepts or the broader con- 
ceptual schemes that give it direction. Such a clas- 
sification cuts across specific institutional arenas 
and has the advantage of being immediately useful 
in the further development of political theory. 

Political behavior analysis has produced or used 
a variety of conceptual approaches. While none of 
them has as yet culminated in a general theory of 
politics in a formal sense, all of them provide con- 
ceptual tools out of which more formal theories of 
empirical relevance might be constructed. Present 
approaches seem to fall into two major types. First, 
there are those which seek to build theoretical 
formulations around a single central concept, such 
as group, power, decision, or conflict. Second, there 
are those which proceed from comprehensive for- 
mulations of high generality, such as system, field, 
process, or communication. In these approaches 
the task of theorizing seems to consist in filling a 
formal scheme with conceptual content of political 
relevance. 

Both types of approach involve serious theoret- 
ical difficulties, Given the complexity of political 
phenomena, the efforts to build theories on à single 
central organizing concept are often unsatisfactory 
because they either explain too little or, paradox- 
ically, pretend to explain much more than is actu- 
ally explained. On the other hand, holistic ap- 
proaches are often unsatisfactory because they 
really explain less than should be explained. For- 
tunately, in most empirical research on political 
behavior the dilemmas of theory are overcome by 
the good sense of scholars facing concrete research 
problems, 

Most theorizing about politics 1 
behavioral research proceeds, in rat 


n the context of 
ther catholic and 
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eclectic fashion, from a range of problems where 
self-consciously theoretical formulations, whether 
of the unitary or systematic type, and operational 
necessities can fruitfully meet to extend the mar- 
gins of political knowledge. Theoretical develop- 
ment in political behavior analysis over the next 
few decades is likely to follow the course charted 
in the past, although increased theoretical sophis- 
tication, methodological refinement, and technical 
skills on the part of behavioral practitioners are 
also likely to open up new research directions. 

System analysis. Of the different organizing 
schemes used in political behavior analysis, system 
notions, first promoted in political science by David 
Easton (1953; 1957), are only sparingly used in 
empirical research. While the system model pro- 
vides a conceptual scheme broad enough to sub- 
sume most other formulations, and while, in the 
form of input-conversion-output-feedback analysis, 
it permits meaningful ordering of variables, the 
complexity of the model requires the production of 
so many kinds of data that any one piece of em- 
pirical research is likely to fall short of theoretical 
requirements. While the model has been used in 
macro analyses ( Almond & Coleman 1960; Mitchell 
1962), little empirical work using the model has 
been done on the micro level of analysis. [But see 
the work on legislatures by Wahlke et al. 1962; 
and Fenno 1962; see also SYSTEMS ANALYSIS, Ar- 
ticle on POLITICAL SYSTEMS.] 

Group approach. The older work, stemming 
from Bentley (1908) and summarized by Truman 
(1951), was mainly concerned with interest or 
pressure groups as the effective actors on the polit- 
ical scene, and the interaction of groups was 
thought to constitute the political process. Political 
behavior research of more recent vintage, following 
a suggestive discussion by Garceau (1951 ), is more 
likely to think of the group as an agency that inter- 
venes between the individual and the institutional 
tension points of the political system, where de- 
cisions are made. In this research, the group is 
mainly conceived of as a source of orientations and 
attitudes in individual behavior; it may also be 
treated as a system of action which, on the micro 
level of analysis, is a replica of larger political sys- 
tems. Recent theoretical work on small groups 
(Verba 1961; Golembiewski 1962) shows that little 
empirical work in the field of politics has so far 
been produced [see POLITICAL GROUP ANALYSIS]. 

Decision-making analysis. In spite of a great 
deal of theoretical discussion, especially in the 
arenas of international and private-organization 
decision making, behavioral-empirical research has 
been unable to make much use of decision-making 
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models in a systematic fashion. The notion of 
decision as the problematic focus of analysis has 
directed research attention to decision makers’ 
attributes and the premises underlying their ac- 
tions. But contextual and situational factors have 
eluded systematic analysis. With a few exceptions 
(Snyder & Paige 1958; Dahl 1957a), decision- 
making models in all their complexity have not 
been applied in political behavior research. Never- 
theless, the possibility of breaking down the deci- 
sion-making process into component parts that can 
be examined without necessary reference to the 
total process promises empirically testable research 
propositions [see DECISION MAKING]. 

Communication. Communication models to 
analyze political processes—in contrast to the study 
of communication as a social process in its own 
right—have only recently been used in political 
behavior research, and so far not with satisfactory 
results [see COMMUNICATION, POLITICAL]. That 
communication as a concrete process is a critical 
variable affecting political processes and outcomes 
is clear enough, but as a model, the communica- 
tion process seems more appropriate in analyzing 
macro political phenomena (Deutsch 1953; 1963) 
than individual political behavior (Milbrath 1963). 
One of the difficulties with communication models 
on the level of individual political behavior is that, 
precisely because communication is a universal 
component of all social relations, a communication 
model is likely to be a rather weak conceptual tool 
for explaining highly specific political processes. 

Power analysis. Though no concept has been 
more central in political analysis than power and 
though no concept is more easily apprehended by 
common sense, its utility as a tool of behavioral 
research seems to be limited—not only because of 
definitional difficulties but also because of opera- 
tional obstacles. Students of political behavior find 
it relatively easy to identify the bases of power or 
the scope of power (Lasswell & Kaplan 1950), but 
power relationships are extraordinarily difficult to 
identify and even more difficult to measure (Dahl 
1957b). As a result, power is rapidly losing ground 
from the point of view of its operational, if not 
analytical, utility. Attempts to use the concept have 
been made in community and legislative behavior 
research, but the more research there is, the more 
elusive the concept of power appears to be [see 
Power]. 

Role analysis. An increasing number of polit- 
ical behavior studies utilize concepts borrowed from 
sociological or social-psychological formulations of 
social role [see RoLE]. Of course, role analysis and 
whatever problems it is focused on—role conflict, 


role consensus, role structures, and so on—do not 
stand by themselves. Role analysis is likely to be 
combined with other theoretical notions, such as 
those of system, decision making, or power, and is 
most useful in structural—functional approaches to 
the political process (Almond & Coleman 1960; 
Wahlke et al. 1962; Barber 1965), It promises to 
be especially useful in closing the gap between 
macro analysis and micro analysis of politics (Eu- 
lau 1963). 


Inventory—state of methods and techniques 


Political behavior analysis has made very few 
original and independent contributions to research 
techniques and practically none to methodology. 
Rather, it has borrowed heavily from the other be- 
havioral sciences, with the result that behavioral 
research on politics has generally lagged behind 
substantive discoveries in cognate fields. Interest- 
ingly, some of the most valuable research tech- 
niques that political behavior analysis can claim as 
its own were pioneered in the pre-World War 1 
era, before the impact of the technological revolu- 
tion in the behavioral sciences was felt in political 
science. The analysis of legislative roll calls by way 
of simple measures of likeness and cohesion (Rice 
1928), content analysis of mass communication 
media (Lasswell 1927), and cluster-bloc analysis 
(Beyle 1931) are examples of creatively original 
technical innovations in the study of politics in the 
earlier period. Of course, other techniques were 
used as well by a handful of scholars, such as field 
interviewing, the mail questionnaire, punch-card 
data processing (Merriam & Gosnell 1924), the 
controlled field experiment (Gosnell 1927), and 
the sample survey (C. Robinson 1932), but concern 
with methodology has not been a hallmark of polit- 
ical inquiry until fairly recently. 

It is now recognized that political behavior re- 
search is necessarily facilitated or impeded by the 
state of methodological sophistication and techni- 
cal know-how at a given time. Especially needed 
today is a corps of trained and skilled research 
workers who are able to use, and recognize the 
limitations of, the modern methods. While the 
training of research workers improved in the yeasts) 
after World War n, it is still generally deficient in 
view of the rapidity of technological development, 
especially in comparison with the training given 
in neighboring disciplines. 

It FEF of course, that concern with methods 
tends to direct research attention to areas of in- 
quiry where data can be most readily harvested, 
processed, packaged, and distributed. As already 
mentioned, political behavior was for some time 


primarily identified with voting behavior, largely 
because of the ready availability of aggregated elec- 
tion returns in many jurisdictions. But it was the 
development of the random sample survey and the 
use of panels—especially in the Erie County study 
of 1940 (Lazarsfeld et al. 1944), in the study of 
Elmira, N.Y. (Berelson et al. 1954), and in studies 
of presidential elections in the United States by 
the Survey Research Center, University of Michi- 
gan (Campbell et al. 1954; Michigan, University 
of 1960) that made possible the analysis of mass 
political behavior on the level of the individual per- 
son. The success of systematic interviewing in the 
study of voting and public opinion gradually en- 
couraged researchers to employ these techniques 
in the study of institutionalized populations, like 
legislatures, bureaucracies, or courts, which are of 
special interest to the political scientist. The use 
of the interview in institutionalized groups in turn 
helped close the gap between behavioral and in- 
stitutional analysis and made possible much more 
carefully controlled comparative study of formal 
governmental structures (Wahlke et al. 1962). 
Methodological development as a requisite of em- 
pirical inquiry is especially critical in a field like 
international relations where direct observation of 
political behavior on the level of group or individ- 
ual is extraordinarily difficult. Nevertheless, recent 
efforts suggest future possibilities. Noteworthy are 
the application of the technique of the semantic 
differential and computer programming in the con- 
tent analysis of state papers (North et al. 1963), 
the simulation of international behavior in labora- 
tory decision-making groups and computer gaming 
(Guetzkow et al. 1963; Brody 1963), and the con- 
struction and exploitation of aggregate indicators 
of state behavior for the purpose of linking internal 
and external aspects of politics (Deutsch 1960). 


Problems 


In the third quarter of the twentieth century 
numerous problems of an empirical, theoretical, 
and methodological nature continue to limit the 
promises and prospects of a behavioral science of 
politics. Many of these problems are shared by the 
behavioral sciences generally. But some, it seems, 
stem from the character of politics as a particular 
mode of human action; even if these problems ate 
not peculiar to politics, they are especially trouble- 
some in the study of that particular field. 

The problem of theory. It is an ideal of science 
that theoretical explanation should remain within 
a system of definitions, axioms, theorems, and 
postulates. Yet at the present stage of development 
no single theory of politics seems adequate to cope 
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with all the problems encountered in political re- 
search. In the task of testing theoretically derived 
propositions the researcher is certainly free to ig- 
nore data that do not seem to be immediately 
appropriate. However, if the purpose of research 
is to describe, predict, and explain political behav- 
ior in the real world, there is an irresistible strain 
toward departure from theoretical purity and to- 
ward adoption of concepts and categories of anal- 
ysis alien to initial theoretical formulation. Such 
departure from theoretical consistency tends to 
make disproof of hypotheses impossible or ex- 
tremely difficult. Yet to limit research to only those 
occurrences that can be accounted for by a single 
theory would tend to rigidify political behavior re- 
search and obstruct its usefulness in the creation 
of new knowledge about politics. It is for this rea- 
son, probably, that political behavior analysis will 
continue to be, for some time to come, catholic 
and eclectic. 

The macro-micro problem. Although concrete 
political action is invariably the behavior of indi- 
vidual human actors, the politically significant 
units of action are groups, associations, organiza- 
tions, communities, states, and other collectivities. 
While referring to the “behavior” of such collectivi- 
ties is readily recognized as necessary shorthand, 
the problem of how meaningful statements about 
collectivities can be made on the basis of inquiry 
into the behavior of individual political actors re- 
mains. The problem involves constructing, not only 
theoretically but also empirically, the chain of in- 
teractions and transactions that links the individ- 
ual actor to the collectivities of which he is a part. 
In trying to solve the problem, a number of fal- 
lacies are possible. For instance, there is the fallacy 
of extrapolation from micro to macro phenomena. 
Small units are treated as analogues of large ones, 
and the findings on the micro level are extended 
to the macro level. There is the fallacy of personi- 
fication: large-scale phenomena are reduced to the 
individual level through the use of anthropomor- 
phic categories of analysis, as in the more grotesque 
descriptions of “national character.” Or there is the 
fallacy of misplaced concreteness: interactional 
and relational phenomena are reified and treated 
as if they were physical entities. 

The problem of observation. Even at the least 
complex level, that of the individual, a great deal 
of political behavior is private, and at the most 
complex levels it is deliberately secret. Direct ob- 
servation—whether of the citizen in the voting 
booth or the diplomat in interstate negotiations or 
the top-level decision maker at the apex of a po- 
litical structure—is impossible. Moreover, there is 
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likely to be an inverse relation between an actor's 
importance in the political system and his acces- 
sibility to research. As a result, the data of political 
behavior analysis are to a large degree secondary 
and phenomenological, creating problems of ac- 
curacy and validity that are difficult to handle. 
Even where political behavior is relatively open 
and public, as in deliberative bodies in democ- 
racies, the large number of actors constituting the 
collectivity whose behavior must be observed tends 
to defy direct observation, necessitating reliance on 
interviews. For both of these reasons—the paradox 
of privacy in presumably public action and the 
complexity of the action systems of interest to the 
observer of politics—political behavior analysis 
continues to rely largely either on inferences drawn 
from what political actors are willing to reveal or 
on inferences drawn from observable stimuli or 
from consequences of political actions. 

The problem of aggregation. Problems also 
arise out of the relationship between discrete and 
aggregate data. What may be true of aggregates 
need not be true of the individuals who compose 
them, Generalizations from aggregate data have 
been found untenable when tested against infor- 
mation about the behavior and attitudes of individ- 
uals (Miller 1955/1956). In other words, the use 
of aggregated data is likely to conceal a good deal 
of the variance in the behavior of individuals, 
which the use of discrete data reveals. Yet aggre- 
gate data are often the only kind of behavioral data 
that are available. On the other hand, even if in- 
dividual data are available and aggregated so that 
Statements about collectivities can be made, such 
aggregation may still do violence to findings about 
individual behavior. Aggregation has, of course, the 
advantage of showing what variance remains to 
be explained. But what is meant empirically by a 
group's loyalty, a party’s cohesion, or an organiza- 
tion’s morale has not been answered. It is seldom 
clear whether such conceptions refer to a “group 
property” that is independent of the behavior of 
the individuals composing a group or to the ag- 
gregated characteristics of individuals. 

The intensity-extensity problem. Political be- 
havior research is also challenged by the gap be- 
tween intensive case studies and more extensive, 
systematic analysis. A number of questions arise. 
Although there are many case studies of political 
behavior in all substantive fields of political sci- 
ence, the grounds on which the cases are selected 
are usually obscure. In other words, the assump- 
tions of random sampling in the selection of cases 
are not met, with resultant difficulties for either 
the typicality or significance of findings. The re- 


searcher’s substantive interests or convenience seem 
to be the guiding criteria. Most case studies deal 
with exciting, spectacular, and perhaps critical 
situations rather than with more modal ones. More- 
over, few case studies are cast in a theoretical 
framework that is geared to the problem of cumu- 
lative knowledge. Finally, although case studies 
are said to be rich sources of hypotheses for future 
systematic research, few follow-up studies are ever 
made (for an exception, see J. Robinson 1962). 
How can the relatively few but intensively studied 
cases be integrated into systematic research? That 
question remains on the agenda of future work in 
political behavior analysis. 

The problem of comparability. The accumula- 
tion of political behavior research makes sense 
only if, in due time, different studies of a system- 
atic nature can be compared. Yet such comparabil- 
ity is difficult to achieve because indexes used by 
different researchers are often ambiguous and un- 
stable. Because either conceptual or empirical equiv- 
alence between indicators is lacking, it is ques- 
tionable just what it is that is being compared. 
Comparison, as a result, does not differ very much 
from the common-sense comparison of the layman. 
Comparative analysis has largely failed to remove 
from undisciplined and possibly arbitrary determi- 
nation of similarities and differences those ele- 
ments that are either irrelevant to comparison or 
perhaps even spurious. Moreover, comparison has 
not been systematic in the selection and control of 
the data from which inferences about the existence 
of uniformities and regularities are to be made. 
Agreement among analysts about the indicators 
that are to be used for specific concepts and about 
which controls and what degree of control should 
be minimally required is a desideratum of political 
behavior analysis that is yet to be attained. 

The problem of change. Most political behav- 
ior research is cross-sectional, dealing with the 
actions of individuals, collectivities, or aggregates 
at one point in time or over a relatively short pê- 
riod. As a result, most political behavior research 
has been ahistorical, the justification often being 
that the discovery of functional rather than causal 
relations is the objective of inquiry. But avoidance 
of the causal challenge does not solve the problem 
of studying change through time. It is, of course, 
possible to compare the behavior of cross sections 
or individuals through time and venture inferences 
about change from such comparison. But the meth- 
od is unsatisfactory because changes may be $ 
opposite directions and compensatory, making only 
for marginal results that indicate little or no change. 
Furthermore, if the time span is short, as it often 


is, it is likely to reflect a sequence of possibly 
“unique” events and make for spurious inferences 
about causation. The dilemma of studying change 
through time must ultimately be solved by longi- 
tudinal studies. Longitudinal research on political 
behavior, using the individual as the unit of anal- 
ysis, is probably the most dependable technique 
of studying the process of cause and effect, for it 
permits description of the direction, degree, rate, 
and character of political change. 

These are only some of the problems facing po- 
litical behavior analysis. To point them up is hardly 
a sign of the weakness of the discipline. Rather, it 
is a precondition of scientific growth and develop- 
ment. 


Evaluation 


The emergence of political behavior analysis oc- 
casioned a number of controversies in political 
science as a whole. Some of these disputes stemmed 
from personal misunderstanding or distrust, but 
others centered on fundamental disagreements 
over the nature and functions of a science of poli- 
tics in society. Quite often, controversies deriving 
from irreconcilable premises, on the one hand, and 
those based on misunderstanding, on the other 
hand, were confused, making for uneasy coalitions 
among opponents and proponents of the new 
trends. Charges and countercharges tended to 
deepen rather than resolve often fictitious cleav- 
ages (Charlesworth 1962). 

That political behavior analysts spoke in a lan- 
guage strange to the ears of more traditional schol- 
ars was certainly one source of suspicion. Other 
sources of difficulty were that political behavior 
analysts seemed happy with using in their research 
what traditionalists considered “gadgets” and “gim- 
micks”; that they seemed to be dealing in triviali- 
ties; that they sometimes exhibited a lack of respect 
for conventional scholarship; and that they turned 
their backs on problems of value and prescription. 
In some respects, all of these charges contained 
elements of truth, although from the perspective 
of the behavioralists they were largely made for 
the wrong reasons. 

On the other hand, proponents of behavioral an- 
alysis were critical of the mainstream of political 
science and its major tributaries because the dis- 
cipline did not seem to measure up to the require- 
ments of a genuine science of politics (Easton 
1953). Traditional political scientists were charged 
with hopelessly confusing statements of fact and 
statements of value; grossly neglecting the subtle 
problems that beset the move from evidence to in- 
ference; dealing in global statements for which 
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there was only flimsy proof at best; advocating 
reforms and proposing policies that pretended to 
be based on knowledge where there was little or 
none; using concepts that were immune to research 
operations; or indulging in tautological explana- 
tions or armchair speculations. 

This type of mutual harassment has largely dis- 
appeared in the middle 1960s. Rather than criti- 
cizing traditional scholarship, political behavior 
analysts are devoting their energies to the research 
tasks at hand. In particular, the largely fictitious 
gap between behavioral and institutional analysis 
separating empiricists has been closed (Ranney 
1962). Analysts of both institutions and public 
policies have come to appreciate the utility of be- 
havioral research for their own concerns. One 
could conclude that the mainstream of political 
science has been successfully channeled in a direc- 
tion that accepts different emphases—institutional, 
behavioral, legal, and historical—as mutually com- 
plementary rather than divergent. Increasing so- 
phistication about the relationship between empiri- 
cal research of all kinds and the problems of public 
policy has reconciled both the science-oriented and 
the policy-oriented students of the political process 
[see POLICY SCIENCES]. Behavioral analysis has be- 
come self-consciously theoretical, and theoreticians 
have become increasingly aware of their responsi- 
bility to make their work empirically relevant. 
Resolute resistance to political behavior analysis 
as one mode of inquiry in political science is now 
largely restricted to a small group of theorists who 
believe that all important political questions can 
be answered only by insight into and deduction 
from philosophical premises involving some con- 
ception of “the nature of political man” (Storing 
1962). 

There remain, of course, disagreements within 
the camp of political behavioralists, but these disa- 
greements relate to problems of strategy and tactics 
in theory and research rather than to fundamental 
questions of knowledge. There are those who would 
start with the accumulation of raw facts and let 
the facts speak for themselves. There are those, at 
the other extreme, who believe that political be- 
havior analysis will not get off the ground unless 
it begins with rigorous deductive-logical, if not 
mathematical, models of political systems and proc- 
esses. By far the largest contingent among politi- 
cal behavioralists begins with somewhat less rigor- 
ous theoretical formulations of problems in the real 
world of politics and lets theoretical and empirical 
work fertilize each other, In general, whether fa- 
voring the inductive or the deductive theoretical 
outlook, most political behavior analysts agree that 
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theory is not knowledge but only a tool for gaining 
knowledge, just as facts are not knowledge but 
only raw materials that must be translated into 
statements acceptable as probably true because 
they have been tested in the crucible of empirical 


research. 
Henz EvLau 


[See also POLITICAL SCIENCE.] 
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POLITICAL CLUBS 


A political club is an organization to which 
members are attracted primarily by nonmaterial, 
associational incentives in order to influence the 
outcome of elections, the leadership of political 
parties, or the conduct of public affairs. These asso- 
ciational incentives are nonmaterial rewards accru- 
ing to members because of their sense of belonging 
and their relations with other members. It is this 
incentive system that distinguishes a club from 
other forms of political organization. A political 
organization that relies on specific, material incen- 
tives (usually money or things easily valued in 
money terms) is a machine; one that relies on the 
personal attachment of the members to a particular 
leader is a following, especially if the connection is 
not formalized and club quarters are not provided. 
An interest group is distinguished from a club by 
its relying on a shared interest or purpose (group 
serving or public serving) rather than on the satis- 
factions that arise from solidarity with other mem- 
bers or with the club as an entity. a 

The distinctions among these kinds of political 
organizations are analytical, not concrete; in real- 
ity, many organizations are mixed cases. Where 
the dominant mode òf attachment—the primary 
incentive—is the sociability, camaraderie, and 
prestige of membership, we may use the term 
“club” with assurance. Generally speaking, a club- 
like attachment is evidenced by certain tangible 
criteria: a formal membership roster that distin- 
guishes those “in” from those “out,” a regular 
meeting place where members periodically come 
together, and a series of planned events that serve 
to affirm the members’ solidarity and organize their 
conviviality. x 

Political organizations that have all the charac 


ee 


teristics of clubs—nonmaterial incentives, formal 
membership, regular meetings—except that they 
rely for inducement entirely on common purposes 
(for example, an ideology) and not at all on so- 
ciability are hard to classify. In principle persons 
may associate together for the loftiest political 
purposes even though they dislike each other in- 
tensely; in practice such cases are rare. In general, 
nonideological incentives are important even in 
ideological associations; the more important they 
are, the more appropriate is the term “club.” Or- 
ganizations that rely wholly on ideology as an in- 
ducement rarely, if ever, display all the outward 
and visible signs of a club. 

Functions of clubs. Political clubs perform dif- 
ferent functions, such as the following: (1) Sym- 
bolic—the club may form, manifest, or intensify 
political opinion or loyalty, drawing like-minded 
persons together both to heighten and make artic- 
ulate their commitment and to demonstrate this 
commitment to others. (2) Resource—political re- 
sources, particularly money and prestige, may be 
mobilized by a club and made available to a party 
or to a faction within a party, (3) Leadership— 
efforts to mantain or capture the leadership of a 
political party may be organized and stimulated by 
a political club, the target of such efforts being 
either the party’s general organization (for in- 
stance, the convention or executive committee) or 
the party in the legislature, or both. (4) Electoral 
—the voter, rather than a party, may be the object 
of action, the club seeking to register him, convince 
him, and bring him to the polls. 

A club may—and usually does—perform more 
than one function, but some combinations are 
more likely than others. The symbolic and resource 
functions tend to be associated in one kind of club, 
the leadership and electoral functions in another. 
In the first case the club exchanges associational 
benefits for the member's funds, the member's as- 
sent to a statement of purpose, or the right to use 
the member’s name; in the second case the club’s 
inducements must compensate the member for 
considerable expenditure of time and effort, often 
on tedious and unrewarding tasks (soliciting votes, 
for example), The first kind of club might be called 
a solidary association, the second a “combat” or- 
ganization, Clubs suitable for one role are as a 
result often unsuitable for the other (for example, 
persons who join a club to partake of its status or 
to lend it theirs are not likely to attach a high value 
to routine electoral work among lower-class voters). 

Age and class influence the functions of clubs. 
Men whose success in life has given them the 
prestige and funds valued by a political club are 
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likely also to be men sufficiently advanced in years, 
in career, and in status to have little time or in- 
clination for the routine, conflict-laden aspects of 
politics; on the other hand, men too young or too 
lacking in family connections to have money and 
status are also likely to regard political labors as 
an opportunity rather than a burden. 

History. The earliest political clubs in England 
and the United States tended to be primarily soli- 
dary associations brought into being by political 
notables in order to symbolize (and in part to 
create) an emerging political party and to mobilize 
resources for the conduct of its affairs. In London 
the most influential aristocrats, ministers, and 
members of Parliament had for years gathered 
informally at White’s or Brook’s, private gambling 
and drinking clubs that began as business ventures 
but acquired political overtones owing to their pa- 
tronage by the various pariamentary factions that 
were beginning to coalesce into the early Tory and 
Whig parties. These clubs were essentially social 
organizations (Brook's was made famous by the 
gambling exploits of Charles James Fox and the 
Prince of Wales); although each acquired a parti- 
san flavor, there were some Whig members in the 
predominantly Tory White’s club and some Tories 
at the Whig center, Brook's. 

After passage of the Reform Act of 1832, British 
party managers faced the necessity of registering a 
sizable number of newly enfranchised voters and 
of bringing together into an electoral organization 
the various factions and “connexions” that had 
flourished in the era of “rotten borough” politics. 

The Tories organized the Carlton Club and the 
Whigs the Reform Club, Both were intended to be 
political associations, but neither relied on party 
program to maintain the support of the members; 
rather, elegant dinners, the prestige of member- 
ship, and comfortable quarters kept the subscrip- 
tion lists filled. Because the clubs were almost 
exclusively upper class, many rank-and-file party 
workers and parliamentary backbenchers were not 
members. Status was often more important than 
ideology; several prominent members were allowed 
to remain even though ideologically they were 
backsliders, but not even the most ideologically 
pure candidate was admitted if he was not a 
“gentleman.” The more middle-class clubs were 
often ideologically rigorous (the Conservative Club, 
founded in 1840, was apparently more intolerant 
of deviants than was the Carlton). 

In the United States no great national political 
clubs arose comparable to the Carlton or Reform; 
the localistic nature of American politics led to 
the establishment of local clubs. The Boston 
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Caucus Club met irregularly in the 1760s. In the 
1790s Democratic and Republican societies (often 
modeled after Jacobin clubs) were established in 
most states to challenge the Federalist domination 
of the national government; these short-lived as- 
sociations served, under the leadership of “mother 
societies” in Philadelphia, as intermediate forms 
between the politics of the great family “connex- 
ions” and the politics of the mass parties of later 
years. 

In both England and the United States the clubs 
attempted to unite various wings of the parties, 
to provide central meeting places for activists and 
stopping places for sympathetic travelers, to supply 
funds and candidates for the constituencies, to in- 
fluence newspapers and public opinion, and, in a 
few cases, to direct parliamentary strategy. It is 
easy to exaggerate their success in these matters; 
the clubs were, after all, a transitional political 
form whose functions were in the hands of part- 
time, lay members, many of them more interested 
in conviviality than in conviction. The need for 
full-time, professional party management grew in- 
creasingly evident, particularly as the franchise 
was steadily widened. The clubs began by support- 
ing paid party managers and were ultimately sup- 
planted by them. 

Clubs grew fastest in periods when the franchise 
was being enlarged and in areas where party com- 
petition was keenest. The club was an appropriate 
device for defining the party's identity and mobiliz- 
ing its resources as long as the party consisted to 
some degree of like-minded persons; with the 
growth of a mass party, particularly under condi- 
tions of two-party competition, the party became 
too heterogeneous to be represented by a single 
solidary association and the requirements of party 
combat too demanding to be left to part-time 
amateurs. 

A centralized political structure, as in England, 
vested power increasingly in the parties’ parliamen- 
tary leadership; party managers became the agents 
of this leadership. In the United States a highly 
decentralized political system saw the growth of 
local, not national, clubs (the Tammany Society, 
founded in 1789, was a conspicuous example) that 
either were assimilated into the regular party ap- 
paratus (the “members” becoming in fact paid 
party workers, precinct captains, and ward leaders 
for whom the “club” was a source of tangible more 
than intangible benefits) or became organizations 
through which insurgents challenged regular party 
leadership in hopes of obtaining the perquisites of 
power or of reforming the system of politics. 

Between 1868 and 1885 the number of political 


clubs in London grew from 5 to 14 and the number 
of members from approximately 6,700 to over 
25,000 (Hanham 1959). Most of this growth was 
accounted for by middle-class clubs, many of which 
provided London quarters for visiting politicians 
and dignitaries from the provinces. In addition, 
both parties—from about 1867—sought to extend 
their influence among the lower classes by foster- 
ing the growth of workingmen’s associations and 
clubs in the wards. Such clubs, created from the 
top down, provided, for a small subscription fee, 
“plain and homely facilities” in which the principal 
attraction was the bar. In New York City and other 
large American cities such working-class clubs 
grew up spontaneously; whereas in politically cen- 
tralized England there was little incentive other 
than sociability to bring persons together in such 
clubs, in America there was in addition the lure 
of the power, offices, and jobs available from ward 
and city politics. 

From the nineteenth century on, there were in 
England and America a number of clubs devoted 
as much to political argumentation as to sociabil- 
ity; in England these “local Houses of Commons” 
consisted of zealous young men, eager for a variety 
of causes and candidates, who were debarred by 
poverty or lack of connections from the traditional 
entry route (via the university) into politics. In 
the United States a hundred years later local Dem- 
ocratic clubs sprang up in New York, Illinois, Cali- 
fornia, and several other states where zealous 
young men and women felt themselves debarred 
from the traditional entry route into politics be- 
cause they had a university education and strong 
convictions. These college-educated liberal Dem- 
ocrats sought to use the club as a device for (in 
New York) “reforming” the party leadership from 
within or (in California) committing the partys 
candidate to a particular policy position. The polit- 
ical club, which began as a device for bringing the 
prestige and wealth of aristocrats into a regularized 
association with nascent parties, now exists most 
commonly in the United States as an association 
in which young middle-class amateurs struggle 
against the regular party machinery in order ig 
advance both their principles and their ambition. 
Where the earliest clubs stressed elegance and ex- 
clusiveness, contemporary clubs are likely to stress 
Robert's Rules of Order and community issues. 

The role of clubs. The club, whatever its form, 
has only seldom been successful in dance 
class differences. Naturally the solidary politic i 
club had no interest in becoming a multiclass E 
sociation; its exclusiveness was a principal incen 
tive. The combat club often strove to attract mem- 


bers from a variety of groups, but the middle-class 
and often professional or even intellectual leader- 
ship of the club was typically unable to make club 
life attractive to working-class members. Many of 
the Jacobin clubs in late eighteenth-century France 
achieved a very diverse membership, but usually 
only after the advent of the Terror had transformed 
the club from a middle-class gentlemen’s literary, 
social, or freemason society into a powerful instru- 
ment of ideology and local government. Aaron Burr 
used the Tammany Society as a vehicle for mobiliz- 
ing both wealth and votes, but by 1872 the Irish 
had driven the Anglo-Saxon Protestants out of lead- 
ership, and men of lower-class antecedents had 
displaced the merchants. In England, after 1883, 
the Primrose League attempted, by the use of elab- 
orate fetes, carnivals, and vaudeville entertain- 
ments, to mobilize working-class voters on behalf 
of an ultraconservative position represented by 
Lord Randolph Churchill. The “reform” clubs of 
New York City in the 1950s and 1960s emphasized 
community and neighborhood service projects and 
servicing of tenants’ complaints as a way of at- 
tracting working-class support. Both the Jacobin 
clubs and the Primrose League “habitations” (as 
the local units were called) placed heavy reliance 
on elaborate, quasi-masonic rituals, handclasps, lit- 
anies, titles, and ceremonies (the Tory Primrose 
League pledged its members to secrecy and “the 
Imperial Ascendancy of Great Britain” and con- 
ferred the title of “knight harbinger” and the order 
of the Jubilee Grand Star; the radical Jacobin clubs 
celebrated the Cult of Reason, revered busts of 
Jean Paul Marat and Louis Michel Lepeletier, and 
insisted on the single title of “citizen”). 

In general, ritual, entertainment, and refresh- 
ments have proved more useful than ideology or 
community service in attracting working-class sup- 
port to clubs led by, or under the patronage of, 
middle-class activists. In tsarist Russia during the 
early twentieth century, for example, there were 
a number of intellectual “study circles” (such as 
Martov’s circle in St. Petersburg), which discussed 
and debated the imperatives of history; there were, 
in addition, workers’ circles to which intellectuals 
were invited as teachers. The intellectuals discov- 
ered, however, that the workers were often more 
interested in being taught how to read and write 
Russian than how to appreciate Marx, Georgi Plek- 
hanov, and Pavel Akselrod; instead of arousing a 
revolutionary consciousness, the intellectuals usu- 
ally succeeded only in assisting the workers in their 
efforts to acquire the attributes of bourgeois re- 
spectability. Another kind of semipolitical club 
(there were practically no clear-cut examples, ow- 
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ing to the absence of electoral and party politics) 
were the older middle-class professional societies, 
such as the Moscow Society of Jurisprudence and 
the St. Petersburg Economic Society; discussion at 
these associations was sufficiently political for the 
government to order them either closed or cur- 
tailed. 

The decline of traditional political clubs in the 
United States has been the result of the decay of 
the political machine (of which the clubs were, 
in many cities, a part), the assimilation of the 
lower classes into a style of politics in which news- 
papers and television are more important than 
clubhouse conversation as a source of both enter- 
tainment and information, and the withdrawal of 
the old elites of wealth and prestige from an active 
role in local politics. Perhaps the last major upper- 
class male political club movement was the Union 
League Association. Union League clubs were 
founded in New York and Philadelphia in 1862- 
1863 to strengthen the Federal cause, counteract 
Copperhead influence, and support the war effort; 
after the Civil War they attempted to organize the 
freed slaves for political purposes. Following Re- 
construction the Union League clubs turned their 
attention to political—particularly municipal—re- 
form and to the difficult task of maintaining their 
expensive clubhouses. 

Political clubs today are more likely to be combat 
than solidary associations and to attract the ideo- 
logically militant rather than either the prestigious 
or the needy. The volunteer spirit of the club move- 
ment has brought to it many young women (ladies 
having more time than men for door-to-door poli- 
tics); in this the clubs associated with the Cali- 
fornia Democratic Council or the New York Com- 
mittee for Democratic Voters are like the Primrose 
League and the Jacobin clubs, which gave a major 
political role to women as early as the eighteenth 
and nineteenth centuries. 

Some of the constituency associations of the 
Conservative and Labour parties in England func- 
tion as political clubs and attract the more zealous 
elements of the parties; as a result, constituency 
association views are often more extreme than 
(and at odds with) the views of central party 
managers or parliamentary leadership. Lord Ran- 
dolph Churchill's efforts in 1883 to capture the 
leadership of the National Union of Conservative 
and Constitutional Associations by mobilizing the 
local units against the influence of the party whips 
and professional leaders presaged the efforts, sev- 
enty years later, of the reform-minded clubs of the 
New York Committee for Democratic Voters to cap- 
ture the executive committee of the New York 
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County Democratic party (that is, Tammany Hall). 
Club leaders are almost invariably in competition 
with the party’s elected officials and managers; the 
former are constrained by the need to maintain the 
enthusiasm of like-minded volunteer club mem- 
bers, the latter by the need to win votes from a 
heterogeneous and slightly bored electorate, 


Little comparative research has been done in 
the following areas: (1) the consequences for 
goals and strategies of differing kinds of club in- 
centives; (2) the institutional or socioeconomic 
conditions under which associational incentives 
will be highly valued by political activists and 
hence under which political clubs will flourish; 
(3) the methods by which professional party man- 
agers deal with political clubs when the supply of, 
or demand for, other inducements (such as money ) 
is insufficient to permit leaders to abandon the 
club form altogether; and (4) the relationship be- 
tween ideology and sociability as organizational 
incentives. 

JAMEs Q. WILSON 


[See also INTEREST GROUPS; VOLUNTARY ASSOCIATIONS. ] 
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POLITICAL CULTURE 


Political culture is the set of attitudes, beliefs, 
and sentiments which give order and meaning to 
a political process and which provide the under- 
lying assumptions and rules that govern behavior 
in the political system. It encompasses both the 
political ideals and the operating norms of a polity. 
Political culture is thus the manifestation in ag- 
gregate form of the psychological and subjective 
dimensions of politics. A political culture is the 
product of both the collective history of a political 
system and the life histories of the members of 
that system, and thus it is rooted equally in public 
events and private experiences. 

Political culture is a recent term which seeks to 
make more explicit and systematic much of the 
understanding associated with such long-standing 
concepts as political ideology, national ethos and 
spirit, national political psychology, and the funda- 
mental values of a people. Political culture, by 
embracing the political orientations of both leaders 
and citizens, is more inclusive than such terms as 
political style or operational code, which focus on 
elite behavior. On the other hand, the term is more 
explicitly political and hence more restrictive than 
such concepts as public opinion and national 
character. 

The concept of political culture can be seen as 4 
natural evolution in the growth of the behavioral 
approach in political analysis, for it represents an 
attempt to apply to problems of aggregate or sys- 
temic analysis the kinds of insights and knowl- 
edge which were developed initially by studying 
the political behavior of individuals and small 
groups. [See POLITICAL BEHAVIOR.] 

More specifically, the concept of political culture 
was developed in response to the need to bridge a 
growing gap in the behavioral approach between 
the level of microanalysis, based on the po 
logical interpretations of the individuals politic 
behavior, and the level of macroanalysis, based ort 
the variables common to political sociology. ma 
sense the concept constitutes an attempt to oe 
grate psychology and sociology so as to be able 


apply to dynamic political analysis both the revo- 
lutionary findings of modern depth psychology and 
recent advances in sociological techniques for meas- 
uring attitudes in mass societies. Within the disci- 
pline of political science, the emphasis on political 
culture signals an effort to apply an essentially 
behavioral form of analysis to the study of such 
traditional problems as political ideology, legiti- 
macy, sovereignty, nationhood, and the rule of law. 
(For a theoretical analysis of the concept see Verba 
in Pye & Verba 1965, pp. 512-560.) 


Political culture and socialization 


Intellectual curiosity about the roots of national 
differences in politics dates from the writing of 
Herodotus, and possibly no recent studies have 
achieved the richness of understanding of such 
classic studies of national temperament as those 
by Tocqueville, Bryce, and Emerson. But the dy- 
namic intellectual tradition which inspired political 
culture studies comes almost entirely from the 
studies of national character and the psychocul- 
tural analyses of the 1930s and 1940s. Benedict 
(1934; 1946), Mead (1942; 1953), Gorer (1948; 
1953; 1955), Fromm (1941), and Klineberg 
(1950) all sought to utilize the findings of psycho- 
analysis and cultural anthropology to provide 
deeper understanding of national political be- 
havior, A major objection to these studies was their 
failure to recognize that the political sphere con- 
stitutes a distinct subculture with its own rules of 
conduct and its distinct processes of socialization. 
The practice of moving directly from the stage of 
child training to the level of national decision 
making meant that crucial intervening processes 
were neglected. 

Stages of socialization. The notion of political 
culture seeks to retain the psychological subtleties 
of the earlier national character studies while giv- 
ing appropriate attention to the distinctive features 
of the political sphere and to the intervening stages 
of personality development between childhood and 
induction into adult political life. This is achieved 
by conceiving of two stages of socialization; the 
first is the induction into the general culture, while 
the second is the more particular, and usually more 
explicit, socialization to political life. In some forms 
of analysis it is useful to distinguish an additional 
stage, political recruitment to special roles within 
the political process. These stages are not neces- 
sarily sequential; explicit political socialization can 
occur at a very early point, when the individual is 
still being socialized into his general culture. 

Basic to the analysis of political cultures is the 
investigation of the relationships between the vari- 
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ous stages of socialization and between the final 
political socialization process and the dominant 
patterns of behavior in the political culture. In 
some systems there is a fundamental congruence 
between the content of the various socialization 
processes and the existing political culture. Such 
congruences existed historically in the traditional 
political cultures of Japan, Egypt, Ethiopia, and 
Turkey (see Ward, pp. 27-82; Binder, pp. 396- 
449; Levine, pp. 245-281; Rustow, pp. 171-198 in 
Pye & Verba 1965). In such systems the values 
and attitudes internalized during the general 
socialization process are consistent with and re- 
inforced by the attitudes and values stressed in the 
process of more explicitly political socialization; 
and the combined socialization processes tend in 
turn to support and reinforce the current political 
culture. Under such conditions the prospects are 
for the continued existence of a coherent and rela- 
tively stable political culture. 

It is, however, also possible to distinguish vari- 
ous kinds of tensions and instabilities in political 
cultures according to the types of contradictions 
and inconsistencies in the socialization processes 
and between these processes and the requirements 
of the political system. The most dramatic exam- 
ples of such contradictions are to be found in revo- 
lutionary systems in which the elite political cul- 
ture is either shaped by a highly explicit and 
unculture-bound ideology or is the product of an ex- 
ogenous historical experience such as colonialism. 

In some societies the primary process of social- 
ization tends to provide people with a strongly 
optimistic view of life and a deep sense of basic 
trust in human relations, while the later stages of 
political socialization emphasize cynicism and sus- 
picion of political actors. As a result, the political 
culture is characterized by a critical and con- 
temptuous view of existing political practices but 
is also colored by a strong utopian faith that reform 
can ultimately remedy the existing situation, Thus 
cynicism is balanced by the expectation that re- 
forms are worth seeking. This appears to have 
been the character of the cynicism which inspired 
the muckraking tradition in American politics. The 
same dynamics seem to be at work in the Philip- 
pines political culture (Grossholtz 1964). In other 
societies distrust of contemporary political insti- 
tutions and personages is preceded by an earlier 
socialization process which instills a sense of fun- 
damental distrust and suspicion, with the result 
that people have little faith in reformist solutions 
and feel that political improvement requires cata- 
clysmic changes. Burma provides an example of 
this process (see Conference . . . 1963). 
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Continuity and discontinuity. Problems of con- 
tinuity and discontinuity also require analysis of 
the relations between socialization and the polit- 
ical culture. Historical events within the political 
system may demand changes in the political cul- 
ture which are inconsistent with either past or 
present socialization processes. In all dynamic po- 
litical systems, tensions are possible because the 
socialization process cannot change as rapidly as 
the political process. This problem becomes pro- 
foundly acute when there is a sudden change in 
the international status of a society, for instance, 
when a colony gains independence. One of the 
basic sources of instability and ineffectualness in 
many newly developing countries lies precisely in 
the differences between the emphasis of the social- 
ization processes which produced the various strata 
of the contemporary society and the attitudes 
necessary for operating a national political process. 

Socializing agents. In shaping the political cul- 
ture the political socialization process operates in 
terms of various socializing agents. Some of these 
agents, such as the family, tend to be prominent 
at the early phases of the socialization process, 
and thus their influences are most closely related 
to personality characteristics fundamental to the 
political culture. Other socializing agents, such as 
the mass media and political parties, tend to be- 
come critical at later stages and thus are primarily 
involved in influencing the more cognitive aspects 
of the political culture. 

Much current research on different political cul- 
tures has sought to determine the relative im- 
portance of different kinds of socializing agents in 
shaping different aspects of the political culture 
and, thus, in evaluating the links between the 
sociological structure of the society and the polit- 
ical process. The family, for example, according to 
Hyman (1959), is peculiarly potent in the United 
States in determining party loyalties, while formal 
education, according to Almond and Verba (1963) 
is most vital in producing commitment to demo- 
cratic values. In studies of the transitional political 
systems of the underdeveloped countries, it has 
become apparent that the intensely politicized na- 
ture of these societies is often the result of the 
dominant role of partisan as against nonpartisan 
or constitutional agents of socialization. It is note- 
worthy that the trend toward one-party systems in 
sub-Sahara Africa is closely associated with the 
fact that nationalist parties were the only strong 
agency for socializing most of the newly politically 
conscious masses (Hanna 1964). When nonparti- 
san or politically neutral socializing agents are 


weak, social life tends to become highly politicized, 
and little appreciation is likely to exist for such 
fundamental constitutional institutions as an im- 
partial bureaucracy and the rule of law. Studies of 
the process of nation building in societies in which 
the mass media are weak and cannot provide an 
objective view of national events suggests that con- 
stitutional development cannot become readily in- 
stitutionalized under such conditions (see Confer- 
ence . . . 1963; Schramm 1964). This relationship 
between the socialization process and the ensuing 
political culture explains some basic difficulties in 
creating national institutions in countries where 
popular political consciousness was inspired by 
highly partisan and ideologically oriented inde- 
pendence movements. 

Elite and mass subcultures. In all societies 
there are inevitably some differences between the 
political orientations of those who have responsi- 
bility for decisions and those who are only ob- 
servers or participating citizens, A national political 
culture thus consists of both an elite subculture and 
a mass subculture, and the relationship between 
the two is another critical factor determining the 
performance of the political system. The relation- 
ship determines such crucial matters as the basis of 
legitimacy of government, the freedom and limita- 
tions of leadership, the limits of political mobiliza- 
tion, and the possibilities for orderly transfers of 
power. 

Mass subcultures are rarely homogeneous, for 
there are usually significant differences between 
the politically attentive strata of the society and 
the elements who are little concerned with politics. 
In some cases the mass political culture is highly 
heterogeneous and sharp differences exist accord- 
ing to region, social and economic class, or ethnic 
community. In such cases, the pattern of relation- 
ships among the various subcultures becomes a 
crucial factor in describing the mass political 
culture. 

In analyzing the extent to which the elite and 
mass subcultures contain complementary sets of 
values, it is useful to distinguish between those 
systems in which recruitment into the elite sub- 
culture is generally preceded by socialization into 
the mass subculture and those in which the chan- 
nels of socialization are completely separate. In 
most stable, modern democratic societies the gens 
eral pattern is for individuals to be socialized into 
the mass culture before being recruited to leading 
political roles, and thus the elite, in spite of ood 
ing highly specialized skills and political pels 
edge, can still appreciate the basic values of 


citizenry as a whole. It does not, of course, follow 
that in all cases people who rise out of the mass 
subculture will continue to be sympathetic or re- 
sponsive to their background; indeed, in tran- 
sitional societies leadership elements often have 
deep resentments against what they feel are the 
backward attitudes of those with whom they were 
once associated. 

In most traditional, and many transitional, sys- 
tems those destined for leadership positions tend 
to have quite different career lines, receive quite 
different forms of education, and have quite dif- 
ferent social experiences from the mass of their 
followers. Even in many transitional societies the 
very basis of legitimacy of the leaders rests on the 
popular belief that they are men inherently set 
apart from others at birth. 

A basic problem in the dynamics of political 
cultures relates to uneven changes in the socializa- 
tion patterns of the two subcultures. Serious diffi- 
culties for the political system can arise when 
rulers discover that the mass subculture is no 
longer responsive to traditional leadership patterns 
but that they themselves have little skill in more 
modern ways of ruling. Or the opposite problem 
can arise when the elite subculture has been 
changed significantly by new patterns of elite 
socialization but the mass culture remains largely 
unchanged. Under such conditions leaders may be 
impatient for change, and in displaying little un- 
derstanding and even outright scorn for the essen- 
tial qualities of the mass culture they may create 
resentment in the population, who may feel that 
their leaders have lost their sense of the proprieties 
of ruling. 


The content of political cultures 


The content of political cultures is in large meas- 
ure unique to each particular society. Studies of 
different political cultures therefore tend to empha- 
size different themes, and the ultimate test of the 
utility of a theory of political culture will depend 
upon its value for comparative and generalized 
analysis. Already there have been promising pio- 
neering advances in comparative analysis in which 
similar qualities of political cultures have been re- 
lated to a common type of political system. For 
example, Almond and Verba (1963) have identi- 
fied the “civic culture” which underlies democratic 
political systems. 

It would seem possible also to isolate some uni- 
versal dimensions of political cultures in terms of 
certain inherent qualities of both political systems 
and the processes of personality formation. Nathan 
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Leites (1951; 1953) has demonstrated the value 
of analyzing elite political behavior character- 
ologically. It seems likely that further research will 
reveal that political cultures tend to manifest de- 
finable syndromes that are related either to recog- 
nized patterns of personality development or to 
general patterns of historical development, or to 
both. At this stage of knowledge it is possible only 
to suggest certain universal problems or themes 
with which all political cultures must deal in one 
manner or another. 

Scope and function of politics. Every political 
culture must define for its society the generally 
accepted scope or limits of politics and the legiti- 
mate boundaries between the public and private 
spheres of life. Scope involves definition of the 
accepted participants in the political process, the 
range of permissible issues, and the recognized 
functions of both the political process as a whole 
and the separate agencies or domains of decision 
making which collectively constitute the political 
process. 

The scope of participants is in most systems 
formally defined by the requirements of citizen- 
ship, but in all systems there are usually also 
formal or informal limits relating to age, sex, 
social status, training, family connections, and the 
like which govern the recruitment process. 

Similarly, in most political cultures certain issues 
are recognized as being outside the domain of 
politics or the jurisdiction of particular parts or 
agencies of the political process. The relationship 
of issues and functions can be highly specialized 
in the sense that particular issues are recognized 
as being the special responsibility of special forms 
of decision making, such as electoral, parlia- 
mentary, bureaucratic, juridical, or technocratic 
expertise. 

In democratic political cultures there is usually 
a clear sense of the appropriate boundaries of polit- 
ical life, explicit recognition of new issues as they 
arise, and respect to some degree for functional 
specialization in the handling of issues and for the 
relative autonomy of the different domains of polit- 
ical decision making. In totalitarian cultures there 
are few established boundaries of the political 
sphere of activity, explicit knowledge that all issues 
can become political, and some respect for func- 
tional specializations but little for the autonomy of 
the different domains. In transitional systems there 
are usually no clearly accepted boundaries of polit- 
ical life, but the impotence of politics provides 
actual limits: there is an expectation that all mat- 
ters can become politicized, and there is little func- 
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tional specialization or autonomy in the various 
domains of political decision making. [See DECISION 
MAKING, article on POLITICAL ASPECTS,] 

Concepts of power and authority. Political cul- 
tures, in providing concepts about the nature and 
properties of power and authority, may differ ac- 
cording to (1) the basis for differentiating power 
and authority; (2) the modes by which the one 
may be translated into the other; (3) the assumed 
limits of the efficacy of power; (4) the elements 
or components of legitimate power, e.g., physical 
force, popular support, moral justification, legal 
sanctions, etc.; and (5) the degree of diffusion of 
centralization of power and authority. [See Au- 
THORITY; POWER.] 

The process of legitimizing power has a critical 
bearing on the performance of a political system 
[see Lecirimacy]. Usually legitimization involves 
restraining the uses of potential power and placing 
limits upon the range of actions of particular insti- 
tutions and power holders. This has been particu- 
larly true in Western political cultures and in the 
development of American constitutional theory in 
relation to the division of powers. These restraints 
of legitimacy sometimes take an absolutist form, 
with the result that no single institution or political 
actor can perform decisively and with full effi- 
ciency. In a few political cultures the process of 
legitimizing power proceeds in the opposite direc- 
tion, so that legitimacy is conferred only upon those 
who can and do act decisively and effectively. This 
is particularly true in countries which have experi- 
enced a period of national humiliation as a result 
of weakness in international affairs. For example, 
the very effectiveness of the Chinese communists 
has been one of the most critical factors in giving 
the Peking government a sense of legitimacy în the 
eyes of its subjects. In democratic political cultures 
there are often ambiguous feelings about the need 
to restrain all power and the need for legitimate 
power to be effective. In transitional societies it is 
often difficult for any forms of power to become 
legitimized because all seem to have so much diffi- 
culty in being effective. 

In all political cultures, concepts about power 
and authority have deep psychological dimensions 
because of the fundamental role of parental au- 
thority in the early socialization process. The skills 
that children develop in coping with family author- 
ity tend to provide a lasting basis for adult styles 
in dealing with authority. Thus, in some cultures 
it is widely assumed that authority can best be 
constrained by stressing issues of justice and fair- 
ness in a spirit of friendly informality, while in 


others the style is that of winning favor by display- 
ing complete and abject submission. 

Political integration. In varying ways and in 
differing degrees, political cultures provide people 
with a sense of national identity and a feeling of 
belonging to particular political systems. Basic to 
the problems of the integration of the political sys- 
tem is that of establishing a sense of national iden- 
tity, and the problem of national identity is in turn 
a function of the process by which individuals 
realize their own separate senses of identity. This 
basic relationship between national identity and 
personal identity provides a fundamental link 
between the socialization process and the inte- 
gration of the political process [see IDENTIFICATION, 
POLITICAL], 

Integration also involves the relationships of the 
various structures involved in the political process, 
and hence is related to the problems of specializa- 
tion of function among decision-making groups 
discussed above, 

A third aspect of integration concerns the man- 
ner in which various subcommunities, ethnic or 
regional groups, or subcultures are related to each 
other. Political cultures differ according to the ex- 
tent to which they permit such minorities to pre- 
serve their separate identities while meeting the 
expected standards of integration. [See INTEGRA- 
TION.] 

Status of politics and politicians. In traditional 
societies, religion, war, and government provided 
the elite, and the art of ruling was seen as having 
a sacred origin. Leadership carried high visibility, 
and those who shared in decision making could 
claim glory and greatness. Modern political cul- 
tures, reflecting an increased division of labor and 
the rise of secular considerations, tend to accept 
politics as only one of many professions and to 
debase the role of politician, even while still ex- 
tolling the supreme importance of state and nation. 

A political culture must establish the generally 
acceptable rewards and penalties for active politi- 
cal participation. In traditional societies the high 
status of leaders also meant that those with power 
could legitimately expect high material rewards. 
With the emergence of other professions and the 
contraction of the political sphere, the material 
rewards of those who enter public life decreased, 
and they were increasingly expected to make per- 
sonal sacrifices for performing public services. The 
political culture, in controlling the accepted balance 
between rewards and penalties for those entering 
public life, also tends to control the quality of ee 
ple recruited. In democratic political cultures t 


desire to shackle power produces the requirement 
that those who seek power should have no self- 
interests but only serve the interest of others; and 
the suspicion that this is not always the case lowers 
popular esteem for politicians as a class. Political 
cultures, in creating the distinctions between states- 
men and politicians, provide another basis for 
rewarding and controlling those who seek power 
[see POLITICAL RECRUITMENT AND CAREERS]. 

Evaluating performance. All political cultures 
contain standards for evaluating the effectiveness 
and competence of those performing specialized 
roles in the political system. Such standards gen- 
erally depend upon popular views as to how national 
and community-wide problems should best be 
solved. In traditional cultures, problem solving was 
usually associated with the correct performance of 
rituals, and hence evaluation of performance was 
strongly influenced by skills displayed in cere- 
monies, Although modern political cultures recog- 
nize the central place of rationality in problem 
solving, there tend to be great differences among 
cultures in what is accepted as being rational. Judg- 
ment about skill in leadership is also influenced by 
the extent to which a society values the personal 
magnetism of leadership or the abilities of tech- 
nical specialists and experts. Changes in the eval- 
uative dimension of political cultures occur as new 
skills and professions are recognized as being rel- 
evant for solving national problems. 

The evaluative aspect of political cultures must 
also reflect the inescapable fact that politics deals 
with future contingencies which lie beyond the 
range of ready prediction. Each political culture 
must provide some basis of faith in the forecasting 
powers of acceptable leaders. Traditionally, this 
faith was usually placed in the mystical and charis- 
matic powers of personal leadership. In other cul- 
tures either divine or secularly inspired doctrines 
are presumed to be endowed with all necessary 
predictive powers. In still other cultures the very 
massiveness and essentially esoteric operation of 
bureaucracies and the complex machineries of gov- 
ernment are enough to generate a popular faith 
that those in power have a grasp of the future. The 
ultimate test of leadership in all cases is skill in 
maintaining popular faith in the leader's capacity 
to deal with all possible contingencies. [See LEAD- 
ERSHIP, article on POLITICAL ASPECTS.] 

The affective dimension of politics. Possibly no 
other social activity touches upon such a wide range 
of emotions as politics, and every political culture 
seeks to regulate the expression of acceptable pub- 
lic passions and to deny legitimacy to others. Above 
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all, since politics invariably involves struggles over 
power, personal aggression is a basic emotion that 
all political cultures must deal with by making 
some forms of aggression legitimate and by defin- 
ing areas and times in which its expression is 
permissible. 

This function of political cultures is related to 
but goes beyond the need of providing integration 
to the system and a spirit of collective identity. 
It involves the degree to which the inherent drama 
of power and decision is either accentuated or 
muted. Essentially, the affective dimension of the 
political culture is determined by the ways in which 
people are legitimately permitted to realize psychic 
satisfaction from active participation in politics. 

The coherence and stability of political cultures 
are constantly threatened by the fact that people 
may turn to political action for intensely private 
and psychologically personal reasons, and thus seek 
satisfactions which are completely unrelated to the 
social or collective functions of politics. Such peo- 
ple may have little interest in the public goals or 
purposes of the movements they support, since 
their satisfaction comes mainly from the spirit of 
involvement and the drama of participation. Harold 
Lasswell first pointed out this phenomenon (1930; 
1948), which was also observed by Almond in com- 
munist movements (1954). 

Balance between cooperation and competition. 
Politics rests upon collective actions which in turn 
depend upon a basic spirit of trust and a capacity 
for cooperation. At the same time politics involves 
conflict and competition, Cultures must therefore 
strike an acceptable balance between cooperation 
and competition, and the capacity of political cul- 
tures to manage this problem usually depends on 
how the basic socialization process handles the 
problems of mutual trust and distrust in person- 
ality development. [See CONFLICT; CooPERATION.] 

A necessary prerequisite for the building of com- 
plex human organizations is a strong sense of 
human trust. Where the basic culture instills in 
people a profound sense of distrust and suspicion, 
collective action becomes difficult, and competition 
tends to get out of hand and become profoundly 
disruptive. On the other hand, general cultures 
which do emphasize the building of personal trust 
may have to be balanced by political cultures which 
emphasize the need for suspicion in the manage- 
ment of public institutions. For example, it has 
been suggested that in the United States the basic 
socialization process does emphasize to a pecul- 
iarly high degree basic trust in human relations, 
but that American political culture stresses the 
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need to distrust institutions, to check their powers, 
and to demand strict accountability of all public 
officials, In many transitional societies we find the 
opposite pattern, in that the socialization process 
instills deep distrust of human relations while at 
the same time people are asked to have complete 
and uncritical faith in their public institutions. 
The pattern has been observed in India (Carstairs 
1957), Ceylon (Wriggins 1960), Burma (see Con- 
ference . . . 1963), and Italy (Banfield 1958 ). 


The future of a theory of political culture 


As the foregoing discussion shows, there is an 
increasing body of propositions that seeks to relate 
aggregate and individual behavior in different polit- 
ical systems, so that it is now possible to talk of 
the growth of a theory of political culture. How- 
ever, it is also appropriate to note several criticisms 
of this theory which reflect its current early stage 
of development. 

It has been suggested that the concept repre- 
sents little more than a new label for old ideas. To 
a degree, this is a valid observation but one that 
ignores the central purpose of the theory, which is 
to search for a new way of connecting psychologi- 
cal theory to the performance of the total political 
system. 

At present the mere term “political culture” is 
capable of evoking quick intuitive understanding, 
so that people often feel that without further and 
explicit definition they can appreciate its meanings 
and freely use it. The very ease with which the 
term can be used, however, means that there is 
considerable danger that it will be employed as a 
“missing link” to fill in anything that cannot be 
explained in political analysis. 

This danger of tautology is particularly great in 
precisely the area which is now the most important 
for the future development of the theory—the rela- 
tionship between political culture and political 
structures or institutions. If the concept of political 
culture is to be effectively utilized, it needs to be 
supplemented with structural analysis, but the dif- 
ficulty is that political structures can be seen on 
the one hand as products reflecting the political 
culture, while on the other hand they are also im- 
portant “givens” which shape the political culture. 

These are problems which must be surmounted 
if the theory of political culture is to realize its 
early promise. The Prospect is excellent that cur- 
rent research is going to set aside most of these 
objections and greatly advance the utility of a polit- 
ical culture theory. Recent systematic comparative 
research, based on survey methods, promises to 
clarify further the relationship between the politi- 


cal socialization processes and numerous dimen- 
sions of the political culture. The work of Gabriel A. 
Almond and Sidney Verba in identifying the com- 
ponents of the democratic political culture has 
already stimulated new attempts to evaluate the 
factors affecting democratic development through- 
out the world. In the 1960s Verba was directing a 
study applying some of the hypotheses of The 
Civic Culture to India, Japan, Nigeria, and Mexico. 
The basic concepts of The Civic Culture have been 
utilized by Ward in analyzing Japanese develop- 
ments, by Scott for Mexico, by Rose for England, 
and by Barghoorn for the Soviet Union (see 
pp. 83-129; 330-395; 450-511 in Pye & Verba 
1965). Other research on the political and psycho- 
logical inhibitions to economic growth is suggest- 
ing further critical dimensions to the modern polit- 
ical culture, whether democratic or not (McClelland 
1961; Hagen 1962). 

Even in its current state, the theory of political 
culture represents a significant advance in the di- 
rection of integrating psychology and sociology with 
political science to produce a richer and fuller 


understanding of politics. 
Lucian W. PYE 


[Directly related are the entries CULTURE; GOVERN- 
MENT; SOCIALIZATION, especially the article on Po- 
LITICAL SOCIALIZATION. Other relevant material may 
be found in MODERNIZATION; NATIONAL CHARACTER; 
POLITICAL ANTHROPOLOGY; POLITICAL SOCIOLOGY; 
SOCIETAL ANALYSIS. ] 
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POLITICAL EFFICACY 


The concept of political efficacy is used by stu- 
dents of political behavior to identify a citizen’s 
feelings about the effects of his action on political 
events. It refers to the person's belief that political 
and social change can be effected or retarded and 
that his efforts, alone or in concert with others, 
can produce desired behavior on the part of politi- 
cal authorities. Efficacy has its origins in social 
psychology and is closely related to “ego strength,” 
“subjective competence,” “self-confidence,” and 
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“personal effectiveness.” The concept has particular 
relevance for assessing behavior in democratic sys- 
tems, where a premium is placed on citizen partici- 
pation and where there are accessible channels for 
expressing political needs. 

The efficacious person views his political self 
with respect. He feels powerful, competent, and 
important. He holds a corollary set of expectations 
with respect to political officials; they are con- 
cerned about his vote and heed his demands. These 
self-evaluations and orientations toward political 
authorities are related to a generalized ‘set of atti- 
tudes about the political system—for example, that 
elections matter or that leadership circles can be 
influenced and even penetrated. 

Efficacy is not the same as a sense of civic obli- 
gation. The latter can motivate political activity 
whether or not the citizen feels that his action 
matters. Involvement, interest, and concern also 
tap dimensions different from efficacy. They are 
likely to be specialized or temporary, whereas effi- 
cacy involves a generalized orientation, toward the 
self and toward political objects, which remains 
more or less stable over time. 

Efficacy refers to the individual's perceptions of 
his effectiveness, not his actual influence, It is 
possible for a citizen to make a mistake in evaluat- 
ing his political importance by either underesti- 
mating or overestimating the extent to which offi- 
cials are sensitive to his demands. However, while 
evaluations of influence may not mirror reality, 
they are probably not unrelated to objective politi- 
cal conditions. It is quite likely that feelings of 
efficaciousness are nurtured and reinforced in a 
context in which one witnesses the translation of 
one’s wishes into realities. This, in large part, €x- 
plains why the concept has been used in studying 
democratic systems. The citizen's feelings of effi- 
cacy are conditioned by the availability of institu- 
tionalized channels for expressing demands, as well 
as by the creed requiring leaders to be responsible 
and responsive to nonleaders. 

Studies focusing on efficacy at the citizen level 
raised three questions: (1) What are the ante- 
cedent conditions accounting for distributions of 
efficacy? (2) What do efficacy data explain about 
other political attitudes and behavior? (3) What 
are the consequences of efficacy distributions for 
the functioning of the general political system? 

Antecedents of efficacy. In studies of American 
voting behavior, the Survey Research Center at the 
University of Michigan has contributed the most 
systematic evidence tracing the antecedents of 
efficacy (Campbell et al. 1954; Michigan, Univer- 
sity of 1960). A person who feels he can cope 
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with the complexities of politics and believes that 
his participation carries weight in the political 
process is generally better educated, has a higher 
socioeconomic standing, and is more likely to be 
a member of a majority ethnic and religious group 
than the less efficacious citizen. These antecedent 
social conditions are to be expected; dominance in 
one aspect of social life produces a sense of control 
and effectiveness, which can be generalized to the 
political sphere. 

Cross-cultural measurings of efficacy have pro- 
duced support for the findings generated by Ameri- 
can data. In five democratic nations, educational 
attainment, highly correlated with socioeconomic 
status, substantially predicted efficacy rates. Differ- 
ent levels of political confidence are more depend- 
ent on educational differences than on cultural 
variations between societies, In addition to provid- 
ing the citizen with the necessary skills and tools 
for exercising political prerogatives, education evi- 
dently contributes to a general sense of security 
conducive to interacting with forces beyond the 
comfortable sphere of the familiar. In fact, there is 
support for the proposition that advanced education 
is the single most important factor contributing to 
high efficacy (assuming an institutional structure 
that permits citizen participation). The citizen 
whose family and school experiences have encour- 
aged expression of preferences is likely to feel effi- 
cacious in his adult political life. Irrespective of 
autocratic or democratic socialization experiences, 
the well-educated person feels more politically com- 
petent than the less-educated individual (Almond 
& Verba 1963, pp. 204-209), 

Psychological antecedents to political efficacy 
have been less clearly established. Current theoriz- 
ing suggests that the capacity to assert oneself in 
politics is related to ego strength—the ability to 
adjust to external events and to control disruptive 
impulses (Lane 1959, p. 147), It is further argued 
that psychological structures which are compatible 
with high evaluations of the self and are conducive 
to feelings of effectiveness in the political world 
are suitable to activity in democratic systems. On 
the other hand, power-driven or authoritarian indi- 
viduals are not expected to find satisfactions in 
political systems that place a premium on bargain- 
ing and compromise (Lasswell 1954). 

In the identification of antecedent conditions of 
efficacy little attention has been given to structural 
or institutional factors. In part this is due to the 
lack of comparable data from different institutional 
settings. Persons do tend to feel more efficacious 
with respect to personnel and policies of their 
local government than to those of their national 


government (Almond & Verba 1963). Since local 
government is more immediate, accessible, and 
familiar to the citizen, it might be inferred that 
institutional availability plays some part in retard- 
ing or promoting efficaciousness. 

Explanatory uses. The use of efficacy informa- 
tion as an independent variable for analyzing po- 
litical behavior has been particularly fruitful in 
explaining rates of political participation and in 
assessing the intensity of involvement in electoral 
politics. Measures of political efficacy predict not 
only voter turnout (the stronger the feeling of 
efficacy, the more likely the person is to vote) but 
also more demanding forms of activity in demo- 
cratic politics—letter writing, discussing issues, 
contributing money, and running for office. It is of 
course true that participation and efficacy are 
mutually supportive, The more one participates, 
the more likely one is to feel confident and vice 
versa. Efforts to isolate either efficacy or participa- 
tion as the crucial antecedent variable would pro- 
duce an inadequate theory of political involvement. 

Although well-educated persons tend to score 
higher on efficacy measures than do the less- 
educated, there is a residue among the college- 
educated who take a cynical stand toward the 
effectiveness of their vote. Nevertheless, they vote. 
This suggests that education leads to political par- 
ticipation even when the feeling of efficacy normally 
associated with educational status is absent. 

Voters may be classified on a scale of “political 
relatedness” (Eulau & Schneider 1956), a concept 
combining efficacy with citizen duty. By examining 
the intensity as well as the rate of political activity, 
this measure helps to explain several dimensions 
of involvement in electoral politics. For example, 
citizens more “highly related” to the political proc- 
ess (i.e., those who score high on the scale of po- 
litical relatedness ) are more sensitive to differences 
between the parties, more issue oriented, and 
more concerned about the election outcome, have 
stronger partisan identifications, have greater in- 
terest in, and knowledge about, candidates and 
campaigns, and expose themselves to the media 
more often than persons less related. 

Consequences for the political system. 
ing the quality of citizen involvement in the polit- 
ical culture, observers stress that the more an indi- 
vidual considers himself capable of influencing 
political decisions, the higher is his satisfaction 
with the general political system and the E 
likely he is to evaluate positively the performance 
of the authorities. This close relationship between 
efficacy and general feelings of satisfaction has im: 
portant implications for problems of consensus an 


In assess- 


support in democratic nations. There is evidence 
that a sizable gap exists between citizens’ percep- 
tions of their capacity to influence decisions and 
their actual exercise of prerogatives (Almond & 
Verba 1963, pp. 473-505). This gap may be 
healthy for democratic systems. While the demo- 
cratic creed prescribes that leaders be responsive 
to nonleaders, the practical needs of government 
require that officials be relatively free of hamper- 
ing restrictions when initiating and carrying out 
decisions. Stability is viewed as a workable balance 
between governmental responsiveness and govern- 
mental power. One important factor contributing 
to this balance is an appropriate set of political 
attitudes held by the citizenry. A wide distribution 
of efficaciousness implies that citizens feel they 
have a reserve of influence, whether they exert any 
influence or not, The citizen is satisfied and sup- 
ports governmental authorities. Within broad limits 
the reserve of potential influence operates to check 
the leadership. However, since political activity is 
low on the scale of salient activities for a large 
proportion of the population, the leadership re- 
mains unhampered by constant citizen attentive- 
ness. The gap between felt influence and used 
influence contributes to a political culture consist- 
ent with both democratic norms and the realities 
of governing. 

The distribution of efficacy may also affect the 
quality of an election as a process of consent ( Jano- 
witz & Marvick 1956). A person lacking political 
self-confidence is less likely than others to con- 
form to the politics of his socioeconomic group. 
There is also some evidence that the authoritarian 
syndrome is related to low efficacy. Quite possibly, 
an individuals sense of impotence or confusion 
about his political role predisposes him to seek less 
rational and more absolutistic answers to political 
problems, Moreover, the citizen lacking political 
confidence is more subject to manipulation than 
the efficacious voter, The inference is that political 
participation unaccompanied by political confi- 
dence may turn a democratic election into a process 
of manipulation instead of a process of consent. 
Efforts to relate findings about the correlates of 
efficacy to evaluations of the election function in 
democracies are just beginning, but it is already 
clear that such attempts are important steps in 
closing the theoretical gap between data on indi- 
vidual attitudes and explanations of the political 
system. 

Specialized uses of political efficacy. The concept 
of efficacy has proven useful in examining certain 
orientations of political officials, For purposes of 
analyzing legislative behavior the concept is rede- 
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fined to mean a legislator’s sense of effectiveness in 
his political roles (Wahlke et al. 1962), The focus is 
less on the feeling of self-confidence, its anteced- 
ents, and implications than on the question of 
whether the legislator feels he has the abilities, 
skills, and knowledge to meet the demands of his 
office. The legislator confident of his political skills 
is more innovative and more often mediates be- 
tween conflicting interests in the political arena 
than his less efficacious colleague. 

Efforts to trace the antecedents of legislative 
efficacy suggest that membership in the majority 
party produces a greater sense of confidence toward 
one’s political role. Information about careers of 
elected officials indicates that greater efficacy 
characterizes the incumbent who has been nur- 
tured in politics since youth, who was self-recruited 
into active political life, and who has political 
ambitions, 

Though studies to date have closed certain em- 
pirical gaps in knowledge of antecedent conditions 
of efficacy, major theoretical problems remain un- 
solved, The concentration on democratic systems 
has hampered an understanding of how structures 
and ideologies affect efficacy patterns. Little atten- 
tion has been given to the relationship between 
feelings of competence and the actual availa- 
bility of channels for expressing political feelings. 
Until comparisons between different institutional- 
ideological contexts are made, any explanations of 
efficacy distributions will remain parochial. Other 
unanswered theoretical questions concern the con- 
sequences of efficacy patterns for the political sys- 
tem. For example, in what manner do particular 
distributions of efficacy restrain political leader- 
ship, affect policy outputs, or cause change in in- 
stitutional structures? Important theoretical po- 
tentials of the efficacy concept remain unexploited. 
It may well be that the understanding of such 
phenomena as stability and change in political 
systems will turn in part on careful attention to the 
degree of confidence a population feels about its 


political worth. 
KENNETH PREWITT 
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POLITICAL EXECUTIVE 


The political executive consists of those institu- 
tions formally responsible for governing a political 
community—that is, for applying its binding de- 
cisions, which may be formulated, to a greater or 
lesser extent, by the executive institutions them- 
selves. The structure, function, and character of 
the executive have varied widely over time, and no 
single conceptual framework can disclose all these 
variations and their consequences. Yet, certain 
fundamentals are clear, and on these we can con- 
centrate our attention. The two most prevalent 
structural forms of the executive are the presiden- 
tial and the cabinet systems. The source of execu- 
tive power has shifted from hereditary right, co- 
optation, and the use of force to election, either 
direct or indirect. The legitimacy of executive power 
has become secularized; “tradition,” divine origin, 
and other forms of appeal to occult forces have 
given way to rational criteria of achievement and 
popular approbation. The principal functions of the 
contemporary political executive are increasingly 
being carried out by specialized structures; these 
functions are representation and integration, lead- 
ership, deliberation and decision making, control 
and supervision of subordinate decision-making 
and enforcement organs. Finally, executive respon- 
sibility and accountability have become institution- 
alized through the acceptance and use of regular 
procedures. 

The above generalizations may be valid for ma- 
ture and developed political systems, but they 
hardly apply to the new nations, which are in a 
period of transition. Force continues to play an 
important role in their political life, and the legiti- 
mization of mechanisms of nomination, selection, 
and decision making has yet to develop. Lack of a 
sense of community, a low level of industrial de- 
velopment and modernization, and a confused 
party configuration continue to characterize the 


new nations: charisma and personal relations are 
more important than institutions and legitimized 
procedures; governmental roles and functions and 
the corresponding structures, including the exec- 
utive, are undifferentiated; factions often play the 
game of politics without accepting general rules, 
and executive leadership is unstable because of the 
prevalence of factional strife. There is a gap be- 
tween form and substance in the new nations, so 
that while most of them have copied the formal 
and written constitutional arrangements of devel- 
oped societies, particularly the organization of the 
executive, they have failed thus far to endow these 
arrangements with corresponding political roles 
and functions. With rare exceptions, the executive 
is the political leader who has attained power and 
the executive structures are only a temporary and 
fluid machinery that serves his rule. It is only in 
a few of the developing societies—notably in India, 
Ceylon, and in some of the stable democracies of 
Latin America—that executive structures and cor- 
responding roles are beginning to be clearly 
differentiated. 


The office 


The executive office consists of a number of 
elected and appointed officials responsible for the 
over-all performances of the functions associated 
with it. The number of officials is generally small 
—twenty or, at most, thirty. If we include top civil 
service personnel, heads of planning agencies and 
nationalized industries, top scientists, and defense 
officials who participate in deliberation and deci- 
sion making, the number rarely goes beyond a 
hundred. While the executive is a collective entity, 
ultimate responsibility for decision making is some- 
times lodged in the hands of one man. This i 
notably the case in presidential systems and in 
many of the cabinet systems in which a well- 
disciplined party controls a majority in the legis- 
lature. In totalitarian one-party systems the leader 
of the party is in law or in fact—and often in both 
—in charge of the executive. On the other hand, 
in a number of multiparty parliamentary demoni 
racies, notably in the Scandinavian countries, the 
coalition cabinets account for a genuine collec) 
of decision making and responsibility. Thus, viewec 
from the point of decision making and repona 
bility and depending upon the formal constitutiona 
arrangements, the prevalent norms in the penne 
and such adventitious factors as personality an 
circumstance, there is a continuum between gen- 
uine one-man rule and genuine collegiality. The 
structures are generally flexible enough to fon 
for movement from one form to the other, Wi 


the same political system. This, incidentally, is 
true both for cabinet systems and for presidential 
systems. 

Under United States and French influence, the 
presidential system has been adopted in various 
forms and with varying degrees of effectiveness in 
Latin America and French-speaking Africa. The 
cabinet system, which originated in England, has 
been adopted in most of the countries of the Euro- 
pean continent, in all the English-speaking domin- 
ions, in many of the former British colonies, and 
in many of the totalitarian systems—notably in the 
Soviet Union and eastern Europe. The differences 
between the two systems are formal rather than 
substantive. In the presidential system, the presi- 
dent is elected by the people. He holds the highest 
executive office over a given period of time, during 
which he is not politically accountable to the legis- 
lature. All top executive officials are nominated by 
him and can be removed by him. In the cabinet 
system, on the other hand, the prime minister and 
his cabinet are responsible to the legislature. They 
are formally invested with executive power by a 
vote of the legislature, and their term can be sus- 
pended at any time by an adverse vote. 

Despite these formal differences, universal suf- 
frage, the growth of national parties, and the pro- 
gressive adoption of a majority electoral system 
account for striking similarities. In presidential or 
cabinet systems, the immediate source of executive 
power is the election and the party. In the cabinet 
system, the leader of the majority party becomes 
prime minister. If the party is disciplined, it is 
unlikely that it will overthrow the prime minister. 
Thus, while the prime minister is technically re- 
sponsible to the legislature, he is just as immune 
to it as the president. In the one-party totalitarian 
systems that have adopted the cabinet system— 
and we use the Soviet Union as the prototype—it 
is the party that sustains executive leadership. As 
long as the leader controls the party (as Stalin did) 
or is accepted by the higher party echelons (the 
Central Committee and the Politburo), he also 
controls the legislature and is technically immune 
to legislative scrutiny. If there are dissensions 
within the party or if the leader loses his support 
in the Politburo and the Central Committee, he can 
no longer hold his position. 

Many trends since World War 1 account for the 
reinforcement of the political executive. Among 
them, the most significant ones are the widespread 
adoption of the majority electoral system and con- 
stitutional reforms establishing the ascendancy of 
the executive over the legislature. The two most 
notable cases are West Germany and France. In 
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West Germany proportional representation was 
greatly modified by the requirement that half of 
the members of the legislature be elected by major- 
ity vote. Executive stability and independence were 
strengthened by the provision that the chancellor 
cannot be removed from office by a vote of the 
legislature unless it is accompanied by a vote in 
favor of a successor. 

In France proportional representation in legis- 
lative elections was abandoned in 1958. In the pres- 
idential election, if none of the candidates receives 
an absolute majority on the first ballot, the contest 
is narrowed to only two candidates on the second. 
As for executive-—legislative relations, parliament 
was “rationalized”; stringent restrictions were placed 
on the vote of censure by the national assembly, 
the executive was given control over legislative 
business and legislation; the committee system was 
simplified and the powers of the committees dras- 
tically reduced; and the president was given the 
right to dissolve the National Assembly and call 
for an election. The trend has been the same, sig- 
nificantly, in Great Britain, where the control of 
the executive over Parliament is made effective 
through the control of the majority party, and in 
the United States, where legislative initiative has 
passed into the hands of the president and party 
discipline in Congress has been tightened. 

From an organizational viewpoint, all executives 
display striking similarities. The prime minister or 
president is surrounded by concentric circles of 
advisers and staff and line agencies: the first is 
the immediate circle of personal advisers and liaison 
agents; the second consists of specialized coordi- 
nating agencies with functions that cut across de- 
partmental or ministerial responsibilities (economic 
planning, national security, atomic energy, space 
programs, administrative reorganization, etc.); the 
third is the cabinet, consisting of top officials re- 
sponsible for policy making and administration of 
functionally defined governmental activities (for- 
eign affairs, trade, labor, welfare, defense, etc. ); 
a fourth circle consists of an increasing number 
of independent or semi-independent agencies with 
regulatory and supervisory responsibilities, some 
of which operate or control economic services. To 
gain a comprehensive view of the political execu- 
tive, however, we must further expand our circles 
to include other forces in the political system. 
There are the political parties and the party leaders, 
who must be persuaded or placated; the legislature, 
whose compliance to policy measures is indispen- 
sable; interest groups that must be taken into 
account; the press and the public at large, from 
which support is ultimately to be derived; and, 
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finally, other nations, various international organi- 
zations and the international community at large. 

In the new nations the lack of stable structures 
sharply contrasts with the leadership and decision- 
making tasks imposed upon the executive. Difficul- 
ties in creating a viable political community, pro- 
viding unity where there was factionalism and 
tribalism, are augmented by the immediate and 
urgent needs of a rapid social and economic devel- 
opment through planning. This may also be one 
reason for the rapid turnover of executives and, 
in general, for executive instability. The loads upon 
executive management decision making and en- 
forcement have been too heavy, and the correspond- 
ing supports, in the form of party systems or 
interest-group organizations, too weak. There is 
also a contradiction between the roles associated 
with political unification and community building, 
on the one hand, and those related to policy making 
and enforcement, on the other. The first tasks re- 
quire personal leadership, the appeal to national 
symbols, and ideology, while the second call for 
rational skills, organizational talents, and a prag- 
matic assessment of goals and available means. 


Functions of the political executive 


The political system can be seen as a mechanism 
through which interests and demands are trans- 
lated into decisions. The decision-making mech- 
anism must be widely accepted by the community 
—that is, it must have legitimacy. Since the politi- 
cal executive plays an important role in transform- 
ing interests and demands into decisions, it has, 
first and foremost, output functions. However, 
since it also represents and accommodates major 
social and interest groupings, it plays an integra- 
tive role as well. In addition to being the central 
policy-making organ, it also supervises and con- 
trols all the subordinate deliberative and enforce- 
ment organs. [See SYSTEMS ANALYSIS, article on 
POLITICAL SYSTEMS. ] 

Integration and representation. The integrative 
and representative function of the political execu- 
tive is both ceremonial and efficient. The executive 
embodies the political community and represents 
it internally and externally. It provides effective 
links between the members of the community and 
the state. In some political systems the ceremonial 
and symbolic role is played by a Politically irre- 
sponsible head—the monarch or the president of 
the republic. In other instances, mostly in presi- 
dential systems, the ceremonial and efficient func- 
tions are combined in the same office and man. 
It is, of course, the efficient functions of integra- 
tion on which depends the durability of the effec- 


tive ties that link the community with its ceremonial 
head. By integrative functions, we mean represent- 
ing demands and interests and making decisions 
accordingly. This is the ultimate test of perform- 
ance. It links the executive with the party, the leg- 
islature, and the public. It is the “hyphen which 
joins, a buckle which fastens” the community to 
the executive. To play this integrative role, the 
executive must provide for leadership, decision 
making, and enforcement. 

Leadership. The crux of executive power is 
leadership. Yet, leadership is a phenomenon not 
well understood. It varies from one system to an- 
other and from one period to another. A leader 
must have the ability to organize, deliberate, decide, 
and execute and the capacity to arouse trust and 
affection and to gain support. Charisma—a per- 
sonal magic of leadership arousing special popular 
loyalty or enthusiasm—is an indispensable ingredi- 
ent of leadership [see CHARISMA]. This aspect of 
leadership is closely related to the ability to take a 
determined stand and to express a belief without 
equivocation. As John Stuart Mill remarked, “a 
great part of political power consists in will... . 
One person with a belief is a social power equal to 
ninety-nine who have interests.” If we combine the 
rational and charismatic traits, leadership must: 
(a) provide for representation and identification 
within the community; (b) foresee common prob- 
lems and suggest policy solutions; (c) provide, at 
the executive level, a strong following of loyal offi- 
cials; and (d) receive popular support. Thus, the 
political executive has significant input functions 
to perform. By injecting new demands and expecta- 
tions or by proposing solutions, it can manufacture 
supports. While it acts within a given structure of 
political forces, it can rearrange these forces and 
create new reference groups to gain support. [See 
LEADERSHIP, article on POLITICAL ASPECTS.] s 

Decision making and deliberation. Political 
theory has long distinguished beween legislative 
and executive acts. This formal distinction is no 
longer adequate. First, the political executive can 
perform the leadership functions traditionally as- 
signed to the legislature. Second, it possesses inde- 
pendent powers—for instance, in foreign policy 
and defense. Third, the practice of delegated legis- 
lation has given vast, albeit subordinate, legislative 
powers to the executive. Finally, the law e 
by the legislature is initiated, prepared, and drafte 
by the political executive—policy initiation has 
become an executive prerogative. The legislative 
process tends to be increasingly one-sided. In the 
totalitarian systems that have adopted cabinet g0V- 
ernment, legislative scrutiny is of no significance, 


and this is virtually the case in all cabinet systems 
that have a strong, disciplined party system. Only 
in the United States does the legislature continue 
to have genuinely independent power of legislative 
scrutiny and initiative. Even so, Congressional 
initiative tends to be limited to scrutiny and amend- 
ment of proposals already submitted by the execu- 
tive. [See LEGISLATION, article on LEGISLATURES.] 

In order to initiate, decide, and act, the executive 
must deliberate. As a result, intelligence, fact find- 
ing, liaison, and staff agencies have mushroomed. 
This has particularly been the case with new gov- 
ernmental activities that do not fit the traditional 
organization of the executive into departments and 
ministries: economic planning and supervision; 
coordinating the preparation of budgetary policy; 
providing administrative reform to create new 
structures that can cope with new functions; and 
considering national security matters from a vari- 
ety of governmental points of view. Thus, new lay- 
ers of agencies and offices have developed to 
comprise what may be called the office of the 
president or the office of the prime minister. 

The trend is universal, but it is especially clear 
in developed societies, in which industrialization 
has created the imperative of regulation and inter- 
national conflict has emphasized coordination and 
preparedness for quick action. It also reflects a 
concomitant trend in favor of developing new pro- 
cedures to provide for deliberation prior to a deci- 
sion, Given the sheer bulk of matters that call for 
decisions and the need of specific knowledge and 
information in order to make them, decision mak- 
ing becomes a matter of following deliberative and 
consultative processes. Increasingly, it becomes a 
matter of process rather than substance. In turn, 
the basis of the authority of the executive shifts 
from the personal to the procedural. The appeal of 
charisma becomes limited, and tradition is no 
longer invoked. The basis of executive authority 
begins to coincide with its ability to act on the 
basis of rationalized procedures. It is only in the 
new, underdeveloped nations that the charismatic 
leader has played, and may continue to play, an 
important role. Crisis situations, however, often 
account for the emergence of personal government 
in developed societies. The emerging leader either 
bypasses the existing procedures or sets them aside 
in an effort to establish new ones. In all cases the 
effective invocation of personal government and 
the successful appeal to new political institutions 
are closely related to the legitimacy of the political 
system. [See CRISIS GOVERNMENT. ] 

Supervision and enforcement. While the classic 
distinction between deliberation (legislature) and 
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execution (executive) is no longer tenable, it is 
still true that the political executive is the agency 
of execution in the narrow sense. It supervises and 
controls all subordinate organs. However, execu- 
tion, properly speaking, is within the purview of 
the bureaucracy. Executive decisions are general 
and comprehensive in scope, and their detailed 
implementation is in the hands of the civil service 
and the various subordinate agencies. The political 
executive, however, remains responsible for the 
organization and reorganization of the machinery 
of government. It can create and reorganize de- 
partments and agencies, establish the rules of 
advancement and recruitment within the civil 

service, and set down procedures for making sub- 

ordinate decisions. Ultimate responsibility for lack 

of efficient execution will be focused on the politi- 

cal executive. But, it is increasingly understood 

that the political executive establishes the proce- 

dures surrounding bureaucratic decisions without 

assuming direct responsibility for implementing or 

failing to implement them. A division of labor be- 

tween the political executive and the bureaucracy 

as a whole has developed and is reflected in dif- 

ferent evaluations of roles and functions and 

differing standards of responsibility. [See BUREAUC- 

RACY; CIVIL SERVICE.] 


Executive restraints and responsibility 


The growth of executive power and the increas- 
ing scope of initiative and decision making call 
for a discussion of the existing restraints and of 
the manner in which responsibility is institution- 
alized. The restraints appear to be relatively few. 
First, there is the burden, common to all political 
executives, including totalitarian ones, of persuad- 
ing the elite. Second, there are systemic restraints 
—no leader can attempt a synthesis of policy ob- 
jectives and goals that does not reflect, up to a 
certain point, the existing demands and aspirations 
of the community. To go too far in suggesting 
policy goals is to become separated from support- 
ers; to stand still is to alienate the interests and 
the demands that could provide support. To be 
effective and to gain approbation and support, the 
executive must gear its actions to the interests and 
demands within the system, As was pointed out 
earlier, the limits of initiative and freedom of 
action may be wide and it is the task of leadership 
to discover them. But, failure to do so or miscal- 
culation may lead to disapproval and rejection. 
Third, there are various types of constitutional 
and procedural limitations that trace the contours 
of executive power and provide for executive re- 
sponsibility. We already mentioned the formal 
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responsibility of the cabinet to the legislature. In 
presidential systems in which the chief executive 
is not directly responsible to the legislature—for 
example, in the United States and, to a certain 
degree, in France—separation of powers, judicial 
review, legislative scrutiny, and, in some instances, 
the direct association of one or the other of the 
houses of legislature in the exercise of executive 
prerogatives impose restraint. [See CONSTITUTIONS 
AND CONSTITUTIONALISM. | 

The substance of executive responsibility, how- 
ever, lies in the party system and in periodic elec- 
tions. The party is both an instrument at the dis- 
posal of the leader to attain power and carry out 
policies and a device that controls him, since with- 
out the support of the party, he is invariably help- 
less. As long as the party acquiesces or agrees, 
the political executive is omnipotent in virtually all 
political systems, despite procedural limitations. 
To lose the support of the party is to lose power. 
The political executive is, in fact, both the leader 
and the prisoner of his party. The only way to 
overcome party control is to appeal to the people. 
However, in totalitarian systems, there is no mech- 
anism for this, and even in democratic systems, 
where the mechanism is readily available, such an 
occurrence is rare. Ramsay MacDonald did it in 
1931, and Charles de Gaulle, in 1962. But the first 
was supported by a party other than his own, and 
the second managed to overcome a badly divided 
and fragmented party system. Theodore Roosevelt’s 
failure demonstrated the difficulty of this proce- 
dure in the United States. The appeal to the peo- 
ple through election provides the second type of 
responsibility. In democratic societies, where basic 
freedoms are respected, an election is the most 
effective instrument of control and, ultimately, of 
executive responsibility. It gives the electorate the 
Opportunity to approve or disapprove of policies 
and to choose between competing parties and lead- 
erships. In totalitarian systems the executive is 
responsible to the party only. In all cases, however, 
we must consider the degree of freedom that sur- 
rounds an election and the role and structure of 
political parties in democratic societies as opposed 
to the internal organization and the degree of dis- 
cipline within the party in a one-party totalitarian 
system. We must look at the substance and not the 
form. This is abundantly clear in the new nations, 
in which institutionalized forms—either through 
the party system, through open competitive elec- 
tions, or through well-articulated social grouping 
—have yet to develop. As a result, no matter what 
the constitutional prescriptions, the top executive 
and his associates are often removed from office 


because of mob violence, factional uprisings, or 
military coups. It should be recalled, however, that 
executive responsibility and the manner in which 
top political leadership is replaced or called to 
office did not become institutionalized in most 
countries until the end of the nineteenth century 
and, in some European systems, not until the 
twentieth. [See ELECTIONS; PARTIES, POLITICAL; 
RESPONSIBILITY.] 


A typology of political executives 


A typology of the political executive requires a 
prior typology of political systems. The two types 
that we discussed—cabinet and presidential—rep- 
resent descriptive, rather than analytical, cate- 
gories. The same is true of the widely used dis- 
tinction between “weak” and “strong” executives. 
A strong executive has been defined in terms of 
stability in office and ability to make and enforce 
decisions. This is the case when party support is 
forthcoming or when specific constitutional pro- 
visions secure the stability and the independence 
of the executive. Weak executives, on the other 
hand, obtain wherever the parties are many or 
lack internal discipline, so that the executive can- 
not rely upon a disciplined majority in the legis- 
lature—a situation often prevalent among multi- 
party cabinet systems. Thus, weak executives 
generally accompany multiparty or weak-party 
configurations, while strong executives can accom- 
pany either two-party or disciplined-party systems. 
The key variable is the party, not the constitutional 
and procedural arrangements. In fact, it may be 
argued that “executive leadership” and “strength 
in West Germany since World War 11 and in France 
in the Fifth Republic derive from the existence of 
a majority party rather than from constitutional 
arrangements. 

To provide for an analytic typology, we shall 
use the concepts of power, legitimacy, and decision 
making. In terms of power, absolute and qualified 
executives can be distinguished: the first is en- 
dowed with unlimited powers, and the second oper- 
ates within accepted and formulated restraints. 
Unless we refer to extinct satrapies and tyrannies, 
it will be difficult to identify absolute executives 
today. Absolute power is qualified in several MRE 
by constitutional limitations (i.e. separation © 
powers), by shared executive power ( collegiality ), 
and by clear procedures for assuring the wee 
sibility of the political executive to the party i 3 
the people. The combination of constitutional T 
itations, collegiality, and responsibility cn 
izes the generic category of democratic executiv i 
The absence of such characteristics or the absene 


of their explicit formulation and effective imple- 
mentation typifies the executives in closed polities, 
the most prevalent of which are the modern totali- 
tarian systems. [See TOTALITARIANISM. ] 

Following Max Weber, the basis of executive 
legitimacy may be classified as rational, traditional, 
and charismatic [see Lecrtrmacy]. Rational legiti- 
macy—the development of institutions and pro- 
cedures that channel executive decision making 
and, hence, legitimize it—is prevalent in both total- 
itarian and democratic systems. Traditional legiti- 
macy is generally associated with static and 
primitive societies and is, therefore, characteristic 
of very few contemporary executives. Charismatic 
legitimacy is most prevalent in intermediate stages 
of development—where tradition has been ex- 
ploded, but authority has not been institutionalized 
and legitimized. This type includes many of the 
new Afro—Asian nations. 

Finally, in terms of decision making, executives 
can be classified as monocratic or pluralistic, per- 
sonal or collegial, arbitrary or limited. The obvious 
“syndromes” here are the “monocratic—personal— 
arbitrary” and the “pluralistic—collegial—limited.” 
But, these syndromes do not fit neatly in empirical 
situations—unless we limit our comparisons arbi- 
trarily (e.g., Great Britain versus a Latin American 
dictatorship). Generally speaking, one can equate 
the “monocratic—personal-arbitrary” syndrome with 
totalitarian systems and the “pluralistic—collegial— 
limited” one with democratic systems if it is re- 
membered that ingredients of all six can be found 
in all contemporary political systems. The types 
suggested are only analytical types; they must be 
carefully related to a generalized theory of political 
systems in order to study the empirical forms they 
take and the conditions under which they are likely 
to develop and change. 


Trends and prospects 

As we have seen, the burden of decision making 
thrust upon the political executive has grown im- 
mensely. The effective performance of executive 
leadership calls for an unprecedented balance be- 
tween leadership and technical know-how, infor- 
mation and evaluation, specialization and coordi- 
nation. Political structures must adjust to new 
environmental demands. The adjustive process of 
the executive branch over the years, particularly 
since World War 1, has involved devolution, depo- 
liticization, and the creation of new administrative 
organs. Devolution is the delegation of the power 
to make decisions to subordinate organs or agen- 
cies. Every effort has been made to decongest the 
political executive by explicitly authorizing subor- 
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dinates or subordinate agencies and boards to make 
decisions. Depoliticization is the process whereby 
certain issues are removed from the realm of polit- 
ical controversy and executive discretion. Collec- 
tive bargaining; wage fixing on the basis of a pre- 
agreed device that pegs wages on the cost of living 
index; and the establishment of public corporations, 
boards for nationalized industries, and regulatory 
commissions are some of the most usual practices. 
Finally, an inevitable solution to the problem of 
handling the new burdens is to create new agen- 
cies and offices and lodge them within the office of 
the president or the prime minister, [See CENTRAL- 
IZATION AND DECENTRALIZATION; DELEGATION OF 
POWERS. | 

The multiplicity of the new functions and the 
growth of new administrative structures to cope 
with them account for the emergence of two op- 
posed developmental trends: concentration and 
diffusion. Concentration is a constitutional and 
political requirement. Even in collegial executives, 
one man increasingly bears the responsibility for 
decision making. The growth of powers, the mul- 
tiplicity of offices, and the division of tasks within 
the executive have only underlined this. It has 
been heightened by the considerations of defense 
and strategy. Institutionalized forms such as the 
inner cabinet or the National Security Council are 
likely to give place to restricted committees of very 
few officials. Crisis government and corresponding 
flexible structures to cope with crises have now 
become permanent aspects of executive decision 
making. Yet, at the same time, the more important 
a decision, the more it is contingent upon a com- 
plex process of deliberation and consultation in 
which offices and men with specialized tasks par- 
ticipate. It is increasingly diffuse and bureauc- 
ratized, while it appears to be highly concentrated 
and often personal. A separation of powers that 
limits the ability of the top executive to reach a 
decision and act is informally institutionalized 
within the political executive. Divergent points of 
view, varying evaluations, and conflicting informa- 
tion about the same situation confront the top 
political leader. As decision making becomes rou- 
tinized and bureaucratized, the top political exec- 
utive often becomes limited to ad hoc compromises 
and may be reduced to immobility. This conflict, 
between the concentration of decision making and 
the diffusion and impersonality that go with the 
routinization of decision making, must be resolved 
if the political executive is to play its role of leader- 
ship effectively. 

The trends we noted above apply primarily to 
the developed political systems. In the new nations 
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the executive, along with virtually all other gov- 
ernmental structures, is still in the process of 
establishing itself as a legitimized instrument of 
decision making. The problem ahead is not so 
much finding a proper balance between personal 
leadership and bureaucratic mechanisms for deci- 
sion making, but rather establishing viable and 
stable structures with specialized roles and effec- 
tive capabilities to suggest policies and make deci- 
sions. Whether this will be accomplished or not 
depends upon a great number of factors—the de- 
velopment of a sense of community, agreement on 
the basic rules of polity, the development of insti- 
tutions with specific tasks, and, above all, the 
routinization and institutionalization of the per- 
sonal government which continues to be the rule 
today. 

Roy C. MAcripis 


[See also LEADERSHIP, article on POLITICAL ASPECTS. 
Directly related are the entries CAUDILLISMO; DIC- 
TATORSHIP; GOVERNMENT; MONARCHY; OLIGARCHY; 
PARLIAMENTARY GOVERNMENT; PRESIDENTIAL GOV- 
ERNMENT. Other relevant material may be found 
in ADMINISTRATION; DECISION MAKING; LEGISLA- 
TION; PARTIES, POLITICAL; REPRESENTATION.] 
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POLITICAL FINANCING 


The acquisition and retention of political power 
require material resources and human energies. 
Money is the most important medium for trans- 
ferring command over material resources. Its com- 
mand over human energies varies among societies 
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at different stages of development but is of great 
importance in all political systems with competitive 
popular elections, Political financing, therefore, 
comprises the processes through which money is 
used to channel resources and energies for political 
purposes. 

Modern studies of political financing have been 
concentrated in systems where important social de- 
cisions are made in competitive elections. They 
have been little concerned with how money is used 
as a source of influence in authoritarian regimes. 
In the democracies, observation and analysis of 
political financing have focused principally on the 
activities of candidates, political parties, and other 
participants in election campaigns, including or- 
ganized interest groups and parallel-action organi- 
zations. Lobbying activities have usually been 
treated separately. The financial aspects of the 
political process have been given greatest attention 
in the United States. In other nations, however, 
especially in western Europe but not only there, 
certain phases of political financing have increas- 
ingly become objects of study. 


The study of political financing 


Characteristics. In systems where political fi- 
nancing has been studied, money flows through 
many reaches of the political process, Focus upon 
money per se involves, directly or by implication, 
all aspects of the political system, Features of the 
conversion of economic power into political power 
are revealed, for example, by analysis of the sources 
of political funds, General questions of voting be- 
havior are encountered by analyses of the uses to 
which funds are put in election campaigns. Tracing 
the flow of funds projects the student into analyses 
of human motivation, political participation, the 
character of influence, and the structure and oper- 
ation of political party systems. The widespread 
attempts to regulate political financing lead to 
questions of constitutional authority, of the social 
requisites for enforceable law, and of public under- 
standing and attitudes. Historical topics of political 
philosophy and complex questions of explanatory 
theory pervade the scrutiny of political financing, 

Since money is an agent, standing in place of 
the goods and services into which it can be con- 
verted, arbitrary boundaries on its definition are 
introduced to make its study feasible. These bound- 
aries can be strategically altered as the data avail- 
able and the character of the political system being 
studied permit. In past research, distinctions have 
been made between contributions of cash and con- 
tributions of goods or services. The latter have 
often been excluded from analysis. Also usually 
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separated from analysis have been the general 
effects of economic power, as felt through many 
channels in the society—control over credit sources, 
impact on the mass media of communications, 
influence emanating from the hierarchical struc- 
ture of segments of the economic system, and 
such. Moreover, the personal wealth of candidates 
is frequently difficult to isolate from the superior 
development of personal political abilities that 
those resources may have helped to develop; hence 
it, too, has often been excluded from the analysis. 

The quality of available data has profoundly 
affected the study of political financing. Much 
more is known in some nations than in others 
about the grand architecture and operational de- 
tails of political financing. In all systems, to vary- 
ing extents, empirical knowledge of desired scope 
and reliability is lacking. Absence of records and 
deficiencies in such records as exist partially ac- 
count for this lack. Inducements to secrecy, because 
of the character of transactions in political financ- 
ing and the public suspicion aroused by them, 
obstruct impressionistic as well as hard, empirical 
analysis. The complexity of the relevant data, given 
the implications of political financing for entire 
political systems, further compounds the difficul- 
ties of interpretation. The subject has at times 
suffered from both sweeping, oversimplified impu- 
tations and excessively narrow attempts to assem- 
ble dependable data for analysis. 

The study of political financing has also been 
shaped by the different incentives of the students. 
Reform sentiment seeks to alter practices, often in 
accordance with ideological assumptions about the 
proper functioning of the political process, Such 
sentiments have originated both with practitioners 
and with others. Political interests directly engaged 
in competitive politics concern themselves with 
the subject in order to gain or maintain conditions 
they consider advantageous to themselves. Some 
study of the subject is politically neutral, directed 
simply toward better understanding of political 
behavior. Thus, the study of political financing 
may serve as a tool of political analysis, a means 
to political objectives, or a combination of both. 

Evolution of the study. The broad development 
of political finance research is revealed by the 
changes that have occurred over the years in the 
primary purpose of study, in the scope of concern, 
in the kinds of data and how they were employed, 
and in the intensity of comparative analysis. After 
an almost exclusive concern with public policy, 
students of political finance have increasingly been 

examining the subject simply for what could be 
revealed about the behavior of political systems. 


From a narrow focus on formal campaign com- 
mittees, attention has widened to embrace many 
of the other aspects of the political process affected 
by political finance. From a heavy reliance on quan- 
titative data submitted in official reports, analysts 
have made more and more use of information 
gathered by depth interview, survey research, direct 
observation, and other empirical means. After a 
long period of exclusive concern with political fi- 
nancing in single nations, comparative analysis 
across the boundaries of political systems has 
emerged. In the United States, where over the 
years most political finance research has been pro- 
duced, this mounting sophistication can be traced 
through four chronological phases. 

(1) Until the 1920s data and commentary were 
largely found in public documents and in the writ- 
ings of journalists such as the muckrakers of the 
progressive era. Congressional testimony, state and 
federal reports, and the discussion they inspired 
constituted the bulk of inquiry and interpretation. 

The first effective concern was felt for public 
employees who were subject to financial assess- 
ment by political leaders responsible for their jobs. 
Some evidence of existing practices was assembled 
in the move for protective legislation. Attempts 
were later made to determine total sums spent in 
certain political campaigns and the amounts and 
sources of campaign gifts. Interest in the subject 
increased with the belief that large donations to 
political parties gave unwarranted influence in 
government to the individuals and corporations 
supplying the funds. State and federal legislation 
was directed not only at age-old abuses, many of 
them (such as bribery) long contrary to the com- 
mon law, but also toward limiting the sources of 
funds and the amounts and kinds of expenditures. 
It was directed too toward providing publicity tg 
campaign accounts, on the theory that public 
knowledge of political financing would have a put- 
ifying effect on the practices followed and would 
give voters important information to assist them 
in deciding which parties and candidates to sup- 
port. Proposals for public subsidies to help meet 
campaign costs were generally ignored. 

In 1926 James K. Pollock published the a 
scholarly treatment of political financing in ae 
form, Party Campaign Funds. He was concerne 
with the formulation of wise public policies for the 
regulation of campaign funds and placed we 
faith in the efficacy of forced publicity. He focuses a 
on favoritism in government and emphasized E 
conception of selfish interests in conflict with pu £ 
lic interests, The analysis was confined to pada 
of party campaign committees and used data draw. 


chiefly from official party financial statements and 
the findings of Congressional committees. Pollock 
cited the shortage of dependable data as a serious 
obstacle to the creation of effective public policy. 
The categories of description included the volume 
of expenditures, the fund-raising methods, the 
sources of funds, the uses of funds, and the state 
and federal legal controls. 

(2) In 1932 a second phase of study opened. 
Pollock published what long remained the only 
book on campaign finance outside the United 
States, Money and Politics Abroad. Practices in 
Great Britain, Northern Ireland, France, and Ger- 
many were examined. Pioneering attention was 
given to auxiliary and nonparty organizations, and 
the main sources of data were direct observation 
and interviews. New categories of analysis were 
added: e.g., the effect of expenditures on the out- 
come of elections, the handling of funds within 
parties, the impact of money on a total political 
system, the general identification of controlling in- 
terests in a state. The main purpose was still to 
discern effective controls of political finance, but 
broader understanding of the whole party system 
was sought as a means to that end. Each nation 
was treated descriptively, separately, and differ- 
ently, without attempt to develop concepts for com- 
parative analysis. 

Also in 1932, the first of a series of writings by 
Louise Overacker appeared, Money in Elections, 
While essentially policy connected, it professed to 
formulate a “theory of parties” in order to state, 
although not necessarily solve, problems posed by 
political financing practices in the United States. 
Overacker accepted political pressures as inherent 
and was chiefly concerned with whether prevailing 
practices prevented voters from adequately express- 
ing their wants and protecting their interests, She 
viewed the use of money in elections as a by- 
product of popular government and reached the 
strong conclusion that legislative efforts to limit 
the volume of election expenditures should be 
abandoned. Overacker put her faith in publicity of 
campaign accounts. She included the financing of 
nomination campaigns in her inquiry, analyzed 
the sizes of contributions, identified economic and 
family connections of contributors, set down cate- 
gories of motivation underlying campaign giving, 
made certain analyses of trends over time, along 
with certain comparisons between nations, and 
otherwise expanded the focus and depth of anal- 
ysis. Overacker’s writings on political finance, cul- 
minating in 1946 in Presidential Campaign Funds, 
especially illuminated the sources of political con- 
tributions, including the newly important labor 
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movement, and exposed the defects of the Hatch 
acts, adopted in 1939 and 1940, in their attempts 
to regulate political finance, She relied mostly on 
data from official reports and public investigations. 

(3) The difficulties of obtaining usable empir- 
ical data have always limited the number of schol- 
ars willing to study political financing. In the 
United States usually no more than one major 
worker has been active at a time. In 1953, at the 
University of North Carolina, a program of re- 
search was initiated designed to engage the ener- 
gies of many persons, expand the range of inquiry, 
make larger amounts of varied, dependable data 
available, and use the scrutiny of political finance 
as an avenue to study of the general political sys- 
tem. The chief resulting publication was The Costs 
of Democracy (1960), by Alexander Heard, The 
study viewed solicitation, donation, transmission, 
and use of political funds as forms of political par- 
ticipation, to be evaluated with other forms of po- 
litical participation—e.g., campaigning and voting 
—as parts of the intricate network of relationships 
that constitute government and politics in the 
United States. The complex of activities involving 
political financing was viewed as one set of the 
evolving mechanisms by which representation of 
interests in the politics of the United States is 
achieved. 

In this framework financial and nonfinancial in- 
fluences in the outcome of elections were deline- 
ated; a typology of motivations for financial par- 
ticipation in politics was presented; and measures 
of the financial involvement of various types of 
groups and associations, including the underworld, 
were offered. The significance of fund solicitors as 
a type of political activist was evaluated. The flow 
of funds through and between political organiza- 
tions was traced and analyzed, with special atten- 
tion to the character of party cohesion revealed, 
The significance of political financing in the re- 
cruitment and winnowing of candidates for nomi- 
nation was appraised, Throughout, efforts were 
made to bring a maximum of qualified data into 
use, to identify meaningful trends in political prac- 
tices, to detect causal relationships, and to inter- 
pret for its public policy significance the experience 
and changing context of public regulation of po- 
litical finance. This work and the studies associated 
with it were national in focus, although Heard 
drew on foreign experiences to give perspective to 
United States practices. 

In 1958 an organization devoted exclusively to 
the study of campaign finance was established in 
Princeton, New Jersey. The Citizens’ Research 
Foundation, with a small staff under the direction 
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of Herbert E. Alexander, extended some of Heard’s 
approaches and initiated new ones of their own. 
Through its publications series, its own research, 
its assistance to anyone studying the subject, and 
the accumulation and dissemination of informa- 
tion, the Foundation kept a steady focus of atten- 
tion on political finance in the United States and 
contributed in many significant ways to the con- 
tinuity and development of its study. 

While scholarly concern with political finance 
was evolving in the United States, western Europe 
for a long time was the only other major area in 
which research into the subject was seriously pur- 
sued, and much of the work there was isolated and 
unrelated. Earlier hypotheses, by theorists like 
Max Weber, were never seriously tested, and re- 
search has been focused quite differently in the 
several countries. In Britain the most extensive in- 
vestigations of recent years were historically ori- 
ented, analyzing changes in the techniques of 
political finance as related to the widening of the 
franchise, changes in elite recruitment, and the 
development of party organizations in the nine- 
teenth century. In Germany a large part of the 
extensive literature was addressed to legalistic 
problems of constitutional interpretation and regu- 
lations, although numerous monographs and ar- 
ticles also used empirical data to evaluate political 
finance practices in the German Federal Republic. 
Some research endeavors were offshoots of interest- 
group studies. The greater disclosure of financial 
data by European trade unions and socialist polit- 
ical parties made possible the analysis of political 
financing activities of various labor movements. 
Certain highly controversial political questions, like 
the relationship of German business to the Nazis 
in the early 1930s, stimulated other islands of re- 
search. Some information, as well as normative 
evaluation, was also developed by a number of 
official commissions set up to advise governments 
on possible regulatory legislation. Such commis- 
sions submitted reports in the 1950s in Sweden, 
Norway, and Germany. 

(4) In the early 1960s an attempt was made to 
prepare for more-broadly comparative analyses of 
political finance. Efforts were begun to develop 
common frameworks for studies of political finan- 
cing in the United States, Europe, and non-Western 
countries. A number of studies were commissioned 
that for the first time presented system-wide de- 
scriptions of the processes of political finance in a 
number of Asian democracies, particularly the 
Philippines and India. The range of nations for 
which minimal information became available was 
enlarged through article-length studies of political 


financing in Italy, Israel, and Australia. These over- 
view studies, constructed within the same basic 
framework of inquiry, together with similar ar- 
ticles on Great Britain, Japan, and West Germany, 
were published under the editorship of Richard 
Rose and Arnold J. Heidenheimer (1963). In his 
accompanying “Comparative Party Finance: Notes 
on Practices and Toward a Theory,” Heidenheimer 
(1963) developed concepts that could be used in 
measuring and explaining differences in political 
financing between political systems and argued 
the relationships between levels of expenditures 
and stages of economic, social, and political devel- 
opment. The emergence of comparative studies 
made clear the need for more generalized con- 
ceptual terms. Moreover, the introduction of non- 
Western data and the use of developmental con- 
cepts widened the horizons of investigators. The 
move toward comparative studies spanning con- 
tinents, and perhaps ultimately spanning centuries 
also, forced recognition of the need for greater 
stress on the general theoretical significance of 
analytical and descriptive studies. 

Research outside the United States. In 1966 
the “research map” of political financing was un- 
evenly charted. Apart from the United States there 
were three major nations in which the principal 
dimensions of political financing had been sub- 
stantially explored. These were West Germany, 
Great Britain, and Japan, although in the latter 
two cases analyses were based mainly on informa- 
tion from official reports filed by the participants 
themselves. In an intermediary group of countries, 
some Western, some Asian (including Israel, Italy, 
Australia, Canada, the Philippines, and India), the 
search for hard data and functional analysis had 
begun with some promise. In certain groups of 
countries, like the small democracies of western 
Europe and the more stable Latin American coun- 
tries, with only a few exceptions little attempt had 
yet been made to assay and develop political finan- 
cing data. The least amount of inquiry had been 
attempted in the new democracies of Africa and 
Asia. There the study of political finance, alie 
necessarily focused on different structures an 
techniques, may eventually prove fruitful. 

With regard to contemporary practices, : 
had become known about political financing a 
West Germany than in any other country He 
the United States. Research was stimulated po: 
only by scholarly curiosity but also by partisan in: 
terests. The political parties had stakes ng 
patterns of financial support and in conflicting 1° 
terpretations of constitutional requirements. see 
gles over enactment of disclosure requirements, 


more 


deductibility provisions, and the role of “conveyer” 
and “sponsor” associations helped to keep the sub- 
ject timely, throwing up much raw information in 
the process. The Germans developed a compre- 
hensive system for public financing of political 
parties, under which in 1964 some 50 million 
marks were paid annually to the three major par- 
ties from the budgets of federal, Land, and local 
governments. This development not only led schol- 
ars to important reinterpretations of the German 
party system but also provided a new model of 
political financing practice that itself stimulated 
further study. 

In Great Britain following World War m, the 
conditions of political finance appeared satisfac- 
tory to most, except some followers of the Labour 
party, and were given relatively little study. Many 
scholars and politicians felt that few improprieties 
were committed and that few distortions of repre- 
sentative government resulted, a radical change 
from practices that prevailed in the nineteenth 
century. British law imposed a rigid limit on ex- 
penditures made on behalf of a candidate in his 
constituency during an election campaign. The 
effectiveness of the limit bred satisfaction with the 
system, and most research in political finance was 
based on official reports of these constituency ex- 
penditures. The growth of central-organization 
campaign activities, however, not limited by these 
controls and not covered by official reports, led 
eventually to a mounting desire for more extensive 
knowledge of British campaign finance. In Great 
Britain, as in other places, the increased impor- 
tance of mass communications and the increased 
role of organized interests in general efforts to 
shape public opinion brought realization that the 
British controls over candidate expenditures dur- 
ing the limited campaign period constituted regu- 
lation of but a small share of significant political 
expenditures. 


The state of knowledge 


Knowledge of political finance can be viewed at 
three levels: descriptive, interpretive, and theo- 
retical. After four decades of scholarly study, 
knowledge of political finance at all three levels 
had advanced substantially in the United States 
and was appreciably greater than corresponding 
knowledge in other nations. The greatest progress 
had been made in assembling and ordering de- 
scriptive data. These led to new interpretive in- 
sights into particular segments of political finance, 
and the broadened range of inquiry during the 
1950s and 1960s made possible the development 
of typologies and of limited, empirically based gen- 
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eralizations about some aspects of the subject that 
had not been offered previously. No comprehen- 
sive, abstract theory seeking explicitly to account 
for all phenomena of the subject appeared, but a 
number of unifying themes emerged from the 
diverse inquiries of Heard and his associates, e.g., 
the ultimate primacy of votes in a society with a 
free election system, the multiple sources of influ- 
ence over votes, the limits imposed by a pluralistic 
political system on any single source of power, and 
the multiple manifestations of social and economic 
power. 

Information about the processes of political 
financing was detailed and well documented in 
many particulars, especially the volume and sources 
of contributions and the volume and purposes of 
expenditures in certain kinds of elections. Finan- 
cial relationships between many party and cam- 
paign units had been traced and understood. The 
differential roles of certain significant participants 
in the processes of campaign finance had been 
identified. Political giving, as part of the syndrome 
of political involvement of individuals and groups, 
had been explored. The roots of effective and 
ineffective legal controls had been probed, and ex- 
perimentation with new forms of regulation oc- 
curred. Expertise of certain kinds had been devel- 
oped sufficiently to make possible the evaluation of 
innovations in soliciting procedures and in other 
campaign practices. And, over-all, many financial 
linkages between social structure and political 
structure had been delineated. 

Election expenditures. Data and understanding 
were technically more complete in the United 
States at national political levels than at state and 
local levels, an inevitable concomitant of the fed- 
eral governmental system and the decentralized 
party structures of the United States. For the presi- 
dential election year 1964, for example, over $26 
million in expenditures through national-level cam- 
paign organizations could be clearly identified, 
made up of cash outlays plus debts. These ex- 
penditures were substantially higher than in previ- 
ous recent presidential election years. The division 
of expenditures between the parties has generally 
favored the Republicans by a ratio of about 60 to 
40, although in 1960 the division was about equal. 
Fluctuations in the purposes for which expendi- 
tures were made by national-level organizations 
could be charted, revealing, for illustration, not 
only the relatively stable emphasis given to both 
organizational and communications activities over 
the decades but also the displacement of the com- 
munications medium dominant at a particular time 
by another, i.e., newspaper advertising by radio, 
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radio by television. The relative importance of 
donations of certain sizes could be traced: usually 
about two-thirds of the amount received by both 
parties in individual donations at the national level 
is in sums of $500 or more. 

The $26 million accounted for in 1964 by na- 
tional groups constituted, however, only a small 
portion of the total cash spent in all nomination 
and election campaigns that year, probably $200 
million in all. In the large states millions of dollars 
were spent in the presidential race. Where compe- 
tition was heated, a candidate for governor or 
United States senator might find over a million 
dollars spent on his behalf. But below the national 
campaign level the reliability of data varied greatly 
from one state and locality to another, so that in 
only a few specific places could trends be followed 
from election to election. Generally, at state and 
local levels the number of reporting and receiving 
officers, the frequency of reports, and their variable 
accuracy and completeness made the summary and 
analysis of data too laborious for regular use. 
Many funds employed in political finance, more- 
over, went unreported in official accounts, thus 
requiring the use of much supplementary informa- 
tion from other sources. 

Party structures. The effects of political finance 
practices on internal party structures, and the re- 
verse, had been increasingly studied. For example, 
the system of special finance committees organ- 
ized openly and solely for solicitation purposes by 
the Republican party in the United States for years 
contrasted sharply with traditional practices in 
most units of the Democratic party. In the latter, 
fund raising was a function of regular campaign 
committees or of special small cadres of indi- 
viduals, well placed to give or solicit funds, who 
operated largely out of public view. The Repub- 
lican finance system consistently displayed greater 
technical proficiency and less secrecy in its opera- 
tions than did Democratic practices. In both parties 
political solicitors became recognized as critically 
important communications links, representing the 
views and needs of candidates to donors and re- 
flecting the interests of donors to parties and pub- 
lic officials. Techniques of fund raising changed 
rapidly in the United States in response to changes 
in income tax and gift tax laws, in per capita dis- 
posable income, in the tools of solicitation, in pub- 
lic attitudes, and in other conditions that affect 
who gives and how much. 

Legal limitations. The futility of the legal ceil- 
ings imposed on individual political gifts and on 
expenditures by individual campaign committees 
had been demonstrated. The federal and state gov- 


ernments in the United States had generally pur- 
sued repressive and negative approaches in seeking 
to regulate political financing, prohibiting or limit- 
ing various types or amounts of contributions and 
expenditures. Reliance on public disclosure of cam- 
paign accounts as a disciplining influence on can- 
didates and parties had proved disappointing to 
most of its advocates. Direct and indirect govern- 
mental subsidies had increasingly found favor with 
students and practitioners as a way of helping to 
meet campaign costs while avoiding some of the 
pressures of private financing. Four states adopted 
statutes permitting limited deductions of political 
contributions in computing personal income tax- 
able under state law. The U.S. Presidents Com- 
mission on Campaign Costs, a bipartisan group 
appointed by John F. Kennedy in 1961, recom- 
mended tax concessions to encourage more-widely- 
dispersed popular financing of presidential cam- 
paigns. 


At all three levels of knowledge postulated 
above—descriptive, interpretive, theoretical—great- 
est future progress in understanding political 
finance lies through systematic comparative study. 
This progress will require the development of many 
standard categories of information about political 
finance in diverse institutional and cultural set- 
tings. It will require sophisticated concepts and 
units of measurement of a type that have not char- 
acterized study of the subject in the past. The mere 
acquisition of certain types of desired data may in 
many places prove impossible. The type of material 
that constitutes relevant data varies from one set- 
ting to another. Common functions appear in dif- 
ferent guises, and seemingly similar phenomena 
may, in their particular contexts, have quite dif- 
ferent significances. The advances already made 
at a conceptual and theoretical level by students 
of comparative political systems offer stimulus 
and promise to students of comparative political 
finance. 

The effect of the processes of political finance 
on the recruitment of political leadership requires 
special attention. The financial aspects of nomina- 
tion and prenomination processes have been ex- 
amined in the United States, for example, yet they 
remain relatively obscure and their revision seems 
beyond the purview of most proposals for change. 
The character of commitments that result from 
acceptance of financial support requires cae 
ization, and the conditions which prompt financi 
support instead of other forms of support n! 4 
delineation. The illumination of these two feani 
of political finance is essential to an understan' 


ing of the sources and exercise of political power. 
Inevitably, the study of political finance will be 
affected—sometimes encouraged, sometimes ob- 
structed—by its implications for political con- 
troversy. 

ALEXANDER HEARD 


[See also BUDGETING; CONFLICT OF INTEREST; PARTIES, 
POLITICAL; POLITICAL PARTICIPATION, ] 
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POLITICAL GROUP ANALYSIS 


The analysis of political groups covers a wide 
range of viewpoints, conceptual schemes, and 
structures of preference. Although the study of 
nongovernmental groups is currently a widespread 
academic concern and although some attempts 
have been made to construct conceptions of the 
structure of political power in which groups, vari- 
ously defined, are central elements, no genuine 
consensus of a theoretical sort has emerged from 
these investigations. To speak of the group ap- 
proach to political studies, of the group theory, or 
of the group model is both inaccurate and mis- 
leading. Such terms are at best labels applied to a 
diverse and frequently conflicting collection of 
analytical efforts in the study of politics. 

Despite the absence of a consensus on what 
group analysis can do for the understanding of 
politics and power, one can identify two tendencies 
in the contemporary literature that provide a 
means of classifying the vast bulk of such writing. 
The first of these is the position that, especially 
in relatively developed societies, it is not possible 
to achieve an adequate understanding of the more 
visible, constitutionally recognized governmental 
institutions and their interconnections without tak- 
ing into account the attributes and activities of 
nongovernmental or unofficial groups of various 
sorts. This position has produced a large and grow- 
ing body of essentially descriptive research on the 
organization and activities of what are usually 
called pressure groups or interest groups. This type 
of investigation developed first in the United States, 
where the issues of domestic politics and the 
peculiarities of the constitutional structure com- 
bined to make such groups relatively visible and 
accessible to inquiry. Since shortly after the close 
of World War u, however, studies of interest 
groups have been conducted in most of the devel- 
oped parts of the globe, especially in countries 
whose systems permit relatively free political in- 
quiry and at least tolerate the formation of extra- 
governmental associations, The contribution of 
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these studies to the description of political systems 
often has not gone beyond adding another set of 
institutions to those customarily examined—execu- 
tives, legislatures, courts, bureaucracies, and po- 
litical parties. 

A second tendency, much less common, is rep- 
resented by a limited set of attempts to develop, 
directly from research or by synthesis from de- 
scriptive studies, something approaching a general 
theory of politics in which the group, variously 
conceived, occupies a central position. These ef- 
forts show differing degrees of sophistication and 
complexity. None is satisfactory as a general the- 
ory, but collectively they have helped to raise or 
revive, in the analysis of political systems, ques- 
tions and problems of a fundamental sort that 
otherwise were likely to be neglected or to remain 
unidentified. Among these are questions concern- 
ing the nature of political leadership, the processes 
of electoral choice, the structure of official decision 
making, the character of linkages between con- 
stituents and officials, and the consequences of 
alternative forms of governmental organization. 
The convergences among these attempts are far 
from complete. They differ chiefly in the degree to 
which they allow for an independent initiative in 
public policy by elements of the official govern- 
ment, in the extent to which they see influence 
relations between groups and governmental units 
as unidirectional or reciprocal, and in the room 
they allow for short-run consequences of idiosyn- 
cratic behavior by those in positions of formal 
leadership, 


History 


The recognition of groups, broadly conceived, as 
significant actors in politics is scarcely a novelty 
in Western thought. The essential notion of the 
group, even as a claimant on the choices of gov- 
ernmental decision makers, is to be found in Plato 
and Aristotle. It appears and reappears in succes- 
sive periods, especially when attention is focused 
on the classic problem of tensions between rulers 
and the ruled, the problem of the center versus the 
circumference, as Charles E. Merriam phrased it, 
whether in the restriction of the claims of the uni- 
versal church, in the formation of the nation-state, 
or in the turbulence associated with the com- 
mercial and industrial revolutions. 

In contemporary terms the analysis of political 
groups has claimed attention as groups have be- 
come highly differentiated and numerous under 
social systems characterized by a complex division 
of labor, interdependence, a resulting emphasis 
upon bargaining relations, and efficient means of 


communication, almost regardless of whether the 
political culture supports the open expression of 
dissent and the legitimacy of nongovernmental as- 
sociations. In addition, the concept has been taken 
up by investigators attempting to give some sort 
of order to the confusing crosscurrents of polit- 
ical influence in semideveloped or underdeveloped 
countries exhibiting rapid change in various societal 
sectors. 

The intellectual paths leading to. contemporary 
concern with the analysis of political groups are, 
however, diverse. At least four can be discerned. 

The first path is that of the English and Con- 
tinental, especially German, pluralists. Essen- 
tially a reformist reaction, especially among its 
later English adherents, against the inclusive 
claims on behalf of the modern state to a virtual 
monopoly of legitimate authority, this doctrine’s 
most influential origins are to be found in the his- 
torical jurisprudence of Gierke [see GIERKE]. Clearly 
not hostile to the emergent German nationalism 
of Bismarck, Gierke saw danger or at least error 
in the concept of the state as the exclusive source 
of law. Accordingly, he emphasized the similarities 
between the state and other social groups and em- 
ployed his impressive historical scholarship in an 
attempt to revive the German Genossenschafts- 
recht. Gierke’s work was introduced to English 
thought primarily by Maitland, who was skeptical 
about the implications of Austin’s monistic theories 
of state power and was consequently concerned to 
emphasize the importance of nongovernmental as- 
sociations to an understanding of contemporary 
politics [see AUSTIN; MAITLAND]. Following Mait- 
land and Figgis, who confined his attention largely 
to the relations between church and state, the later 
pluralists, G. D. H. Cole and especially Harold 
Laski, used these insights for an attack on existing 
state power and on the capitalism with which they 
saw it inextricably allied [see COLE, G. D. Hs; 
Fıccıs; Lasxk1]. Paradoxically, their writings 1m- 
plicitly assumed the relative stability of pon 
Victorian British politics in their neglect, at leas 
until the 1930s, of the problem of how the au- 
thority of an ultimate power, whether as dires 
or as umpire, is to be maintained. A useful mgt 
ment for socialist criticisms in Britain and for a 
demands of British labor, the doctrines of Ene: 
pluralism were influential in the United shee 
a convenient descriptive approximation Bete of 
increasingly familiar features of the Cel 
power on the American scene. [See PLURALIS. Sa 

A second path to contemporary Conan 
opened up by a set of highly influential ro atl 
tal sociologists, notably Gumplowicz, Simmel, 


Ratzenhofer, in the closing years of the last cen- 
tury [see GUMPLOWICZ; RATZENHOFER; SIMMEL]. 
Differing widely among themselves and pursuing 
sharply differing goals, they had in common an 
effort to use notions of the group as a means of 
comprehending the process of social and political 
development. They were among the contributors 
to the stimulating atmosphere of the Continental, 
especially the German, universities, into which 
entered a generation of young American scholars 
who were to be important in the emerging Ameri- 
can graduate schools, in the social sciences as well 
as in other fields. 

The third path to the contemporary concern with 
the analysis of political groups is best exemplified 
by the work of one man, Arthur F. Bentley, al- 
though he was not wholly alone in his efforts. His 
name must be linked with that of John Dewey, who 
was an influence upon him as early as 1895 and 
with whom an increasingly intimate association 
and collaboration developed in the 1930s and 
1940s [see Dewey]. Bentley's classic, The Process 
of Government (1908), was his only systematic 
excursion into the explicitly political realm. 

The book, which was in his words an “attempt 
to fashion a tool,” is in part critical and in part 
constructive, Bentley participated in “the revolt 
against formalism” in this period of American 
thought. He rejected the reification of conventional 
categories, whether legal, political, or psycholog- 
ical, and repudiated a fixed and simple classifica- 
tion of social groups, whether that of Marx or 
those of the German sociologists. He insisted that 
a political science, or any social science, could be 
based only upon observable human behavior, al- 
though he left room for a “potential” stage of 
activity which he called “tendencies of activity.” 
Group activity, which he saw as a means of stating 
all the phenomena of government, was to be known 
and to be stated on this basis. He began to see the 
process composed of such activity as sets of “trans- 
actions,” though he did not use the term or elabo- 
rate the view until much later. [See BENTLEY.] 

The Process of Government is thus an inquiry 
much broader in scope than a study of “pressure 
groups.” For more than thirty years after its pub- 
lication, however, the broader relevance of the 
book was almost wholly ignored. Instead, Ameri- 
can scholars produced a series of studies of pres- 
sure groups and “lobbies” which collectively con- 
stitute the fourth discernible path to contemporary 
concern for the analysis of political groups. Fre- 
quently owing their inspiration to materials gen- 
erated in the course of investigations conducted by 
committees of the Congress, these studies varied 
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in scope and level of sophistication, but they built 
up a body of comment on a hitherto neglected type 
of political institution. The best of them focused 
on a particular arena (Herring 1929; 1936), on 
a type of policy choice (Schattschneider 1935), on 
the actions of a single group (Odegard 1928), or 
on the connections between a group’s internal 
processes and its actions (Garceau 1941). Follow- 
ing World War u, as previously noted, the dis- 
covery that such groups were not restricted to the 
American political scene led to a large number of 
comparable investigations in a wide variety of 
countries, “Pressure-group” studies have since be- 
come a staple in the research activities of political 
scientists in almost every corner of the globe. 


Current trends 


No genuine convergence of these four paths has 
occurred, but collectively they have led to a num- 
ber of uneven attempts to produce some limited 
theories or at least to alter prevailing modes of 
exposition. One of the first areas for this sort of 
effort was the American political party. The works 
of Herring (1940), Schattschneider (1942), and 
especially Key (1942) differed in their analytical 
and prescriptive positions, but each helped to estab- 
lish the view that the political party, in the United 
States at least, could not be understood apart from 
its relations with constellations of nonparty groups 
[see Key]. Key took the additional step of examin- 
ing the party organization itself as one type of 
interest group, functionally indistinguishable in 
some of its features from nonparty groupings. 

Paralleling such attempts have been a number 
of efforts at theorizing about nonparty groups in 
relation to the whole spectrum of the political sys- 
tem. Although these efforts cannot yet be properly 
assessed, it seems likely that those which avoid 
too narrow a conception of the term “group” will 
prove most acceptable. To limit the reference of 
this term to the “pressure group” of popular par- 
lance is dangerous because, on the one hand, there 
is the risk of oversimplification and, on the other, 
the likelihood of adding another piece of mislead- 
ing formalism to those erected by the constitutional 
structure. 

Groups thus narrowly conceived are often as- 
signed a monopoly of initiative in the area of pub- 
lic policy, no nongovernmental actors except for- 
mally organized groups of this sort are considered, 
and, consequently, governmental actors are treated 
as mere referees of group conflict or recorders and 
ratifiers of the outcomes of intergroup contests. 

This oversimplification leads to the error of treat- 
ing a limiting case as if it were the norm. This 
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obvious error occurs with great frequency in the 
literature, despite the fact that even our rough 
knowledge of the distribution of cases of the rela- 
tion between interest group and government dem- 
onstrates that this oversimplification fails to square 
with the evidence concerning a vast range of in- 
stances. A proper understanding of Bentley's con- 
ception would help to avoid the constricting con- 
sequences of these oversimplified formulations. 
Bentley does not use the word “group” altogether 
unambiguously, but he does use it broadly, so that 
it becomes almost the equivalent of any pattern 
of activity. The gap between his own and conven- 
tional denotations may have been too wide, but the 
whole thrust of his theory was away from preas- 
signed weights and roles. Even his distinction be- 
tween underlying groups and representative groups, 
a variant of Marx’s sub- and super-structure, did 
not confine the representative groups to being 
referees and recorders. 

Those who merely add the nongovernmental 
group to the collection of formal structures to be 
explained by analysis may do a different sort of 
disservice. By concentrating on a single institu- 
tional pattern in a descriptive way, they often lose 
sight of the total political process in all its com- 
plexities. In particular, they may ignore the ques- 
tion of variations in power resources, the question 
of differences in the degree of commitment and of 
efficient use of such resources, and the matter of 
the emergence of new groups and the recession 
of established ones. They may also neglect the 
functional similarities, as claimants upon legiti- 
mate authority, among a wide variety of groupings 
extending well beyond those conventionally ac- 
cepted as “pressure groups” or interest groups. If 
the group notion is to prove useful in a theory of 
the political system, it is not likely to come about 
through such a reversion to formalism, 

Among the more promising recent efforts in the 
analysis of political groups have been those which 
have focused on a broad segment of a political sys- 
tem or on the whole of a single system and have 
attempted to include political groups in the larger 
analytical enterprise. The best examples of the 
segmental sort are to be found in recent voting 
studies based on sample surveys, especially The 
American Voter (Michigan, University of ... 1960). 
As part of an extensive examination of the electoral 
decision in the United States, this study includes 
an analysis of the influence of group activity and 
identification on the choices of voters. The imme- 
diate significance of this work is its effectiveness in 
coming to grips with variations in group influence 


within and between elections. These the authors 
trace to a series of factors relating the individual 
to the group and the group to the political world, 
They show that some of these factors are relatively 
stable and some susceptible to considerable short- 
term fluctuation through circumstance or design. 
This work has important implications for a wider 
range of questions. Exploration of the factors af- 
fecting group influence on voters’ choices suggests 
and strengthens hypotheses and intimates lines of 
inquiry concerning the relations between groups 
and points of decision within the formal govern- 
mental apparatus. In particular, the variability of 
electoral influence casts further doubt on those 
conceptions of the relations between group and 
government which give the group a monopoly of 
initiative in all circumstances. 

A second and closely related example that deals 
with groups in the context of one broad segment 
of a political system is Key’s work on public opinion 
in the United States (1961). Working primarily 
from survey data, in this instance data on the struc- 
ture and properties of mass opinion, Key selects 
as a basic objective an analysis of the system of 
“linkages” that may make opinion governmentally 
relevant, including nongovernmental “pressure 
groups.” His findings are largely inferential, but 
they serve to undermine simplistic conceptions that 
categorically assign to groups a coercive monopoly 
in the formation of policy. He shows that such 
power is exercised at most in marginal cases of low 
frequency, and he makes a persuasive case for more 
sophisticated conceptions involving a greater num- 
ber of variables. 

A revived interest in the analysis of political 
systems as such has been a marked feature of the 
period following World War 11. This revival of a 
classic concern is traceable in large measure to a 
high rate of change in numerous political systems, 
which has compelled analysts to re-examine and 
in some instances drastically to recast the organiz- 
ing conceptions with which they previously worked. 
As a result, the activities and functions of govern- 
mentally relevant groups have been placed in ee 
text. The work, though not uniform, has produce 
more general and perhaps more satisfactory con- 
ceptions of governmental systems and at the same 
time has helped to refine views of group activity 
within the systems. The strongest thrust in this 
revival has come from the necessity to come ig 
grips with the developing political systems in the 
newer nations, to which the categories conven an 
ally used in discussing states of the western ai 
pean type were patently inadequate either 


analyzing these less familiar systems or for com- 
paring them with the more familiar types. 

The most influential example of this kind of 
effort is the work of Almond and his associates 
(Almond & Coleman 1960). Building on the accu- 
mulated evidence of the multifunctionality of 
political structures and working on the assumption 
that conceptions of Western systems have over- 
stated the functional specificity of structure, while 
those of the primitive and traditional systems have 
overemphasized diffuseness and lack of differentia- 
tion, Almond develops a set of structures and of 
functions and proposes a comparison of political 
systems in terms of the probabilities of perform- 
ance of specified functions by specified structures 
in specified styles. In discussing the function of 
interest articulation, he identifies four main types 
of interest groups, conceived almost as broadly as 
Bentley's, and suggests a variety of styles in which 
these may perform this function. Although its suc- 
cess has not been definitely established, the scheme 
is useful because it places the analysis of political 
groups in a context which could allow for effective 
comparison of structures, styles, and systems. 

A more concrete example of the renewed inter- 
est in systems as it bears upon the analysis of 
political groups is provided by a recent study of the 
process of governing New York City (Sayre & 
Kaufman 1960). Examining this system as a con- 
test among individuals and groups, including both 
governmental and nongovernmental groups, the 
authors deal comparatively as well as descriptively 
with variations in the degree of involvement, in the 
resources, and in the goals of the group contestants. 
They introduce further and significant probabilistic 
properties into their scheme by emphasizing the 
effects of uncertainty, including imperfect informa- 
tion, upon the actions of participants. The resulting 
conception has considerable analytical power. Al- 
though it may not, without considerable modifica- 
tion, be appropriate to a wide range of cases, it 
bears at least a familial resemblance to other lead- 
ing efforts to deal with groups in the context of an 
analysis of systems. 


In one form or another the analysis of political 
groups has become fixed as part of the study of 
governmental systems and processes. Whether this 
analysis is pursued at the level of the group or at 
the level of the system, the current trends suggest 
that it is unlikely to prove fruitful unless it avoids 
a reversion to formalism and rejects simplistic, 
single-factor schemes of explanation, The promis- 
ing revival of interest in the political system and 
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the associated tendencies toward probabilistic con- 
ceptions point more constructively toward conceiv- 
ing political groups in context and toward develop- 
ing multivariate schemes in which such groups are 
taken into account as factors of varying weight 
and consequence. 

Davin B. TRUMAN 


[See also INTEREST GROUPS; PLURALISM; PUBLIC POL- 
icy; SYSTEMS ANALYSIS, article on POLITICAL SYS- 
TEMS. Other relevant material may be found in 
POLITICAL SCIENCE; and in the biographies of BENT- 
LEY; FOLLETT; MERRIAM.| 


BIBLIOGRAPHY 

ALMOND, GABRIEL A.; and CoLEMaN, James S. (editors) 
1960 The Politics of the Developing Areas. Prince- 
ton Univ. Press. 

BENTLEY, ARTHUR F. (1908) 1949 The Process of Gov- 
ernment: A Study of Social Pressures. Introduction 
by H. T. Davis. Bloomington, Ind.: Principia Press. 

Garceav, OLiver (1941) 1961 The Political Life of 
the American Medical Association. Hamden, Conn.: 
Shoe String Press. 

HERRING, E. PENDLETON 1929 Group Representation Be- 
fore Congress. Baltimore: Johns Hopkins Press. 
Herrinc, E. PENDLETON 1936 Public Administration and 

the Public Interest. New York: McGraw-Hill. 

Herrinc, E. PENDLETON 1940 The Politics of Democ- 
racy. New York: Norton, 

Himscuman, ALBERT O. 1963 Journeys Toward Prog- 
ress: Studies of Economic Policy-making in Latin 
America. New York: Twentieth Century Fund. 

Key, VALDIMER O. Jn. (1942) 1964 Politics, Parties, and 
Pressure Groups. 5th ed, New York: Crowell. 

Key, Vatprmer O, Jn, 1961 Public Opinion and Ameri 
can Democracy. New York: Knopf. 

LaPaLomBana, JosepHG. 1964 Interest Groups in Italian 
Politics. Princeton Univ, Press. 

Lovepay, Peren 1962 Group Theory and Its Critics. 
Pages 3-44 in Peter Loveday and Ian Campbell, 
Groups in Theory and Practice, Melbourne: Cheshire. 

Macnivis, Roy C. 1961 Interest Groups in Comparative 
Analysis. Journal of Politics 23:25-45. 

Meynaup, Jean 1958 Les groupes de pression en 
France. Paris: Colin. 

MICHIGAN, UNIVERSITY oF, SURVEY RESEARCH CENTER 
1960 The American Voter, by Angus Campbell et al, 
New York: Wiley. 

Opecann, Perer H. 1928 Pressure Politics: The Story of 
the Anti-saloon League. New York: Columbia Univ. 
Press. 

Sayre, WALLACE S.; and KaUrMAN, Herpert 1960 Gov- 
erning New York City: Politics in the Metropolis, New 
York: Russell Sage Foundation, 

Scuattscunewen, ELMER E. 1935 Politics, Pressures, 
and the Tariff: A Study of Free Private Enterprise in 
Pressure Politics, As Shown in the 1929-1930 Revi- 
sion of the Tariff. New York; Prentice-Hall. 

ScHATTSCHNEIDER, ELMER E. (1942) 1960 Party Gov- 
ernment. New York: Holt. 

Truman, Davin B. (1951) 1962 The Governmental 
Process: Political Interests and Public Opinion. New 
York: Knopf. 


246 POLITICAL IDENTIFICATION 


POLITICAL IDENTIFICATION 
See IDENTIFICATION, POLITICAL. 


POLITICAL JUSTICE 


Political justice refers to the use of the judicial 
process for the purpose of gaining (or upholding or 
enlarging) or limiting (or destroying) political 
power or influence. It may accompany or confirm 
political or military action, or it may be a substitute 
for such action. Political justice usually involves 
the courts, which may be invoked either by public 
officials or, in those societies which permit open 
competition for political power, by private indi- 
viduals. The party invoking the judicial arm must 
present its demands in a form susceptible of legal 
determination. This party’s allegations in regard to 
facts must be open to incrimination and proof 
under the specific legal system involved. These 
allegations may relate to intrinsically political acts 
or to common crimes in which the criminal is 
charged with political motivation, for example, 
bank robbery to finance revolutionary activity. 
Those bringing the case may reap propaganda 
benefits from the political stature of a person im- 
plicated in offenses of a nonpolitical and even tech- 
nical nature, for example, violation of foreign 
exchange regulations. 

The classical political offense is that of attempt- 
ing to obtain or to damage a position of political 
power by actions incompatible with the rules of 
the existing political system. The state may define 
the offense in such all-embracing terms as those 
defining the early Roman perduellio—“animated 
by a spirit hostile to the commonweal” (Ulpian, 
Digest 48, 4, 11). This offense simply consisted in 
whatever the tribuni plebis were able to put over 
on the people as perduellio (Brecht 1938). Con- 
temporary equivalents are various forms of the 
offense of “endangering the security of the state.” 
This type of protective grant may, of course, be 
Tefined into any number of ad hoc enactments. 

The modern concept of uniform citizenship has 
led toward a more clear-cut definition of civic obli- 
gations, although the extension of the individual’s 
political rights has created many possibilities of 
contest over the boundary between political free- 
dom and political crime, between permissible dis- 
sent and punishable disloyalty, Until the beginning 
of the twentieth century both legislation and judi- 
cial practice in Western Society showed a general 
line of development that increasingly excluded 
peaceful speech and agitation by individuals or 


groups from the category of crime against the state 
or its rulers. 

The political and intellectual climate deteriorated 
in the interwar period. The concept of the inde- 
pendent and unified national state was challenged 
by technological changes, by the increasing in- 
transigence of domestic political deviations, and 
by external threats to the autonomy of the nation- 
state. Furthermore, a number of states were re- 
organized on totalitarian lines far more restrictive 
than had been the case with old-fashioned autoc- 
racies. Consequently, twentieth-century statutes 
have tended to narrow the perimeters of permitted 
political activities, facilitating the prevention or 
detection by the state apparatus of even the most 
incipient stages of hostile organization or propa- 
ganda. Internal and external dangers have given 
rise to concepts such as “demoralization” of the 
army or nation and “danger to the existence or 
independence” of the state. 

Even in fairly open societies, where traditional 
categories of political offenses provide fewer means 
for reproving real or fancied dissent, the state may 
redefine “perjury” or “contempt of the legislature” 
so that the area of legal control extends over politi- 
cal attitudes or actions. Individual and group com- 
petitors in politics may invoke the concept of 
“defamation of character” to intimidate their op- 
ponents or score propaganda advantages. The utili- 
zation of such concepts allows the state or the 
political contestants to compress the most complex 
processes and correspondingly complex attitudes 
into stereotypes of black-or-white, yes-or-no alter- 
natives. The contest between alternatives seems to 
lend itself to court determination and helps to cre- 
ate public images of defense of, or attack upon, 
traditional values. In the liberal constitutional 
framework the value of the political trial depends 
on the psychological and political pressures of the 
moment. Because of the meaningfulness of the iat 
ditional legal institutions and processes, the trial is 
marked by a creative element of risk and unpre- 
dictability, which distinguishes it from an admin- 
istrative command performance. 3 

Outside the realm of liberal constitutionalism 
this creative element of uncertainty is reduced, 
sometimes to the vanishing point. The judiciary is 
likely not only to be connected with the state au- 
thorities of the day by common social premises (a8 
is usual in all societies) but to be directly p l 
ent upon them. The element of uncertainty ae 
appeared altogether in the Stalinist trials of si 
1930s and 1940s. They differed not only foe 
liberal usage but from old-style authoritarian tri 


by integrating allegations and evidence almost com- 
pletely with changing political needs. They pre- 
fabricated the incriminating facts and tried to fit 
them tightly into the hypothesis of treasonable de- 
velopment, which the defendant, now publicly 
cooperating with the prosecution, was assigned to 
represent. The Stalinist trial formula was the ex- 
treme and partly self-defeating culmination of 
political justice, 

In political trials, the decision is officially sought 
in terms of the alleged criminal past behavior of 
the real or fancied political foe. Yet the judgment 
frequently rests on a projection of the defendant's 
alleged past behavior into the future. Criminal re- 
sponsibility for past action is transformed into 
responsibility for hypothetical future happenings. 
The criminal-conspiracy thesis is manipulated so as 
to converge with the conspiracy thesis of the his- 
torical process. 

Political trials of leaders and prominent mem- 
bers of fallen political regimes are a special, though 
by no means novel, category of political justice 
(e.g., Charles 1, Louis xvi, Robespierre). Frequent 
changes of political regime have brought an in- 
creasing number of trials by fiat of political suc- 
cessors. Beginning with the abortive Riom trials of 
the leaders of the French Third Republic and 
reaching a peak in the Nuremberg and Tokyo war- 
crimes trials, judicial arraignments of predecessor 
regimes have become a regular feature of sudden 
political change. In the late 1950s and early 1960s 
such trials took place in Turkey, South Korea, Cuba, 
and elsewhere. They frequently lump together re- 
sponsibility for unsuccessful political action with 
charges of violation of constitutional norms, illegal 
forms of personal enrichment, and inhuman and 
brutal acts, termed “crimes against humanity” in 
the war-crimes language. 

The war-crimes trials were an intensive form of 
predecessor-regime trial, although in their case the 
successor regime was a composite of foreign mili- 
tary victors. These trials added a new and, judging 
by the results, problematic dimension to interna- 
tional political justice: the charge of aggressive 
warfare. This charge raised legal doubts because it 
was newly created after the commission of the 
incriminated acts. Moreover, this charge was some- 
what tainted by hypocrisy, coming as it did from 
states that had either tolerated many of the acts of 
the now fallen culprits or even become their 
accomplices. 

Predecessor-regime trials combine the most prob- 
lematic features of political trials. They invariably 
raise the problems of the ad hoc court (judges 
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selected for the specific purpose ) and the tu quoque 
rejoinder (discussed above). They also suffer from 
procedural shortcomings, restrictions on the free- 
dom of defense, and the lack of any appeals juris- 
diction. They are largely directed toward the 
creation of politically useful imagery. Nevertheless, 
the trials of predecessor regimes carry their own 
justification to the extent that the procedural ir- 
regularities are outbalanced by the weight of the 
intrinsic evils that they bring before the bar. 

The frequency and severity of political trials 
depend on many variables of time and place. Few 
societies have been entirely free of political justice, 
but the continuum extends from the halcyon days 
of Victorian England to the delirium of Stalinist 
Russia. Acute physical threat to the existing order 
—social, political, colonial, or racial—is most likely 
to lead the authorities to invoke the judicial ap- 
paratus frequently; sometimes their foes are in a 
position to invoke it too, and occasionally political 
prosecutions may result from the local interplay 
between informal police violence and judicial pro- 
cedure. Where access to courts is fairly free and 
where witnesses are protected against the cruder 
forms of intimidation and violence, courts exercise 
a dual function. They legitimize official reaction 
against open challenge to constituted authority. 
Also, however, by submitting official action to some 
form of court scrutiny, however mild, they act as 
brakes upon the spread of official violence. 

In the absence of acute physical challenge to 
established authority the likelihood of political 
prosecution is linked to tactical and propagandistic 
considerations. It also depends on the strength or 
weakness of liberal traditions, on ideological judg- 
ments, on the prospects of continued erosion of 
existing authority—as well as on officeholders’ 
calm or fear in the face of events beyond their 
control. 

Any court performing an act of political justice 
rejects the contentions of illegality and prejudice 
and insists on the strict legality of its bases of 
action. But whether courts apply an existing norm 
to appropriately ascertained events or apply a spuri- 
ous norm to “events” perhaps artificially concocted, 
they are influenced by the actions of political 
agencies. Similarly, the actions of the courts can 
support, mitigate, or check those of the political 
bodies. 

The utilization of the apparatus of justice to 
attain, and at the same time to legitimize, political 
goals raises questions of appropriateness, necessity, 
and usefulness. The justification of both goals pur- 
sued and methods applied will remain endlessly in 
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dispute, in general and in each individual case. 
Meanwhile, resort to court proceedings for political 
ends, especially by the state, will be subject to its 
own inherent limitations. 

It is one paradox of political justice that repres- 
sion is most effective when least necessary—that 
is, when the regime attacks small, unimportant, or 
transitory minorities—and least likely to be effec- 
tive when most attractive to the regime, that is, in 
the face of strong, persistent minority opinion and 
organization. It is another basic paradox that the 
value of legal procedure in conveying or enhancing 
the legitimacy of a regime depends largely on the 
degree to which that procedure respects the limita- 
tions on political prosecution, that is, the degree of 
insistence upon evidence of concrete past action, 
the untrammeled introduction and challenge of 
testimony, the freedom of the defense, and the or- 
ganizational and intellectual distance between the 
prosecution and the court. 

OTTO KiRCHHEIMER 


[See also INTERNATIONAL CRIMES; JUDICIAL PROCESS; 
JustIce.] 
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POLITICAL MACHINES 


The term “machine,” when used in a social con- 
text, describes a hierarchic organization whose 
members perform different functions, play various 
roles, and occupy various statuses. It has a leader- 
ship which defines and pursues goals and policies 
with sufficient effectiveness and regularity so that 
it can be compared to a mechanical device. The 
term also suggests efficiency, impersonality, and 
ruthlessness in operation and in the pursuit of 
goals. The analogy between the human organiza- 
tion and the mechanical organization can be elabo- 
rated by pointing to the need of both for fuel, 
lubricant, and an operator, or headman, or boss. 

A political machine is a stable, well-functioning 
party organization headed by a boss or leader or 
by a small leadership group. It may be rural or 
urban. It usually operates on a city or county basis, 
although state-wide machines can also be found. 
It is a group composed in the main of professional 
politicians whose principal objective is the acquisi- 
tion, maintenance, and enlargement of political 
power. 

Development of the concept. In the past (see 
Sait 1933) it was usual to distinguish between the 
political party, the political organization, and the 
political club on the one hand, and the machine 
on the other. The machine was wicked. It sought 
power secretly and through the use of corrupt and 
sometimes criminal methods. Its objectives were 
not the public or party good but private and per- 
sonal gain. Ostrogorskii (1902) promulgated this 
view. For him, machines were legitimate party 
organizations which had been “captured” by mer- 
cenaries. He believed that honest machines were 
not natural. Such views are less common today 
but are still widely held, especially outside scholarly 
circles. They emphasize “ruthless efficiency, bri- 
bery, patronage, graft, and rigging of elections. 

In the past and certainly currently, the term has 
also been applied without such sinister connota- 
tions. This was true in such cases as the “La Fol- 
lette machine” or the “Taft machine.” The neutral 
connotation is more useful, since it makes a greater 
contribution to an understanding of the phenom- 
enon. Thus, any stable, effective, political orga 
zation which has a leadership, a hierarchy, ani 
disciplined members should be considered a politi- 
cal machine. Theodore Roosevelt was correct when 
he observed that the opposition’s orearen a 
habitually called a machine, and its leader 4 we 
or a politician. One’s own is a club or party, sae 
even a faction or a caucus, and its head is invar 


ably a political leader, Sait insisted that “the boss 
generally claims the title of leader and the machine, 
usurping the name as well as the thing, calls itself 
the party organization” (1933, p. 657). Roosevelt 
and other commentators recognized that both lead- 
ers and machines were necessary as well as inevi- 
table. However, machine and boss are generally 
considered by press and people as evil both in 
purpose and in method. 

Bryce considered “machine” as simply a synonym 
for “party organization.” His chapters entitled “The 
Machine” dealt with the hierarchy of party com- 
mittees and conventions (1888, vol. 2, chapters 
60-62). Most contemporary scholarship follows 
this lead. For example, according to V. O. Key, Jr., 
the party consists of all those who consider them- 
selves members, while the “inner core” of the party 
is the “party machine or organization’—“the more 
or less cohesive group held together by the ambition 
to gain power” (1942, see p. 337 in the 1953 edi- 
tion). What the machine does with the power if 
and when it gains it is much less important in 
Key’s approach. The machine may be corrupt and 
vicious, or clean as a hound’s tooth and public- 
spirited to boot. 

Avery Leiserson cogently sums up these attempts 

to distinguish between “political” bosses and ma- 
chines, without principles or ideology, etc., and 
the presumably principled, nonpolitical organiza- 
tions and leaders. With respect to these legitimate 
as against nonlegitimate forms, he concludes: 
As a conception of political organization, the picture 
of party organizations as being necessarily dominated 
more by the motives of private, personal gain than are 
so-called non-political organizations suffers from two 
defects; (1) substitution of a logical or ethical cri- 
terion for empirical, functional analysis, and (2) fail- 
ure to realize that any kind of organization that enters 
the political process commits the “sin” of mixing ideal 
with material motives. If there is a distinctive motiva- 
tion and attitude characteristic of political parties, it 
is the survival and the power of the organization, but 
in this the party is no worse than the church, the 
army, the corporation, or the trade union, (1958, 
pp. 199-200) 


The history of machines. Machines and bosses 
are as old as politics. They have existed in many 
parts of the world. Pericles was a boss; so were 
Caesar, Pompey, and Crassus—and the last three 
constituted the leadership of a “machine.” So was 
Robert Walpole in eighteenth-century England. 
Today Latin American dictators, General Franco of 
Spain, Premier Ky of South Vietnam, and the heads 
of most of the emerging countries could be con- 
sidered bosses. Michels’ “iron law of oligarchy” dis- 
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covered bosses in European socialist parties. Every- 
one is familiar with the description of Nikita 
Khrushchev as a boss. 

Hamilton, Jefferson, and Burr were bosses; and 
their “factions” or “caucuses” were “machines” com- 
posed of members of the gentry. Jacksonian democ- 
racy ushered in an era which required the ordering 
and mobilization of large numbers of newly enfran- 
chised frontiersmen. This was a great opportunity 
for the creators of effective political organization. 
The “gentlemen” were pushed aside by this new 
breed, who brought new skills to the political arena. 
They cared less for niceties and manners than for 
effectiveness and private pelf. The spoils had been 
increased by the multiplication of offices and the 
destruction of the civil service. 

The Jacksonians succeeded in this enterprise 
because of widespread antipathy toward the gentry, 
because their tactics were rationalized by a primi- 
tive democratic theory, and because the attention 
of most of the population was on the conquest of 
the continent. A division of labor was made be- 
tween “the people” and the political machine. By 
mid-nineteenth century, effective, disciplined ma- 
chines were active in most cities. Because of their 
near monopoly of nominations, they became in a 
real sense the governments of the cities. Highly 
organized local politics became the rule, while 
national politics remained chaotic. 

Local machines continued to be characteristic 
in both urban and rural areas at least until after 
World War 11. Since then there has been a decline. 
But in the 1950s Philadelphia, which had formerly 
been a Republican fief, created an old-style Demo- 
cratic organization. 

Why were many of these machines so corrupt? 
Tocqueville noted that Americans believed that 
politics was disreputable. It was considered suitable 
only for those who had failed in private business. 
In those early days the public business was be- 
lieved to offer fewer challenges than private realms. 

American party conflict has been unusually non- 
ideological in character as compared with that of 
parties elsewhere. Issues between parties, especially 
local parties, have tended to be nonexistent or dull. 
Gifted or imaginative men have not been attracted 
to party life, especially if they are genteel. This 
continued to be the case at least until the time of 
Theodore Roosevelt, and even then friends tried to 
cajole him away from dealing with the low fellows 
in politics. 

The apathy of the “decent” person, that is, one 
who shunned politics, was not the only reason for 
abdication of the responsibilities of republican citi- 
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zenship. The machines succeeded to complicate 
government machinery and the electoral process 
to such an extent that many amateurs became 
frustrated. This was even more true of the growing 
number of poorly educated, sometimes illiterate, 
usually non-English-speaking voters. 

The functions of machines. Machines devel- 
oped in part to compensate for the fractionalization 
of governmental authority decreed by the national 
and state constitutions. The wide dispersal of power 
made effective and responsible government diffi- 
cult, if not impossible. The inability of the national 
government to act with energy was a serious matter. 
A similar situation existed at the state and local 
levels. In state governments, long lists of adminis- 
trative officials were elected. In cities and counties 
it was worse. There was confusion, overlapping, 
and duplication between cities and counties, and 
also among the myriad school districts and special- 
purpose governments. 

Thus, frequently when important problems oc- 
curred and the people or some interests demanded 
firm, quick action, no one had adequate authority 
to act. The machine filled this lack. It was the only 
available means of redressing the rigidities of for- 
mal government structure. Structure, in the form 
of law, was stifling necessary function. The ma- 
chine volunteered to redress the balance. 

This process was accelerated by the rapid growth 
of cities as the industrial revolution transformed 
an agrarian land into an embryonic urban society. 
By 1890 the population of urban centers had in- 
creased sixfold while the rural population had 
only doubled. The crazy-quilt system of local gov- 
ernment which had already demonstrated its inade- 
quacy could not begin to meet the grave new 
challenges. Then, as now, the new urbanites were 
not prepared for their changed life—nor were their 
governments and their public officials, 

City life of those days was more harsh and more 
cruel than it is today. Many immigrants were terri- 
fied. Even native-born, rural-reared Americans 
found adjustment to the impersonal and dangerous 
city difficult. Both needed and wanted help. 

Middle-class people, especially businessmen, also 
had needs: streets, sewers, police and fire protec- 
tion. Building permits and other licenses were re- 
quired. Protection was wanted from sometimes 
debilitatingly rigorous administration of regulatory 
ordinances. Valuable franchises for public utilities 
and mass transportation were desired, Many fled 
to the suburbs in order to escape corruption, boss 
rule, and high taxes, surrendering the city to the 
machine and the poor. 

Criminal and twilight enterprises require per- 


mission and protection: bookmaking, gambling, 
prostitution, horse and dog races. The best available 
broker for the criminal and the racketeer was the 
corrupt machine and its captive police force. The 
quid pro quo was not only money but also votes 
and manpower on election day. Chicago bosses 
“Hinky-Dink” Kenna and “Bathhouse” John Coughlin 
had a horde of pimps, prostitutes, and their cohorts 
regularly available on election day. 

Mass immigration was the source of an impor- 
tant part of the population growth. The majority 
of immigrants settled in the cities, which became 
ethnically heterogeneous, so that the machines 
were able to appeal to national pride and prejudice. 
Often the immigrants received nearly instant citi- 
zenship and franchise. Because of their inability 
to speak English, because of their poverty and 
illiteracy and lack of experience with voting, their 
votes were often purchasable. The machines had 
the manpower, experience, incentive, and effron- 
tery to manipulate the new citizens. 

The task of the machines was made easier by 
the legacy of Jacksonian democracy and populism. 
Aided by the loose national governmental structure, 
the spoils system had become institutionalized. So 
had the long ballot. Only a few people had the 
energy, time, and knowledge to comprehend the 
complexities of the electoral process and the unrea- 
sonable burden of seriously evaluating dozens of 
candidates in every election. Being unable to do 
this, many voters were prepared to be assisted by 
the willing machines. 

In a society in which there is rapid and dramatic 
concentration of wealth coupled with a developing 
tradition of an ever-widening franchise, it is inevi- 
table that the wealthy attempt to control the grow- 
ing number of enfranchised poor. Ethical restric- 
tions could not stand in the way of furthering this 
goal. Machines were used because they possessed 
the skills and resources to control the multitudes. 

In part, machine politics developed for these 
reasons. In a deeper sense, however, the growth 
of machines was inevitable. Organization is Te 
quired in all complex undertakings, and in any 
organization there is a tendency for a few to dom- 
inate, no matter how many may seem to ae 
pate. This tendency is exacerbated by the lack o! 
effective, integrated leadership which can anus 
coherent policies and viable government and Mie 
the people can hold accountable. Because this wt 
of leadership was absent and unavailable in ie 
formal structure, it had to be supplied from 7 
outside as the only alternative to chaos. Leaders} A 
thus went to the party organization, the politic 
machine, and the boss. 


Corruption and reform. The political, social, 
economic, and financial costs of corrupt machines 
are incalculable but enormous, whatever their in- 
evitability or functional utility. The cost in terms 
of the “degradation of the democratic dogma” can- 
not be measured. Tens of millions of Americans 
have developed cynical attitudes toward the myths 
of our political system, expressed by “You can’t 
fight city hall.” The low prestige of politics and 
politicians is another result, since it has served to 
keep many potentially able people away from the 
public arena. The civil service has been degraded. 
The United States lagged fifty years behind Euro- 
pean countries in adopting public welfare and so- 
cial insurance programs. The physical city and the 
quality of life therein have deteriorated. Few cities 
have been specifically planned, and thus lack parks, 
beaches, and other public amenities. Inadequate 
schools, inadequate fire protection, inadequate 
and/or brutal police service, the development of 
slums, lack of minimum standards for housing, 
neglect of tenements, rent gouging, lack of con- 
sumer protection, wholly inadequate hospital and 
public health services, an almost total lack of 
protective labor legislation, and long delays in legis- 
lation for the protection of women and children— 
all have been caused by rapacious machines. 

From the early days, attempts have been made 
to bring about structural changes to improve the 
quality, tone, and probity of government. Most of 
these have not succeeded. There is evidence that 
good government can come into being without 
structural changes and that bad government can 
continue despite great changes in form. On the one 
hand, high-quality leadership by the mayor and 
city council can rectify some systems considered 
wretched; on the other hand, wily bosses have suc- 
cessfully circumvented reform. 

The major reforms attempted included the fol- 
lowing: the nonpartisan election; the commission 
form of government; the council-manager form of 
government; the election for councilmen-at-large; 
the proliferation of the merit system at all levels; 
the initiative, referendum, and recall; the Austral- 
ian secret ballot; the short ballot; centralizing 
power and responsibility at state and local levels, 
including the state government reorganization 
movement and the public administration move- 
ment. [See LOCAL POLITICS.] 

The decline of the machine. In recent years, 
machine politics has been in decline. Local sources 
of machine power have been significantly eroded. 
A study of the 1956 national elections in the United 
States reported that only 10 per cent of its respond- 
ents had been contacted by party workers. If only 
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residents of nonsouthern cities of over 100,000 
population are considered, this proportion increases 
to nearly 20 per cent. But even this figure is a 
modest one for an effective machine (Greenstein 
1963, p. 40). 

The reasons for this decay are in part conjec- 
tural. There has been a great reduction in tradi- 
tional rewards of money, perquisites, and jobs of- 
fered by the machine to the poor and the ethnic 
minorities. Some of this is doubtless due to accre- 
tions of various reforms and the loss of patronage. 
There has also been a vast increase in govern- 
mental social welfare programs. And the dramatic 
increase in the general standard of living has re- 
duced the value of the rewards that the machine 
is still able to offer. The number of persons so 
attracted has declined as the poor have decreased. 
Material security has altered attitudes toward the 
self, government, and society. “Recognition” and 
other emoluments no longer seduce blocs of voters 
as they once did. Boss control was further weak- 
ened as the intensity of group identification dimin- 
ished and as many old ethnic neighborhoods 
changed. 

The rise in strength and influence of trade 
unions, with their strong pension, health, and wel- 
fare plans, has further reduced the value of boss 
rewards to yet another clientele. Moreover, union 
members receive political directions from union 
political action committees which may rival polit- 
ical machines in resources and manpower. Gen- 
erally the electorate is now better educated and 
the mass media offer much political information, 
increasing the sophistication of the electorate. 

In some places machines have been brought 
down from the outside. In New York, Carmine 
DiSapio was unseated in this way, and the forces 
of Eisenhower and Goldwater each defeated the 
“regular” Republican machines in enough states 
and localities to win presidential nominations by 
landslides. An important element in these defeats 
of the machine was the availability of what the 
constituents accepted as attractive candidates. In 
addition, especially in the Goldwater campaign, 
control of party machinery was wrested from the 
regulars. 

Reformers in New York and California have dis- 
played more political insight and greater willing- 
ness to “play” organizational politics than their 
predecessors did, Although the new reformers 
abhor bosses and machine politics, they have 
worked within the parties and have challenged the 
machine at its own game of face-to-face voter per- 
suasion. 

Machine politics still survives. There remain large 


252 POLITICAL MOVEMENTS 


pockets of poverty amid American affluence where 
conditions are similar to those which accompanied 
the rise of the machines. By various estimates 17 
per cent or 20 per cent or 25 per cent of the popu- 
lation is poor. Negroes and Puerto Ricans make 
up a significant portion of the poor, and their 
problems are somewhat similar to those of the 
earlier waves of immigrants. Both groups have 
made important, though insufficient, gains in polit- 
ical representation. The power that Congressman 
Adam Clayton Powell had in New York and the 
increasing number of Negroes in Congress testify 
to this. In California and the Southwest, citizens of 
Mexican origin are similarly increasing their polit- 
ical activities. 

Machine politics is not yet dead, even in the 
invidious sense. There have been major transfor- 
mations; there will be more. But the need for organ- 
izations, for leadership, and for political responsi- 
bility has increased in the contemporary world. 
Some promising new organizational forms are de- 
veloping. They coexist side by side with the remain- 
ing weakened and modified older forms and with 
the still developing structures in the troubled 
Negro, Puerto Rican, and Mexican-American ghet- 
tos. Perhaps we are now wiser than we were fifty 
years ago. Perhaps we can devise structures that 
will permit access and integration for those groups 
which are still dispossessed, without paying the 
enormous price we have paid for ineffective and 
often venal local governments. 

ALEX GOTTFRIED 


[Directly related are the entries Crry, especially the 
article on METROPOLITAN GOVERNMENT; LOCAL POL- 
ITICS; POLITICAL CLUBS. Other relevant material may 
be found in Community; LOCAL GOVERNMENT; 
NONPARTISANSHIP; OFFICE, MISUSE OF; PARTIES, 
POLITICAL. ] 
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POLITICAL MOVEMENTS 
See the guide under SOCIAL MOVEMENTS. 


POLITICAL ORGANIZATION 
See under POLITICAL ANTHROPOLOGY. 


POLITICAL PARTICIPATION 


In this article the term “political participation” 
will refer to those voluntary activities by which 
members of a society share in the selection of 
rulers and, directly or indirectly, in the formation 
of public policy. The term “apathy” will refer to 
a state of withdrawal from, or indifference to, 
such activities. These activities typically include 
voting, seeking information, discussing and prose 
lytizing, attending meetings, contributing finan- 
cially, and communicating with representatives. 
The more “active” forms of participation include 
formal enrollment in a party, canvassing and ee 
tering voters, speech writing and speechmak n 
working in campaigns, and competing for public 


and party office. We shall exclude from this dis- 
cussion such involuntary activities as paying taxes, 
serving in the armed forces, and performing jury 
duty. 

Why do social scientists study political partici- 
pation? To begin with, participation is an ingred- 
ent of every polity, large or small. Whether the 
society is an oligarchy or a democracy, someone 
must make political decisions and appoint, uphold, 
and remove leaders. Those who fail to participate, 
whether out of neglect or exclusion, are likely to 
enjoy less power than other men. Although not 
all who participate possess effective power, those 
who do not participate cannot exercise or share 
power. 

As these observations imply, the right to partici- 
pate is an essential element of democratic govern- 
ment, inseparable from such other attributes of 
democracy as consent, accountability, majority 
rule, equality, and popular sovereignty. Indeed, the 
growth of democratic government is in part meas- 
ured by the extension of the suffrage and the 
correlative rights to hold office and to associate for 
political purposes. Whereas traditional monarchies 
restrict power and participation largely to the no- 
bility and their agents, democracies have in prin- 
ciple transformed these prerogatives into rights 
enjoyed by everyone. 

This expansion of participation was partly stim- 
ulated by the desire to give meaning and force to 
the principles of consent, accountability, and po- 
litical opposition. Participation is the principal 
means by which consent is granted or withdrawn 
in a democracy and rulers are made accountable 
to the ruled. Since men can be equal and free only 
if they share in the determination of their own 
affairs, participation has been viewed as a means 
for realizing these democratic objectives as well. 
[See Democracy.] 

From Aristotle to John Dewey, political philos- 
ophers have extolled popular participation as a 
source of vitality and creative energy, a5 4 defense 
against tyranny, and as a means of enacting the 
collective wisdom. By involving the many in the 
affairs of the state, participation should promote 
stability and order; and by giving everyone the 
opportunity to express his own interests, it should 
secure the greatest good for the greatest number. 
The community should gain, furthermore, by 
drawing upon the talents and skills of the largest 
possible number of people. Some philosophers 
have claimed, in addition, that participation bene- 
fits the participants as well as the larger com- 
munity. It ennobles men by giving them a sense 
of their own dignity and value, alerts both rulers 
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and ruled to their duties and responsibilities, and 
broadens political understanding. 

Ought participation, however, to be open to all, 
or restricted to those who know how to use it 
wisely? On this question, political philosophers 
and statesmen have often disagreed. Some have 
maintained that men are not equally worthy of 
being consulted about their opinions. Some have 
held that men should be excluded because of caste, 
race, religion, poverty, or other presumed marks 
of irresponsibility and moral deficiency. Some have 
argued that men who lack property or education 
have no “stake in society,” are likely to be swayed 
by demagogues, and will use the opportunity to 
participate merely to register their envy and re- 
calcitrance. 

Arguments like these lay behind the exclusion 
of slaves and aliens from Athenian democratic 
processes; of commoners and Jews and other non- 
Catholics from participation in medieval princi- 
palities; and of Catholics from governments formed 
in the wake of the Reformation. Even the Ameri- 
can constitution, as originally framed, implicitly 
sanctioned prohibitions against voting or holding 
office for reasons of race, property, religion, or sex. 
At the present time, the suffrage in some parts of 
the United States is, in law or in fact, denied to 
aliens, illiterates, paupers, criminals, Indians, and 
Negroes. Nevertheless, as democratic institutions 
have advanced, the trend has been for such bar- 
riers to be dropped. 

Despite their historical association, widespread 
participation is not peculiar to democracy. Even 
greater emphasis is placed upon participation by 
the modern mass dictatorships, both communist 
and fascist. Their desire to involve every citizen in 
political affairs is evident not only in their efforts 
to achieve unanimous voting in elections but also 
in their organizaton of the masses into an elaborate 
network of youth groups, mass parties, trade 
unions, people’s councils, cooperatives, recreational 
and cultural societies, study circles, conferences, 
rallies, parades, demonstrations, and staged mass 
demonstrations. 

Whether these efforts represent participation in 
the sense in which we have been using that term 
is open to question. Much of this activity is invol- 
untary, and none of it is designed to allow the 
masses to wield influence over policy or the selec- 
tion of rulers. On the contrary, it is manipulated 
by the self-appointed political elite so as to exploit 
the democratic mystique, make the masses acces- 
sible to the regime’s propaganda, harness their 
energies in the building of the state, and lend the 
regime an appearance of legitimacy. Thus, the 
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attempt to involve the people in the achievement 
of the state’s programs is calculated to strengthen 
the dictatorship rather than to bring it under popu- 
lar control. [See DicTATORSHIP.] 


Nature and frequency of participation 


Despite its importance to democracy, the right 
to participate is not exercised by all who possess 
it. The number of nonparticipants varies with 
time, place, and circumstance, and also with the 
type of participation. 

More people discuss politics than vote, and many 
more vote than join parties or work in campaigns. 
Approximately three-fourths of the Americans sur- 
veyed during recent national elections reported that 
they had discussed the campaign with other people. 
The number of eligible Americans who have ac- 
tually voted in presidential elections since World 
War u has ranged from 51.5 per cent in 1948 to 
63.8 per cent in 1960 (U.S. Bureau of the Census 
1965, p. 384). Turnout in off-year elections to the 
House of Representatives during the same period 
has ranged from 37.6 per cent in 1946 to 48.9 
per cent in 1962 (U.S. President’s Commission . . . 
1963, p. 66). Approximately one-fourth of the elec- 
torate professes to have proselytized others on be- 
half of one of the parties or candidates (Michigan 
. .. 1960, p. 91). Some 15 per cent say they have 
worn a campaign button or displayed a sticker 
(Milbrath 1965a, p. 19), and almost the same 
number claim to have written or wired their con- 
gressmen or senators on some occasion (Public 
Opinion: 1935-1946 1951, p. 703). An even smaller 
number (roughly 10 per cent) claim to have con- 
tributed campaign funds (Alexander 1966, p. 69), 
while 7 per cent assert that they have attended po- 
litical meetings, rallies, or dinners (Michigan... 
1960, p. 91). From 3 to 5 per cent maintain that 
they participated actively in recent national cam- 
paigns (Lane 1959, p. 53), while only 2 or 3 per 
cent claim membership in political clubs or organ- 
izations (Michigan . . . 1960, p. 91). The number 
who run for political office or hold influential posts 
in the parties is, of course, even smaller—only a 
fraction of 1 per cent. 

Even these figures are probably inflated, for 
more people tend to report political activity than 
actually engage in it. The figures cited may also 
vary, depending upon the time of polling (e.g., 
whether during or between campaigns) and the 
type of election; more people vote in national than 
in local elections, in presidential than in off-year 
elections, in partisan elections than in primaries, 
and in elections involving candidates than in ref- 
erenda involving issues, 


Political interest and awareness are related var- 
iables that are even more difficult to assess than 
participation, because their manifestations are less 
overt. Roughly one-third to one-half of the respond- 
ents surveyed during the 1956 election had no 
opinion or information on 16 well-known political 
issues (ibid., p. 174). Approximately one-third of 
the adult public expressed little or no interest in 
the 1948 and 1952 elections, while another third 
were only “moderately” or “somewhat” interested 
(Berelson et al, 1954; Campbell et al. 1954). Even 
fewer have the interest or capacity to arrive at a 
coherent set of opinions (Converse 1964), 

The attentive public, thus, is distinctly a mi- 
nority. As Bryce observed ([1921] 1931, vol. 2, 
pp. 157 ff.), only a small group gives constant at- 
tention to politics, a slightly larger group is in- 
terested but comparatively passive, while the mass 
of men are largely indifferent. A contemporary 
writer, Milbrath (1965a, pp. 16-22), employing a 
similar classification, stratified the electorate into 
“gladiators” (the small number of party actives 
and officeholders), “spectators” (who seek informa- 
tion and vote), and “apathetics” (who participate 
only passively, if at all), Although all such classi- 
fications are arbitrary, they point up the tendency 
for various forms of political involvement and in- 
difference to cluster: people who engage in one 
of the more “active” forms of participation (say, 
canvassing ) are inclined to be active in other ways 
as well (say, the collection of money); whereas 
people who habitually fail to vote will usually avoid 
such other minimal activities as reading the politi- 
cal news. Milbrath suggests that the various forms 
of political involvement fall into a hierarchy or 
continuum according to the cost in time and effort 
that each demands. But other variables, such as 
political articulateness, saliency, and interest— 
which can only partially be reckoned into the 
“costs”—also affect the frequency of the various 
forms of participation. 

These patterns of political activity are also sub- 
ject to considerable variation. Many people vote 
habitually, but others vote only intermittently. 
Although reliable figures are not available, there 
is considerable fluctuation from one election to 
another among those who participate actively, as 
candidates, issues, and political conditions change. 
A party that is out of power or that has little Kora 
of winning elections is particularly vulnerable a 
such shifts. Turnover seems to be especially pest 
at the lower party levels, among political amateurs 
and volunteers, who have neither office nor emolu: 
ments to sustain them. 
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Variations are also to be found in the rates an 


patterns of participation from one era to another 
and from one nation to another. In the United 
States, participation appears to have declined 
sharply at the end of the nineteenth century. Al- 
though turnout for presidential elections in the 
decades following the Civil War had averaged more 
than 75 per cent, the rate fell steadily during the 
early years of the present century to an average of 
51.7 per cent in the 1920s, rose again in the 1930s, 
and in the 1960s averaged slightly above 60 per 
cent (Burnham 1965; U.S. Bureau of the Census 
1965), Even within the United States, regional 
variations in participation are great in any given 
year: in 1960, for example, the turnout ranged 
from 30.4 per cent in Georgia to 80.7 per cent in 
Idaho (U.S. President's Commission . . . 1963, 
p. 67). 

Equally striking are the differences in the rates 
of political activity from one country to another. 
Despite the greater incidence of education, urban- 
ization, affluence, and mass communication, turn- 
out in the United States has generally been lower 
than in other modern democracies. Turnout in 
Italy and Belgium in the years since World War 11 
has approximated 90 per cent; in Denmark, West 
Germany, and Great Britain, 80-85 per cent; and 
in Canada, Norway, Finland, and Japan, 70-80 
per cent. The few cross-national studies conducted 
so far indicate, however, that despite the low turn- 
out, other indexes of participation—political in- 
terest and awareness, expressed party affiliation, 
sense of political competence, etc.—tend to be 
higher in the United States than in many other 
countries, such as France and Italy (U.S. Presi- 
dent’s Commission . . . 1963; Almond & Verba 
1963; Lipset 1960). 

These findings suggest that the interrelations 
among the several forms of participation found in 
the United States are not universal. No one has 
yet discovered any overarching principle to explain 
the varying ways in which these phenomena are 
ordered in different countries. Some of this varia- 
tion is surely due to the familiar determinants of 
participation (education, access to information, 
etc.). But important weight must also be given to 
factors peculiar to individual countries—traditions, 
history, access to the governing institutions, the 
particular forms of political competition, or, as In 
Italy, special inducements to vote. The systematic 
investigation and weighting of these factors in 
various countries remains one of the urgent re- 
search tasks. 

So far we have treated participation as though 
it were a unidimensional variable. This may be a 
questionable assumption, however, for although all 
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instances of nonvoting are overtly the same, they 
may spring from different motivations ard may 
represent quite different acts. Thus, in general, 
two classes of apathetic individuals can be dis- 
tinguished: those who fail to participate out of 
political indifference, exclusion, or incapacity; and 
those who consciously choose not to participate. 
Although no precise information is available as to 
their frequency, the first is plainly the larger of 
the two classes. It includes the habitual nonvoters 
who have little knowledge of issues or candidates 
and are mystified by political events. Rarely are 
they able to connect what happens “out there” with 
the events of their own lives. Apathy of this type 
abounds among the uneducated, the inarticulate, 
the parochial, the isolated, and those who occupy 
roles in which political passivity is perceived as the 
norm, e.g, women in political systems heavily 
dominated by men (Bell et al. 1961; Lipset 1960). 
As these findings signify, political participation is 
not “natural,” but must be learned; and for learn- 
ing to occur, one must have capacity, motivation, 
and opportunity. In some strata of the society, all 
three preconditions are missing. 

The second class of apathetics, though small, 
is far more diverse. It includes those who disdain 
politics because it seems to them self-serving and 
corrupt. Some adopt this view as a projection of 
their own hostility or dissatisfaction with their own 
lives; others, out of misplaced idealism and the in- 
evitable disappointment with human imperfection; 
some, out of a generalized cynicism toward man- 
kind and all his arrangements; and some, merely 
because they are prey to prevailing stereotypes. 
Still others are disenchanted either because the sys- 
tem serves them badly or because politics does not 
seem to them sufficiently “meaningful.” Some— 
the “realists”—have concluded that their chances 
of influencing the gigantic and remote political 
system are too slight to warrant the investment of 
time and energy. Others believe that the system 
offers no genuine alternatives and that all efforts 
to change the outcomes are idle and self-deluding 
(Almond & Verba 1963; Campbell in Rokkan 
1962a; Erbe 1964; Rosenberg 1954). Still others, 
while aware of politics and convinced of its im- 
portance, simply find the entire subject dull. 

Nor are the forces that lead to withdrawal in 
all respects “negative.” In certain subgroups of the 
society, apathy is positively reinforced. Among 
those who have attempted to set up their com- 
munities outside the prevailing culture—for ex- 
ample, certain “bohemian” subcultures—conven- 
tional political activities are frequently regarded as 
foolish and unbecoming, while political indiffer- 
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ence is esteemed. The same holds, though less 
consciously and articulately, for certain deprived 
minorities, who perceive participation as useless, 
dangerous, or as an affectation. For them, the 
“affirmative” act is to express one’s contempt by 
withdrawing. [See ALIENATION; RADICALISM.] 


Explaining participation 

Social scientists aim to develop general theories 
of human behavior that will account for as many 
relevant facts as possible with the smallest number 
of assumptions and explanatory variables. So far 
no general theory of participation even approaches 
this ideal. Participation appears to be a complex 
phenomenon that depends on a great many vari- 
ables of different relative weights. This does not 
necessarily mean that no conceptual model can 
be employed to explain it. For one can at least 
group the relevant independent variables into those 
influences which are essentially internal (psycho- 
logical and cognitive) and those which derive from 
the individual's external environment, social and 
political. Any model employing these broad cate- 
gories is bound to be oversimplified. One may 
nevertheless reason that (a) individuals are em- 
bedded in a matrix of social forces (status, educa- 
tion, religion, etc.) that orients them toward or 
away from political participation; (b) in addition, 
characteristic differences in drives and capacities 
will cause individuals to vary in their readiness 
to respond to political stimuli; and (c) the degree 
to which these social and psychological predis- 
positions find expression as political activity de- 
pends partly on the nature of the political environ- 
ment itself—including the political structure and 
institutions, the party system, and the pattern 
of political values and beliefs. The three sets of 
variables are closely linked and intermingled. A 
change in any of them can, therefore, increase or 
decrease participation, and an analysis based on 
only one of them is bound to be misleading and 
incomplete. 

The social environment. The e!2ments that 
compose the social environment include education, 
occupation, income, age, race, religion, sex, mo- 
bility, and residence. Research in the United States 
and elsewhere shows that most of these variables 
correlate to some degree with participation. In 
general, participation tends to be higher among 
the better-educated, members of the higher occu- 
pational and income groups, the middle-aged, the 
dominant ethnic and religious groups, men (as op- 
posed to women), settled residents, urban dwellers, 
and members of voluntary associations. 


It should be emphasized, however, that the cor- 
relations between participation and some of these 
variables are low and unstable and that they may 
vary from one cultural—political context to another, 
Thus, education and socioeconomic status and par- 
ticipation correlate strongly in the United States 
but weakly in Norway. Urban-rural differences in 
participation occur in some elections but not in 
others. City dwellers, with their increased exposure 
to mass media, higher education, and greater pre- 
disposition to form voluntary associations, charac- 
teristically participate more than those who live 
in rural communities in the United States; yet- 
some farm states, such as Idaho, Utah, and South 
Dakota, have significantly higher turnouts than 
some industrial states, such as California, New 
York, and New Jersey. Likewise, in some countries 
that have long traditions of communal leadership 
or cooperative forms of agricultural organization, 
participation is greater in rural than in urban 
areas—e.g., Japan, France, Arab villages in Israel, 
and parts of Scandinavia (Lipset 1960; Milbrath 
1965a). Church attendance correlates positively 
with political participation in the United States, 
scarcely at all in Great Britain, and negatively in 
Germany and Italy. Some ethnic minorities in the 
United States (e.g., Negroes) have very low turn- 
out rates, while others (e.g., Jews) have among 
the very highest. y 

These deviations suggest several observations 
about the relation of the social environment to 
participation. First, the variables in this category 
are so broad as to be fairly limited in their ex- 
planatory power. Often they represent configura- 
tions of more exact and dynamic variables, but 
the configurations are not always identical. Po- 
litical apathy of Negroes in the United States re- 
flects in large measure not just their position as 4 
minority but also their status as a deprived minor- 
ity—poor, uneducated, rural, parochial, e 
this, in turn, has gradually led to the widesprea 
acceptance by both whites and Negroes onthe 
Negro’s role as that of nonparticipant. At the other, 
end, the greater participation of high-status groups 
in most societies is due partly to their superior 
education and partly to their enhanced oppor 
ties to acquire firsthand knowledge of and ae 
fluence politics by greater access to political lea 5 
and to the sources of political decision making. r 

Since the relevant variables are subject to m- 
teraction effects, the same demographic pa 
may have dramatically different conse ee 
different political—cultural contexts. Social fal 
differences, for example, may signify power 


inequalities in one culture and trivial differences 
in another. The disparity in the correlations be- 
tween occupational level and participation in Nor- 
way and the United States, for instance, results 
partly from the greater political and ideological 
organization of the Norwegian working class than 
the American working class (Campbell & Valen 
1961). The inverse relationship between correla- 
tions for church attendance and voting turnout for 
the United States and Italy reflects primarily the 
differences in education and economic status of 
churchgoers in the two countries. 

Despite these qualifications, categoric variables 
are not only heuristically useful but sometimes can 
actually influence participation. Thus, education 
offers high and reliable correlations with participa- 
tion, partly because it helps to develop a sense of 
civic duty, political competence, interest, and re- 
sponsibility, as well as personality characteristics 
of self-confidence, dominance, and articulateness. 
Furthermore, the schools themselves serve as set- 
tings in which the skills of participation are ac- 
quired: one learns to join organizations, fulfill 
duties, participate in meetings, discuss broad social 
questions, and organize to achieve group goals. 
Finally, the more educated are better able to trans- 
mit their political interest and knowledge to their 
children and, hence, to perpetuate the relation- 
ship between education and participation. 

In the example of education we thus see at work 
three of the most powerful influences affecting 
participation: articulateness, sensitivity to one’s 
self-interest, and effective socialization by the po- 
litical culture. While these influences bear even on 
differences in voting frequency, they are espe- 
cially significant in differentiating the politically 
active from ordinary citizens. These same factors 
also help to explain the findings on participation 
yielded by research on other categoric variables. 
The higher turnout among, say, Congregationalists 
than Baptists is a function, in part, of their greater 
articulateness. The greater activity and ideological 
solidarity of higher-status occupations can be 
traced in part to their superior capacity for per- 
ceiving the relation of their own interests to gOov- 
ernmental decisions. The higher frequency of vot- 
ing among the middle-aged compared with young 
adults reflects the need for time and experience in 
order for politicization to take effect; and the same 
principle applies to the greater turnout and ac- 
tivity of settled residents as against transients. The 
greater political activity in the cities, compared 
with rural areas, testifies to a higher level of artic- 
ulateness there. The greater political activity of 
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members of voluntary associations reflects in some 
measure the effects of all three influences: articu- 
lateness, awareness of self-interest, and greater 
exposure to the agencies of socialization. 

Through such variables as these, and through 
other general constructs of a more dynamic nature 
(e.g., power, influence, motives, pressures, drives ), 
we may eventually succeed in developing models 
that refine demographic variables into their appro- 
priate units, equivalent over time and across cul- 
tures. When these components are isolated and 
appropriately weighted, predictions about participa- 
tion under varying conditions should become much 
more accurate. [See SOCIALIZATION, article on PO- 
LITICAL SOCIALIZATION, | 

Psychological variables. Participation survives 
by virtue of its capacity to provide rewards for 
those who engage in it. Political observers through- 
out the ages have variously attributed man’s po- 
litical activity to his need for power, competition, 
achievement, affiliation, aggression, money, pres- 
tige, status, recognition, approval, manipulation, 
sympathy, responsibility—in short, to virtually 
every need that impels human behavior. Unfortu- 
nately, systematic data on the relative frequency 
and influence of any of these motives are extremely 
sparse. Nor do we know whether political participa- 
tion gratifies certain needs that are not satisfied by 
other kinds of endeavor, or whether the motives 
that induce active or psychologically costly forms 
of participation (such as managing a campaign) 
differ from those which prompt simpler, less ardu- 
ous activities like voting. 

In addition to the active versus passive distinc- 
tion, one may classify participation in terms of its 
goals: i.e., instrumental as opposed to consumma- 
tory or expressive (Davies 1963; Milbrath 1965b). 
Instrumental political activities are primarily ori- 
ented toward concrete goals, such as party victory, 
the passage of a bill, or the enhancement of one’s 
own status, influence, or income. Consummatory 
or expressive activities are aimed at more imme- 
diate satisfaction or release of feeling: thus, voting 
may be a consummatory rather than an instrumen- 
tal act for those who care less about the outcome 
than about the positive feeling they get from cast- 
ing their ballots; like parading or saluting the flag, 
it may become a ritualistic form of rewarding be- 
havior. For other people who are mainly concerned 
with achieving certain goals, the same act of voting 
would be instrumental. Most behaviors embody 
both instrumental and consummatory purposes to 
some degree. 

In general, psychological variables may be 
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thought of as those stemming from individual per- 
sonality traits (whether primarily constitutional- 
genetic or primarily learned) and from cognitive 
structures, which in this case represent certain 
characteristic ways of conceptualizing the self and 
the surrounding sociopolitical world. [See PERSON- 
ALITY, POLITICAL.] 

Available evidence suggests that a number of 
the more basic or genotypic personality traits— 
rigidity, guilt neurasthenia, intolerance of ambig- 
uity, manic-depressive tendencies, manifest anx- 
iety—do not correlate highly with political par- 
ticipation. The weakness of these relationships is 
particularly evident at the more passive end of the 
participation continuum. Thus, voters scarcely dif- 
fer from nonvoters on the traits mentioned above. 
However, apathetic individuals tend to be slightly 
more aggressive and paranoid than voters. In gen- 
eral, the more active participants exhibit less hos- 
tility than the general population, except in the case 
of activists who belong to extreme or messianic 
movements. Participants in such movements, which 
aim at quick and drastic refashioning of the world, 
are frequently motivated by rage and paranoia and 
find that participation gives them a legitimized con- 
text for discharging their aggression (Almond 
1954), 

Although basic personality dimensions such as 
guilt and rigidity do not adequately distinguish par- 
ticipants from nonparticipants, they do differen- 
tiate somewhat the less active from the more active, 
the inactives exhibiting these traits in greater meas- 
ure. The correlations are not high, but one would 
not expect them to be. Participation is so complex 
a phenomenon that the connection between any 
particular activity (e.g., voting) and any source 
trait (e.g., rigidity) is bound to be extremely tenu- 
ous. Then, too, for many people—especially the 
more passive participants—the psychological in- 
vestment in politics is so slight that one would be 
surprised to discover that deep-seated motives were 
attached to a given activity. Furthermore, the “dis- 
tance” between a basic personality trait and a 
specific manifestation of political activity is too 
great and the route between them too circuitous 
for the one to be directly engaged by the other. 
Nevertheless, correlations do turn up between cer- 
tain personality traits and participation that appear 
to be due mainly to the impairment of social func- 
tioning induced by personality disturbances, An 
individual who scores high on measures of para- 
noia, inflexibility, guilt, hostility, and so on will 
ipso facto function less effectively in many social 
contexts. He will be less able to perform tasks that 


require accurate appraisals of reality and may fnd 
threatening such political activities as organizing, 
deciding, bargaining, interacting, cooperating, de- 
bating, and proselytizing. 

Similarly, research has shown (Gough et al. 
1951; Milbrath 1965a) that personality traits 
which are particularly influenced by social learn- 
ing—such as dominance, social responsibility, and 
self-confidence—are positively associated with po- 
litical participation. It is not so much that these 
signs of ego strength are sufficient to inspire polit- 
ical participation, but that individuals who lack 
them are more likely to avoid active involvement. 

Although the correlations between: psychological 
traits and participation are modest, their direction 
largely refutes much of the folklore about political 
practitioners as highly ambitious, exhibitionistic, 
“folksy,” narcissistic, driven, enthusiastic, material- 
istic, authoritarian, and power-hungry. Research in 
the United States (very little has been conducted on 
the subject elsewhere) indicates that politicians 
and other active participants do not possess these 
traits in greater measure than do nonparticipants 
or members of other professions (Hennessy 1959; 
McClosky & Schaar 1965). American data on social 
responsibility, authoritarianism, and related meas- 
ures suggest that participants are more likely than 
nonparticipants to show social conscience and con- 
cern and affirmative attitudes toward mankind. 

Participants are also distinguished from nonpar- 
ticipants by such cognitive variables as belief in 
one’s own adequacy and in the amenability of the 
social order to change. Even the elementary forms 
of participation, such as voting, may present some 
people with threatening questions about their 
ability to understand and affect external institu- 
tions that strike them as bewildering or remote. An 
individual's sense of his own personal competence 
tends to color his judgment of his political effec- 
tiveness, which in turn strengthens his motita 
to participate. Confronted with the challenge 0 
trying to change political and social con 
lower-status groups and the psychologically handi- 
capped are prone to feel bewildered and helpless; 
they are, in general, more susceptible to feelings 
of alienation, anomie, and pessimism—both Po 
sonal and political (McClosky & Schaar 1965). : 
their view, the social—political system is hostile ae 
inaccessible. They find few of the personal esr 
received by the politically active (approva. 
friends, being “on the inside,” the “excitement us 
politics, and so on). Accordingly, they not on yi ts 
less frequently but also are less interested a m 
sonally involved in politics, have fewer an 


coherent opinions, and are less concerned with 
issues and with the outcome of elections (Michigan 
... 1960; Eulau & Schneider 1956). 

While the relations between the cognitive states 
of low self-esteem and feelings of pessimism and 
alienation from society are strongly correlated with 
political apathy, one cannot be certain which way 
the causal arrow is pointing: i.e., alienation may 
lead to nonparticipation; nonparticipation may en- 
courage alienation; or both may be correlates of a 
more basic variable, such as ignorance, which 
breeds fear, mistrust, and avoidance of situations 
that might turn out to be threatening. Similarly, 
the relationship of participation to actual or imag- 
ined efficacy can be demonstrated, but here, too, 
the influence patterns are somewhat circular: one 
who participates has a greater incentive to learn 
how the system works and how to function effec- 
tively within it; a sense of efficacy, in turn, may 
predispose one toward further participation. [See 
PoLITICAL EFFICACY.] 

These psychological variables, and especially the 
degree to which political participation is felt to be 
rewarding or nonrewarding, are powerfully medi- 
ated by the individual's reference groups. Many of 
the values and habits of participation are instilled 
by the family and sustained by peers and other 
primary groups (Almond & Verba 1963; McClosky 
& Dahlgren 1959). Any such group for which pol- 
itics is highly salient will reward its members for 
participation or punish them for nonparticipation 
by granting or withholding approval and affection. 
Intermediate groups as well as primary groups may 
help prepare their members for citizenship by alert- 
ing them to their own interests, developing their 
social skills, and instructing them in the techniques 
of public activity. Membership in trade unions, 
service clubs, and other voluntary associations has 
repeatedly been found to correlate significantly 
with political participation. 

The influences exerted on a given individual by 
his various reference groups may be aggregative 
or may cancel each other out, depending on how 
consonant the groups are. Conflict or cross pressure 
among one’s membership groups may dilute the 
reinforcement pattern or cause the individual so 
much pain that he refuses to vote or avoids partici- 
pating in other ways. Cross pressures, of course, 
may also result from holding conflicting beliefs, 
from attempting to serve conflicting interests, Or 
from harboring opinions that are manifestly dis- 
crepant with reality. [See Cross PRESSURE.| 

The political environment. Much is asserted 
but little is reliably known about the political cor- 
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relates of participation. Political apathy is alleged 
to be affected by the size, remoteness, and complex- 
ity of modern political systems and, more specifi- 
cally, in the United States by the frequency of 
elections, the number of offices to be filled, the 
length of the ballot, and the necessity for observ- 
ing and making decisions about four levels (mu- 
nicipal, county, state, federal) and three branches 
(executive, legislative, judicial) of government. 
Widespread participation is presumably further dis- 
couraged when the major parties are large, loosely 
knit brokerage agencies which lack enrolled mem- 
berships, effective discipline, or vital centers for 
adopting party policy. In the light of these condi- 
tions, many observers have suggested that large- 
scale political participation may be an unrealistic 
goal. A few studies have indicated that such insti- 
tutional barriers as complicated election codes and 
nonpartisan forms of the ballot somewhat reduce 
turnout (Michigan ... 1960; Tingsten 1937). On 
the other hand, participation is highest among the 
very individuals who are most articulate and most 
capable of perceiving the ambiguities and complex- 
ities in the system, Would the inactives be less 
apathetic if government were simpler and more 
comprehensible? We cannot tell. One difficulty in 
resolving this question is that those who have little 
initial interest in politics often rationalize their 
inactivity by seizing upon any potential obstacle, 
while the highly motivated have relatively little 
difficulty in surmounting the informal barriers to 
participation thrown up by the system (Michigan 
.. . 1960; Almond & Verba 1963). 

Obstacles to participation may also take more 
narrow and prescriptive forms: cumbersome regis- 
tration procedures, literacy tests, poll taxes, resi- 
dence requirements, inadequate provision for ab- 
sentee voting, inaccessibility of polling places, and 
so on. Residence requirements alone may disen- 
franchise as much as 5 to 10 per cent of the eligible 
electorate (U.S, President's Commission . . . 1963, 
chapter 1). Except for a few restrictions designed 
to disqualify specific groups (e.g. literacy tests and 
poll taxes in the South), these obstacles chiefly 
impede those who are least motivated to vote in 
any case. 

In the United States, only a modest effort is 
made by government or the parties to register and 
instruct potential voters. While voting is widely 
encouraged, mere exhortation appears to have little 
effect, except perhaps on professed attitudes toward 
voting. Thus many more people—nonvoters as well 
as voters—believe that everyone ought to vote or 
participate than actually do so. Conversely, the cit- 
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izens of some nations with typically higher turn- 
outs than the United States nevertheless admit to 
a weaker sense of civic duty (Almond & Verba 
1963). Like feelings of political efficacy, professed 
belief in civic duty is significantly influenced by 
education, status, and general articulateness, as 
well as by the cultural norms, 

The situational factors that predispose people 
toward political activity are no better understood 
than are the legal and institutional barriers to par- 
ticipation. For example, the commonly held belief 
that great national or international crises awaken 
the impulse to participate has not been systemati- 
cally tested. Scholars favoring this hypothesis can 
point to the increase in election turnout during the 
last years of the Weimar Republic, the early days of 
the New Deal, and similar isolated examples; but 
one can find as many instances that support a con- 
trary interpretation; Turnout did not increase dur- 
ing the depression of 1932 or the recession of 1958. 
In wartime, voting tends to decline rather than 
to rise—partly, however, because many young 
voters are away from home. Thus, turnout in- 
creased slightly in 1940 over 1936, but dropped in 
1944, In 1948—a time of momentous decisions 
and cold war tensions—the proportion of eligible 
voters who cast their ballots fell to 51.3 per cent, 
the smallest percentage for any presidential elec- 
tion since 1924. The relative unpopularity of the 
candidates seems to have had far greater weight 
than the urgency of international events. In 1956, 
a comparatively placid year, the presidential vote 
represented 60.1 per cent of those eligible—a rela- 
tively high proportion for the United States. It may 
even be that unusually grave or ambiguous prob- 
lems tend to paralyze rather than to activate voters. 
Such anecdotal data, unfortunately, do not tell us 
much. What is needed is a series of cross-cultural 
investigations in which the meaning of such terms 
as “crisis” and “important” are operationally defined 
a priori, rather than rationalized in post hoc inter- 
pretations by either the respondents or the inves- 
tigators. 

In addition to the general variables touched on, 
the party system, the nature of the campaign, and 
issues and ideology are three areas that shape par- 
ticipation in modern societies, 

The party system. Of all political influences on 
participation, the party is probably the most potent. 
Its role is partly expressive and partly instrumental. 
The party resembles the nation or the church both 
in its symbolic force and in its capacity for arous- 
ing affection, devotion, and sacrifice on the part 
of its loyal members. The vast Majority of the party 
faithful would no more think of switching parties 


than of changing their nationality or religion. The 
party inspires in its members feelings of belonging 
and, equally, of opposition to those in other parties, 
While membership in cognate social groups may 
strengthen party influence on participation, the 
party is a powerful reference group in its own right. 
Indeed, it may help to solidify attachments to other 
social groups. This mystique keeps large numbers 
of people persistently active even though they have 
only a slim chance of affecting the outcome of 
important public events. 

The parties also perform a number of instrumen- 
tal functions. Despite their many derelictions, the 
American parties contact and register voters, select 
candidates, organize the campaign, and tell sup- 
porters what to believe on issues and how to vote. 
Accordingly, people who affiliate with a party vote 
more often than those who do not; and those who 
are strongly attached are more active in discus- 
sions, listen to more speeches, and respond more 
positively to their party's views than do those who 
are weakly attached. Again, these are correlates, 
and one cannot always be certain whether party 
affiliation causes participation or the reverse. Fur- 
thermore, even these correlations are far from per- 
fect. Many voters have only a marginal preference 
for one party over the other, while many nonvoters 
report strong party loyalties (Michigan . . . 1960, 
p. 97). In France and some other European coun- 
tries, party affiliation is less common than in the 
United States, but electoral turnout is higher (Con- 
verse & Dupeux 1962). ve 

Many students of politics believe that participa- 
tion in the United States would be greater if the 
competition between parties were more intense 
that is, if they were more equally matched in the 
number of their adherents or more sharply divided 
in ideology. There is some empirical support for 
the belief that owing to greater incentives, turnout 
increases as the number of supporters of the ord 
peting parties becomes more equal (Milbra 
1965b). But closeness of competition appears to 
exercise most of its effect on those with gee 
party identifications (Michigan .. . 1960, P. 99). 

More debatable is the claim that greater ideolog- 
ical cleavage between parties increases participa 
tion. The argument rests principally on the assump- 
tion that those who see the parties as diraeh 
are more likely to find the election important T 
will therefore be more strongly motivated to WO 
and vote for their party. Support for the hy ei 
can be gleaned from several sets of findings: ie. 
out in regular elections is almost invariably Se 
than turnout in primaries; voters in some a ed 
with typically heavier turnouts than the Un 


pothesis 


States (e.g., Norway) see their parties as more 
divergent ideologically than do Americans (Camp- 
bell & Valen 1961); and within the United States, 
active party workers are more likely than ordinary 
citizens to regard the parties as differing sharply 
on issues (McClosky 1964). But another set of ob- 
servations can be adduced to support the opposite 
view: Many voters support their party without ref- 
erence to the stand it takes on issues (McClosky 
et al. 1960), and only a minority accurately per- 
ceive the degree of intellectual cleavage that 
already exists between the major American parties. 
In some countries with high turnout, such as 
France and Italy, voters do not see the parties as 
very divergent ideologically. Nor has participation 
declined in countries that are alleged to have ex- 
perienced “depoliticization,” or a so-called “end of 
ideology” (Himmelstrand in Rokkan 1962a; Lipset 
1960), even though they may have undergone a 
reduction in the amount of political controversy 
and agitation among intellectuals. It is also possible 
that when party positions become polarized, some 
people will shrink from having to choose between 
extreme and unpalatable alternatives. [See PARTIES, 
POLITICAL.] 

The campaign. The effort made by the parties 
to involve the electorate in the political contest is 
concentrated in the campaign itself. Something has 
been learned about the effects of campaigns on 
polarizing party attachments, reinforcing candidate 
preferences, and switching votes (Berelson et al. 
1954). But little is reliably known about the effects 
of different kinds of campaign techniques on par- 
ticipation. One can assume that even the most list- 
less campaign will succeed in arousing some people 
who might not otherwise think of attending a po- 
litical meeting, listening to a political speech, or 
carrying a banner. Such findings as we do have 
suggest that the campaign chiefly reaches the faith- 
ful, crystallizing partisanship and reinforcing the 
intention of committed party adherents to vote and 
to persuade others to vote (ibid.). When the cam- 
paign ends, most of the participants revert to their 
relatively passive roles. 

Which campaign techniques are most effective in 
stimulating citizen participation? All forms of per- 
suasion and publicity probably have some effect, 
however minuscule. The most dramatic results, 
however, appear to be achieved through face-to- 
face communication with potential voters. This 
contact can be made formally, through designated 
party canvassers, or informally, through politically 
interested friends and opinion leaders (Lazarsfeld 
et al. 1944; Cutright & Rossi 1958; Katz & Lazars- 
feld 1955). The relative effectiveness of these com- 
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municators depends on their ability to command 
the attention of the people they seek to contact, to 
represent themselves as trustworthy sources of in- 
formation, to enforce moral or psychological pres- 
sures, and to convey campaign messages in mean- 
ingful language (Cohen 1964). 

The effect of the campaign on participation de- 
pends not only on campaign techniques but also 
on the popularity or charisma of the candidates. 
Intuitive or anecdotal data suggest that candidates 
with strong personal appeal can significantly in- 
crease the interest of typical nonparticipants and 
quicken the fervor and activity of party regulars. 
But in the few elections on which we have data, 
most voters were unable to discriminate clearly 
the personality attributes of the candidates (Pool 
1959; Davies 1954). 

Issues and ideology. A sizable body of research 
has shown that participation is associated with po- 
litical awareness, that is, actual knowledge of polit- 
ical affairs. (Awareness, of course, is in turn highly 
correlated with interest.) As we have seen, the 
number of citizens who can be described as “aware” 
in any sophisticated sense is extremely small. As 
many as half or three-fourths of the electorate are 
unable to define terms common to ordinary polit- 
ical discourse—e.g., “monopoly,” “plurality,” “left,” 
“right,” “balanced budget” (see Key 1961). Many 
cannot identify the reference groups that speak for 
their interests, cannot classify themselves accu- 
rately as liberal or conservative, and cannot de- 
scribe the differences between their party and that 
of the opposition. Striking as these findings are for 
the United States, they are even clearer for other 
countries where education and dissemination of 
public information are less widespread. 

Awareness affects both the amount and the 
quality of participation. If the unaware participate 
at all, they tend to do so ina random, inconsistent 
way that may actually work against their own 
stated aims. Data indicate that the politically aware 
are usually better able to relate their social values 
to their political opinions; to achieve stable, inter- 
nally consistent belief systems; and to comprehend 
and act upon the constitutional “rules of the game” 
(McClosky 1964; Prothro & Grigg 1960; Stouffer 
1955). 

Issues also play a role in participation. Although 
voters may lack knowledge of the array of issues 
being contested in a given election, some are 
strongly motivated by a single issue or class of 
issues that are for them particularly salient. For 
some purposes, then, the electorate can be thought 
of as constituting “issue publics’—e.g., Negroes 
(civil rights), the elderly (Medicare), trade union- 
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ists (the closed shop), young men of draft age (the 
war in Vietnam), and so on. The activity generated 
within such publics by the surfacing of the appro- 
priate issue may be critical both for the outcome 
of the election and for the adoption of government 
policies. But self-testimony about the important 
issues can be deceptive: Although some people may 
actually be moved to vote by the issues they name 
as decisive—e.g., the Korean conflict in 1952— 
(Campbell et al. 1954), others may be motivated 
largely by their preference for a candidate and, 
when asked, merely name the issues stressed by 
him. 

Evidence suggests that the party actives are 
more likely than ordinary voters to be aware of 
and motivated by issues. In the United States the 
active members of the two parties differ sharply 
on a wide range of issues, while their respective 
followers tend to agree on all but a few issues 
(McClosky et al. 1960). These disagreements in 
the issue outlook of party elites furnish motive 
force, help to define the parties’ images, and deter- 
mine whom the parties recruit and activate. 

In principle, any issue can be a powerful stimu- 
lus to participation; in practice, some issues are so 
narrow, technical, or esoteric that they have little 
chance of capturing the interest of a large public. 
“Position” issues (the so-called bread-and-butter 
issues) are generally thought to have a better 
chance of interesting would-be voters than “style,” 
or symbolic, issues (Berelson et al. 1954). Such 
position issues as social security, minimum wages, 
and Medicare are presumably easier to understand, 
even for the uneducated, and promise more tan- 
gible rewards. Style issues, such as civil liberties and 
many foreign policy questions, are presumably more 
complex and abstract and, hence, less compelling. 
Obviously there are important exceptions to these 
generalizations: for example, civil rights is for 
most Americans a style rather than a position issue, 
but during the 1960s it has probably inspired more 
activity than any other domestic issue, 

There are severe limitations on the degree to 
which issues of any type can stimulate interest 
and participation among large segments of the 
population. The distance between the individual’s 
behavior and the eventual reward (i.e., effective 
government action on the issue) is typically very 
great, and the reinforcement pattern sporadic and 
uncertain. The wonder, then, is not that People do 
Not participate, but that they do. 

Political factors, thus, may cause participation 
rates to vary, but one must keep in mind that the 
over-all rate of participation for a given country in 
a given era tends to remain fairly stable and that 


changes in the rates of participation from one elec- 
tion to another are usually small. This suggests 
that the broad social and psychological predisposi- 
tions earlier discussed set severe limitations on the 
play of political and situational elements. 


Participation and democracy 


What do the findings on participation mean for 
the theory and practice of democracy? According 
to the textbook model of democracy, an alert, in- 
formed, and wise citizenry rationally assesses the 
men who offer themselves for election, chooses the 
best, and removes or reappoints them after care- 
fully weighing their performance. As we have seen, 
the portrait bears little resemblance to reality. How 
should this picture be revised in the light of the 
findings on participation? Does the evidence render 
traditional notions of democracy untenable, requir- 
ing a reformulation of its basic assumptions and 
claims? These questions have stirred considerable 
controversy (Dahl 1966; Walker 1966). 

Much of the argument has centered on what the 
widespread failure to participate means for demo- 
cratic government. While some observers have felt 
that universal participation is neither possible nor 
especially desirable, none has proposed restricting 
the practice of universal suffrage. The issue, rather, 
is the assessment of the consequences of political 
inactivity among certain sections of the electorate 
and the amount of participation that reasonably 
can or should be expected of citizens in a democ- 
racy. The answers to these questions, many believe, 
will shape the nature and quality of the democracy 
a society enjoys. 

The arguments of those who are not severely 
troubled about the dangers of political apathy may 
be summarized as follows: 

(1) Little is gained, and something may be lost, 
by encouraging the involvement of men ani 
women who are politically uninformed and unin: 
terested. Such people are likely to misperceive their 
own and society’s best interests. They are likely te 
have the poorest understanding of the requiren 
of a democratic system (civil liberties, tolerance o 
nonconformity, etc.) and are most ouscer ee 
misleading propaganda and the appeals of popu ie 
but inappropriate leaders. Encouraging them i 
participate may actually cause harm to democral 

overnment. 
: (2) To insist that all must participate bas 
all are affected by politics is to substitute piety he 
judgment. Little is gained merely by increasing a 
number of voters. Political activity may be i 
dressed to undesirable as well as to desirable en 2 
A vote may be used to elect a Hitler as well as 


Roosevelt or a Churchill. In a democracy, more- 
over, a citizen has the right to disdain politics if 
he chooses. Better apathy than heedless participa- 
tion. 

(3) Since even under optimal conditions the 
great mass of the electorate can never possess the 
awareness that complex political judgments now 
demand, the business of politics might better be 
turned over to those active minorities who, by vir- 
tue of their interest, knowledge, and judgment, 
have shown that they are capable of governing in 
a democracy. As a safeguard in this kind of gov- 
ernment-by-minority, the would-be rulers must, of 
course, be drawn from all segments of the society, 
must compete among themselves for office, and 
must account to the voters for their actions. 

(4) Widespread political activity, while desir- 
able in some respects, also carries disadvantages. 
A “too active” electorate may impede those who 
rule from making the decisions they are best qual- 
ified to make. A highly politicized electorate may 
lead to excessive controversy, fragmentation, and 
instability, The existence of a large number of 
“indifferents” among the electorate lends flexibility 
to the system by permitting power to shift from 
one administration (or party) to another without 
generating unusual tension or anger; the decisions 
of the new officeholders are thus more easily ac- 
cepted and accommodated. Any sudden upswing 
of political interest and activity is likely to herald 
a condition of disturbance or crisis in the system 
and the emergence of new and profound cleavages. 

While some observers remain fairly sanguine 
about the incidence of nonparticipation, others are 
deeply troubled (Walker 1966; Lane 1959). Typ- 
ical of their arguments are the following: 

(1) Those who fail to participate are not prop- 
erly represented. Government is thereby deprived 
of its broadest possible assessment and of the bene- 
fit of whatever these nonparticipants have learned 
from their experience. In a democracy, participa- 
tion is power. Rulers can therefore afford to ignore 
the needs and interests of nonparticipants. By neg- 
lecting to avail themselves of the reinforcements 
contingent upon participation, the apathetic are 
further discouraged from bothering to formulate 
political opinions and demands. Ignorance thus 
accumulates, and the general level of political 
vitality and vigilance declines. In practice, it will 
typically be the poor and the socially deprived who 
are most likely to be unrepresented—those who 
most need to be represented. 

(2) Widespread apathy increases the chances 
that government will be dominated by men who are 
unresponsive, self-aggrandizing, and unscrupulous; 
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participation, on the other hand, reminds those 
who govern that they must attend to their duties 
and serve the electorate. Whenever apathy prevails, 
it becomes more difficult to organize and maintain 
a political opposition—an essential ingredient of 
the defense against tyranny and the abuse of po- 
litical power. 

(3) Even if the opinions of the nonparticipants 
are presently ill-informed, there is no better way 
to improve the quality of their judgment than by 
the experience of participation. In the course of 
participating, one is impelled to acquire the knowl- 
edge needed for sound judgment, to become aware 
of one’s best interests, to learn how the system 
works and what principles and beliefs it values. 
Voters looking for guidance are prompted to seek 
out information, to discuss politics with others, 
and so on. Therefore, participation not only stim- 
ulates political learning but also heightens respon- 
sibility, deepens awareness, and increases one’s 
sense of political effectiveness. 

(4) Apathy is a symptom as well as a cause of 
weakness in the system. It signifies a failure to 
involve all members of the society in their own 
governance, a failure to inspire interest and loyalty. 
Such failures may be dangerous to democracy, for 
whenever a large number of people exist outside 
the normal channels of politics and are unable to 
share in the decisions that shape their lives, the 
political atmosphere becomes potentially explosive. 


The cogency of some of the foregoing arguments 
on both sides of the issue indicates that the rela- 
tionship of participation to democracy cannot be 
understood simplistically. To claim that more par- 
ticipation is always preferable is to blind oneself 
to the possible disadvantages of enlarging partici- 
pation under certain circumstances. To contend, 
on the other side, that any increase in participation 
will invariably serve to enshrine mediocrity and to 
debase the quality of political life is to ignore the 
powerful considerations for giving everyone who 
wants it a role in the collective decision process. 
Most of the disputants, fortunately, would be un- 
willing to press their arguments to these extremes. 
Nothing is to be gained by converting participation 
into either a fetish or a taboo. It seems plain enough 
that in itself it is neither good nor bad, but that it 
takes its character from the social and political con- 
texts in which it occurs, as well as from the motiva- 
tions of the participants. 

Both participation and apathy, then, are com- 


i plex phenomena which resist easy characterization 


and analysis, not only for the reasons just given 
but also because many of their correlates are still 
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unknown. Although much has been learned in a 
relatively short time, the relationship between par- 
ticipation and its social, psychological, and polit- 
ical correlates is far from being understood in a 
systematic way. The explorations that need to be 
undertaken might be much more useful if both 
participation and its correlates were broken down 
into their principal components, so that the influ- 
ence process—whatever the direction of its flow— 
might be understood more dynamically. Once iden- 
tified, these components need to be systematically 
explored in different social contexts and across 
cultures to determine their relative explanatory 
power under varying combinations of forces. 


HERBERT McCLosky 


[See also AccEss TO POLITICS; ELECTIONS; REPRESEN- 
TATION, especially the article on REPRESENTATIONAL 
BEHAVIOR; VOTING. Directly related are the entries 
Democracy; ELITES; INTEREST GROUPS; MAJORITY 
RULE; OLIGARCHY; PARTIES, POLITICAL; POLITICAL 
CLUBS; POLITICAL EFFICACY; POLITICAL FINANCING; 
TOTALITARIANISM. Other relevant material may be 
found in POLITICAL BEHAVIOR; POLITICAL SOCIOL- 
OGY; PUBLIC OPINION; SOCIAL MOVEMENTS; SOCIALI- 
ZATION, article on POLITICAL SOCIALIZATION; VOL- 
UNTARY ASSOCIATIONS, article On SOCIOLOGICAL AS- 
PECTS.) 
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POLITICAL PARTIES 
See PARTIES, POLITICAL. 


POLITICAL PHILOSOPHY 
See POLITICAL THEORY. 


POLITICAL PROCESS 


“Political process” refers to the activities of peo- 
ple in various groups as they struggle for—and 
use—power to achieve personal and group pur- 
poses. Its most carefully studied forms have been 
the efforts of conflicting political parties, factions, 
cliques, and leaders to attain formal positions of 
legitimate authority in the central organs of public 
government—unitary and federal, national and 
local. The process also operates in the endless con- 
flicts between nations and between international 
blocs. It may be found in the internal governance 
of corporations, government agencies, trade unions, 
churches, and universities (Merriam 1944). In all 
these forms of the political process, actual power 
may be far different from formal authority. Any 
power wielder’s objectives, whether manifest or 
or latent, may fall anywhere on the broad spectrum 
between good and evil. The means may include not 
only violence, bribery, and character assassination 
but also persuasion, mutual understanding, recip- 
rocal adjustment, and the slow building of consen- 
sus as well. 

The political process is probably a more difficult 
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object of study than most other aspects of human 
behavior. One difficulty, as with many forms of 
social action, is the tremendous number of vari- 
ables that interact simultaneously in many subtle 
ways. This produces an “organized complexity,” 
something much more difficult to understand than 
the “disorganized complexity” of random phenom- 
ena and random variations, which is susceptible to 
statistical analysis based on probability theory 
(Rockefeller Foundation 1959, pp. 7-15). With po- 

. litical events, moreover, the distortion and suppres- 
sion of important information—whether through 
ignorance, bluff and maneuver, taboos, or self- 
censorship—are inevitable aspects of the process 
itself. Indeed, participation in the political process 
is frequently characterized by the vigorous denial 
of participation. The posture of being nonpolitical, 
nonpartisan, bipartisan, or “above politics” is a 
popular style of playing politics. 

To some extent these difficulties have been coun- 
terbalanced by progress in formulating basic con- 
cepts, developing currently useful generalizations, 
and linking concepts and generalizations to ob- 
served data, both qualitative and quantitative. 
Some of the elements in this forward progress may 
be reviewed in terms of (1) the process approach 
to politics; (2) the actors in the political process; 
(3) their purposes; (4) their methods of utiliz- 
ing power; (5) their development of power; and 
(6) the art-science relation in the political process. 


The process approach 


The process approach is far broader than poli- 
tics. Indeed, the process concept has been a major 
unifying influence in modern thought and science. 
Although its roots go back to Heraclitus and Hegel, 
among many others, it has been set forth in both 
general and explicit terms by twentieth-century so- 
ciologists, such as Ross (1897-1904) and Cooley 
(1918), and philosophers, such as Dewey (1925), 
Whitehead (1929), and Bentley (see Dewey & 
Bentley 1949). By the process approach, the world 
or any part of it is seen as an ongoing stream of 
events in time, as becoming, rather than merely 
being. “Facts do not remain stationary, .., The 
value of every fact depends on its position in the 
whole world-process, is bound up in its multitu- 
dinous relations” (Follett 1924, pp. 9, 12). Facts 
and process are separated into discrete elements 
only by human analysis, which is itself a contin- 
uing process of interrelating between knowers and 
the known. Change—whether slow or rapid, hidden 
or open—is continuous. It is also complex. Single 
causation is impossible; multiple causation and 
multiple effects are inevitable. Structures are not 


ruled out by this approach; they are included as 
the more stable aspects of action and interrelation- 
ship. “To designate the slower and the regular 
rhythmic events structure, and more rapid and ir- 
regular ones process, is sound practical sense. It 
expresses the function of one in respect to the 
other” (Dewey [1925] 1958, p. 62). 

More concretely, although often less explicitly, 
the process approach has entered each of the nat- 
ural and social sciences. In his efforts to under- 
stand the ongoing flow of events, man tries to 
slice up his universe into various specific processes: 
physical, chemical, physiological, psychological, 
sociological, political, economic, administrative, 
and many others, Each of these represents a dif- 
ferent aspect of or a different way of thinking 
about the same ongoing stream of action and be- 
coming, The process approach to politics developed 
as a reaction against viewing politics in the terms 
of presumably static and depersonalized structures, 
such as “the state,” individual nations, branches of 
governments, and political parties, As Easton has 
pointed out (1953, pp. 160-170), the first steps 
toward a process approach were taken by those 
who, like Bagehot (1865-1867), Wilson (1885), 
and Bryce (1888) examined the genuine loci of 
power, as distinguished from mere formal author- 
ity, in governments and parties. The next step, as 
illustrated by the work of Bentley (1908) and his 
many followers, was to look outside government 
and view the political process “principally as the 
interaction among governmental institutions and 
social groups” (Easton 1953, p. 172). This has led 
to a pluralistic concern with the activities of many 
kinds of organized groups inside and outside of 
government, but with much more attention than 
Bentley was willing to give to individuals, common 
interests, and normative values. ‘ 

Within any area of inquiry, in turn, theorists 
and researchers slice up activity into many sub- 
processes and types. Thus, the political process m 
been persistently studied in terms of the elector i 
legislative, judicial, executive, and internationa 
processes, with occasional attention to the more 
violent processes of revolution and war. The deeper 
any such study goes, the more it involves attention 
to still-more-specific processes, such as decision 
making, planning, budgeting, conflict resolution, 
socialization, communication and information proc- 
essing, adaptation and learning, organization Fi 
reorganization, growth and decay. The list is = 
less— particularly since each is, in turn, often Su 
divided still further. ra 

As the theorist or researcher penetrates any i 
these many aspects of the political process, he so! 


finds that what he is dealing with is “only the 
political aspect of the social process in its entirety” 
(Lasswell & Kaplan 1950, p. xvii). Political deci- 
sion making involves processes of problem posing 
and problem solving that are also studied by psy- 
chologists, social psychologists, economists, logi- 
cians, and mathematicians [see DECISION MAKING]. 

The process approach has thus led specialists 
from these fields to apply their skills to political 
decision making. Similarly, it has led political sci- 
entists into one or another of these fields, either 
to appropriate ideas for quick application to polit- 
ical decision making or, more boldly, to formulate 
concepts or theories that cannot be obtained pre- 
fabricated from those less familiar with political 
data, Indeed, any political process is closely inter- 
related with other social processes. An integrated 
view of these interrelations would require some- 
thing more than isolated ventures across discipli- 
nary boundaries, namely, a comprehensive view of 
the “social process in its entirety.” This may be 
done through social systems models dealing with 
the full range of systems from the nation-state 
down to formal organizations and small groups 
(Kuhn 1963; Miller 1965; Gross 1966). Such 
models provide the analytical tools for describing 
and explaining the changing structure and per- 
formance of concrete systems in specific environ- 
ments composed of other systems. Unless great 
care is taken, however, such a view may be so 
abstract as to lose contact with the major realities 
of political action (as in Parsons & Shils 1951). 
This contact may be partially regained by focusing 
on the “political system” (as suggested in Easton 
1953; 1965a; 1965b; and attempted in Almond & 
Coleman 1960). Here the risk becomes one of 
losing contact with the political processes in non- 
government organizations. This contact may be 
preserved by viewing the political system as a sub- 
system involved in continuous transactions with 
other crisscrossing subsystems [see SYSTEMS 
ANALYSIS]. 


The actors 


Although everyone plays some small part in the 
political process, the majority of people are rela- 
tively passive. It is the activist minority of elites, 
leaders, rulers, or managers who play the major 
roles, exercise the greater power, receive the 
greater rewards and recriminations, and become 
the heroes and villains of history. In many socie- 
ties these activist roles have been reserved mainly 
for men who have been born into the proper fam- 
ilies, castes, or classes, have been blessed with 
inherited wealth, or have demonstrated successful 
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achievement in battle, business, or trial by compet- 
itive test. In all societies many are relegated to 
subordinate roles by discrimination based on sex, 
religion, ethnic origin, race, color, or caste. And 
many others, not barred by such obstacles, may be 
effectively restricted by poverty, disease, oppres- 
sion, inadequate education, and an atmosphere of 
hopelessness. 

Yet, the passive masses are always part of the 
act. Their style of “followship” affects the pattern 
of leadership. Their interests and norms, even ` 
when unorganized, provide the potentials for future 
organization (Truman 1951, pp. 34-35, 51-52). 
They may even determine where and how the 
leaders end up—as suggested by the age-old de- 
scription of the ruler as a man riding a tiger. From 
them may come builders of new organizations, 
challengers of old regimes, and successors to old- 
time leaders. Moreover, there is rarely a sharp and 
single dividing line between the major and minor 
actors. Hierarchical relations in societies and large 
organizations provide finely graded scales of formal 
authority and crisscrossing spheres of legitimate 
influence, In industrialized and industrializing so- 
cieties, with increasing differentiation between and 
within organizations, the number of such spheres 
grows apace. All of them may be sources of the 
indirect and reciprocal influences that lead toward 
pluralistic dispersions of power (Dahl 1961, pp. 
89-103). 

The identity of all major actors is never readily 
apparent. Behind the “great man” there often 
stands the éminence grise and the anonymous as- 
sistants; behind the directorate, the “kitchen cab- 
inet”; behind the patriarch, the influential wife, 
mother, or mistress; behind the “boss,” the sprawl- 
ing bureaucracy, and behind it, a network of alli- 
ances with supporting groups and individuals. The 
illusion of the central omnipotence of a single per- 
son or group is often fostered, both by top leaders 
seeking to expand their power by exaggerating it 
and by behind-the-scenes operators seeking to pro- 
tect their power by hiding it. In the electoral proc- 
ess the people most active in picking candidates 
and supporting their campaigns are seldom as vis- 
ible as the candidates or the electors. In the daily 
business of government and nongovernment opera- 
tions, the major contenders in the power struggle 
are usually known only to the contestants them- 
selves and a handful of close observers. 

The social roles of the major actors are always 
complex. In part they are formally determined by 
constitutions, laws, charters, regulations, and cus- 
toms. In part they are, also, the product of informal 
arrangements and understandings. On the one 
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hand, they embody the expectations of others. On 
the other hand, they are shaped by the personality 
and capacities of the incumbents themselves. In 
some respects the role determines the actor’s major 
movements. In some it leaves him free to depart 
from the script, to extemporize, or to prepare his 
own lines in advance. In some it saddles him with 
conflicting and partly irreconcilable requirements. 
These role conflicts are magnified when people play 
different roles in different organizations or, as fre- 
quently happens at higher levels of authority, “wear 
many hats” in the same organization. 

All the actors in the political process are involved 
in conflicts with opposing groups and individuals. 
At the lower and middle ranks of single organiza- 
tions these conflicts may center largely on the 
“internal politics” of budgets, reorganization, jobs 
to be filled, and tasks to be grabbed or evaded. At 
the higher levels they tend to reflect divergent in- 
terests in the environment and the divergent pres- 
sures of suppliers, and clients, associates and 
adversaries, controllers and controllees. In “public 
politics” the executive, legislative, and judicial 
organizations of government are even more inti- 
mately embroiled in structuring or resolving the 
major conflicts of the community or nation. 


Purposes of the actors 


A major objective of all actors in the political 
process is the use and development of power or 
influence—that is, the capacity to influence events. 
The extent of their power distinguishes the major 
from the minor actors. 

Because of its importance, the role of power in 
the political process may easily be exaggerated. It 
makes sense to regard politics as a “power strug- 
gle.” But viewing it as a struggle for nothing but 
power means disregarding other objectives and 
misunderstanding the nature of power itself. The 
modern tendency is to view power as one objective 
within a complex network of purposes. Power's 
critical importance derives from its essential con- 
tribution toward achieving other purposes. Without 
power no leader or administrator can serve his own 
interests, the interests of a small minority, or the 
interests of a great majority, 

Power, however, is rarely an undifferentiated 
form of currency. It is usually the more specialized 
capacity to influence certain events under certain 
conditions. The extent of power can be appraised 
only in terms of the achievement—actual or poten- 
tial—of specific objectives. The sources and forms 
of power are adapted to and constrained by the 
objectives sought. An association of doctors, with 
considerable influence on health matters, will not 


exert much influence on foreign policy, The power 
to install a person in office or remove him from it 
is not equivalent to the power to control his deci- 
sions in office. A president usually finds that the 
power to do one thing may interfere with the power 
to do another: “The move that gains him ground 
on some particular may scar his general Washing- 
ton reputation” (Neustadt 1960, p, 192), 

The purposes served by power may be viewed 
both in terms of group and of individual interests, 
In organized groups people work together to pro- 
vide outputs that satisfy various interests of mem- 
bers and outsiders, To do this, they try to obtain 
certain resources from the environment and—with 
some degree of economizing on resources perceived 
as scarce—engage in activities that produce the 
services or goods needed to satisfy interests. They 
seek to use some resources to strengthen the capac- 
ity of the organization itself. They try to do all 
these things rationally and in accordance with ac- 
ceptable codes of behavior (Gross 1964, part 5). 
Their successes and failures in trying to achieve 
these various objectives are the best measures of 
their power. The difficulty in making such meas- 
urements derives in part from the great complexity 
—and in some cases the impossibility—of apprais- 
ing these various dimensions of performance. The 
greatest difficulty lies in appraising the satisfaction 
of human interests. It is easier to identify the 
parties at interest—the “interesteds’—than to ana- 
lyze their interests. It is easier to identify their 
interests than to appraise the fluid and evanescent 
combinations of satisfaction and frustration. 

Power is also an instrument for the satisfaction 
of individual interests. People need power to pro- 
vide for their basic physiological needs and to help 
assure their security and survival. They seek power 
as a means of winning acceptance by others an 
becoming part of a group or community. Power E 
needed to satisfy the “desire for a stable, firmly 
based . . . evaluation of themselves, for self-respect, 
or self-esteem, and for the esteem of others (Mas- 
low 1954, p. 90). It is a means toward ge 
autonomy, creativity, and self-fulfillment. oe 
may also be used for self-aggrandizement at a 
expense of others and as a way of venting aggr i 
sions upon those too weak to resist. It may is 
sought as a path to higher status without eee 2 
to performance. These are the kinds of things 


ferred to by the idea of power sought “for its ow? 


sake.” On the other hand, power may also be ie 
by individuals and groups sincerely inta = 
serving interests beyond their own. Withou = 
use of power, altruism and idealism become €: 


ts 
cises in pathos and futility. Only through the effor 


of enough people struggling for more power has it 
been possible to achieve those dispersions of power 
that make an organization, a polity, or a society 
more democratic. Only through the struggle for 
power may significant steps be taken toward serv- 
ing the higher interests of human beings and the 
common interests of humanity. Neustadt (1960) 
has pointed out that presidents of the United States 
can provide adequate national leadership only 
through conscious and expert efforts to build pres- 
idential power. A similar comment may be made 
concerning administrators in any organization and 
leaders in any nation. 


Using power 


Power in use invariably involves a mixture of 
many different forms—sometimes mutually rein- 
forcing—of persuasion and pressure (Gross 1964, 
pp. 781-794). 

Persuasion. Persuasion takes place when A in- 
fluences B to adopt a course of action without A’s 
promising or threatening any reward or punish- 
ment. It may take the form of example, expecta- 
tion, proposals, information, education, or prop- 
aganda. 

Many actors in the political process influence 
others by the sheer power of example, Where 
leaders go, others may follow on the basis of 
loyalty, devotion, or mere imitativeness. People and 
groups may also be influenced by mere expecta- 
tions—particularly when their expectations of the 
power wielders conform to established customs or 
codes, Expectations, even when unspoken, tend to 
communicate themselves in effective ways; when 
communicated more openly, they become proposals 
and advice. These, too, may be extremely influen- 
tial—particularly when there is confidence in the 
wisdom or disinterestedness of the proposers or ad- 
visers. Even without specific proposals or advice, 
great influence may be exercised merely by provid- 
ing—or withholding—certain types of information. 

When the premises on which people act are 
known, the mere structuring of the information at 
their disposal may shape their future behavior 
(March & Simon 1958, pp. 162-169). But educa- 
tion is a long-range and indirect means of influ- 
encing behavior, and although it has great poten- 
tialities for the development of learning abilities 
and adaptability, it is often used more vigorously 
to promote conformity [see SOCIALIZATION]. In 
some countries it has been “the most powerful 
means of social control to which individuals must 
submit” (Crozier [1963] 1964, p. 238). Propaganda 
is a more immediate form of persuasion, in which 
Powerful symbols are used to appeal to people's 
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interests through their emotions and feelings. 
Propaganda “can strengthen predispositions and, 
in particular, give them specific direction; and it 
can contribute to weakening them if this process 
is supported by direct experience, or by nonpropa- 
ganda symbols” (Lasswell & Kaplan 1950, pp. 113- 
114). It is particularly powerful when supported 
by other forms of persuasion and by various forms 
of pressure—that is, by the “propaganda of the 
deed.” Sustained propaganda and demonstration 
by the mass media of communication can have 
profound “characterological effects” on the values 
and attitudes of many people (Pool 1963). [See 
PROPAGANDA.] 

Pressure. Pressure is applied by A upon B 
whenever A tries to make a course of action more 
desirable by promising or threatening contingent 
rewards or punishments. It may take the form of 
force, commands, manipulation, or bargaining. 

Force is the most extreme form of power—one 
that has made the very term “power” obnoxious to 
many. Its application through imprisonment, tor- 
ture, or destruction of life and resources will re- 
duce or eliminate the recipient's capacity to act in 
an undesirable way. Even its threat may serve as 
an important deterrent. In the international arena 
nations have traditionally used military power in 
the pursuit of both offensive and defensive ends, 
Within nations, governmental monopoly of the 
legitimate use of organized violence provides the 
ultimate sanctions behind the enforcement of law 
and order. Beyond this, physical force is usually 
used as a method of influence only by revolution- 
ary or separatist groups, criminals, the parents of 
young children, and in some old-fashioned schools, 
teachers. 

Physical force is a blunt instrument which, even 
when used as a last resort, may have little positive 
value. It may unleash counterforces capable of un- 
doing what has been done, if not of seriously dam- 
aging its users. Besides, more flexible and reliable 
modes of pressure are available. Rewards, in the 
form of monetary payments, new positions, higher 
status, support, favorable votes, cooperation, ap- 
proval, or the withdrawal of any anticipated pun- 
ishment, may be bestowed or promised. Punish- 
ment, in the form of fines, firing, reduction in 
status, unfavorable votes, noncooperation, rejec- 
tion, disapproval, or withdrawal of any anticipated 
reward, may be given or threatened. It is such in- 
centives and sanctions that provide the backing for 
command—that is, for the use of power through 
orders, directions, regulations, or instructions. 

Command also suffers from many limitations. 
It can be used only by those whose authority to 
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give orders is accepted. It may undermine the 
initiative and self-respect of those receiving the 
orders. “Because command is a more ostensible, 
direct, person-to-person relationship, perhaps it can 
offend status, self-respect, and dignity of individ- 
uals more than does manipulation of operating 
penalties. Manipulation of the field seems to be 
more suitable for creating ‘permissive’ situations, 
i.e., situations in which a particular response is 
attained although the individual can choose among 
a variety of rewards and penalties” (Dahl & Lind- 
blom 1953, p. 121). In manipulation, rewards and 
punishments are used without an open statement 
concerning the kinds of action desired. 

Bargaining is a still more fluid—and far more 
pervasive—form of using pressure. In bargaining, 
all sides exercise power upon each other through 
reciprocal promises or threats. The objectives, the 
rewards, and the punishments are all subject to 
negotiation. Indeed, force, command, and manip- 
ulation tend to become enveloped in the broader 
and more subtle processes of bargaining. The great 
contribution of these cruder methods of power is 
not in supplanting bargaining but in strengthening 
the hands of the actors at the bargaining table. 
[See NEGoTIATION.] 

Overcoming resistance. No matter what the 
combination of persuasion and pressure, the users 
of power always encounter inertia or resistance. 
“Yet, when the order is given, obedience is reluc- 
tant, partial; resistance widens: and as penalties 
are made heavier, opposition becomes stronger. . . . 
Naked hands and empty pockets may obstruct and 
antagonize the action of authority and with means 
which can scarcely be successfully opposed without 
destroying the basis of human association itself” 
(Merriam 1934, pp. 156-157, 183). Other forms of 
power are met by other forms of resistance. For 
example, expectation and proposals may be coun- 
tered by indifference; information by inattention 
or misinterpretation; propaganda by cynicism. The 
more effective any form of influence is, the more 
it may encourage, under certain circumstances, 
the organizers of countervailing power. 

In using power, therefore, the actors in the po- 
litical process develop strategies and tactics for 
dealing with inertia and resistance. Head-on re- 
sistance can often be avoided by the “strategy of 
the indirect approach,” that is, by flexible shifting 
from points of greater to points of lesser resistance 
(Liddell Hart 1929). Action campaigns may be 
carefully timed and designed. This often means 
taking the initiative, remaining on the offensive, 
avoiding overdefensive positions, and, where nec- 
essary, making sudden changes or retreats. It often 


requires a quick response to the challenge of an 
unexpected crisis or even the purposeful creation 
of crisis conditions. “Crisis may make it possible 
to take required actions against powerful groups 
which are normally well entrenched and invulner- 
able . . . [and] may stimulate action and hence 
learning on a problem on which insight has been 
low and for that very reason has not been tackled 
as long as it was in a quiescent state” (Hirschman 
1963, p. 261). Above all, the various forms of per- 
suasion and pressure must be continuously alter- 
nated and intermingled—with bargaining backed 
up by potential force; propaganda, by example; 
and manipulation, by information or by the iron 
fist suitably encased in the velvet glove. 

No matter how excellent the leadership, the 
fully successful use of power is extremely rare. Un- 
seen or unpredicted influences usually bear upon 
a situation. Desired consequences are often accom- 
panied by unforeseen or unforeseeable ones, either 
immediate or subsequent. On the other hand, what 
appears to be total failure will usually be counter- 
balanced by mitigating circumstances and favor- 
able, albeit unforeseen, aftereffects. Many sharp 
conflicts lead only to deadlock or—when either 
side fears the outcome—avoidance. Some con- 
flicts may be resolved by an integration of oppos- 
ing interests, that is, a creative readjustment of 
interests so that all contenders may gain and none 
lose (Follett 1926-1930, pp. 31-36). Most power 
struggles seem to end in mutual concessions. From 
this point of view the political process may be 
characterized as a stream of successive compro- 
mises punctuated by frequent episodes of deadlock 
and avoidance and by occasional victories, defeats, 
and integrations. Under such circumstances, the 
measurement of actual or potential power is indeed 
a baffling task. [See PowER.] 


Developing power 

The actual use of power requires the develop- 
ment of a “power base,” which in turn usually ie 
quires the actual use of power. Indeed, any ae 
of power may help strengthen the power base, $ T 
as a muscle may be strengthened by exercise. a 
the constant use of power to achieve an 
objectives will, by itself, usually lead to the Be : 
tion of power. The actors in the political proc on 
must conserve their power for the occasions A 
which it is most needed. Over the long ra R 
pletion is inevitable unless some power 1S diver E 
from other purposes and invested in the are 
nance or strengthening of the power Hake Bare 

Formal authority. Formal authority, i one 
legitimate right to engage in certain actions, 1$ 


of the most obvious sources of power. It is con- 
ferred upon individuals, groups, or organizations 
by constitutions, charters, statutes, regulations, 
delegations, or official decisions. It becomes en- 
trenched and protected by habit, custom, tradition, 
and the ritual or ceremonial “trappings” of au- 
thority (Merriam 1934, pp. 102-132). It is often 
derived from a person's lineage, wealth, profes- 
sional status, knowledge, abilities, or sheer en- 
ergy. Whether at the level of a nation-state (where 
it is called sovereignty) or of a single organization, 
it becomes meaningful only if accepted by those 
subject to the influence of the authority holders 
(Michels 1930; Barnard 1938, pp. 161-165). 

To strengthen their power base, the actors in the 
political process usually spend considerable time 
in efforts to win or expand authority. Indeed, the 
terms “struggle for power” and “seizure of power” 
often refer specifically to a struggle for or seizure 
of positions of formal authority. This struggle in- 
volves the creation of new positions, the adjust- 
ment of old ones, and the installation of people in 
positions by appointment, election, or military 
force. It involves the interpretation (or even the 
change) of customs (Merriam 1945, pp. 74-76) 
and the use of conspicuous symbolism, in the form 
of flags, uniforms, buildings, office furnishings, 
titles, insignias, degrees, or ceremonies (Merriam 
1945, pp. 81-93; Barnard [1935-1946] 1956, pp. 
207-244). At times it involves “usurping authority” 
by engaging in unauthorized acts and subsequently 
seeking legitimation. 

Wealth. Wealth is another source of power. 
Almost any form of wealth—natural resources, 
man-made goods, or monetary claims against re- 
sources—may be used as a reward. Some forms 
of wealth, such as the media of communication, 
are effective instruments of persuasion. Others, 
such as armaments, are widely used by nations as 
instruments of pressure. Money is an all-around 
instrument for mobilizing other sources of power. 
Indeed, in many societies the wealthy person or 
organization acquires a certain aura of authority 
on the basis of wealth alone. Ownership is not 
essential. It is control of wealth that is important. 
This may be exercised by the managers of corpora- 
tions, as distinguished from stockholders, and by 
the top officials of government agencies and pri- 
vate associations. 

Human resources. People—or human resources 
—are a still more fundamental source of power. It 
is people who convert physical resources to wealth 
by developing them and finding them useful. Both 
authority and organization are rooted in the inter- 
relations of people. The sheer quantity of people 
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is an important factor, The huge population of 
China constitutes a tremendous power potential; 
the small population of Cyprus, a great constraint. 
The economic power of producers often depends 
on the availability of mass markets. The power of 
political parties and factions is affected by the 
number of their supporters; that of trade unions 
and other associations by the number of members. 
Where democratic values and customs obtain (and 
particularly where conflicts are settled by voting), 
the number of people committed to or supporting 
a given course of action is a significant factor. 

Yet, in any society or situation the quality of 
people is also important. In the modern world 
“quality” refers to much more than the aristocratic 
birth symbolized by the old phrase “people of qual- 
ity.” It refers to a person's potential for perform- 
ance, a potential based on education, knowledge, 
intelligence, skill, interests, age, health, and en- 
ergy level. Science and technology are of steadily 
increasing significance in the power base of any 
nation or organization. Although they are some- 
times discussed as if they were abstract, disem- 
bodied forces, science and technology are mean- 
ingful only insofar as they are embodied in the 
capacities of specific scientists and technologists, 
capable of drawing upon or adding to the stock of 
human knowledge and know-how. Of still greater 
significance are the qualities of leadership exer- 
cised by the major actors in the political process. 
It is this critical minority of people who exploit 
or bring to fruition the power potential in larger 
numbers of people. 

Organization. Organization is an indispensable 
source of power. Individuals themselves have only 
limited influence upon others, but by joining to- 
gether in some sort of organization, they can de- 
velop the power to do things that could not be 
done at all, or at least not as well, alone. Through 
the specialization of labor and the cooperation be- 
tween its component parts, an organization can 
exploit the special capacities of better-qualified 
people. It can bring people, resources, and au- 
thority positions together in a far-flung and flexible 
system of power. Accordingly, the major actors in 
the political process invest considerable time and 
other resources in maintaining, expanding, and 
reorganizing existing organizations or in building 
new ones. Those who free themselves from this 
task can do so only because others in the organiza- 
tion devote themselves to it. 

Since no human organization is a closed sys- 
tem, the power developed through internal rela- 
tionships must usually be sustained—and can in- 
variably be extended—by various coalitions with 
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external organizations and individuals. Supply 
lines must be protected, clients pacified, control- 
lers mollified, controllees kept in line, associates 
given assistance and—to the extent feasible— 
adversaries split or outwitted. This involves the 
organization of a complex network of support, 
composed both of active allies and of passive col- 
laborators. This network may include a large va- 
riety of shifting and overlapping coalitions, or it 
may be crystallized into one pattern of alliances. 
In many situations considerable effort is spent on 
building unorganized opinions and attitudes into 
the support network. One of the most important 
functions of political campaign organizations is to 
win or maintain the support of unorganized indi- 
viduals. 


The art-science relationship 


The most successful actors in the political proc- 
ess, it has often been said, are those who master 
the “art of the possible.” This phrase emphasizes 
the fact that the users and developers of power 
usually must be prepared to compromise on certain 
principles. Otherwise, the desirable may be im- 
possible. 

The possible, however, covers a broad spectrum, 
from the inevitable to the improbable. Some actors 
in the political process concentrate only upon the 
most feasible of objectives and the lowest of as- 
pirations, Here the major skill lies in taking credit 
for events that would probably have happened any- 
way, as in “epiphenomenal planning” (Wiles 1962, 
pp. 72-74). Much greater artistry is involved in 
overcoming inertia and resistance that seem in- 
vincible. The masters of the political process are 
those who from time to time transcend the limits 
of immediate feasibility and shape sequences of 
events previously regarded as highly improbable. 
To satisfy higher aspiration levels, politics must 
become the “art of the improbable.” 

At any level of aspiration, however, the actors 
in the political process face indeterminate situa- 
tions, Except for certain highly routinized behav- 
iors, they can never be sure what will happen if 
they do not act or what will be the effects of alter- 
native courses of action. They must make their 
decisions in the face of risk and uncertainty. 
This calls for considerable artistry in developing a 
power base, in using power in the form most suit- 
able to their purposes, and in adapting purposes to 
the exigencies of power development and use. 

Although ancient, the political art is not static. 
It changes with and adapts to changing cultures, 
life styles, forms of organization, and scientific and 


technological advances. It has unquestionably been 
affected in many ways by the slow accumulation 
of scientific knowledge and theory on the political 
process. For one thing, under the influence of 
modern technique it has become highly differen- 
tiated. The art of budgetary politics is not the same 
as the art of using mass media of communication 
or public opinion polls. It is now appropriate to 
talk about “the arts,” rather than “the art,” of the 
improbable and the possible. 

Moreover, the expansion of scientific knowledge 
of the political process places a higher premium 
on these arts. It is not knowledge by itself but 
rather its skillful use that replaces hunch with fact, 
intuition with principle, and fuzziness with an 
operational definition (Polanyi 1958). Such use re- 
quires greater judgment in evaluating and coordi- 
nating the contributions of expert advisers. It calls 
for greater wisdom in selecting those types of 
knowledge and techniques that are most relevant 
to any particular problem. It demands more train- 
ing and sustained learning, Although scientific in- 
quiry, in turn, may even focus upon the acquisition 
and growth of these arts, it cannot hope to replace 
them. The most it can do is provide a stronger 
foundation for the art-science relationship in the 
political process. 

BERTRAM M. Gross 


[Other relevant material may be found in GOVERN- 
MENT; POLITICAL BEHAVIOR; POLITICAL SCIENCE.] 
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POLITICAL RECRUIT 
AND CAREERS 


The social sciences are only now beginning the 
systematic quantitative study of the social bases 
for political recruitment and career patterns. By 
contrast, politicians and their biographers, as well 
as political philosophers, historians, and legal 
specialists, have long given attention to the cre- 
dentials and careers of men in public life—how 
political recruits used or improved upon their birth 
chances and what they contributed to political 
affairs. 


The range of relevant studies 

Belatedly but energetically, early lines of schol- 
arly inquiry into political career lines and selection 
mechanisms are being retraced and extended. In 
1919 Max Weber's influential lecture “Politics as 
a Vocation” was published; in America bench-mark 
studies such as those of Gosnell (1935; 1937), 
Salter (1938), and Zink (1930) followed, using 
vignettes to show how the careers of ethnic-group 
leaders, legislators, or city bosses could be made 
or wrecked by turns of luck quite as much as by 
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either merit or maneuver. In the 1950s and 1960s 
studies like those of Schlesinger (1957), Eulau 
and Sprague (1964), and Milbrath (1963) used 
more rigorous measuring instruments and more 
extensive samples of evidence in order to identify 
changing skill patterns, organizational screening 
devices, and shifting power bases as aspects of 
political recruitment. 

A similar contrast in technique may be seen in 
work linking psychological needs to the kinds of 
gratifications, the intensity of commitment, or the 
style of performance manifested by a public per- 
sonality. In 1930 Lasswell’s clinical case studies 
of the motivational factors underlying self-recruit- 
ment into politics were formulated in terms of how 
“private motives” could be displaced onto “public 
objects” and later endowed with “public purposes.” 
Thirty years later in work like that by Gabriel A. 
Almond, Robert E. Lane, and David C. McClelland, 
predispositional categories were being differenti- 
ated and methods devised for measuring the com- 
plex psychological dynamics that underlie political 
career aspirations. 

Fundamental social trends of the past half cen- 
tury, as well as advances in technique, have trans- 
formed the functional analysis of elite recruitment. 
In the modern comparative work of scholars like 
Edward Shils, Lucian Pye, and Morris Janowitz, 
the changing social origins and activation patterns 
of plural “elites” in politics are in focus. In the 
early elite formulations of Mosca, Pareto, and 
Michels eligibility was presumed to be sharply re- 
stricted, and the emphasis tended to be upon the 
stultifying effects of incumbency. 

Institutional processes, rather than functions, 
are the focus of other current studies. Sustained 
attention is being given to how men are selected 
and groomed for careers involving community de- 
cision making, party activities, legislative work, or 
the policy-making orbits of officialdom. In contexts 
where political induction and advancement are 
increasingly bureaucratized, sample-interviewing 
techniques have facilitated systematic inquiry. 

Rigorous marshaling of quantitative empirical 
evidence, however, is only beginning to charac- 
terize studies of political recruitment. The relevant 
literature is vast only if one includes philosophical 
speculations about who should rule, plus the work 
of modern historians concerned with elite trans- 
formations, and—in every society featuring a ju- 
ridical code—the disquisitions by legal specialists 
on the formalities of investiture, the mechanics of 
election, the conditions of examination, the pre- 
rogatives of office, and the rules of accountability. 
Of course, source materials linked to unique public 


lives and occasions—memoirs and biographies— 
are also voluminous. From them a diversity of keen 
impressions and some shrewd aphorisms on how 
to succeed can be gleaned. But they constitute an 
unsystematic commentary on political recruitment 
and career patterns. 

Modern historians have been somewhat more 
explicit about patterns, trends, and research meth- 
ods. They have tried to identify the canalizing and 
formative structures typical of various regimes. 
Elites are sometimes treated as vehicles of change; 
sometimes historical forces are seen creating or 
destroying elites. There has been recurrent empha- 
sis on the lag between elite habits and preoccupa- 
tions and the emerging historical challenges to 
their tenure. Distinctive configurations of rearing 
and apprenticeship, stamping political elites with 
inimitable styles, are sketched. Careful work, like 
that of Namier, has traced the social origins and 
family connections of those in “constituted bodies” 
—assemblies, councils, and state bureaucracies. 
New techniques exploit genealogical records, wills, 
and tax rolls. Documentary evidence remains un- 
avoidably incomplete, however; historians com- 
monly use it to illustrate, rather than to measure, 
phenomena. 

Merit selection and vocational opportunities ate 
shown to have produced effective administrative 
elites, whose efforts sustained great systems of 
public order in ancient China, India, the Near 
East, and pre-Columbian America, as well as in 
ancient Rome. Under Islamic regimes slaves of 
the sultan could be groomed and elevated to the 
imperial administrative elite; the selection criteria 
were not seriously hampered by considerations of 
birth or family status. By contrast, nearly every- 
where in Europe birthright fixed one’s life status 
and made smaller the talent reservoir. In the 
Orient the Confucian principles that men Mee 
unequal and that merit should determine socia 
status helped to legitimize the competitive erar 
nation system, which kept the bureaucratic elite 
from becoming self-perpetuating. However, E 
recruits in each generation did come from g d 
official families, whose traditions of service ee 
give the Chinese bureaucracy its continuity, s 
bility, and assimilative capacities. 

In the West historians have noted the T 
proliferation of nonhereditary elites eer a th 
field of endeavor. Those with advantages ° aaa 
increasingly must validate their ascriptive ©” ite 
tials with achievements of their own. Nor is S a 
hood in one domain a warrant for capacity ae se 
shape public policy. The special skills and ot ies 
tives needed to negotiate, legitimize, and imp 


modern 
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ment collective community efforts—the distinc- 
tively political skills—increasingly require special 
grooming. With the development of political vo- 
cations careers for poor men were opened in public 
life, as Weber noted. Success in politics, both as 
a chosen lifework and as a secondary career linked 
to success in civilian life, has become possible 
through a combination of merit, luck, tactics, 
sponsorship, and perseverance. 


Perspectives for inquiry and evaluation 


In the long history of theorizing about politics, 
“who should govern” is a familiar query. To Plato 
and Aristotle no real distinction between public 
service and private lives could be sustained. A po- 
litical community, by its machinery for inducting 
men into offices, inevitably allocated rewards and 
opportunities, integrated resources and perspec- 
tives in some degree, and affirmed the ethical “mix” 
of its political life by the conduct of typical public 
men as well as stellar figures. 

Modern formulations like those of Dahl (1961) 
or Lasswell are preoccupied with pluralism, mul- 
tiple power centers, semiautonomous localities, in- 
trabureaucratic satrapies. Accordingly, they stress 
procedural consensus—the rules of the game, for 
co-optation as well as election—rather than sub- 
stantive norms, People who would fight over ul- 
timate principles may agree on intermediate ob- 
jectives. Modern issues pose problem complexes 
that call for political brokers and administrative 
expediters. Highly divergent career patterns must 
be supported. Men may enter politics for reasons 
only remotely linked to shared ideals or public pur- 
poses, The attitude of modern civility expects per- 
sonal career calculations and concern for the 
common weal to exist simultaneously in the minds 
of fellow citizens. 

Systematic comparative research into political 
manpower problems has substantially appeared 
only since World War 11. Two major approaches, 
which differ markedly in the treatment of substan- 
tive and procedural norms, exist: the functional 
systems approach and the conventional organiza- 
tions approach. The former deals in elites; the 
latter, in personnel. 

The systems approach provides both an idiom 
and an agenda for comparative research. As yet, 
it has not solved the problems of marshaling em- 
Pirical evidence to “test” ideas, Functional systems 
analysis looks at all processes and structures in 
terms of their instrumental contribution to the vi- 
tality of a larger political system. Attention centers 
on the performance of functions by structures. 
Success is inferred from the continued viability of 
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the political order in question. Put another way, 
the key structures are elites who collectively per- 
form the functional necessities, elites whose per- 
spectives are undergoing often profound cultural 
metamorphosis, elites who are in the process of 
learning how to operate new machinery of political 
life. In Almond’s formulation, that machinery in- 
cludes not only the essential means of governing 
but also the means to enlarge the system’s develop- 
mental capabilities—to integrate perspectives, ac- 
commodate interests, sustain a sense of participa- 
tion, and distribute values equitably. 

The organizations approach is usually confined 
to a single sector or segment of the conventional 
political order—a legislative process, a policy- 
making sequence of executive meetings, a party 
organization selecting its legislative candidates. 
Attention focuses on the performance of tasks by 
personnel, and success is inferred when actual 
achievements conform to the idealized patterns 
aimed at by participants—e.g., democratic selec- 
tion, representative action, statutory policy, or a 
constitutional standard. 

Men are born with certain life chances by virtue 
of their social circumstances and personal endow- 
ments. Modern elite recruitment studies have con- 
centrated on social origins, activation patterns, and 
the changing kinds of political jobs available as 
major dimensions for analysis. For example, by 
studying recruitment patterns during the last 
stages of colonialism and the early stages of na- 
tional autonomy, political scientists have plausibly 
inferred the activation motives and identified the 
social bases of those involved in periods of accom- 
modation with the alien elite, agitation for self- 
government and modernity, maneuver for place 
and protection during the tutelary transition, and 
consolidation after independence. In virtually all 
“new states” political recruitment patterns since 
independence have been shifting substantially, in 
terms of class and educational levels and in terms 
of urban-rural and center-hinterland derivations. 
Always the shifts have been downward and out, 
that is, in the directions that broaden and diversify 
the pool of aspirants. 

What have not been examined carefully in elite 
studies are the effects of grooming, and career 
commitments, or the selective screening criteria by 
which institutions safeguard their key roles from 
“misfits.” These are the very dimensions, on the 
other hand, that have begun to receive systematic 
attention in studies of organizational personnel se- 
lection in politics. Instituions do not recruit cross- 
section samples from the social pools of eligible 
raw material. Even if they did—or where they 
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seemed to—the formative influence of career ex- 
perience may be expected to attenuate the primor- 
dial loyalties in some unestablished degree. As 
Weber has noted: “According to all experience 
there is no stronger means of breeding traits than 
through the necessity of holding one’s own in the 
circle of one’s associates.” 

To do any firsthand investigation one has to 
specify the jobs, identify the incumbents, consider 
both formal and informal screening arrangements 
used to fill the positions, and then compare the 
“fit” between what is wanted and what is obtained. 
Organizations do not regularly get what they want 
by way of manpower. Nor do organizations neces- 
sarily seek what they functionally need. The fit 
between the functional role structures and the or- 
ganizational position structures in a political sys- 
tem is rarely, if ever, one-to-one. Only with great 
caution is it possible to treat a job incumbent as 
a functional role incumbent. Findings about or- 
ganizational grooming and screening patterns are 
thus not easily articulated in terms of functional 
systems theories as presently formulated. 


Organizational positions and functional roles 


The study of political recruitment must begin 
with some consideration of what the jobs in poli- 
tics are, First come the legally defined positions. 
Clearly included are legislators, chief executives, 
high civilian and military officials, judges, and 
diplomats, Their counterparts in local government 
are mayors, councilmen, board members, and 
agency chiefs. How should these jobs be grouped? 
Classifying them by reference to their legal powers 
and duties quickly proliferates the categories; 
grouping them according to statutory terms of 
eligibility or formal method of selection obfuscates 
nonlegal comparisons by treating legally recog- 
nized criteria such as age, residence, citizenship, 
literacy, or nominal group affiliation as unambig- 
uous attributes or by suggesting that the same 
functional considerations should be applied to all 
elected recruits or to all appointed—or all co-opted 
—public officials. Legal prescriptions and similar 
formalized distinctions are useful but inconclusive 
evidence about political jobs. 

Second, the inventory of positions includes the 
institutionalized roles distinguished by participants 
in the political arena. Terms like lobbyist, propa- 
gandist, staff man, reporter, commentator, party 
cadre, agitator, anchor man, and trouble shooter 
only begin to suggest the kinds of behavioral 
patterns—the orbits of action as well as the skills 
and resources involved—that may be familiar in 


a given milieu. Even in studying a conventional 
political organization, it becomes apparent that 
ad hoc functional equivalents must be used to 
subsume the enormous variety of role patterns. 
Clearly, too, roles may exist but not be filled, either 
because no qualified candidate can be found or 
because it is preferred to keep a post vacant. 

Whatever the institutional boundaries and orbits 
of politics under investigation, the tasks which 
apply to a given job are partly shaped—preshaped 
in binding ways even—by virtue of constitutional 
and organizational forms. In part they depend 
upon the expectations of other participants—out- 
siders as well as insiders, subordinates and peers 
as well as superiors—and those of the incumbent 
himself, concerning both the ordinary tasks to be 
done and the marginal behavior that is acceptable. 
Circumscriptions on political jobs arise also be- 
cause of the resources, both technological and hu- 
man, directed and controlled differentially through 
organizational processes. A further circumscription 
is imposed by the political culture. Roles that are 
equivalent in form, in norms of performance, and 
in resources may call for a distinctive style in one 
milieu but not in another. 

The position structure, both formal and infor- 
mal, within a legislature, party unit, secretariat, 
or any other political entity is presumably meant 
to secure a minimum level of functional output of 
the organization’s product, whether that be statutes, 
precinct work, briefing materials, or something 
else. To understand how the organization's modal 
product is generated, and thereby have a basis for 
suggesting improvements, it is evidently necessary 
to undertake a detailed analysis of the intramural 
role structure both as the participants see it (ien 
conventionally) and in terms of a functional 
model, 

For many kinds of research, however, it is not 
necessary to study the organization's internal dy- 
namics at all. Its membership can be studied in 
terms of the social origins, skills, and moral quali- 
ties demonstrated by modal and deviant types. In 
this kind of inventory the qualities of key actors 
are in some degree confounded with those of lesser 
significance. But it can be argued that the test of 
an organization’s health is the caliber of its average 
decision maker, not the brilliance of its stellar 
figures. Moreover, in functional systems terms, 
what is important is how adequately the structure 
viewed as a collectivity is equipped and staffed to 
perform the desired functions. A knowledge of the 
inputs, in other words, is a reasonably adequate 
basis for making inferences about the outputs. A 
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better basis, but not an essential one, would be 
provided by a full analysis of the organization’s 
functional dynamics. 


Political socialization and recruitment 


The induction of manpower into political roles 
is more specialized and focused than the induction 
of people into a political culture, Recruitment is 
concerned with special political role socializations 
which occur “on top of” general political socializa- 
tion, However the political system is defined, re- 
cruitment to its posts taps people from various 
subcultures. Presumably, by giving recruits an in- 
side view of a political process, it trains them in 
appropriate skills and furnishes them with political 
cognitive maps and perspectives. 

The prevailing expectation among most scholars 
is that primordial perspectives and psychic man- 
agement patterns acquired early in life remain 
operative in later years and are overlaid by atti- 
tudes, values, and conduct patterns more or less 
acceptable and appropriate to one’s career aspira- 
tions—i.e., what one wants to make of one’s life. 
Whether these perspectives and patterns govern 
all basic decisions in later life, including one’s 
official acts, or whether they only color one’s po- 
litical judgment on certain problems or sharpen 
one’s political sensitivities in specific ways are un- 
resolved questions. 

Hard as it is to draw a line, early political so- 
cialization is relevant to the study of political re- 
cruitment only insofar as it affects one’s chances 
of being recruited or of desiring to enter into spe- 
cific roles in political life. Therefore, much current 
academic work about politicians is formulated in 
terms of their early socialization. Inferences are 
often uncritically drawn as though a person’s sub- 
sequent career experiences had only minor weight 
in determining his adult perspective on essentials. 
Yet the individual's life path through the corporate 
and primary group “infrastructures” of his com- 
munity provides crucial skills and attitudes as well 
as sponsors and material resources. Especially in 
poorly developed countries with weak infrastruc- 
tures and poor consensus, early political socializa- 
tion and later experience seem to be of roughly 
equal importance in determining the viability of 
the political order. Apart from the development of 
suitable career perspectives by in-service grooming, 
it is important to note that selective recruitment 
from the pool of eligibles may minimize the effects 
of imperfect early socialization on leadership in a 
modern civil society. 

Ordinarily, performing the role of voter or pur- 
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suing a citizen’s career are not included in dis- 
cussions of recruitment and career patterns in 
politics. More specifically, political recruitment 
refers to institutional processes by which political 
jobs beyond the citizenship level are filled. Political 
careers are patterns of incumbency in these politi- 
cal offices and roles. Subjectively, career perspec- 
tives are moving vantage points from which men 
in politics, whether treating public affairs as a 
calling or as a vocation, appraise their duties and 
opportunities. 


Selection devices and screening criteria 


Institutions use a variety of devices to fill all 
jobs. Through co-optation, equals augment their 
ranks or sustain their numbers. By mobilization 
supporters are rallied to perform basic tasks in 
organized efforts or to “act out” their contributions. 
Through appointment a key figure designates sub- 
ordinates to hold office or discharge tasks. Election 
elevates a man to act as spokesman, leader, or 
representative for the legally defined electorate 
that chooses him. Lot and rotation are deliberately 
arbitrary methods for securing incumbents for 
specialized roles from among a group of presumed 
peers. Apprenticeship and examinations, both for- 
mal and informal, often precede incumbency in 
political jobs. 

Formal recruitment processes in politics are typi- 
cally extramural or organizational threshold proc- 
esses, that is, they are designed to secure man- 
power from an outside pool or public. They give 
title to political offices and some claim to institu- 
tional resources, but they seldom bestow distinctive 
influence within the organization. Such influence 
comes, instead, when one can rally a following, 
claim to be an insider, show special prowess, or 
otherwise become distinguishable to colleagues. 
Intramural recruitment mechanisms are basic to 
most analyses of the organization's performance; 
these mechanisms select individuals to fill not only 
the organization’s formal offices and recognized 
positions but also optional informal roles. In the 
extreme sense of hiring and firing personnel, the 
formal processes can override the informal ones; 
extramural controls can prevail over intramural 
ones. In the ordinary course of political business, 
however, the intramural screening and reception 
system for the neophyte legislator, official, or party 
functionary is a more searching and more un- 
nerving process than any formal, or extramural, 
hurdle. Moreover, apart from official prerogatives 
that come with formal induction, the influence 
wielded inside a political organization is largely 
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determined by the intramural role patterns that a 
member finds himself conforming to, or—more 
rarely—carving out for himself. 

Of the formal induction modes, special atten- 
tion has been given to elections. Public machinery, 
including such devices as nominating conventions, 
primary elections and runoffs, various proportional 
representation (PR) formulas, and such contriv- 
ances as nonpartisan elections, is unlikely to be 
very efficient in picking the candidate with the 
best credentials of skill and ability for the job; 
style and native character are perhaps more likely 
to be discerned and appreciated by electorates; 
personality, one’s stand on current issues, and 
party sponsorship are widely felt to be the most 
important determinants of popular support. 

In the activities of candidate-sponsoring groups 
prior to elections, the considerations felt to be most 
relevant in the search for candidates do not always 
provide voters with a serious election day choice. 
The qualities that make a candidate “available,” 
i.e., that are felt to maximize his chances to win, 
are largely irrelevant when the election is not close. 

Certain recurrent processes of informal selection 
should be noted. One concerns the emergence of 
natural leaders, lieutenants, followers, and devi- 
ants. From political biographies and memoirs it 
seems clear that nuances of appraisal and advice 
occur in face-to-face situations that count heavily 
in processes of co-optation into inner circles. Pat- 
terns such as these are presumably part of the 
informal weeding-out or letting-in of political talent 
by those already in a legislature, a party organiza- 
tion, a bureaucratic clique, or a community power 
structure. 

People are sometimes accepted—and may feel 
themselves destined—as naturally possessing an 
office or rank in and through the established order 
of things, as in monarchical succession or the place 
conceded to a peer’s eldest son. Sometimes people 
are revealed during some critical occasion as hav- 
ing inimitable or indelible qualities previously 
hidden (e.g., charisma, strength, indecision), qual- 
ities held to be peculiarly appropriate (or inappro- 
priate) to a key office or rank. 

Examination and apprenticeship, as prelimi- 
naries to formal selection, raise interesting prob- 
lems. People are sometimes certified as eligible 
because they have indicated by appropriate exami- 
nation that they have the requisite range of skills, 
knowledge, or characteristics for a position or role. 
Somewhat different is the claim that past perform- 
ance and experience have demonstrated not only 
that one is competent but also that one is able to 
put skill, knowledge, and character to actual use. 
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The importance of sponsors—“who you know” 
rather than “what you know’—cannot be dis- 
counted in political recruitment. Who is to read 
the tests and letters, vouch for their authenticity 
or candor, and convince the selecting unit, whether 
it be electorate, executive, or conciliar body? 

Ascriptive criteria of recruitment in some meas- 
ure obviate the need for sponsors by making selec- 
tion automatic. Where achievement considerations 
are in point, commonly the electorate tends to con- 
sider which political groups are underwriting a 
candidate, the executive tends to heed a letter of 
reference from a well-placed source, and the con- 
ciliar body is deferential toward one of its own 
acting as a neophyte’s sponsor. A perfectly objec- 
tive test of skills or knowledge is hard to conceive, 
but using it would make selection as automatic as 
any hereditary claim in former centuries. Fears of 
“meritocracy” stress this eventuality. 

Mobilization is a distinctive kind of recruitment 
process. The tasks may be fairly complex; the re- 
cruits are often much on their own, The commit- 
ment is typically brief but strenuous; the experi- 
ence probably has important consensus-building 
functions. Activating an effective volunteer force 
in an election campaign is one example; coordi- 
nating a civil-disobedience demonstration is an- 
other; organizing a legislative coalition is yet a 
third variety; developing an effective subversive 
network is a fourth. All are garden-variety recruit- 
ment phenomena built around the competence of 
a cadre to train and deploy raw personnel. 

Two other mechanisms of recruitment need 
mention: seizure of office, by which it is suddenly 
and forcibly (or with a threat of force) taken, as 
in palace revolts, conquests, and coups d'état; and 
purchase of office, by which it is turned into a com- 
modity or some aspects of its performance are con- 
trolled as investments. Both mechanisms have 
been widely used in the world’s history, sometimes 
to the virtual exclusion of other methods but more 
commonly in tandem with more conventionally 
acceptable practices. Purchase has special signifi- 
cance as a plutocratic mode of influence. Montes- 
quieu argued in favor of selling all public employ- 
ments in monarchies, on the ground that, in most 
cases, “chance will furnish better subjects than the 
prince’s choice” (Spirit of Laws, V, 12, 19). Venal- 
ity in offices is thus seen as “functional” across the 
board in a monarchy, although Montesquieu agrees 
with Plato that the same practice would undermine 
virtue in a republic. Seizure is especially in point 
when top-elite succession patterns are poorly rou- 
tinized—that is, when key figures hold multiple or 
vaguely defined offices for indefinite terms and are 
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not accountable to any definite constituency and 
when “juridical defense” (in Mosca’s terms) is 
grossly inadequate. 


Career lines and career perspectives 


One of the basic questions about any system of 
public order is how smoothly it provides career 
opportunities to each successive generation. Armies 
get top-heavy with old-fashioned men; civil serv- 
ants of the “old school” are inevitably blocking 
someone else’s progress, In former colonial do- 
mains the agitational elite, whose political skills 
and beliefs often were acquired in prisons, in crowd 
situations, and in conspiratorial circles, has inher- 
ited the modern apparatus of nationhood and 
power while still young in years. As a group, how- 
ever, its members lack the executive skills and 
educational credentials needed for policy making 
when government is seen as the programmatic 
vehicle for rapid modernization and social trans- 
formation. The next generation, not much younger, 
feels itself better qualified. Some join radical and 
subversive movements; many follow career paths 
that siphon their energies into professional and 
corporate roles as policy implementers. The formal 
educational system, and its typical apprenticeship 
scheme for postgraduate service, has been crucial 
in furthering political integration. One common pat- 
tern has brought young people to the central insti- 
tutional complex of their society, provided them 
with a collegial setting for higher education, im- 
bued many with an elite-corps image of themselves, 
and then deployed them to the peripheral localities 
of the nation once again. 

The ways in which men move from nonpolitical 
career lines, either bureaucratic or civilian, into 
careers in politics and public affairs have begun to 
receive special attention. Thus, the “birth elite” and 
the “mobile elite” may be distinguished in any field 
of endeavor: they are, respectively, those whose 
level of success approximates that of their fathers 
and those who have risen above the family status 
into which they were born. Research suggests that 
those embarking upon political careers after up- 
wardly mobile success in private vocations are dif- 
ferently motivated, follow different codes of con- 
duct, and display different sensitivities than those 
whose private careers began from more advanta- 
geous parental platforms. To what extent does the 
second-career phenomenon reflect premature dis- 
enchantment with the original choice? Durkheim 
held that economic careers in Western nations in- 
volve no coherent system of public service or cul- 
tural transmission; rather, the emphasis has been 
on denying public purposes, resisting public ac- 
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countability, minimizing public responsibility as 
aspects of economic life, To the extent that this is 
so, entering politics and abandoning his first career 
may have the symbolic meaning to a businessman 
of rejecting a selfish life in favor of work that tran- 
scends his personal needs. Quite different is the 
second career in politics for a man coming from 
a career as a journalist, professor, minister, mili- 
tary man, or top civil servant. Each of those careers 
explicitly generates a rationale of cultural trans- 
mission or public service. These differentials are 
reflected in recent attitude studies of local officials, 
legislative candidates, lobbyists, and administrators. 

In many emergent countries with weak infra- 
structures and small, educated elites, lateral access 
to political roles is easier, Men of high status in 
any walk of life are so scarce and so conspicuous 
that they receive generalized deference. They are 
expected to be actively concerned with public af- 
fairs, By contrast, in societies with more elaborate 
infrastructures, lateral access is difficult and often 
short-lived. Men of high status in nonpolitical fields 
of endeavor are common; outside their specialty 
they are not well known; outsiders, including poli- 
ticians, are likely to give deference only to their 
professional or corporate status and not to them 
personally. 

Lawyers in American politics illustrate many of 
these points. They are trained as advocates in a 
profession that does not have to be practiced con- 
tinuously, that is convenient to fall back upon, and 
that rests on transferable skill and discipline of a 
kind needed to analyze the formal bases of govern- 
mental policies and bureaucratic procedures, how- 
ever these may vary historically. With legal work 
to return to, one could risk entering politics at a 
younger age. With legal discipline to help in grasp- 
ing policy issues and institutional procedures, one 
could hope to go further in politics, shifting from 
local to regional to national arenas of power, mov- 
ing from executive posts to legislative circles. Evi- 
dence shows, however, that law-enforcement posts 
—from police chief and prosecuting attorney to 
attorney general and trial or appellate judge—are 
becoming the only kind for which legal credentials 
and training give one a discernible career edge. 
Lawyers have distinctive technical roles to play in 
policy-making processes, both legislative and bu- 
reaucratic. But versatile politicians are nonlawyers 
in background more frequently than was true in 
a simpler age, when both law and politics were less 
complicated by bureaucratic structures. 

Political recruitment studies thus far have placed 
little emphasis on what happens en route or what 
hurdles and screens are deliberately erected by 
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organizations. Disjointed career orbits, haphazard 
talent searches, and imprecise skill requirements 
complicate inquiry. 

Two basic institutional disjunctions commonly 
occur: one concerns the level of government, the 
other concerns the kind of institutional stage of 
politics in which one’s career orbit lies. In general, 
the longer an individual remains at any one level, 
the harder it is to move; the more habituated he 
becomes to one kind of platform, the harder it is 
to make a transition. The career orbits of local and 
provincial politics overlap in discernible but atten- 
uated patterns with the orbits of national political 
careers. At all levels, too, there are persistent tend- 
encies for political decision-making processes to 
diverge from bureaucratic ones. The political proc- 
esses, with their legitimatizing and informing func- 
tions, tend to develop skills in symbol manipula- 
tion; the bureaucratic processes, with their planning 
and allocation functions, tend to stress skills in 
resource manipulation. 

The political realm is fragmented everywhere, 
with provincial and central institutional complexes 
creating career orbits that interpenetrate at only 
a limited number of points. Men serve one term in 
local or provincial legislative office, find the work 
onerous or dull and the prospects of advancement 
discouraging, and quit. Both in America and Eu- 
rope, evidence suggests that about half the incum- 
bents in public offices at the lower and middle 
reaches of power and governance are serving for 
the first—and last—time. Even for those who per- 
sist, lateral movement and occasional retirement 
into civilian pursuits are likely. Moving to the level 
of national politics is more remote and more diffi- 
cult to prepare for; often it depends on political 
windfalls and personal friendships. 

The contrast in grooming processes between the 
American and Soviet systems is suggestive. Democ- 
ratization of government—including spoils poli- 
tics—brought, in the late nineteenth century, 
vocational careers to American electoral politics. 
Since then, a typical career has involved a sequence 
of campaigns for elective offices, hopefully with 
bigger constituencies and broader responsibilities 
each time. The key problem is when to seek which 
office; the crucial skill is the ability to: persuade 
equals and appeal to publics. In the Soviet Union 
an early twentieth-century revolutionary cadre had 
to be transformed into a political control apparatus. 
A typical career now starts at the bottom of the party 
bureaucracy; grooming involves a standardized 
educational sequence plus tours of duty in special- 
ized policy fields and at provincial levels; advance- 
ment is based on “pull” and support, as well as on 


tested abilities. The key problems are how to move 
bureaucratic gears and secure substantive policy 
results; the crucial skill is the ability to prod sub- 
ordinates and please superiors. In short, it is an 
intramural, rather than an extramural, recruitment 
system. 

In many systems those who control political ad- 
vancement want to make firsthand appraisals of 
the most promising eligibles. At the same time they 
find it hard to say just what qualities they are 
looking for. The criteria kept in mind when co-opt- 
ing a member or sponsoring an aspirant for a 
relatively high political job presumably include an 
estimate of what is being risked on an unknown 
quantity. When a recruit is rated for advanced 
work, the situation is often quite different before 
and after he has been tested in the relevant con- 
text. Beforehand, those who control the screening 
must be content to evaluate his potential by refer- 
ring to allegedly parallel or analogous experiences 
in his record. Often these must be vouchsafed by 
a sponsor; often, too, the parallels do not quite fit. 
The sense of confidence that a recruit has the de- 
sired qualities is markedly different after he has 
served a probationary period within his screening 
committee’s view, even if his period of duty has 
been uneventful. 

Some evidence suggests that political appoint- 
ment processes depend far more on the aspirant’s 
initiative and on haphazard searches for talent than 
is true in other fields, Even in the case of chief 
executives, whose appointive power is very wide 
and whose talent pool in theory could be society- 
wide, seldom is there more than a casual effort to 
compile rosters of potentially recruitable appointees 
for high positions. It is not enough to explain this 
political lethargy by arguing that only by screen- 
ing political appointees through party-controlled 
machinery can men be secured with an appropriate 
political outlook. Rather it seems that political elites 
are content to co-opt from their “doorstep,” so to 
speak, partly because only in that way can they 
apply their own judgment of the man or use the 
judgment of familiar and trusted associates. 

Analysis of political recruitment practices is just 
beginning to disclose their effects on the skills and 
attitudes acquired during one’s passage through in- 
termediate levels of political life. A number of 
convenient and plausible notions turn out to be 
doubtful. Not everyone entering at the bottom 
wants to get to the top—or even, for that matter, 
to the upper levels of the immediate political ap- 
paratus in which he works. It is seldom true that 
the manpower needs, either for advanced and pub- 
lic positions in the political hierarchy or for the 
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episodically used machinery of election campaigns, 
are filled by any systematic process of search and 
appraisal. Nor can it be demonstrated that when a 
certain kind of political job increases in functional 
significance—e.g., when historical circumstances 
make specialists in symbol manipulation less vital 
to political stability than, say, specialists in vio- 
lence or specialists in resource allocation—there 
is a corresponding increase in either material re- 
wards or deference to those filling the position. It 
is not commonly true, either, that political careers 
regularly follow a simple ladder from less exacting, 
less significant, or poorly paid posts to more de- 
manding, more important, and well-paid positions. 


There are persistent motives and purposes en- 
gendered and rekindled in a modern pluralist 
society that cause perhaps every twentieth person 
to enter politics actively, work at it, accept its 
terms, and sometimes wish to rise in it (see Lane 
1959 for U.S, estimates and Almond & Verba 1963 
for European figures). Moreover, there are both 
recurrent and emergent manpower needs in the 
institutional processes ancillary to control of gov- 
ernment; these manpower needs prompt those in 
politics to recruit others, offer them appropriate 
rewards and gratifications, and in some cases help 
them to rise politically. 

The criteria for political advancement are con- 
stantly changing. The ways by which younger and 
more vigorous men learn the trade of politics from 
those ahead of them cannot be studied without 
attention also to the conditions that cause new 
generations in politics to perform in a different 
style and to live by different standards of respon- 
sible conduct when in office. 

Research is needed that will identify the inim- 
itable characteristics of those who rise to the 
highest rungs of the political ladder—what distin- 
guishes them from the second echelon, whose qual- 
ities are more easily imitated. Much contemporary 
analysis is concerned with the lag between the 
“functional needs” of a political system and the 
“institutional recognition” of priorities in meeting 
such needs. Political leaders of the first order in- 
novate in ways requiring skills and resources not 
widely available; yet, they set styles and impose 
performance standards that, once demonstrated, 
can be imitated by others. In this sense, the career 
perspectives of the next generation are molded by 
contemporary examples and by contemporary in- 
adequacies. Future political leaders still at the on- 
set of their careers, then, must always find them- 
selves being screened for capacities their elders 
never had to have, trained to cope with develop- 
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ments only gropingly dealt with by the current 
incumbents, and equipped with an awareness and 
sensitivity to new problems in political decision 
making, posed by impending breakthroughs in sci- 
ence and technology. 

DWAINE MARVICK 


[See also CIvIL SERVICE and OCCUPATIONS AND CA- 
REERS. Directly related are the entries ADMINISTRA- 
TION; LEADERSHIP, article on POLITICAL ASPECTS; 
LEGISLATION, article on LEGISLATIVE BEHAVIOR; 
PARTIES, POLITICAL; POLITICAL EXECUTIVE; POLITI- 
CAL PARTICIPATION; REPRESENTATION. Other rele- 
vant material may be found in POLITICAL CULTURE, 
PROFESSIONS. ] 
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POLITICAL SCIENCE 


This article provides a broad introduction to the 
discipline of political science. The major subfields 
are reviewed in the entries INTERNATIONAL LAW; 
INTERNATIONAL RELATIONS; POLITICAL BEHAVIOR; 
POLITICAL THEORY; POLITICS, COMPARATIVE; PUB- 
LIC ADMINISTRATION; PUBLIC LAW. For individual 
contributions to the development of the discipline 
see the biographies of BAGEHOT; BARNARD; BEARD; 
BENTHAM; BENTLEY; BRECHT; BRYCE; COKER; 
CONDORCET; FOLLETT; GoopNow; HELLER; KEY; 
LINDSAY; LIPPMANN; LOWELL; MAINE; MARX; 
MERRIAM; MICHELS; MILL; Mosca; OSTROGORSKII; 
PARETO; RICE; RICHARDSON; SCHMITT; TOCQUE- 
VILLE; WALLAS; WEBER, MAX; WILLOUGHBY; WIL- 
son. Related material from other disciplines may 
be found in POLITICAL ANTHROPOLOGY; POLITICAL 
SOCIOLOGY. 


Political science in mid-twentieth century is a 
discipline in search of its identity. Through the 
efforts to solve this identity crisis it has begun to 
show evidence of emerging as an autonomous and 
independent discipline with a systematic theoret- 
ical structure of its own. The factor that has con- 
tributed most to this end has been the reception 
and integration of the methods of science into the 
core of the discipline. 

The long failure of political science to assert a 
fundamental coherence of subject matter led some 
scholars to deny that it could ever form an auton- 
omous field of research coordinate with other 
social sciences, such as sociology, anthropology, 
and psychology. They would rather assign it to the 
category of an applied field, one in which the the- 


oretical concepts formed in the other social sci- 
ences are applied in the study of political institu- 
tions. But this assessment of the theoretical status 
of political science is largely a result of the failure 
to perceive the profound revolution that has taken 
place in the discipline, especially since World War 
1. In these decades political science has taken some 
firm and well-articulated steps toward its own re- 
construction as a theoretical discipline. 

During the many centuries from classical antiq- 
uity almost to the end of the nineteenth century, 
the study of political life remained not a discipline 
in the strict sense but a congeries of inherited in- 
terests. Only in retrospect, when modern criteria 
are imposed upon the thought of past social philos- 
ophers, is it possible to identify their intellectual 
concerns as indeed part of what today we choose 
to call political science. As a result, by the time po- 
litical science took shape as an independent aca- 
demic discipline, it had assumed a thoroughly 
synthetic character; its subject matter appeared to 
consist largely of a collection of loosely related 
topics handed down and modified through the ages. 
On the surface, all that seemed to draw these in- 
terests together was their relationship to some 
vaguely defined political institutions and practices. 

If we examine the history of political inquiry in 
the last 2,500 years, we discover that, for the most 
part, the questions which achieved prominence in 
the thought of those social philosophers who dealt 
with political matters not unnaturally reflected the 
major issues of the day, Over the course of time 
these topics had accumulated, so that the older 
political science grew as an intellectual pursuit, the 
greater was the volume and variety of subjects it 
embraced. By mid-twentieth century the discipline 
as a whole threatened to collapse under the strain 
of absorbing and imposing a logical and consistent 
order on a staggering load of knowledge about 
highly diverse subjects. f 

By then, however, there also were clear signs 
that the traditional way of selecting problems for 
research was threatening to change fundamentally. 
A nagging question had begun to press in earnest 
upon the members of what by that time had be- 
come a highly specialized discipline. Is political 
science indeed only a synthetic discipline, coms 
posed of a mélange of topics as suggested by his- 
torical concerns? Is political science little more 
than a historical accident that has brought together 
all thinking in the general neighborhood of govern- 
mental or political institutions—and no more pre- 
cisely or profoundly defined than that? Or is it pos- 
sible to say that it is in any sense a theoretical 
discipline with a definable intellectual unity? 


The basic subject matter 


Two widely differing sets of criteria have 
emerged, in the last hundred years or so, for dif- 
ferentiating political life from all other aspects of 
society and, thereby, for isolating the subject mat- 
ter of political science. The one has sought to define 
political life in terms of the institutions through 
which it is expressed; the other has turned to the 
activities or behavior of which institutions are the 
particular historical forms. Under the first set, po- 
litical science has been described, not very pro- 
foundly, as the study either of governmental (or 
political) institutions or of the state, Under the 
second set, which did not gain widespread accept- 
ance until well into the twentieth century, it has 
been characterized as the study either of power 
or of decision making. 

Institutional criteria. Two institutional ap- 
proaches may be distinguished. 

Governmental institutions. To this day, the 
most frequent and most popular way of describing 
the field of political science is as the study of gov- 
ernmental or political institutions (Bentley 1908; 
Truman 1951). Yet it is the least useful, since it 
leaves almost entirely to intuition the sorting of 
political institutions from all other kinds. [See 
GOVERNMENT.] 

As the basic conceptualization of the field, it 
begs the vital question. It does not help us to dis- 
tinguish governmental or political institutions from 
other kinds. We are left as much in the dark about 
the subject matter of political science as ever, that 
is, at the mercy of our intuitions. This way of 
orienting oneself to political research virtually 
abdicates all responsibility for elevating the theo- 
retical level of the discipline, Just what is to be 
included within it must rest on the emergent but 
undefined consensus of each generation of political 
scientists. 

The state. No conceptualization of the subject 
matter of political science has had a longer history 
than that of the “state.” Its origins as a way of 
orienting political inquiry lie buried in the sixteenth 
and seventeenth centuries. Machiavelli is often 
cited as one of the first known users of the term, 
but his invention of it is disputable. However, dur- 
ing these centuries it slowly emerged as a substitute 
for earlier terms to refer to important political en- 
tities, such as kingdom, land, principality, com- 
monwealth, republic, dominion, and empire (Mac- 
Iver 1926). 2 

The state’s long tradition as a central political 
concept testifies more to the persistence of practical 
political interests in setting research perspectives 
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than to the utility of the idea for understanding 
political life. In fact, its shortcomings for theoret- 
ical purposes have become so widely recognized 
since World War m that its professional use has 
dwindled to a small fraction of its previous fre- 
quency (Easton 1953). For research and analytic 
purposes, scholars have denuded the term of most 
of its implication; it has been reduced simply to a 
neutral and empty conceptual shell for identifying 
the actors in the international sphere. In its place 
is appearing “political system,” a concept that is 
burdened with few practical political overtones. 
[See STATE, article on THE CONCEPT.] 

Functional criteria. In the face of the theoret- 
ical limitations inherent in institutional concepts, 
it is scarcely surprising that efforts were made to 
develop alternative first approximations to a gross 
description of the major variables of political sci- 
ence, Dissatisfaction with an institutional approach 
has given birth to a variety of interpretations that 
have at least one quality in common: they all iden- 
tify the subject matter of political science as a 
kind of activity, behavior, or, in a loose sense, func- 
tion. Although some definitions of this kind orig- 
inated in the nineteenth century, it is only since 
the middle of the twentieth century that they have 
finally come to be accepted as an approach superior 
to an institutional one, The specification of the 
political function in a society permits political sci- 
entists to generalize their subject matter. It is no 
longer limited in any way by the variable historical 
structures and institutions through which political 
activities may express themselves, whether they be 
in the form of highly centralized “states,” undiffer- 
entiated tribal systems, or diffusely organized in- 
ternational systems. 

Power. During the nineteenth century, concep- 
tualization of political science as the study of the 
state had reached its zenith in the Staatslehre 
(state theory) school of political inquiry in the 
German-speaking countries. Its major character- 
istic was that it reduced political inquiry to an 
idea of the state interpreted as a body of formal 
constitutional norms. Ultimately this converted 
political science to an arid legal formalism that 
turned away entirely from social reality and at 
times even seemed to escape contact with legal 
reality itself. 

The opponents of this school saw the state not 
as a body of legal norms but as a set of social 
groups in eternal competition for power over its 
instrumentalities. For example, Marx, Treitschke 
(1897-1898), and the early political sociologists, 
such as Gumplowicz (1885), Ratzenhofer (1893), 
and Oppenheimer (1907), saw force and power, 
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especially in the struggle and conflict among 
groups or classes, as an inherent aspect of political 
relationships. In the United States it took somewhat 
longer for this change to gain acceptance, if only 
because it was frequently associated with unaccept- 
able European social philosophies. But by the 1930s 
Catlin (1930) and Merriam (1934) were arguing 
for an interpretation of politics as a set of power 
relationships, and they were joined in short order 
by many others (Lasswell 1936; Lasswell & Kaplan 
1950; Key 1942). 

As a means of approaching the analysis of polit- 
ical phenomena, power has been dramatically suc- 
cessful in breaching the walls of an institutional 
approach and in blazing a new pathway, one that 
has led toward a functional conceptualization of 
political science. An enormous amount of time 
and energy has gone into describing and defining 
power relationships, between individuals, groups, 
and nations and within national political systems, 
local communities, and organizations. It has com- 
manded the attention of all disciplines. Yet as a 
central theme, power has one overwhelming draw- 
back, Despite all efforts, the idea of power remains 
buried under a heavy cloud of ambiguity. It has 
been suggested that the time may be long overdue 
for giving serious consideration to the question of 
whether the social sciences ought not to abandon 
the idea entirely as a useful concept for the direct 
purposes of analysis and research (March 1966). 

But we may wish to adopt a more hopeful atti- 
tude. We may assume that the inability to achieve 
a clear understanding of the referents of power 
stems from insufficient research or from inade- 
quate but improvable tools of analysis, rather than 
from the excessively global character of the term 
itself. In that event we would be left with other 
equally insurmountable conceptual barriers to the 
use of power as an orienting concept. Even with 
a stable, unambiguous meaning, power would still 
be at one and the same time too narrow and too 
broad to describe even grossly the boundaries of 
political research. 

It is too narrow because political interaction, 
within any normal use of the term, involves more 
than the control of one person or group by another 
or efforts at reciprocal influence. Power, it is true, 
appears in all political interactions. However, each 
political relationship may include other aspects as 
well, and we can neglect these only at the peril of 
failing to achieve an adequate understanding of the 
situation. 

The description of politics as the study of power 
also casts the definitional net far too broadly. Con- 
flicts over control take place in all areas of life, 


not just the political. A parent has power over a 
child, a priest over his parishioner, a friend over 
a friend, a corporation over its employees, If we 
wished, we might choose to designate all of these 
kinds of power relationships as political. In this 
event, by fiat we could convert the study of politics 
into the search for a general theory of power ap- 
plicable to all social relationships. 

But if we were to undertake to do this, it would 
still leave an unresolved question on our hands. 
Normally many of these kinds of power relation- 
ships seem to fall far outside even the broadest 
conception of the frame of reference of political 
science. The contro] of a parent over a child is 
seldom considered to be political, except in an ana- 
logical sense. But if this is granted and it is accept- 
ed that there are some power situations that are not 
political, we would then be left with the task of 
devising criteria for distinguishing political power 
from all other types, such as parental or, for that 
matter, economic, religious, and the like. In this 
event we would be right back where we started, 
at the point of searching for a workable criterion 
for identifying the “political.” 

At the very most, a general theory of social 
power would be extremely helpful for shedding 
light on the properties of power relationships in 
a political setting. In itself this would be no mean 
accomplishment. But beyond that, it would not 
help us to attain a conceptualization of political re- 
lationships as a whole, [See PowER.] 

Decision making. Toward the middle of the 
twentieth century there appeared an important and 
popular variation on the theme of power as the 
central subject of political science. In this view, 
power attains significance because it leads to con- 
trol over the processes through which public de- 
cisions are made and put into effect. This interpre- 
tation was quickly absorbed into the discipline. It 
has led to the direct interpretation of political life 
as a set of relationships through which public de- 
cisions or policies are formulated and put into 
effect. Power as a component recedes into the back- 
ground, as just one possible determinant of de- 
cisions. 

The description of political science as the study 
of the making of public policy has become so wide- 
spread, particularly in American political science, 
and has become so much a part of the normal and 
even unstated intellectual apparatus of most su 
dents of politics, that its elaboration as a major 
concept can no longer be associated with any single 
person or group. Its beginnings may be found in 
the work of Carl Schmitt in Germany, where it 
arose as a response to the years of indecision fol- 


lowing World War 1 [see ScuirT]. At that time, 
however, the idea left little impact on political 
science as a discipline, and it was only through the 
efforts of organizational theorists in the United 
States in the 1940s that its general significance for 
political analysis and research became apparent. 

From its beginnings in the study of organiza- 
tional behavior, the study of policy making has 
spread into virtually every sector of political re- 
search. Indeed, its use by political science reflected 
the even broader penetration of the idea of decision 
making into all of the social sciences. Decision 
making has proved to be one of the truly seminal 
ideas of the decades following World War 1, and 
few conceptualizations of political science can 
afford to ignore it entirely. [See DECISION MAKING, 
article on POLITICAL ASPECTS.] 

But however transparently relevant a decisional 
approach is to understanding at least one sompo- 
nent of political processes, decisions, like power, 
are characteristic of all other spheres of life as 
well. Similar decisional behavior occurs in such 
organizations as trade unions, corporations, 
churches, and families, as well as in the political 
system. Thus the mere definition of political sci- 
ence as the study of decision making provides little 
guidance in differentiating political decisions from 
other kinds of decisions. The adoption of decision 
making as the major focus would, to be sure, pro- 
vide us with a general theory of social decision 
making, which would undoubtedly throw some light 
on our understanding of decision processes in a 
political context. But we would still need criteria 
by which to isolate political decisions from other 
kinds of decisions. The concept in and of itself is 
insufficient to delineate, even at the grossest level, 
the range of data that any initial description of 
political science would have to include. 

The political system. On the whole, the efforts 
at describing the broad range of interests of polit- 
ical science have been less than successful in the 
decades following the turn of the twentieth cen- 
tury. Institutional definitions, such as government 
and the state, merely serve to define one unknown, 
political science, by other unknowns. Functional 
conceptualizations based on power and decision 
making go well beyond any range of interests that 
political scientists would at least intuitively seek 
to embrace within their discipline. 

Each description of the subject matter of polit- 
ical science has something to commend it, if only 
because no one way of accounting for the intrinsic 
cohesion of any field is either the only or the neces- 
sarily correct one, Each mode of conceptualization 
creates its own blind spots and opens its own 
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unique windows on political reality. The adequacy 
of any formal definition will depend on the extent 
to which it gives a sufficiently general description 
of the subject matter so that past and current con- 
tent areas accepted by most scholars are not exclud- 
ed—or if they are, it is with convincing theoretical 
justification. 

No more than for sociologists, economists, an- 
thropologists, or psychologists is it appropriate for 
political scientists to presume to adopt the total 
behavior of a society as their special concern. 
Rather, from all social interactions, political science 
can abstract only those kinds which its theoretical 
perspectives dictate to be political in nature. In 
this sense, politics as a field of study is analytically 
distinct from the other disciplines. By the same 
reasoning, since each discipline directs its special 
attention to only one aspect of the total aggregate 
of interactions called society, no one discipline is 
“more basic” than or prior to the others. In its 
theoretical status each represents an equivalent 
level of abstraction from the totality of interac- 
tions in which the whole biological person engages. 
[See SYSTEMS ANALYSIS.] 

We may usefully identify the political interac- 
tions in a society as its political system, rather than 
as government, the state, power, or a set of deci- 
sion-making processes. How are we to distinguish 
this system from other systems of behavior, such 
as the religious, economic, psychological, and cul- 
tural? In answering this question, we shall simul- 
taneously obtain an initial, gross conceptualization 
of political science, one that can serve as a point of 
departure for distinguishing political science as a 
theoretically separate and autonomous discipline. 

We may describe the political system as that 
behavior or set of interactions through which au- 
thoritative allocations (or binding decisions) are 
made and implemented for a society ( Easton 1953; 
1965a; 1965b). The implications of this thumbnail 
definition are enormous, and here we can examine 
only the most salient ones. 

The authoritative allocation of values. Scarcity 
prevails in all societies. This is a fundamental start- 
ing point for political analysis. There are not 
enough of the values (valued things) to meet the 
wants of the members of a society. This is as vital 
a political assumption as it is a sociological, anthro- 
pological, and economic one. Differences and con- 
flicts over values in scarce supply cannot be avoid- 
ed. In most controversies, members in all societies 
are able to negotiate settlements independently, 
without the intervention of any special agency 
speaking in the name of the society. Integration of 
social behavior is in large part a consequence of 
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the independent interaction of members as dictated 
by their personalities, the social structure, and the 
culture. Through such autonomous behavior they 
are able to adjust their differences, however accept- 
able or distasteful any resolution may be to the 
participants. 

But in all societies there are always some valued 
things over which differences cannot be easily ne- 
gotiated by the members themselves. In these cases, 
empirically we always find that special institutions 
or processes have emerged as mechanisms for help- 
ing to impose a settlement. Without such special 
means at the disposal of a society, its integration 
might be threatened. In addition, collective pur- 
posive action would be impaired, if not eliminated. 
If the society sought to adopt any objectives requir- 
ing the combined effort or resources of part or all 
of its members, some means would have to be 
found to organize and direct their energies. Differ- 
ences would have to be reconciled or regulated so 
that purposive action in the name of the society 
could become possible. This involves more than 
just the establishment and maintenance of order, 
although that may be and typically has been a 
primary objective. But regardless of the goals, each 
society has in fact devised some means for regulat- 
ing differences, so as to be able to coordinate the 
efforts of its members. 

One way of achieving this result is by invoking 
force in the name of the society. Others are through 
the use of persuasion, manipulation, ad hoc media- 
tion, and the like, The weakness of these methods 
is that they do not offer stable and regularized pro- 
cedures through which conflicts over valued things 
may be negotiated when the members are not able 
to arrive at an autonomous settlement. 

Regular and stable means for meeting this situa- 
tion require two things at the very least: first, 
structures and procedures for undertaking deci- 
sions and related actions—what we may call allo- 
cations—that will help reduce or regulate differ- 
ences, and second, some guarantee that there will 
be a greater rather than a lesser likelihood that 
these decisions and actions (allocations) will be 
accepted as authoritative. That is, the outcomes 
of efforts to regulate the differences must have a 
relatively high probability of being accepted as 
binding. 

In structurally differentiated societies, the mak- 
ing and implementing of decisions with which the 
members normally comply are assigned to special 
institutions that have come to be known as “govern- 
mental.” But in small, nonliterate societies, where 
structures are combined and unspecialized, such 


tasks will usually be undertaken by persons in 
social roles that cannot be described as either 
purely or largely governmental. A lineage head in 
a tribal society may be chief religious leader, regu- 
lator of hunting (economic director), and familial 
chief, as well as the paramount negotiator of dis- 
putes not regulated by the members themselves. 

In every society, therefore, we can expect to find 
those kinds of interactions which have to do pri- 
marily with influencing and shaping the way in 
which authoritative allocations of values (decisions 
and actions) are made for the society. It is these 
interactions taken collectively which constitute the 
behavior to which the term “political system” re- 
fers; it is the study of these interactions that pro- 
vides the basic subject matter of political science. 
We must examine each of the component terms 
of the phrase “authoritative allocations of values 
for a society” if we are to appreciate its full impli- 
cations. 

Politics revolves around allocations. An alloca- 
tion distributes valued things among the members 
of a society. In providing safety, a policeman helps 
to allocate this value differently from the way it 
would have been allocated without his presence; 
in building roads, a government offers a benefit to 
its users and a deprivation to other taxpayers, for 
whom it may have no importance at all. An alloca- 
tion may occur in three significantly different 
ways: when a decision or action prevents a mem- 
ber from retaining a value that he already posses- 
ses; when it prevents him from obtaining one he 
would like to possess; when it gives him access to a 
value that he might not otherwise have obtained. 
Generally an allocation apportions benefits or dep- 
rivations in such a way that they are different 
from what they would have been but for the allo- 
cative activities. 

An allocation of values may be formal. In mod- 
ernized political systems, allocations take such 
forms as legislation, laws, judicial decisions, and 
administrative decrees. They may be informal in 
these systems, as when administrative action sub- 
stantially modifies or subverts a law in the proc- 
ess of applying it. They may also be informal, as 
in nonliterate societies, where a council of elders 
may meet and achieve an unarticulated consensus 
on what ought to be done, and the responsible mem- 
bers of the lineage or clan may then feel impelled 
through custom to undertake the necessary actions. 
Control over allocations may be diffused through- 
out a society, as in a direct democracy, Or it may 
be confined to the hands of a few, as in an autoe- 
racy. The allocations may benefit all members of 


a system or only a powerful few. In each case some 
redistribution of values has taken place in the 
sense we are here discussing. 

But allocations occur in all spheres of life. The 
distinctive property about an allocation in the po- 
litical system is that it is usually highly probable 
that the decisions and actions will be accepted as 
authoritative. If this is not so, either the system is 
in the throes of collapse or the members are no 
better off than they would be without a political 
system. In that event, there would be no way of 
fulfilling what we may assume to be a requirement 
in all societies for political kinds of settlements. 

To say that an allocation is authoritative does 
not necessarily imply that it will be accepted as 
legitimate. This conclusion is made possible only 
if we equate authority with legitimacy—a possible, 
but not a necessary, identity, Thus, a totalitarian 
usurper may be able to allocate values through the 
political processes even though a majority of the 
members in the system consider his power to be 
illegitimate. Yet out of fear of the consequences 
they may accept his decisions and actions as bind- 
ing. As long as there is a greater rather than a 
lesser probability that most of the members will 
accept a decision, and its implementing actions, as 
binding most of the time, it is authoritative. 

There are numerous reasons why members ac- 
cept. allocations as authoritative. They may do so 
because of tradition and inertia, love and affection 
for the rulers, fear of violence for failure to com- 
ply, self-interest, or loyalty. But as is often the case, 
compliance may result from a strong conviction 
that it is right and proper to obey those who make 
the decisions and put them into effect, that is, that 
they are legitimate. In this use of the concept 
“authoritative,” legitimacy is only one ground for 
the acceptance of an act as having this quality. 
But regardless of the grounds for considering a 
decision and related actions as authoritative, what 
separates political allocations from other kinds is 
the fact that this compelling quality is associated 
with it. [See Lecrrrmacy.] 

But political science is not centrally interested 
in every allocation, even when it is accepted as 
authoritative. In every organization, aside from the 
political system, there normally are those who en- 
gage in making and implementing decisions that 
are considered binding by the members of the 
organization. If we wished, we could so broaden 
and redefine the scope of political science that it 
embraced the study of authoritative allocations 
wherever they are found. In this event, the making 
of binding decisions in a family, church, trade 
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union, fraternal club, or corporation would become 
phenomena that lie at the heart of political re- 
search. But there is a more useful and theoreti- 
cally more economical way of handling this situa- 
tion. It will also enable us to obtain the full benefits 
of research into authoritative allocations wherever 
they may occur, and yet it will clearly distinguish 
political allocation as an analytic type different 
from others, 

Theoretically we would be adhering more faith- 
fully to the long traditions of political inquiry if 
we confined the concept “political” to those author- 
itative allocations which apply to a society rather 
than only to an organization within that society. 
That is to say, we would devote our attention to 
those allocations which are normally accepted as 
binding by most members of the society, whether 
or not the members themselves are actually af- 
fected by them. This contrasts with the binding 
decisions and actions effected by organizations 
within the society, Such allocations need be accept- 
ed as authoritative only by the constituent members 
of the organization; other members of society need 
not consider themselves bound in any way. Thus, 
political allocations in the inclusive sense proposed 
here are societal in their scope and implications. 
It is because of this and the functions they fulfill 
for a society that formal and special sanctions, 
such as the use of force, are often associated with 
them. But this is only a typical accompaniment, 
not a necessary one, in political systems (Schapera 
1956). 

This conceptualization of political science need 
not neglect the obvious fact that in other kinds of 
organizations, authoritative allocations are also 
undertaken and that the investigation of the proc- 
esses surrounding them will aid immeasurably in 
understanding similar processes within the socie- 
tal political system. If we wished, we could describe 
those aspects of voluntary organizations, families, 
lineage segments, or interest groups which deter- 
mine the way in which binding decisions are made 
and implemented for these groups as their political 
systems, We could then distinguish organizational 
political systems from the societal political system. 

To be sure, political scientists have typically 
been interested in the internal processes of such 
groups and organizations, for at least two reasons. 
First, in most societies such groups help to influ- 
ence the way in which binding decisions are for- 
mulated, their content, and their implementation. 
But this is only a derived interest. It flows from 
the hypothesis that we cannot understand the au- 
thoritative allocations for a society without becoming 
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thoroughly familiar with the internal operations of 
groups that influence these allocations. But a sec- 
ond possible reason is that these organizations and 
social units look very much like political systems, 
at least in microcosm. We might even designate 
them as parapolitical systems, the comparative 
study of which might help shed light on the polit- 
ical processes of the broader society (Easton 
1965a). But recognition of their relevance for polit- 
ical research is entirely different from equating 
them with the object at the center of attention of a 
science of politics, i.e., the political system. 

Resulting areas of research. The areas of re- 
search for political science that flow from this de- 
scription cannot be itemized here. In general, they 
will encompass all the structures, processes, and 
activities more or less directly related to the making 
and implementing of authoritative allocations for 
a society. But we cannot know in advance what 
these are under specified historical conditions. They 
will vary with the kind of political system under 
scrutiny and the historical period in which it is 
examined. This conclusion in itself reveals the 
folly of seeking to describe the subject matter of 
political science in the specific terms of the kind 
of institutions that exist at any historical moment. 

It is instructive, however, to recognize that none 
of the variable structures and processes of interest 
to political science in mid-twentieth century need 
be neglected in the conceptualization of the disci- 
pline suggested here. Thus, in nonliterate systems, 
where little differentiation of political structure 
prevails, such general social structures as the kin- 
ship groups, lineage councils, village headmen, 
paramount chiefs, friendship cliques, and war 
parties may be most relevant for revealing the way 
in which binding decisions are made and put into 
effect. [See POLITICAL ANTHROPOLOGY.] 

In industrialized, structurally differentiated so- 
cieties the fields of research assume an equally 
specialized character. Through the examination of 
such variable differentiated structures as legisla- 
tures, executives, administrative organizations, 
parties, and interest groups, political science has 
sought to explore those elements of a political sys- 
tem which are influential in determining who 
makes the allocations, the nature of the ones that 
are undertaken, and the way in which they are 
implemented. 

Through the study of electoral, or voting, behav- 
ior, political scientists seek to identify the kinds 
of issues that become subjects of controversy as 
possible authoritative allocations and try to explain 
the recruitment of those responsible for the day-to- 
day tasks related to making and effectuating such 


allocations [see Vorne]. Public law examines the 
way in which a system validates binding decisions 
according to legal criteria, thereby contributing to 
their acceptability as authoritative in a system [see 
PuBLic LAw]. Comparative politics casts its net 
over similar aspects of political life, but in cultural 
and social settings alien to the national origin of 
the research worker himself [see POLITICS, COM- 
PARATIVE]. International relations turns the atten- 
tion of political science to those institutions and 
structures through which binding decisions are 
made and implemented in the relationship among 
individual political systems. Here it is helpful to 
conceive of the interaction among political systems 
as constituting a kind of political system itself, at 
a higher level of generality. In this view, the par- 
ticipating political systems are just subsystems of 
the international system, in much the same way 
that states or provinces may be subsystems of so- 
called national political systems [see INTERNA- 
TIONAL RELATIONS], 

The other subject matters standard in political 
science at mid-twentieth century also fall com- 
fortably into place when we conceptualize political 
science as the study of those actions more or less 
directly related to the making of authoritative allo- 
cations. Thus, political philosophy represents that 
area devoted to the assessment of allocations and 
related structures according to ethical criteria. It 
also examines and critically evaluates the way in 
which philosophers have attempted this in the past. 
Empirical theory, however, seeks to systematize 
the research processes themselves and to bring con- 
ceptual order and consistency to the discipline as 
a whole or in its various parts. To it belongs the 
task of developing, in the form of general theories, 
comprehensive explanations of how authoritative 
allocations for a society are made and put into 
effect; and in the form of partial theories, inter- 
pretations of the operations of aspects or segments 
of political systems. [See POLITICAL THEORY.] 

Brief and formal as this listing of the range of 
interests for political science is, it does demon- 
strate that the description of the discipline that has 
been suggested is consistent with all the research 
that political scientists characteristically under- 
take. Not that the subfields just mentioned are 
fixed in any sense: historically the very opposite 
is the case. They represent only a convenient and 
changing way of dividing the whole field for pur- 
poses of specialized research. But the point is that 
the new conceptualization of the key problems with 
which political science deals does not in itself pie 
clude the continuation of past concerns in political 
research, It includes them and unites them analyti- 


cally, but it leaves the door open to, and indeed 
invites, their reformulation and extension. 


Content 


The content of political science as a discipline 
also reveals a long-range trend toward greater 
analytic unity. We can see this best if we trace 
the various substantive areas into which political 
science has come to divide itself. In describing the 
evolving relationships among these subdivisions, 
we shall find that increasingly they have come to 
depend less upon practical issues and problems for 
setting their main areas of interest—although the 
hold of a problem orientation is still strong—and 
more upon theoretical criteria. 

Historically the successive patterns of the sub- 
fields that political science has found it useful to 
adopt fall into four major categories: universalism 
(moral philosophy), legalism (Staatslehre), real- 
ism (political process), and behavioralism. Meth- 
odological considerations have some important 
bearing on the movement toward the latest pattern- 
ing of the subfields, but we can leave discussion 
of this aspect for the following section. Although 
the actual changes from one pattern to the next 
occurred in a relatively slow and imperceptible 
way, they brought about major upheavals in the 
kinds of data to which political science has di- 
rected its attention. 

Universalism. The period of universalism was 
by far the longest and least homogeneous or dis- 
tinctive. It encompasses the time from the founding 
of conscious political reflection in Greek antiquity 
to somewhere in the nineteenth century. The sub- 
ject matter of politics was fully integrated with 
the general study of society, that is, with universal 
moral philosophy. Each scholar was what we might 
today call a general or universal social scientist. 
Without specialization in the study of political 
matters, there was no reason for special subdivi- 
sions of inquiry to arise. The social philosopher 
was free to follow his own political interests, as 
dictated by the problems of the day. The subject 
matter, therefore, was as varied as the history of 
Western political thought. 

Legalism. After the centuries in which a polit- 
ical interest was indistinguishable from universal 
philosophy, the emergence of the legalistic Staats- 
lehre school in the nineteenth century opened up 
a new era. Imported from Germany into the United 
States by J. W. Burgess and others, it was rein- 
forced by the positivistic utilitarianism of Bentham 
and Austin. For a brief time it sounded a note 
harmonious with scholarly opinion in the United 
States, in tune as this opinion already was with 
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specialism. The Staatslehre school provided a body 
of knowledge devoted to the study of the state. It 
is true that the state was narrowly defined as a 
collection of legal norms and empirically confined 
to formal legal structures. Nevertheless, concentra- 
tion on the state, even in this way, did enable those 
in the United States who later in the nineteenth 
century began to call themselves political scientists 
to appropriate, albeit unobtrusively, this body of 
knowledge as their own. 

At the turn of the twentieth century, the flow 
of texts in the United States—from J. W. Burgess, 
W. W. Willoughby, R. G. Gettell, J. W. Garner, and 
others—clearly revealed that as a specialized dis- 
cipline of political science began to take shape for 
the first time, it absorbed and adapted the formal 
legalism of the Staatslehre school as the core of 
its concerns. Thus, political science turned to the 
investigation of the nature and origin of the legally 
conceived state, particularly with respect to its 
sovereign properties and the growth of law. To 
these were added detailed descriptions of the legal 
provisions for the forms of governments and for 
the formal powers of the electorates, the judiciary, 
the administrative services, and the executive de- 
partment. This objective formal description was 
fortified with traditional philosophical inquiry into 
the ends of government and the state. 

Realism. Although in Europe political research 
remained confined to legal forms well into the 
twentieth century, the American environment 
could not tolerate such restriction for long. The 
social problems of a rapidly growing industrial so- 
ciety and the political complexities accompanying 
the large-scale immigration of varied ethnic groups 
clearly called for more accurate and increased 
knowledge about the realities of political life, Be- 
hind these social and political needs, in the intel- 
lectual sphere, pragmatism as a social philosophy 
pressed for contact with and interpretation of 
social experiences, 

However we may account for the early demise 
of the Staatslehre approach in the United States, 
its conscious rejection ultimately led political in- 
vestigation into entirely new substantive paths. In 
this third phase, political science began to pene- 
trate beneath the legal forms to the political reali- 
ties, identified in due course as the underlying po- 
litical process, To legal form and structure were 
added nonlegal and informal processes. 

The shift toward these new kinds of data came 
about in two distinctive but overlapping stages. In 
the first, the tendency was to denounce arid legal- 
ism in favor of a new realism in political research. 
Following the approach of Bagehot in The English 
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Constitution (1865-1867), Woodrow Wilson in his 
Congressional Government (1885) condemned the 
“literary theory” of the constitution and in its place 
discovered that the true source of legislative con- 
trol lies in Congressional committees. Other real- 
ists followed in his footsteps. James Bryce, in his 
American Commonwealth (1888), added another 
dimension to the new reality by stressing the role 
of parties and unearthing their inner groups—very 
much like James Fitzjames Stephen's wire pullers 
—in whom the main power was presumed to rest. 

Realism quickly opened up a second stage of in- 
quiry. In it the generalized search for the political 
realities came to a sharper focus on the basic 
groups that seemed to be increasing their power 
on the American scene. In this discovery of groups 
as the major vehicle through which the political 
struggle worked itself out, American political 
science found a new direction for research, the 
implications of which were not exhausted until 
after World War 11. Politics came to be interpreted 
as a process through which group activities, out- 
side the formal political structure but acting upon 
and through it, managed to influence all phases 
of governmental activity. 

Charles Beard exposed the impact of extralegal 
economic groups on the course of historical events. 
Although Arthur Bentley was not influential until 
almost a half century later, at the beginning of 
the twentieth century his insistence upon conceiv- 
ing of all political activities as group phenomena 
(and upon the need to abandon the habit of attrib- 
uting causation to ideas and legal norms) reflected 
the latent sentiments about political research that 
others were coming to adopt. A group orientation 
among European sociologists, imported by sociolo- 
gist Albion Small, together with philosophic plural- 
ism, reinforced this interest among American po- 
litical scientists. With the appearance of Herring's 
Group Representation Before Congress (1929) the 
basic pattern of group research for the first half 
of the twentieth century had been set. It was left 
only to Truman, in The Governmental Process 
(1951), to formulate systematically the whole con- 
ception of political life as conflict among groups. 
[See POLITICAL GROUP ANALYsIS.] 

With this penetration to groups as a new kind 
of data to explain political events, the twentieth 
century witnessed a great proliferation of subfields 
in political science. In the 1960s most scholars still 
functioned more or less with some dependence on 
the topical delineation of the field as it was laid 
down during this third stage in the substantive 
development of the discipline. 

For the first time political science broke up into 


a large number of specialized areas. The distribu- 
tion of these subfields unmistakably reveals their 
origins in nineteenth-century legalism. In addition, 
they testify to the continuation of the historic habit 
of choosing a research topic by reference to the 
varied practical problems for which it might offer 
an immediate and direct solution. Thus, the main 
divisions of political science as a discipline are 
defined institutionally, a heritage from political 
legalism; and the institutions are those peculiar to 
Western political systems and are linked, intui- 
tively, to the solution of the changing issues of 
politics, especially in democratic systems. 

Until World War u, political science consisted 
of four main fields with names that clearly mirror 
these legalistic origins and practical interests: po- 
litical philosophy or theory, national government, 
comparative government, and international rela- 
tions. Political philosophy showed the strongest 
ties with the past (we shall later return to this 
fully). In the area of national government, the 
practical criteria of the legalistic phase led political 
science to open up all institutional aspects of 
Western democracy for major exploration: the 
executive, legislature, judiciary, and administra- 
tion. The basic organization here was regional: at 
the national, state (provincial, department), and 
local (municipal, township, county) level. “How 
to improve them” was the immediate, implicit, and 
persistent theme. 

With the inroads made by a‘concern for the so- 
called realities of politics, the area of national 
government was vastly extended to include the 
underlying and informal social and political group 
structure. New special fields of research arose to 
meet these new interests: political parties, interest 
or pressure groups, and the public or electorate as 
an influential aggregate and an ethically important 
element in a democracy. 

Comparative government added little to the cri- 
teria for selecting topics already visible in the area 
of national government. Research in the compara- 
tive field required attention only to the same kinds 
of institutions, but in political systems other than 
the one in which the investigator happened to live, 
a simplistic but nonetheless close-to-accurate SA 
scription of the differentiating characteristics ©: 
this field. Because empirical political research was 
less advanced in countries other than the United 
States, however, comparative government in those 
countries tended to confine its attention to the 
governmental and party level, with groups and the 
electorate receiving little attention until after 
World War 1. 

In the comparative field, strangely enough, sys 
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tematic comparison received little serious atten- 
tion. Research either took the form of configurative 
descriptions of the complex of political institutions 
in each system considered, or where comparisons 
were essayed, they consisted only of taking insti- 
tutions with the same name from one system, 
setting them beside those of another, and com- 
menting on the apparent similarity and differ- 
ences. Herman Finer and C. J. Friedrich were vir- 
tually alone in breaking from this pattern and, 
somewhat ahead of their time, pointed the way to 
an analytic approach. 

Finally, the establishment of the League of Na- 
tions virtually gave birth to the field of interna- 
tional relations. But in this area the legal tradition 
died hard. International law and the formal de- 
scription of various international organizations 
and treaty relationships, joined with the traditional 
study of foreign policy as the history of diplomacy, 
fairly exhausted the research interests. 

In all areas, prior to World War 11, some early 
efforts at reconceptualization had begun to assert 
themselves. In particular, the view of political life 
as a struggle among groups for some measure of 
influence over government helped to draw atten- 
tion to the fact that there might be a single basic 
kind of activity in the light of which all politics 
could be explained: the struggle for power over 
policy. This shift in focus away from legal norms, 
first to groups and then to power (whatever the 
over-all shortcomings of power as an introduction 
to the basic subject matter of political science), 
pointed to the arrival of a fourth phase in the or- 
ganization of fields, that of behavioralism. In this 
phase theoretical criteria begin to crop up beside 
the traditional, practical ones and to play some 
part in the selection of the basic areas of inquiry 
in political science. 

Behavioralism. The behavioral movement in 
political science came to full bloom after World 
War 1m. In common with a tendency that pervaded 
all of the social sciences, political science began 
to probe in earnest for the concrete behavior that 
goes to make up the activities broadly described 
both as legal structures and as informal groups. 
It turned to the individual—his attitudes, motiva- 
tions, values, and cognitions. The new level of 
reality in the political process that was thereby un- 
covered was very much a product of the discovery 
of new techniques for the study of human behavior 
in an interactive situation; both the methods for 
the exploration of a new level of subject matter 
and the new kinds of data arose simultaneously 
and as different aspects of the same general re- 
search trends. [See BEHAVIORAL SCIENCES.] 
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Although the methodological implications of 
political behavioralism normally command most 
attention (we shall turn to them in the following 
section), what has been largely neglected is the 
equally important impact it has had upon the 
organization of the subject matter of political 
science. It has been propelling the discipline to- 
ward a fundamental rethinking of the way in 
which it subdivides itself for research. 

This restructuring of the field was still very 
much in progress in the 1960s, and the ultimate 
point of stabilization is by no means clear or easy 
to predict. Nevertheless, one conclusion cannot be 
avoided. As a result of this revolution in methods 
and data, political science has begun to break away 
in earnest from an institutional orientation. The 
formulation of research tasks has also begun to 
move a considerable distance away from the im- 
mediacy of practical political issues as understood 
by common sense, toward a subtle, if at times in- 
articulate, search for basic theoretical criteria to 
guide the selection of research topics. Processes, 
rather than structures and institutions, have begun 
to appear as the major guidelines for political re- 
search. This both mirrors and reinforces a slow 
trend away from the older synthetic discipline 
toward a newer analytic one. In the process, the 
very organization of political science is undergoing 
profound changes. 

Long before World War 11, Grałam Wallas, in 
Human Nature in Politics (1908), had turned to 
political motivations as a significantly new and 
noninstitutional dimension in understanding po- 
litical life. Over the decades, a sprinkling of others 
had joined him. Walter Lippmann, in Public 
Opinion (1922), had signalized the relevance of 
stereotyped opinions in molding the behavior of 
individuals. In Psychopathology and Politics (1930), 
Harold Lasswell had introduced psychoanalysis as 
a method for penetrating to the impact of latent 
motivations on political activity. The whole of the 
Chicago school in the 1930s had exercised itself 
about the importance of psychology in understand- 
ing participation in politics., In isolating varying 
kinds of behavior that cut across existing institu- 
tions known through common-sense observation, 
these new approaches began the process of chip- 
ping away the foundations of the prevailing insti- 
tutional subdivision of political science. 

The subfields of political science. Even though 
the discipline has not yet settled on a new, Co- 
herent, and viable structure, it is clear that the 
old subfields have begun to acquire new names; 
at the same time they have been joined by a va- 
riety of new subspecialities. 
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National politics. In particular, national gov- 
ernment as a field has shown signs of disintegrat- 
ing. In its place there is arising a vast array of 
cross-sectional or functional topics. For example, 
side by side with the continuing study of the execu- 
tive, political leadership and elitism as general 
phenomena to be found in many different settings 
have come to provide a new focus for research. 
Similarly, judicial, administrative, and legislative 
behavior have become substitute or parallel com- 
panion fields for the pre-existing institutional areas 
of the judiciary, administration, and legislature. 
The conviction has grown that the specific institu- 
tional setting is of less importance than are the 
generic forms of behavior that appear in the vari- 
ous institutions, at least as a starting point for 
better understanding the way in which the institu- 
tions themselves function. 

To the older institutional interests that have 
been decimated in this way, new major subfields 
have been added in increasing numbers. Out of the 
earlier concern for public opinion as an influential 
force, at least in democratic systems, there has 
blossomed the vast field of voting, or electoral, be- 
havior (Berelson et al. 1954; Michigan, University 
of . . . 1960). Through it, efforts have been made to 
document the precise way in which public opinion 
operates upon political institutions in the selection 
of leadership in a democratic system, in the impact 
on parties, and in the shaping of public policies. 
The spread of ideas and influence has been traced 
as a pattern in the flow of political communica- 
tions (Deutsch 1953; Bauer et al. 1963). Studies 
have revealed the role of the personality in de- 
termining the nature of political participation in 
varying institutional settings (Lasswell 1948; Lane 
1962). Research about political socialization has 
begun to explore the effects of early attitudes, 
values, and beliefs on attachments to the political 
system and on later political participation. New 
approaches have characterized such problems as po- 
litical recruitment, leadership, representation, and 
ideology. We have here but a rough indication of 
the degree of proliferation of new fields relevant 
to national government but cutting through it from 
entirely new, noninstitutional directions. [See Po- 
LITICAL BEHAVIOR.] 

Comparative politics. By the 1960s compara- 
tive government had also been shaken to the roots 
by the general behavioral upheaval in political re- 
search. Policy interests that had expressed them- 
selves in configurative and formal description of 
structure now began to give way to the search for 
theoretical criteria as a basis for organizing re- 
search. Themes in the comparative field have 


begun to look very much like those which had 
emerged in the behavioral study of national poli- 
tics. Form and structure have yielded to the study 
of behavior as it appears in all structural settings. 
Decision making, socialization, leadership, political 
motivation and personality, communications, po- 
litical participation, ideology, and the like have be- 
come equally significant topics of comparative 
research. 

In addition, comparative research itself has 
been able to make a signal contribution to the 
theoretical analysis of research on domestic sys- 
tems. As a result of World War u and the subse- 
quent colonial revolutions, comparative govern- 
ment has extended the range of its interests in two 
new major directions. Static description has given 
way to the pursuit of an understanding of the 
conditions of political change. It has thereby pro- 
vided political analysis with a new and profound 
dimension in which theoretical thought has be- 
gun to crystallize. Furthermore, comparative gov- 
ernment has also moved beyond the narrow con- 
fines of European governments to include the 
political systems of exotic cultures. It has brought 
in its train a recognition of the special function 
of cultural variations and of the need for rigorous 
concepts to ensure the adequate isolation of cul- 
tural determinants. Both of these innovations—the 
identification of change as a focus and of culture 
as a vital variable—show promise of raising the 
study of politics in the comparative field to the 
same level of sophistication as that to which the 
behavioral movement has been raising the study 
of national political systems. In recognition of this 
profound transformation, by the 1960s the name 
of the subfield had changed from that of compara- 
tive government to comparative politics. [See PoLt- 
TICS, COMPARATIVE. ] 

The most significant outcome of the new content 
being poured into this old field, as a result of the 
pervasive behavioral movement, is that it has be- 
come increasingly difficult to distinguish research 
in the comparative area from that in the domestic. 
The discovery of the importance of change and 
culture has fed back upon the main body of po- 
litical research and has sensitized students of na- 
tional politics to the significance of their own 
culture patterns in the behavior of the members of 
their own systems. As a result, it is becoming 1n- 
creasingly meaningless to attempt to distinguish 
comparative research from national research, es 
pecially since all rigorous scientific analysis by 
definition must be comparative, as J. s. Mill in 
Book 6 of his System of Logic long ago insisted. 

These mutual effects of the national and com 


parative subfields have helped to bring about a 
more uniform and consistent blend of the whole 
discipline. They have strengthened the impulse 
to search for a theory that unifies the study of all 
political systems, and thereby may ultimately force 
upon the discipline a major reorganization of its 
subfields. Since all scientific research is by nature 
comparative, subfields may no longer rest on geo- 
graphic differences between domestic and alien 
systems, the existing principle of division, but must 
rest upon major theoretical discriminations in 
subject matter, regardless of spatial location and 
political jurisdiction. 

International politics. The behavioral movement 
has brought about similar modifications in the field 
of international research, In the past it bore the 
abel “international relations and organization”; 
by mid-twentieth century its internal transforma- 
tions, under the impact of behavioralism, had be- 
gun to give it the new name “international politics,” 
a slight but highly meaningful shift in emphasis. 
t indicates that political science no longer con- 
siders that because the actors in the international 
sphere are nations, the field is therefore sui generis. 
See INTERNATIONAL POLITICS.] 

In Politics Among Nations (1948), Morgenthau 
demonstrated that, as in the group-process phase 
of domestic research, power was also needed as a 
concept to help cut through the variety of insti- 
tutions in which international politics manifested 
itself. As was seen earlier, the decision-making 
concept was also found to be a useful tool for ana- 
lyzing the behavioral aspects of international poli- 
tics, thereby freeing the subfield from an exclusive 
preoccupation with those institutions and struc- 
tures peculiar to a given stage in history. Others 
have devised logical models for analyzing the be- 
havior of international actors based upon strategies 
of choice under varying kinds of political and social 
relationships (Kaplan 1957). Deutsch (1953) has 
sought to explain international political interac- 
tions as a complex process of cleavage and adjust- 
ment dependent upon the patterns of communica- 
tion among the relevant units. As in the other 
subfields of political science, the practical problems 
of the day are receding in importance in setting 
the priorities for research, [See INTERNATIONAL 
RELATIONS. ] 

Political theory. In no area is the behavioral 
shift from an institutional and practical problem 
orientation more sharply revealed than in the sub- 
field of political theory itself. In a very real sense 
the changes within it sum up the whole pattern 
of development in political science toward an ana- 
lytic discipline. At the same time, these changes 
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have propelled political science even more force- 
fully in the same direction. For this reason the 
subfield of political theory merits special attention. 
The further restructuring of subfields in the dis- 
cipline may depend upon the conscious leadership 
offered by political theory, since it alone accepts 
an over-all responsibility for the coherence and di- 
rection of the whole discipline. 

Contemporary political theory had its origins in 
general political philosophy. During the long proc- 
ess of separation from the main body of social 
knowledge, political philosophy, as noted earlier, 
was scarcely distinguishable from all reflection 
about human institutions. Indeed, it was the cap- 
stone of such thought. The intention of most social 
philosophers was to shape moral criteria for as- 
sessing current trends and for constructing images 
of the good society. Political philosophy was there- 
fore ethically creative in its intentions. From 
Aristotle to John Stuart Mill, it sought to create 
new goals and social arrangements that would 
serve to guide personal behavior and social policy. 

By the turn of the twentieth century, however, 
as political science blossomed into a separate area 
of specialization, it had lost this ethically creative 
quality. If moral constructiveness endured any- 
where, it remained with the parent stem, philoso- 
phy itself. In the trend toward a science of politics, 
political philosophy limited itself to the history of 
the ideas of others, such as the great political phi- 
losophers. It not only recounted the progression 
of such ideas—as in the evolution of ideas about 
freedom, constitutionalism, democracy, or equality 
—but it also sought to understand them better by 
analyzing them for their clarity, consistency, and 
implications. In addition, it sought to explain their 
growth, persistence, and diffusion as historical 
phenomena. In effect, political philosophy thereby 
became an unarticulated but nonetheless real 
amalgam of the history of ideas, logical analysis, 
and the sociology of knowledge (Sabine 1937; 
Easton 1953). Reflection on and speculation about 
the desired state of affairs—ethically creative anal- 
ysis—found its way only incidentally into the field 
of political philosophy. 

But as a part of the advance of behavioral re- 
search, a latent aspect of political philosophy for 
the first time emerged to the forefront for con- 
sidered attention. In some limited measure, if only 
inadvertently, all political philosophy in the past 
had sought to explain how and why political sys- 
tems function as they do. It was this aspect that 
expanded after World War u to become empiri- 
cally oriented, or behavioral, theory. [See POLITICAL 


THEORY.| 
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Empirical theory has been growing on a number 
of levels, thereby demonstrating the way in which 
trends in subject matter have since World War 1 
been converting political science from a topical 
discipline to an analytic one. In its most limited 
scope, the new theory requires the phrasing of 
propositions as singular generalizations if they are 
intended to describe the relationship between two 
or a few phenomena. At a broader level, statements 
that embrace a segment of political life but some- 
thing considerably less than the whole may be 
designated as partial theories. Thus, there have 
been efforts at formulating theories of parties 
(Duverger 1951), leadership, administrative be- 
havior (Simon 1947), decision making, represen- 
tation, community power structure, consensus and 
cleavage (Lipset 1960), and the like. 

But theory also comes in larger packages and 
seeks to include the whole of a subject matter 
described as a discipline. This is general theory. 
At each level—singular, partial, and general—the 
behavioral revolution has stimulated the formula- 
tion of empirically oriented theory. But it is gen- 
eral theory alone that offers political science a 
basic means for identifying and assessing the full 
range of its interests and, ultimately, its intrinsic 
sense of identity as a discipline. 

General theory has shown signs of asserting it- 
self in two guises: as allocative theories and as 
system theories. Allocative theories shed light on 
the way in which the various political processes 
contribute to the distribution and use of political 
resources. In effect, they help us understand those 
forces which are at work in determining the kinds 
of policies adopted in a political system. For ex- 
ample, the group approach to politics, as systema- 
tized by Bentley and Truman, presents an equi- 
librium theory of political processes. It interprets 
policies as the outcome or equilibrium attained as 
the groups in a system achieve an adjustment or 
accommodation in the exercise of their power. 
Power theories seek out the various dimensions of 
influence that determine control over resources, 
personnel, and institutions and, thereby, over the 
formulation and implementation of public policies. 
Theories of decision making have presented a third 
alternative to the analysis of allocative processes 
in political systems. 

Equilibrium, power, and decision making, as al- 
locative theories, have tended to dominate political 
science since World War m. Not that they are 
usually self-consciously interpreted as allocative 
theories. It is in the nature of theoretical inquiry 
in political science, at its stage of development in 


the 1960s, that the status of these theories is not 
at all clearly defined. They do not see each other 
self-consciously as competitors, or even as over- 
lapping explanations at the same level of analysis. 
Nor is any theory deliberately geared to explain 
allocative problems. Political science is still at a 
very early stage in its awareness of its own theo- 
retical development and of the role of theory as an 
integral part of its growth as a science. 

Allocative theory helps in the organization of 
inquiry into political life from the inside, as it were 
—that is, with regard to its internal operations, It 
helps us to understand the forces that contribute 
to the making of authoritative allocations or bind- 
ing decisions, the kinds of the outcomes charac- 
teristic of political interactions. But if such alloca- 
tions are to be made at all, this presupposes that 
it is possible for a system of political interactions 
to continue over time. What allocative theory takes 
for granted—the persistence of a system of politi- 
cal behavior—system theory questions. It assumes 
that the logically prior problem is to explain the 
conditions under which a system of political inter- 
actions manages to persist at all. 

The general approach to a political system at 
this level has appeared in two forms, functional 
analysis and systems analysis. Functional analysis 
had just begun in political science during the 
1960s (Almond & Coleman 1960). Although it has 
yet to be elaborated in the detail it deserves, if it 
were to proceed along the lines familiar in socl- 
ology and anthropology, it would focus primarily 
on the problem of the maintenance of political sys- 
tems. It would assume that we can better under- 
stand political structures and institutions if we are 
able to depict the part they play in maintaining the 
whole complex of interactions that we call a po- 
litical system. Close analysis would reveal, how- 
ever, that a functional approach represents not a 
theory as such but, rather, only the protocol setting 
forth procedures to be used in any adequate scien- 
tific research about social relationships. But since 
it does encourage the search for patterns of rela- 
tionships, it moves in the direction of all-inclusive 
or general theory. J 

Systems analysis, however, launches us directly 
into theory construction (Easton 19654; 1965). 
It does not find it necessary to postulate self- 
maintenance as a theoretical organizing principle. 
Rather, it may be used to view political life žE a 
set of interactions imbedded in a social environ- 
ment but analytically separable from it. Political 
interactions would then constitute an open syste 
subject to influences from this environment in the 


form of inputs. The system, in turn, modifies the 
environment through the production of outputs 
peculiar to this kind of system alone—allocations 
accepted as binding by most members of a society 
most of the time. 

The key question for systems analysis is how 
such a political system manages to persist over 
time, even in the face of disturbances from its en- 
vironment—such as economic crises, social dis- 
organization due to rapid change, defeat in mili- 
tary conflict—that threaten it with destruction. 
Persistence under possible stress, it suggests, is a 
function of the fact that a political system is a self- 
regulating pattern of interactions. Such a system 
is able to react creatively to stress and to take 
actions aimed toward its own persistence through 
time, It may persist by maintaining existing struc- 
tures and processes, but it may also respond by 
changing these both marginally and fundamen- 
tally. In this interpretation, self-maintenance is 
only one mode of response and not necessarily the 
most frequent one. [See SYSTEMS ANALYSIS, article 
On POLITICAL SYSTEMS.] 

Regardless of the merits of the allocative or sys- 
tem theories that have become available since 
World War 1r, together they would offer the most 
comprehensive and most general tool of analysis 
for political science. It is true that at the present 
stage of development neither of these types of 
theories measures up to the ultimate ideals of a 
theoretical science. As in most other social sci- 
ences, theory is in a very early and rudimentary 
state, Nevertheless, it is incontrovertible that this 
initial thinking about allocative theory and system 
theory, even in the first few decades following 
World War 1, has been lighting the way toward a 
comprehensive, empirically based general theory 
of political systems. In the process, it can aid in 
bringing order out of the changes under way in the 
subfields of political science by offering positive 
criteria for testing the relevance and significance 
of alternative ways of parceling out the discipline 
for specialized research. 


Method and theory 

The increasing tendency of political science to 
take the shape of an analytic discipline rather than 
a synthetic one has been reinforced and, in some 
measure, even initiated by a major revolution in 
method that has also shaken the discipline. By the 
1960s the methods of modern science had made 
deep inroads into political research, under the 
rubric of the study of political behavior. This be- 
havioral revolution has permanently changed the 
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methodological and technical face of political 
science. It has heightened the level of scientific 
awareness among political scientists, making them 
increasingly sensitive to the role of scientific 
theory. It has also encouraged a rapid growth in 
the range and quantity of factual research about 
political life; the economical interpretation of this 
became impossible without the guiding hand of 
theory. The metamorphosis in political research 
has thus given birth to the search for a new theo- 
retical coherence in political science. 

The technical revolution. Only as a specialized 
academic discipline of politics began to take shape 
in the United States, during the nineteenth cen- 
tury, did the methods of political inquiry become 
a salient matter. In part, political science could 
emerge as a discipline separate from the other 
social sciences because of the impetus Marx had 
given to the idea of the difference between state 
and society, an idea virtually unheard of before 
his time. This interpretation made it possible for 
students of politics to contemplate the state as an 
independent focus for their interests. But in part, 
the origins of political science also lie buried in the 
Staatslehre approach, brought to the United States 
during the nineteenth century. It too identified the 
state as a central and separate subject matter, but 
it cut the state off from those roots in society which 
Marx tried to uncover. As we have seen, it inter- 
preted the state as a sociologically disembodied set 
of abstract legal-constitutional norms. However 
inadequate this formal interpretation was, it did 
offer students of politics an apparently independ- 
ent and coherent subject matter, something they 
had lacked until that time. 

It is one of those strange ironies of history that 
the very approach which helped to set political 
science on its feet as a separate academic disci- 
pline should be so quickly rejected, as it was in 
the United States. But even in the process of being 
cast off, the Staatslehre approach inspired a new 
scientific consciousness in political research. The 
revolt against arid legalism in the United States 
took place in the very name of science itself. 

The work of James Bryce is a classic and repre- 
sentative illustration. Although he was a Scot, in 
his self-conscious espousal of science and in his 
insistence (1921) that this involved the discovery 
of the facts of political life at all costs—‘It is facts 
that are needed: Facts, Facts, Facts”—he reflected 
the antilegalistic mood of the American intellectual 
environment. By the 1920s this passion for un- 
earthing facts had been thoroughly legitimated 
under the imposing title “the new science of. poli- 
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tics” (Merriam 1925), A hyperfactual era began, 
one in which the collection of brute, but it was 
hoped reliable, facts became the accepted rule and 
part and parcel of the prevailing understanding of 
science (Easton 1953). 

But in the new-found passion for “hard” data, 
American political science lost sight of the grand 
fact that in spite of its profligate use of the adjec- 
tive “scientific,” it still had a long distance to go 
to throw off the heavy yoke of the Greek classical 
tradition, The new science of politics, as it devel- 
oped in the first few decades of the twentieth cen- 
tury, had mistaken a concern for the so-called 
facts of political life for the whole of a science of 
politics. It had substituted one kind of scientific 
product—empirical, verifiable data—for the whole. 
But in the realm of method it was still operating 
on some of the major premises of the classical 
tradition. It continued to assume that the methods 
of inquiry were not themselves problematic. It had 
no clear idea of the form of the propositions in 
which its findings needed to be stated to make 
verification possible, It had done little to clarify 
its views on the relationship between facts and 
values, and as a result, political science continued 
its complete and unquestioned immersion in a pre- 
scriptive, problem-oriented approach. 

The “behavioral revolution” in political science 
after World War 1 stands as a summary label for 
the consequences of the full reception of scientific 
method, It received its important start in the 1920s 
and 1930s under the stimulus of the Chicago 
school, as represented in the work and inspiration 
of Merriam, Lasswell, and Gosnell. It brought to a 
head the many tendencies that had been evolving 
over the centuries and then exploded the discipline 
in a multitude of new directions. By the 1960s the 
behavioral methods had gone a long way toward 
becoming homogenized with the main body of po- 
litical scence. 

If we disregard particulars for the moment, what 
stands out with brilliant clarity in the behavioral 
revolution in political science is the final and com- 
plete acceptance of the idea that methods of in- 
vestigation, in all their aspects, are problematic 
and, accordingly, merit special, concentrated atten- 
tion. Once this was acknowledged, the main barrier 
to sophisticated methodological development was 
destroyed. It opened up a limitless vista of oppor- 
tunities for the acquisition of reliable methods of 
inquiry. Time alone was needed for the conse- 
quences to work themselves out. But one im- 
mediate result was the sudden awakening of the 
discipline to the need for special academic courses 


devoted to methods of research alone. Through this 
special attention, the discipline will in time be able 
to offer the same kind of innovative leadership in 
the area of methods and techniques of research 
and analysis that is characteristic of its best efforts 
in the substantive fields. 

The theoretical revolution. While the technical 
revolution was taking place, the behavioral move- 
ment was casting aside the last remnants of 
the classical heritage in political science. It has 
brought about a profound transformation in con- 
ceptions about the role that theory plays as a tool 
for political research. Thereby it has tended to 
drive political science away from a prescriptive, 
problem-directed discipline to one in which re- 
search depends increasingly upon empirically ori- 
ented theoretical criteria. 

The growing acceptance of the difference be- 
tween factual and evaluative statements, and the 
need to be aware of when and how each is being 
used, has placed the prescriptive policy orientation 
of past political science in a new perspective. In 
the long past of political inquiry, and even to a 
considerable extent in the 1960s, students of poli- 
tics have considered it a major objective to address 
themselves directly to the question of how to solve 
current political problems—how to bring about a 
responsible two-party system, to improve control 
over bureaucracies, to increase the responsiveness 
of representatives, to achieve political stability in 
a changing world, to reduce the survival chances 
of dictatorial systems, to control international con- 
flict, and the like. All these problems taken col- 
lectively have in effect defined objectives of po- 
litical research. 

With the growing commitment to the goals and 
methods of a rigorous science, political scientists 
began, by the 1960s, to show a strong inclination 
to pose questions in a sharply different way. Uae 
derstanding began to take logical and chronological 
priority over premature efforts to prescribe poli- 
cies. Not that political scientists have arrived at a 
conviction that they ought to avoid the application 
of their knowledge to the solution of practical day- 
to-day issues of society. Any science that thus di- 
vorced itself from society would soon find itself 
extinct, and justifiably so. But the belief is that 
there may often be a clear difference between the 


short-run needs of society, on the one hand, and 
the long-run needs for an adequate understanding 
ble to offer 


of political life, on the other. To be al i 
reliable judgments about how to improve an 
reform political systems, it may be necessary to 
grasp the fundamentals about their operations, re- 


gardless, initially, of the goals for which this 
knowledge might later be used. [See PUBLIC 
POLICY. | 

However, the clarification of the role of values 
and prescription in the methods of political analy- 
sis has posed a new and serious problem. And yet 
it is through efforts to meet and conquer this prob- 
lem that we can anticipate an acceleration in the 
conversion of political science from a synthetic 
science to a pure science. 

However much the desire to prescribe for the 
solution of current political issues may distract 
political science from a search for a fundamental 
understanding of political systems, the positing of 
ethical purposes did have a singular merit. At the 
very least it defined some outer limits on the range 
of research; research had to be relevant in some 
way to current social issues. But when, by the 
1960s, these ethical criteria began to disappear, the 
gates were opened wide to an unrestrained flood of 
empirical research. If it is no longer necessary to 
test the relevance of research findings by their sig- 
nificance as possible solutions to practical prob- 
lems, the choice of topics would seem to depend 
entirely on the intuitions and proclivities of the 
research worker himself. There would, conse- 
quently, be no foreseeable limit to the variety or 
volume of research carried out in the name of 
political science. If only to abort a danger like this, 
theoretical criteria could be expected to assert 
themselves, as indeed they did in the 1960s. 

Even if none of these other factors had been at 
work, the state of the technical arts in social re- 
search today would have been driving political 
science in the same direction. Political science has 
come to an appreciation of the role of rigorous 
research at the very time that the rapid growth of 
computer technology has made possible the collec- 
tion, storage, retrieval, and analysis of vast moun- 
tains of raw data. The very fact that in the past 
political science had been slow to take advantage 
of systematic methods in its research was, in the 
1960s, turned to its advantage. Since it did not 
have a tradition or vested interests in older tech- 
niques to hold it back as it moved into the area of 
more sophisticated methods, it was able to take 
advantage of the most advanced technology avail- 
able, For this reason political scientists in the 1960s 
were to be found in the lead in the development of 
information storage and retrieval systems for social 
data, [See INFORMATION STORAGE AND RETRIEVAL.] 

But these new skills brought with them a paral- 
lel danger: Even more than in the past, it has be- 
come feasible to generate data in political science 
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at an ever steeper rate. Even if purely ethical cri- 
teria had continued to set the patterns of research, 
the data that had already begun to accumulate in 
the interwar period would have cried out for some 
systematic way of bringing economy and order to 
their organization, analysis, and future growth. It 
is here that theory entered to play a major role, 
For theory imposes limits by differentiating funda- 
mental research from trivial research, by system- 
atizing and codifying knowledge, and by offering 
guidance for future research. The transformation 
of political science from a synthetic discipline to a 
theoretical discipline became, therefore, no longer 
a matter of choice; it was being forced upon the 
field out of a sense of self-preservation. 
The decades after World War 1 have witnessed 
a marked growth in scientific consciousness in all 
disciplines, accompanied by a spreading apprecia- 
tion of the place of theory in scientific methodology 
itself. This has given added impetus in political 
science to the search for scientifically more respect- 
able and acceptable criteria for analyzing the 
boundaries and intrinsic unity of the discipline. The 
decline of pure empiricism in the other social sci- 
ences has helped to provide the incentives in polit- 
ical science to search even more intensively for a 
theoretical matrix that would give direction and 
analytic purpose to empirical research and that 
would draw all the basic subdivisions of political 
science together into a conceptually consistent 
unity. Not that consensus at the theoretical level 
has been achieved, or is even in the offing. But 
experimentation with alternative theoretical frame- 
works and analytic models promises to reveal the 
intrinsic coherence of political science as a disci- 
pline. Developments of this kind give strong evi- 
dence that political science has been slowly evolv- 
ing from its synthetic past into a theoretical future. 
As this happens, political science will join its neigh- 
boring disciplines as one of the basic social sciences. 
Davin EASTON 
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POLITICAL SOCIOLOGY 


Two distinct but converging intellectual tradi- 
tions are to be found in the theoretical and empiri- 
cal writings of political sociology. Broadly con- 
ceived, political sociology is concerned with the 
social basis of power in all institutional sectors of 
society. In this tradition, political sociology deals 
with patterns of social stratification and their con- 
sequences in organized politics. It is one sree 
approach to the study of social organization anı 
societal change. By contrast, in narrower eet 
political sociology focuses on the organizationa 
analysis of political groups and political leadership: 
In this perspective, the core of political ponon 
involves the study of both formal and inform 


party organization, with its linkages to the govern- 
mental bureaucracy, the legal system, interest 
groups, and the electorate at large. This approach 
is an expression of an institutional or organiza- 
tional point of view. 

As societies strive to become modernized and as 
the role of formally organized political parties 
becomes more and more dominant, it appears diffi- 
cult to make a sharp distinction between the social 
stratification and the institutional approaches to 
political sociology. Nevertheless, these perspectives 
assume persistently different conceptions about the 
political process and are reflections of the basic 
writings of Karl Marx and Max Weber, respectively, 
both of whom have deeply influenced the emer- 
gence of a sociology of politics. 

Conceptions of the political process. From the 
formulations of Karl Marx has come the pervasive 
view that class conflict and social stratification 
derive from economic factors or from the social 
relations generated by the mode of production. But 
Marx’s fundamental contribution is not limited to 
or even dependent upon the orientation that politi- 
cal behavior is an expression of economic interests. 
To the contrary, his essential contribution was that 
he made the study of political sociology equivalent 
to the study of societal structure, or macrosociology, 
as it has come to be called. Marx’s view that the 
political system derives from the pattern of social 
stratification, rather than his specific emphasis on 
the primacy of economic factors in fashioning 
social relations, has been a dominant theme in the 
development of an empirical analysis of politics. 

Nevertheless, such an orientation has been criti- 
cized, both by political scientists and by sociolo- 
gists, because it reduces political events to a social 
by-product and fails to consider the consequences 
of differing types of political institutions on societal 
change. The social stratification view of politics has 
been described as a form of sociological reduction- 
ism that has inherent limitations because of the 
institutional and cultural factors which are eX- 
cluded. The economic determinist view of social 
stratification is also seen as a barrier to comparative 
analysis because, by implication, it assumes the 
universality of a historical pattern of industrialism 
which holds, at the most, for western Europe and 
does not apply to the United States. Moreover, it is 
inappropriate to an understanding of the develop- 
ing nations, where new forms of political organiza- 
tion are crucial in conditioning economic growth. 

It is from the writings of Weber that political 
sociologists received an intellectual impetus for a 
More autonomous and more institutional view of 


politics, As a sociologist, Weber adopted a mode of 
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reasoning which converged with that of Marx, in 
that he held a comprehensive view of social struc- 
ture as a basis for analyzing politics. However, he 
saw social stratification as encompassing both eco- 
nomic relations and social status—prestige and 
honor. Furthermore, in his essay “Class, Status, 
Party” (1921), Weber postulated that the emergence 
of modern society implied a historical process of 
separation of political institutions from economic 
and social structure. Political institutions thereby 
emerge as worthy of direct sociological inquiry be- 
cause they are an independent source of societal 
change. 

These classic formulations of a social stratifica- 
tion view and an institutional view of politics have 
persisted despite their reformulation in the light of 
historical events and intellectual criticism. As the 
division of labor has become more complex, social 
stratification theories have been reformulated as 
“interest group” theories. Politics is still seen as 
derived from the struggle and conflicts of social 
strata, but these are viewed as more differentiated 
and as expressing the demands of specific interest 
groups—economic, professional, organizational, 
and even ethnic-religious. Social stratification the- 
ories of politics have been broadened to include the 
view that the governmental bureaucracy and the 
political party itself have emerged as new strata 
and thereby as elements in the theory of interest 
groups. 

The institutional approach has come to be re- 
formulated as a theory of “societal strain.” Political 
parties are seen as mechanisms for accommodating 
the strains that exist in modern society. Thus, the 
identification of elements that condition the effec- 
tiveness of political organization in performing this 
mediating function becomes a central topic of 
sociological inquiry. Because the political party 
penetrates all sectors of society and because quasi- 
political institutions develop to assist the party in 
this mediating function, the strain theory of politi- 
cal sociology must extend widely beyond the inter- 
nal structure of party organization. The political 
sociologist addresses himself to a range of overlap- 
ping empirical problems, whether he begins with a 
concern for underlying social stratification or with 
a direct investigation of political party organization. 

Often differences between these theoretical posi- 
tions involve differences of social values and con- 
ceptions of political philosophy. It is much too 
crude to label social stratification theories as radical 
and institutional theories as moderate in their 
orientations to political change. The most that can 
be stated is that some theorists who emphasize 
social stratification in their view of political sociol- 
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ogy are also committed to comprehensive and ideo- 
logical positions of political change. By contrast, 
the exponents of institutional thinking often tend 
to be concerned with pragmatic and incrementalist 
strategies of political change. 


Social context of political power 


At the empirical level, the bulk of research in 
political sociology has been directed toward the 
investigation of the social basis of political cleavage 
and consensus. These studies are mainly derived 
from a social stratification theory of politics and 
have been characterized by a progressive refine- 
ment of categories of analysis, from broad concern 
with class and occupation to much more refined 
measures of social status, Through analysis of vot- 
ing statistics and sample surveys, political party 
affiliation and voting behavior of the mass elector- 
ate have been charted in a voluminous literature. 
In almost every nation with multiple-party election 
systems, sample surveys have been introduced. As 
a result, it is possible to describe voting behavior 
in considerable detail, in terms of such variables as 
occupation, income, education, status, ethnicity, 
and religion, Some surveys have come to include 
such data as membership in voluntary associations, 
exposure to the mass media, and contact with polit- 
ical party organization. Experimental work has 
been done on the relevance of personality and 
social-psychological variables for understanding 
voting patterns, These empirical researches have 
focused mainly on the correlates of national elec- 
tion decisions; they have not probed ongoing mass 
contact and involvement with administrative agen- 
cies of government, even though these contacts are 
very powerful factors in molding popular perspec- 
tives toward the political process. [See VoTING.] 

While this body of research is an extremely valu- 
able source of descriptive and historical documen- 
tation, the findings permit only a limited contribu- 
tion to the theoretical aspects of political sociology. 
In part, the difficulties are technical. National sur- 
veys based on a limited number of cases produce 
valid results for the society as a whole, but this ap- 
proach does not produce sufficient data to isolate 
the political behavior of specific social and eco- 
nomic subgroups on the basis of a trend analysis. 
Moreover, comparisons between nations are diffi- 
cult, if not impossible, to make with precision and 
clarity, because of the difficulty of developing ap- 
propriate standardized indexes. 

The basic difficulty in empirical studies of con- 
sensus and cleavage is the failure to articulate 
sample surveys with theoretical issues. Most of 
these findings have been interpreted as supporting 


the view that advanced industrialism produces a 
“middle-majority” politics. “Middle-majority poli- 
tics” implies the decline of class conflict between 
the working and middle classes and the emergence 
of a wider area of consensus among less differen- 
tiated social groupings. The gap between elements 
in the working class and the middle class is de- 
scribed as declining, and the political process is 
seen as transformed into a process of pragmatic 
bargaining over specific issues (see Lipset 1960), 
Middle-majority theories are primarily concerned 
with changes in occupational structure and do not 
focus on new sources of social tension and political 
conflict, such as those based on race, ethnicity, and 
religion. These stratification theories also under- 
emphasize the impact of foreign affairs on the polit- 
ical orientations of the electorate. 

While research studies document these trends 
in the transformation of social cleavage for some 
industrialized countries, especially the Anglo- 
American countries and Scandinavia, the middle- 
majority theories have been criticized for failing to 
address themselves to the persistence of working- 
class political behavior, especially in the Catholic 
countries of western Europe. It is also the case that 
the proliferation of middle-income occupations un- 
doubtedly is transforming political orientations in 
the one-party systems of industrialized communist 
societies, such as that of the Soviet Union and 
those of eastern Europe. However, even in the 
absence of adequate empirical studies, it is clear 
that these political changes cannot be understood 
in terms of changes in social stratification alone. 
The development of middle-class occupations has 
very different implications in the developing na- 
tions, in that it contributes to discontinuities in the 
social hierarchy and thereby increases the poten- 
tialities for political instability. 

Public opinion and ideology. Aggregate anal- 
ysis of the social correlates of political participation 
and voting has been augmented by extensive re- 
search on mass public opinion and political ideol- 
ogy. While these research efforts also rely heavily 
on the use of the sample survey, they represent a 
refinement in the intellectual concerns of the social 
stratification approach, since they seek to explore 
the extent to which political attitudes not only 
reflect social structure but are influenced by party 
organization and the mass media. With the growth 
of representative institutions and the spread of 
both literacy and mass media of communication, 
the processes of government, in varying degree, 
become responsive to mass opinion. In turn, politi- 
cal parties and the administrative agencies of ee 
dustrialized societies find it necessary to mobilize 


public opinion in order to achieve mass participa- 
tion in social and economic institutions. 

Systematic research into political opinion has 
produced a body of data which has considerable 
theoretical sophistication and which gives deeper 
meaning to studies of voting behavior and political 
participation. The techniques of opinion measure- 
ment enable the description of attitude structures 
toward specific political issues, political candidates, 
and political institutions. These studies focus on 
the detailed identification of those parts of the 
social structure which are characterized either by 
an absence of political orientations or by political 
orientations which are extremely weak or, at best, 
limited to very specific interests and issues. Politi- 
cal apathy has been found to be concentrated in 
lower-income groups and is a persistent aspect of 
highly industrialized societies, even with increas- 
ing levels of educational attainment. In one sense, 
the major findings drive home, to those political 
sociologists who have held an overpoliticalized view 
of man, a more realistic image that has long been 
recognized as valid by most political leaders. [See 
POLITICAL PARTICIPATION. ] 

The concept of alienation has come to figure 
prominently in empirical research into public opin- 
ion, While this concept is fundamentally diffuse, 
it focuses on understanding the social and psycho- 
logical processes which produce a withdrawal or 
disengagement from political interest and political 
participation. Political apathy appears to be a 
broader category, which includes both alienation 
and socially inherited disinterest in politics. Avail- 
able research does not permit any trend statement 
about an increase or decrease in political alienation 
but instead highlights the social groups particularly 
vulnerable to alienation, such as youth, minorities, 
and intellectuals, These researches are most rele- 
vant when they focus on the process by which a 
person becomes alienated. They imply that aliena- 
tion is not a “steady state” but an orientation which 
can gradually or suddenly be reversed and produce 
direct intervention, outside the normal channels of 
political action. 

The study of mass opinion converges with the 
analysis of popular ideologies. A sharp distinction 
must be made beween the developing and the in- 
dustrialized societies. In the developing nations the 
concept of ideology is probably not applicable 
among traditional and peasant groups. if by “ide- 
ology” is meant a comprehensive, rigidly held, and 
explicit political belief system or “world view” (this 
is not to overlook the obvious and pervasive reli- 
gious “world views” held by the mass of the popu- 
lation). But the intellectuals in these nations who 
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have been trained in Europe or have had prolonged 
contact with Western political thought are deeply 
involved in ideological controversy. Moreover, with 
the rapid expansion of literacy, the mass media, 
and urbanization, middle-class groups in these na- 
tions develop explicit political preferences. In turn, 
ideological alternatives enter mass political debates. 

By contrast, in Western industrialized nations 
this process of ideological diffusion has passed, and 
there has been a growth of consensus about many 
domestic, economic, and welfare issues which re- 
flect the changing character of social stratification. 
At the level of elite or mass opinion, it would be 
exaggerated to speak of an “end of ideology” (Bell 
1960), and more appropriate to refer to a constric- 
tion or transformation of ideology. An ideological 
outlook is still found among elements of the most 
politically active and involved. However, the bulk 
of the population, including better-educated groups, 
do not hold such ideological orientations but, rather, 
hold generalized party preferences which express, 
at best, partial ideologies or pragmatic responses 
to changing political and social circumstances. Spe- 
cific ideological components also emerge and per- 
sist with respect to religious, ethnic, and racial 
issues and conceptions of foreign affairs, and these 
can be held with great intensity by small segments 
of the population. Paradoxically, people whose 
orientation to politics is limited to specific issues— 
often issues which seem peripheral to the central 
questions of political decision making—tend to 
display ideological orientations. As a result, middle- 
majority politics in industrialized societies is com- 
patible with the emergence of minority ideological 
orientations. 

The formation of public opinion and ideological 
attitudes involves an interplay between a person's 
social and psychological background and his par- 
ticipation in organizations and associations, as well 
as his exposure to the mass media. The term “polit- 
ical socialization” refers specifically to the whole 
process of internalization of political values, in- 
cluding the impact of the family and educational 
institutions. Under conditions of rapid social 
change, the relevance of initial socialization vari- 
ables in explaining mass political perspectives must 
be amplified by an understanding of the impact of 
education and involvement in secondary associa- 
tions. In fact, the continuing task of systematic 
empirical research, especially the sample survey, is 
to help clarify the complex processes involved in 
the dynamics of public and political opinion. [See 
SOCIALIZATION. | 

Empirical studies of election campaigns reveal 
the limited extent to which shifts in political at- 
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titudes and in actual voting behavior take place in 
a given campaign, although the amount of change 
is clearly crucial in determining the outcome. It 
is undoubtedly true and obvious, as the research 
literature implies, that long-term political socializa- 
tion has greater impact than the consequences of 
exposure to mass media in a given political cam- 
paign. Nevertheless, the effects of the mass media 
—both long-term and short-term—and the impact 
of party organization are key variables in both 
maintaining and molding political opinions. This 
is particularly the case for persons who do not hold 
firm political beliefs or whose style of life is not 
rooted in memberships in voluntary associations. 
The influence of the mass media operates either 
through local activists and opinion leaders or by 
direct exposure. The notion of “the politics of mass 
society” (Kornhauser 1959) specifically refers to 
those processes which weaken community and as- 
sociational affiliations and expose individuals to 
the pressures of party organization and the mass 
media. Paradoxically enough, there is every reason 
to believe that the impact of the mass media is 
greater in multiparty, democratic systems than in 
one-party systems, where distrust of the media is 
great and where it is recognized that the political 
regime relies more on organizational controls than 
on persuasion. 


Elite analysis 


Political sociologists who have come to consider 
politics as more than a reflection of social stratifica- 
tion and mass ideology have increased their con- 
cern with the analysis of those institutions and 
social systems through which the political process 
operates. The intellectual heritage of those political 
sociologists who seek to synthesize the stratification 
and the institutional approaches to political power 
is diverse. In particular, they have been strongly 
influenced by the “elite” theorists, on the one hand, 
and, on the other hand, by the variety of writers 
who can be called macrosociologists because they 
have taken total societies as the object of their 
analysis, 

Gaetano Mosca (1896) and Robert Michels 
(1911) served as central figures in stimulating 
empirical studies of elites and the sociology of polit- 
ical organization. Their initial formulations were 
concerned with the bureaucratic features of party 
organization and had strong ideological evertones. 
Particularly in the case of Michels, the “iron law 
of oligarchy” was more a definition than it was an 
empirical generalization offered as a fundamental 
barrier to representative institutions. 

As a result of the subsequent development of a 


more objective and detailed theory of organizations, 
political parties and their auxiliary institutions have 
been subject to various forms of empirical analysis, 
Typologies of party organization have been created, 
using such categories as “patronage,” “ideological,” 
“programmatic,” and the like, but these typologies 
were at best transitional to a more detailed study 
of the specific functions that political organizations 
and political elites perform. The writings of the 
University of Chicago empirical school of political 
research, which included Charles Merriam, Harold 
Lasswell, and Harold Gosnell and which in turn 
came to be called political behavior research, were 
crucial in transforming the study of party organiza- 
tions (see, for example, Gosnell 1927; Lasswell 
1936). The effect of this tradition has been to add 
to elite analysis an interest in the effectiveness of 
differing types of party organization on the per- 
formance of such activities as the recruitment of 
new leaders, the posing of political alternatives, the 
maintenance of linkages between the electorate and 
the government bureaucracy, the mobilization of 
mass political participation, and the formulation of 
consent. The literature consists mainly of detailed 
case studies of political parties and covers a wide 
range of political systems. Comparative analysis 
mainly takes the form of paired comparisons (as 
between, for example, the United States and the 
Soviet Union) or of more-generalized models for 
describing the dilemmas facing similar groups of 
nations, such as the developing nations. 

The analysis of political party organization obvi- 
ously involves not only its internal structure but 
also its relation to the sociopolitical balance of 
society. First, there is the focal issue concerning 
the capacity of the economic and industrial sector 
to influence and control political decisions. There 
seems to be widespread agreement between political 
sociologists with differing value assumptions that, 
with the growth of a complex division of labor, 
industrial and economic organizations are con- 
stricted in their capacity for direct management of 
the political process. The complexity of economic 
organization is such, it is argued, that economic 
leaders do not have the skills or programmatic 
approach to maintain complete dominance over 
the political party system, be it a single-party or 4 
multiparty structure. The separation of ownership 
from property control contributes to this process. 
Furthermore, the development of trade union oF 
ganizations often serves as a countervailing force 
to the political power of economic organization in 
those societies where labor unions are autonomous 
organizations, Institutional analysis also implies 
a modification of economic theories of political 


power by calling attention to the growth of profes- 
sional associations, with their ability to exercise 
political power in the name of both science and 
public welfare, and to the political power that 
adheres to large governmental bureaucracies. 

Paralleling the politics of economic institutions 
is the basic balance between the political party and 
the military. The military have considerable actual 
and potential power because of the vast resources 
they command and because of the fundamental 
importance of national security. Nevertheless, per- 
sonal military dictatorships are generally absent, 
since they are incompatible with the political re- 
quirements of contemporary social structure. Politi- 
cal sociologists have sought to describe and account 
for the various forms of political balance which 
operate between modern political parties and the 
military. There is hardly a society in which the 
military do not have some political power. The 
influence of the military varies from that of a 
pressure group to that of an active coalition partner 
in the domestic political structure. In some of the 
developing states the military may serve as the 
nucleus of a modernizing oligarchy, although it 
may be a transitional oligarchy. It is striking to 
note that one-party states, such as Nazi Germany, 
the Soviet Union, and Communist China, have 
succeeded in reducing or eliminating the military 
as a major independent power base in domestic 
affairs. 

There has been a growth of professional and 
voluntary associations that tend to accumulate po- 
litical power, and these organizations have been 
studied, on a selective case study basis, as examples 
of pressure groups. The social structure of an in- 
dustrial society or one in the process of moderniza- 
tion produces a variety of groups, such as old-age, 
youth, and ethnic, cultural, and religious associa- 
tions, which generate political demands through 
their associational representatives. In a multiparty 
system these pressure groups seek direct access to 
the parliamentarians and administrative leaders 
and tend to weaken the party, Even in one-party 
systems, where the base for independent action by 
voluntary associations is limited or carefully con- 
trolled, the official mass party seeks to make use 
of such organizations as devices for communication 
and political support. 

The analysis of elites supplies a conceptual de- 
vice for understanding the patterns of integration 
of institutional power. One important convergence 
in the field of political sociology is in the progres- 
sive increase of research emphasis on elite struc- 
tures. A paradigm has come to pervade the perspec- 
tives of political sociologists: the study of social 
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stratification is augmented by concepts of public 
opinion and ideology; institutional analysis is 
elaborated by the study of elite structures. 

Elite analysis has shifted from an exclusive con- 
cern with social background as a determinant of 
elite behavior to a broader concern with the proc- 
esses of recruitment, career development, and pat- 
terns of interaction. Modern elites tend increasingly 
to be selected by criteria of achievement rather 
than on the basis of inherited social background, 
and as a result they tend to be recruited from 
broader and broader social strata. Furthermore, as 
Karl Mannheim has pointed out (1935), the sheer 
increase in the size and complexity of elite struc- 
tures brings about a growth of heterogeneity, a 
crisis in defining standards of behavior, and the 
necessity of developing new devices for achieving 
agreement and consensus among competing elites. 

The literature of national power structures tends 
to focus on the analysis of specific elite groups. In K 
particular, there are available a series of national 
studies in depth which deal with the recruitment 
and socialization of the parliamentary elites. In 
addition, attention has been paid on a comparative 
basis to the differing patterns of pressure groups, 
especially economic pressure groups, in influencing 
the political process. However, systematic research 
on the differentiation and integration of different 
elite groups, even for countries such as the United 
States and the Soviet Union, is far from compre- 
hensive and adequate. 

The persistence of substantive differences be- 
tween interest group and social strain theories of 
political sociology is reflected in differing models 
of elite behavior. In the analysis of the United 
States, the residues of economic determinism are 
to be found in C. Wright Mills's “power elite” con- 
cept (1956), in which the societal leadership is 
seen as an integrated ruling group of a capitalist 
economic system transformed, in part, by the pres- 
sures of international relations and exercising 
power on an arbitrary basis. The leadership ele- 
ments are based in the industrial and military sec- 
tors operating in conjunction with the professional 
political elite. The economic elites are dominant 
and fuse with the military, while the political elites 
have secondary and circumscribed roles. 

By contrast, a variety of writers, including Robert 
Dahl, Talcott Parsons, Daniel Bell, and Morris 
Janowitz, identify a bargaining model in the United 
States, characterized by a more pluralistic pattern 
of political power. The elites are seen as much more 
differentiated and subject to a system of counter- 
vailing checks and balances. In this approach the 
political elites are crucial to the extent that they 
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perform the mediating and adjusting role between 
the various institutional sectors of society. On the 
basis of this model, the basic political issue is not 
so much the arbitrary exercise of power by a small, 
integrated elite as it is the necessity of creating 
conditions under which a differentiated elite can 
make effective decisions. In the United States, ac- 
cording to the analysis of Shils (1956) and others, 
elite integration presents special problems because 
the creative role of the politician is not adequately 
understood and the respect accorded him by the 
other elite sectors and by the electorate at large is 
relatively low and unstable. 

Empirical research into elite structures has dis- 
tinguished between local—community, metropol- 
itan, and regional—elites and national elite sys- 
tems. Interestingly enough, for the United States 
both the power elite concept and the bargaining 
model highlight the separation of economic power 

` and political elites at the local level. A rich body of 
historical and analytical material describes the proc- 
ess of “bifurcation” of local elites in the United 
States. According to the power elite model, this is 
the result of a shift of political interest to the 
national arena; for the bargaining model, it is the 
outgrowth of the process of “democratization,” 
which brings representatives of ethnic, religious, 
and lower-status groups into political power. 


Macrosociology and political change 


The elite perspective in political sociology has 
been paralleled and broadened by those few but 
influential sociologists who specialize in the study 
of total societies and political change at the so- 
cietal level. These men were stimulated by the 
holistic approach of social anthropologists and, in 
return, have had a profound impact on political sci- 
entists who deal with comparative politics. In the 
slow and almost discontinuous development of 
macrosociology the central issue has been the anal- 
ysis of the impact of modernization on representa- 
tive institutions. In turn, studies of the spread of 
industrial institutions have served to highlight the 
significance of differing political institutions in ac- 
counting for various patterns of national develop- 
ment and the persistence of national cultures. 

Tocqueville's analysis of prerevolutionary and 
postrevolutionary France, The Old Regime and the 
French Revolution, published in 1856, and Thor- 
stein Veblen’s Imperial Germany and the Industrial 
Revolution, which appeared in 1914, stand as land- 
marks of early contributions to the study of the 
interplay of political institutions and social and 
economic development. Both men forecast intel- 


lectual trends, in that they did not produce na- 
tional case studies. Instead they were seeking to 
explain, by implicit comparative analysis, par- 
ticular sequences of societal change which were 
reflected, in the first case, in the outbreak of the 
French Revolution in contrast to the absence of 
such violence in England and, in the second case, 
in the late and authoritarian character of indus- 
trialization in Germany. 

Explicit concern with the theoretical aspects of 
macrosociology is rooted in the diverse approaches 
to the common problems of societal integration 
offered by Max Weber, Emile Durkheim, and 
Ferdinand Ténnies, whose works, among others, 
supplied the basis for the subsequent reformula- 
tions of Talcott Parsons, in The Structure of Social 
Action, published in 1937. At the theoretical-em- 
pirical level, the intellectual pioneer was W. I. 
Thomas, in The Polish Peasant in Europe and 
America, published in 1918-1920. This monu- 
mental work set forth the empirical requirements 
for comparative analysis. His standpoint was both 
intensive, in that he sought to describe and under- 
stand the cultural values of Polish society, and 
comprehensive, in that he sought to analyze the 
full range of social institutions, from family and 
kinship groups to political organization. By juxta- 
posing the development of a relatively integrated 
Polish society in Europe with the social disorganiza- 
tion of the Polish immigrants in the United States, 
he highlighted the differential role of values and of 
political institutions in the process of moderniza- 
tion and urbanization. 

The intellectual vitality of macrosociological per- 
spectives, however, derives less from formal theo- 
retical considerations and more from the dramatic 
impact of contemporary history—particularly, first, 
the rise and transformation of totalitarianism and, 
second, the rapid process of decolonization after 
World War 1. Joseph Schumpeter, in Capitalism, 
Socialism, and Democracy (1942), developed a 
comprehensive and generic analysis of the social 
and political institutions on which capitalism was 
based. His ideas about the transformation of entre- 
preneurial activities into a large-scale organization 
format, the negative role of intellectuals in the 
politics of capitalism, and the decline of representa- 
tive institutions have been seminal formulations. 
Franz Neumann’s Behemoth (1942), an analysis 
of the social organization of the Nazi party and its 
transformation of German economic and social 
structure, and Barrington Moore’s Terror and Prog- 
ress €1954), a similarly comprehensive volume 
for Russia, are most noteworthy studies in depth. 


Comparison of the different elements of totali- 
tarianism and their consequences is dealt with by 
C. J. Friedrich and Z. K. Brzezinski in Totalitarian 
Dictatorship and Autocracy (1956). 

While it is not easy to categorize the so-called 
new nations, they display patterns of similarity 
which contribute to the comparative analysis. 
There are those nations, such as Japan and Turkey, 
which were never under Western colonial rule and 
which embarked early on the process of moderniza- 
tion, However, an overriding distinction, based 
mainly on the impact of colonial experience, has 
become operative. It is possible to differentiate the 
colonial experience of South America, with its wars 
of liberation in the nineteenth century, from that 
of the so-called new nations of Africa and Asia, 
which, with notable specific exceptions, achieved 
independence quickly and without extensive vio- 
lence, after World War n. In turn, these new na- 
tions can be categorized by the type of metropolitan 
tule they experienced—British, French, Dutch, etc. 
—which could be direct or indirect and which was 
imposed on differing indigenous cultural—religious 
systems. 

The central issue in the study of new nations 
after independence hinges on the limitations and 
actual breakdown of multiparty systems in supply- 
ing the political leadership necessary for economic 
and social development. Scholarly writing in this 
area has passed from a focus on individual case 
studies to a variety of types of comparative analy- 
sis. One approach is that found in Edward Shils’s 
Political Development in the New States (1959— 
1960), where he presents a series of generalized 
governmental types, such as traditional oligarchies 
and modernizing oligarchies, and analyzes their 
political dilemmas. Gabriel A. Almond and James 
S. Coleman, in The Politics of the Developing Areas 
(1960), follow a similar approach, but they make 
use of statistical indicators to explain these types 
of political regimes. Alternatively, comparative 
analysis has been pursued by exploring specific 
hypotheses related to a particular institution, such 
as the governmental bureaucracy Or economic en- 
terprise. An example of this approach is Morris 
Janowitz’ The Military in the Political Development 
of New Nations (1964), in which the limitations 
on the capacity of the military to supply political 
leadership are in part accounted for in terms of 
internal organizational and professional factors. 

A fully comprehensive approach to comparative 
political sociology must encompass the distinction 
between industrialized and nonindustrialized na- 
tions. Such work has been stimulated partly by the 
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desire to make use of the data that are available 
and to produce quantitative comparisons and find- 
ings even though the problems of the validity of 
international statistical sources and the compara- 
bility of survey findings have not been solved. Karl 
Deutsch and his associates are representative of 
the efforts to uncover patterns of political behavior 
through refined statistical analysis of the standard 
census-type data for all the political divisions of 
the world. By contrast, more selectively and inten- 
sively, Almond and Verba (1963) have employed 
survey research techniques in countries of Europe 
and in Mexico to probe both political participation 
and socialization of fundamental political values. 

Regardless of subject matter, political sociology 
has developed a common perspective in its focus 
on political conflict and political consensus. It is 
not possible to contend that sociologists have neg- 
lected the study of political conflict for an undue 
emphasis on the study of political consensus. The 
case is that in crisis situations which result in con- 
flict or produce compromise, it is difficult to gain 
access to relevant data; thus, the development 
of the “behavioral persuasion” in the study of- 
politics does in fact encourage a focus on routine 
and ongoing processes, rather than on crises and 
decision-making points. Nevertheless, there is a 
body of monographic literature which describes in 
“natural history” terms the outbreak of political 
conflict—when the pursuit of group interest leads 
to action outside the institutionalized forms of 
political change. This type of phenomenological re- 
search has come to encompass the full range of 
politics, from community conflict to relations be- 
tween nations. Social-psychological approaches de- 
rived from the study of collective behavior or col- 
lective problem solving have been employed to 
handle these empirical materials. The sources of 
political conflict and the means by which consensus 
is created are therefore the central issues for po- 
litical sociologists, even though the practical dif- 
ficulties of studying these phenomena are con- 
siderable. 


Political sociology and political theory 

In the nineteenth century the development of 
representative institutions meant the extension of 
suffrage and an increase in the importance of parlia- 
ment as a device for sharing political power and 
resolving political conflict. In the twentieth century 
the complexities of social structure and of the gov- 
ernmental process have produced a rise in the in- 
fluence of executive leadership and a decline in the 
impact of the parliamentary process. Theorists of 
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the democratic process have therefore had to face 
the task of making an intellectual contribution to 
“institution building,” both to strengthen parliament 
and to make possible representation at new points 
in the political process. 

At this juncture, political sociology faces political 
theory. Political sociologists have been men of 
strong opinions, and they have been concerned 
with the value implications of their work. But it 
is only since the end of World War n, particularly 
under the influence of the newer types of economic 
analysis, that some political sociologists have be- 
come interested in theoretical formulations which 
explore explicitly the conditions under which polit- 
ical democracy would be maximized. If economic 
analysis is designed to maximize the use of eco- 
nomic resources, then political sociology has the 
goal of formulating social, psychological, and eco- 
nomic conditions under which political democracy 
would be maximized. Some theorists, as represented 
by Schumpeter, hold that elections are the hallmark 
of democratic society and that, therefore, the clari- 
fication of the election process is a key task of 
social research. Others, such as Robert Dahl and 
Charles Lindblom, are concerned with the formula- 
tion of criteria which encompass the practices of 
administrative and community agencies. Defini- 
tions of political democracy have been drawn by 
some theorists so as to encompass one-party sys- 
tems as well. Regardless of the particular defini- 
tions, political sociology has come to be linked to 
the analysis of the economic, social, and psycho- 
logical preconditions for political democracy. 


Morris JANOWITZ 


[Directly related are the entries POLITICAL BEHAVIOR 
and POLITICAL SCIENCE. Other relevant material may 
be found in ELITES; IDEOLOGY; INTEREST GROUPS; 
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POLITICAL SYSTEMS 
See under SYSTEMS ANALYSIS. 


POLITICAL THEORY 


The articles under this heading deal with gen- 
eral problems of political theory and their study. 
Specific concepts are discussed in AUTHORITY; 
CHARISMA; CONSENSUS; DemocracY; DUTY; 
EQUALITY; FREEDOM; GENERAL WILL; IDEOLOGY; 
JUSTICE; LEGITIMACY; MAJORITY RULE; NATIONAL 
INTEREST; NATURAL LAW; NATURAL RIGHTS; 
PowER; RESPONSIBILITY; SOCIAL CONTRACT; Sov- 
EREIGNTY; STATE. The development of political 
thought, primarily its development in the West, is 
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also reviewed in ANARCHISM; CONSERVATISM; CON- 
STITUTIONS AND CONSTITUTIONALISM; ISOLATION- 
ISM; LIBERALISM; MARXISM; MILITARISM; PACI- 
FISM; PLURALISM; SOCIALISM; TOTALITARIANISM; 
UTILITARIANISM; UTOPIANISM. For discussions of 
non-Western political thought, see CHINESE POLIT- 
ICAL THOUGHT; INDIAN POLITICAL THOUGHT. Other 
relevant material can be found in GOVERNMENT; 
POLITICAL SCIENCE; SCIENCE, articles on THE HIS- 
TORY OF SCIENCE and THE PHILOSOPHY OF SCI- 
ENCE; and in the articles listed in the guides under 
Law and RELIGION. 


Arnold Brecht 
Sheldon S. Wolin 


I. APPROACHES 
1. TRENDS AND GOALS 


I 
APPROACHES 


1. What is “theory”? 


The term “theory” is used in various ways. It 
always refers to “thought,” as distinct from “prac- 
tice” or “acting.” Not every thought is theory, how- 
ever; reminiscing, for example, is not. The original 
Greek meaning of theðria was that of a well-focused 
mental look taken at something in a state of con- 
templation with the intent to grasp it. In this sense, 
it covered apprehension of being (ontology) as 
well as causal explanation, religious and philo- 
sophical as well as empirical and logical thought. 
In current usage, two meanings must be distin- 
guished at the outset. One is as broad as the ancient 
one, or even broader. It comprises a thinker’s entire 
teaching on a subject (his Lehre), including his 
description of the facts, his explanations (no mat- 
ter whether religious, philosophical, or empirical), 
his conception of history, his value judgments, and 
his proposals of goals, of policy, and of principles. 
This broad usage prevails, in particular, in dealing 
with the history of political theory (see section 5). 
Theory, in its more technical contemporary use, 
however, has a far narrower meaning. It denotes 
“explanatory” thought only or, at least, primarily. 
A “theory,” then, is a proposition or set of proposi- 
tions designed to explain something with reference 
to data or interrelations not directly observed or 
not otherwise manifest. Mere “description” is no 
theory. Nor are “proposals” of goals, of policy, or of 
evaluations. Only the explanations, if any, offered 
for descriptions or proposals may be theoretical; 
the description or the proposal as such is not. On 
the other hand, theory does include “prediction,” 
provided that it follows from an explanation. 

No particular quality is expressed when the 
term “theory” stands alone; the explanation given 
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may be true or false, mature or childish, profes- 
sional or nonprofessional, religious, philosophical, 
or scientific—any explanation someone offers for 
a phenomenon may be called his “theory.” If used 
within a scientific context, however, theory is gen- 
erally meant to refer to scientific explanations only 
(see next section). 


2. What is “scientific” theory? 


That explanations may be either “scientific” or 
“nonscientific” is hardly controversial, but the 
meaning of this distinction is. A fundamental split 
runs through the ranks of contemporary political 
theorists and other social scientists. On one side 
are those who only regard as “science” work done 
in line with scientific method in the strict sense of 
empirical observation and logical reasoning; on 
the other side are those who use the term in a 
broader sense, in which it also accommodates other 
types of (true or putative) knowledge, in particular 
knowledge supplied by “pure reason,” intuition, 
self-evidence, or even religious revelation. The split 
shows most in moral valuations. The former group 
holds that the validity of ultimate moral standards 
of valuation cannot be established through sci- 
entific means; the second, using a broader con- 
cept of science, holds that it can. No scientific 
argument can compel a theorist to agree that the 
term “science” must be used in only one sense. 
Rather, like “theory,” it is used in various ways. It 
will here be employed in the narrower sense, as 
modified below. Not every scholar has to accept 
this use, nor is it claimed that a scientific explana- 
tion is always “better” or “truer” than a religious 
or philosophical one, but merely that it is some- 
thing else and that the knowledge it supplies is 
“transmissible qua knowledge” to a larger degree 
(see next section). 


3. Transmissibility of knowledge 


Whichever way social theorists may use the 
term “science,” they generally concede that there 
are degrees in the transmissibility of knowledge. 
We can, as a rule, communicate our “belief” that 
we know something to others with no particular 
difficulty, but we cannot always transmit our (real 
or putative) knowledge qua knowledge to them. 
Some knowledge is transmissible qua knowledge 
to anyone who is able to understand the language 
used and to carry through the experiments or tests 
referred to; some is not. This makes it advisable 
to distinguish between scientia transmissibilis and 
scientia non transmissibilis, although there are 
gradual transitions, as we shall see presently. 

First, there would be no transmissible knowl- 


edge at all, and consequently no intersubjective 
type of science, unless certain minimal assump- 
tions were accepted, all of the common-sense type, 
especially the following: (a) “consubjectivity” 
(Hocking 1956), that is, the assumption that hu- 
man beings obtain similar impressions through 
their senses; (b) some regularity in nature, with- 
out which there could be no explanations in gen- 
eral terms; and (c) some degree of human freedom 
to form an opinion on truth and falsehood, The 
validity of these assumptions cannot be scien- 
tifically demonstrated; they are “methodologically 
immanent.” As they are universally accepted, how- 
ever, or at least universally comprehended, they 
do not block transmissibility of knowledge. 

Second, while “descriptions” of observations are 
entirely transmissible in principle, other phases of 
scientific procedure are not to the same degree, 
especially not the acceptance of (a) observations 
as sufficiently exact, (b) descriptions as sufficiently 
precise, (c) results of observations as actual facts 
(“reality”), and (d) hypotheses of causal interrela- 
tions as valid. Rather, all decisions to accept are 
exposed to the possibility of challenge. Important 
as this limitation of transmissibility is, it does not 
justify the verdict that on that ground no knowl- 
edge at all is transmissible, since the knowledge of 
documentary descriptions remains fully transmis- 
sible and so does purely analytical reasoning, as in 
formal logic and mathematics. Furthermore, our 
reasons for accepting or refusing to accept and the 
reports on additional evidence through further ob- 
servations are also transmissible. Michael Polanyi, 
who in his book Personal Knowledge (1958) argues 
impressively that any type of knowledge, even sci- 
entific, is ultimately based on personal “belief” or 
“commitment,” has failed to pay attention to the 
difference between nontransmissible “decisions to 
accept” and the transmissible material to which 
they refer; even so, he does not seem to deny the 
heuristic advantages of generally recognized stand- 
ards of science. 

There is a great qualitative difference between 
the limitations of intersubjective transmissibility 
in the area of empirical knowledge and those that 
block transmissibility of moral valuations in the 
face of opposing valuations held in good faith by 
others. Here intersubjective transmissibility fails 
much earlier or indeed altogether. It should not be 
overlooked, however, that the factual assumptions 
on which evaluations are based, as well as the 
biological and psychological factors accounting for 
evaluations, and the consequences and risks R 
which the pursuit of cherished values leads, leni 
themselves to intersubjectively transmissible €x- 


aminations. Full exploitation of these methods can 
do much to reduce the significance of the cleavage 
between the two schools (see sections 6 and 9). 


4. Some distinctions 


Theory and practice. Practice may serve to 
generate, test, or refine theories. We learn by doing, 
by trial (practice) and error (bad theory). But to 
go ahead in practice with no previous theoretical 
thinking may prove very costly and lead to irrep- 
arable damage that could have been avoided by 
forming good theories before acting. If a scientific 
theory is right and complete, it is self-contradictory 
to say that something “may be good in theory but 
is not in practice.” 

Theory and description. Both “description” and 
“theory” (explanation) are legitimate functions of 
science. To achieve, if not an explanation, at least 
a full description of observable phenomena may 
significantly augment the corpus of human knowl- 
edge. Although, as defined above, mere description 
is not theory, theory may help discover describable 
phenomena and aid in getting them properly de- 
scribed; it alone can determine their “relevance.” 

Theory and hypothesis. The term “hypothesis” 
denotes a tentative assumption of facts or of inter- 
it is chiefly used in the latter sense. A 
n explaining phenomena, generally refers 
to some new or old “hypothesis” or law but itself 
is not a hypothesis; it merely employs one for the 
purpose of explanation and may even consist solely 
of that, especially in formulating a new hypothesis. 

Theory and law. If used in the realm of “is,” 
the term “law” denotes regularity of interrelations. 
Propositions intended to describe regularities (laws) 
are, in principle, always subject to challenge in 
view of conflicting observations. Strictly speaking, 
therefore, there are no “statements” of laws but 
only “hypotheses” about laws. If used in the realm 
of “ought,” the term “law” denotes a “norm of con- 
duct” that has been set by someone, man or God. 
To enumerate “nature” among the potential origi- 
nators of moral norms is logically objectionable 
unless nature is conceived of as itself having a 
moral will or purpose; otherwise the moral norms 
cannot be logically regarded as “set” by nature. If, 
however, moral norms are conceived of as set by 
God, it is not in conflict with logic to look to nature 
for intimations of the moral intentions of its 
creator. Theories of this kind, therefore, are not 
blocked by the demands of logic: but they are 
“nonscientific” (except in hypothetical terms) be- 
cause of their reference to premises that can 
be neither confirmed nor refuted with scientific 
means. 
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Theory and philosophy. A theory tries to ex- 
plain “something”; a philosophy, in one use of the 
term, “everything.” Philosophical explanations are 
theories, too, but in their most characteristic efforts, 
nonscientific ones. They begin where science in the 
narrower sense of the term leaves off. A few phi- 
losophers have attempted to reduce philosophy to 
strictly scientific work (Edmund Husserl); yet the 
prevailing use of the term leaves the philosopher 
free to follow intuition and to speculate about 
things and interrelations not accessible to science, 
at least if that is done with full regard to the results 
of scientific work. Nor is the meaning of philosophy 
limited to “explanations”; philosophers may, and 
often do, attempt to “describe” Being as they see 
it. To that extent philosophy is not “theory,” not 
even nonscientific theory, but rather “ontology.” 

The advantage of philosophy over science is its 
greater freedom in methods; its disadvantage, the 
lack of transmissibility qua knowledge. Despite this 
shortcoming, philosophy performs an irreplaceable 
function in human society because of the irrepres- 
sible human search for the unknown. 

Theory and reason. Any attempt to explain 
something is an act of reasoning. “Scientific” theory 
tends to preclude “loose” reasoning in favor of 
precisely determined types of reasoning, especially 
strictly logical deductions from premises whose 
origin is exactly stated, and logical inductions ( gen- 
eralizations) from observed facts, rendering ac- 
count of the number and type of observations. 
When, in deductive reasoning, premises are not 
won through induction but regarded as religiously 
revealed, self-evident, grasped by intuition, ac- 
cepted as traditional, or merely postulated as heu- 
ristically useful starting points, this should be 
clearly stated. Time and again there have been 
attempts to break through the walls of strictly 
scientific reasoning by recourse to principles or 
maxims considered “reasonable” in daily speech, 
especially humanitarian principles or “the founda- 
tions of Western civilization.” However, if those 
principles themselves are questioned, reference to 
a person’s intuition, received revelation, etc. offers 
no “transmissible” knowledge. What can be trans- 
mitted, even then, is thought on the consequences 
and risks inyolved in following or failing to follow 
the principles in question. 

No limits are set in scientific theory to the heu- 
ristic type of reasoning (for example, by analogy, 
intuition, guessing, etc.) engaged in during the 
preparatory stage of selecting promising topics for 
research or working hypotheses. 

Theory and models. In order not to be bogged 
down by the great number of variables in theo- 
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retical research in political science, it is often help- 
ful to construct a “model” out of assumptions and 
hypotheses, for example, a specific type of society 
or of government (see section 12), and then to 
work within the framework of that model. The 
model may reflect either conditions actually pre- 
vailing at a definite time and place or alternative 
conditions considered for the purpose of theoretical 
explorations. In either case, assumptions and work- 
ing hypotheses should be clearly stated. 


5. History of political theory 

Writing on the history of political theory has 
rarely if ever been limited to “explanations” and 
even less so to strictly “scientific” efforts in this 
direction. Rather; employing the term “theory” in its 
broadest sense, it has generally included (a) non- 
scientific types of explanation, especially those of 
a religious and philosophical character, (b) non- 
scientific value judgments, especially moral ones, 
(c) proposals offered for the selection of political 
goals and for political actions, and (d) mere de- 
scriptions of phenomena as seen by political phi- 
losophers of the past. It has, therefore, been a 
blend of philosophy, scientific theory, and descrip- 
tion, with far more space and emphasis given to 
nonscientific philosophical aspects than to strictly 
scientific ones, Typical issues have included Plato's 
“ideas” conceived of as essences having some sort 
of reality, Christian theological dogmas based on 
revelation and tradition, and philosophical doc- 
trines that claim to constitute superior knowledge, 
especially those ascribing to pure reason the power 
to reveal truth independently of experience. They 
have included, more specifically, the doctrines of 
natural law in the sense of a code of moral prescrip- 
tions allegedly to be found in nature or reason, and 
its progeny, the doctrines of natural rights. Indeed, 
natural law constituted the very core of political 
science and political theory from about 1600 to 
1900. The main topics treated under the guidance 
of the various doctrines and approaches were the 
state, its nature, origin, and proper ends; the theory 
of social contract; the proper relationship between 
church and state; sovereignty; the best form of 
government; and the implications of natural law 
for politics, 

Many of these historical efforts have been impor- 
tant stages in the development of human thought 
and Western civilization; they have influenced 
ideals, actions, and the history of institutions. As 
theories, however, most of them were based on be- 
lief rather than science in the narrower sense of 
the term. This is not meant to say that their results 
must have been false or that religious or philo- 


sophical reasoning ought to be abandoned. The 
only concern here is with the fact that the history 
of “scientific” political theory must be culled from 
its fusion with nonscientific religious or philo- 
sophical thought. 

Ideas and methods of a strictly scientific char- 
acter have rarely been entirely absent in political 
thought. The ancient Greek ability of abstraction 
and of logical and mathematical thinking laid the 
foundation of modern science. Sophists and their 
opponents amply referred to empirical observa- 
tions. The Socratic method of confronting debaters 
with questions on the meaning and the implica- 
tions of their propositions and with objections that 
led them to reconsideration, modification, or self- 
contradiction was the early prototype of good sci- 
entific method (“clarification of the meaning of 
propositions and of their implications”). Plato’s and 
Aristotle’s works abound in psychological observa- 
tions that, although not subjected to tests in the 
modern sense, still offer valuable scientific con- 
tributions or cues. Aristotle cultivated the method- 
ical gathering of factual data as the base for theo- 
retical work; he carefully examined the risks and 
consequences involved in institutional arrange- 
ments, actual or imagined ones. There has hardly 
been a political philosopher who did not take some 
time off from speculation to examine facts and 
human motives. The preponderance of religious 
authority in medieval thought, however, beginning 
with St. Augustine (354-430), led to a millennium 
of dearth in strictly scientific work. There was a 
partial revival when reason and Aristotelian thought 
were being readmitted into scientific work as sup- 
plementary sources of knowledge to the extent that 
they were not in conflict with revealed religious 
truth (Thomas Aquinas, 1225?-1274). But several 
centuries were still to pass before the typically 
modern type of scientific theory emerged in the 
realistic examination of actual conditions and 
motivations by Machiavelli (1469-1527), the 
emphasis on inductive methods by Bacon (1561- 
1626), the experimental and mathematical ap- 
proaches of Galileo (1564-1642), Kepler (1571- 
1630), and Newton (1642-1727), and the ensuing 
ascendancy of empiricism. These new methods 
stimulated scientific reflection in political theory. 
No sharp lines, however, were being drawn yet be- 
tween religion, philosophy, and science. The vari- 
ous approaches were used side by side, often 
queerly intermingled and, even if scientific in 1n- 
tent, frequently based on erroneous factual assump- 
tions (for example, social contract) or serie 
logical thinking (for example, inferring ough 
from “is,” as in natural law, or necessity from 


consistency, as in theories of sovereignty [see sec- 
tion 11]). 

Important further steps toward modern scientific 
theory include the emphasis on the significance of 
“doubt” in the works of Descartes (1591-1650) 
and Hume (1711-1776), and the rejection of 
“pure reason” as a source of knowledge by Hume 
and, less radically, by Kant (1724-1804). Kant 
restricted the reliability of pure reason as a source 
of theoretical knowledge in the realm of “is” to only 
a few conditions and steps of thought, such as the 
“categories” of mental operations; in moral (“prac- 
tical”) questions, however, his doctrine that we 
have a priori knowledge of the basic moral law 
still maintained an absolutistic approach. Another 
great scientific impulse for political theory sprang 
from the work done by Montesquieu (1689-1755) 
on the political influence of environment (geogra- 
phy, milieu, etc.) and on methods that are apt to 
check the abuse of power (separation of powers). 
The vigor with which, pursuing these approaches, 
the founding fathers of the United States studied 
the implications, risks, and consequences of insti- 
tutions and the methods by which the abuse of 
power could be checked (separation, federalism, 
a bill of rights) highlighted the development of sci- 
entific political thought. For this reason, the Fed- 
eralist became a milestone in the evolution of 
scientific political theory. But the authors blended 
scientific with religious and philosophical ap- 
proaches (trust in God, natural law, natural rights). 
Characteristic of this promiscuity are the cele- 
brated words in the Declaration of Independence 
that proclaim not that all men are equal but that 
they are “created” equal, and not that they possess 
certain inalienable rights but that they are “en- 
dowed by their Creator” with them. Indeed, to an 
extent seldom fully realized, our Western values 
are based not on science but on religion, history, 
tradition, and the creative idealism of man intent 
on forming a world in which he considers life 
worth living. 

The separation of philosophical and scientific 
ways of thinking from religious ones began in the 
seventeenth and eighteenth centuries in England 
and France, Philosophical and scientific approaches 
remained interwoven, however, until, in the course 
of the nineteenth century, they, too, began to part 
company; indeed, their separation did not grow 
sharp and consistent until the end of the nine- 
teenth century and the beginning of the twentieth. 
What finally brought it to completion, at least 
within a broad and influential section of social 
theorists, was the deliberate elimination of non- 
scientific elements in what came to be called “scien- 
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tific method” and in its offspring, “scientific value 
relativism” (see next section). 


6. Scientific relativity of political values 


“Philosophical” value relativism is dead today, 
but “scientific” value relativism is not. There is no 
need of relativity in speculative philosophical think- 
ing on values. Dogmatic relativity is rather out of 
place there. Any proposition asserting, as a com- 
pelling doctrine, that “there are no absolute values” 
or that “all values are relative’ must indeed be 
rejected; to claim absolute truth for a proposition 
that there is no absolute truth is self-contradictory. 
But an examination of values that keeps within 
the boundaries of well-defined scientific methods 
must be as relative in its results as it is in its 
methods, This is the meaning of “scientific value 
relativism.” It does not teach that all values are 
relative and that there are no absolute values. In- 
stead, it merely says that the question whether 
something is “valuable” can be examined scien- 
tifically only in relation either to (a) some goal or 
purpose for the pursuit of which it is or is not 
useful (valuable), or to (b) ideas held by someone 
regarding what is or is not valuable. “Ultimate,” 
“highest,” “absolute” values or standards of values 
are “chosen” by mind or will, or conceivably 
“grasped” by faith, intuition, or instinct; but they 
are not “proven” by science, although science can 
help a great deal in clarifying the meaning of 
ideas about values and the consequences and risks 
entailed in their pursuit (Brecht 1959, pp. 117- 
118). On these grounds, the relative approach to 
values is mandatory for “scientific” examinations 
only; it does not bind religious or philosophical 
thought. Nonscientific convictions held on the 
absolute character of certain moral values may 
well be in line with transcendental truth; they are 
if there is a God who has set the order of values 
as we feel it to be, but whether that is so is not 
for science either to affirm or to deny. The rebuttal 
that scientific arguments too are in need of “ac- 
ceptance” (Kaufmann 1944) or in need of “belief” 
or “commitment” (Polanyi 1958) does not wipe 
out the qualitative difference in transmissibility 
that severs evaluative from scientific arguments 
(see section 3). 

Historically speaking, scientific value relativism 
is of rather recent origin. It did not come to be 
overtly postulated until the turn of the century. Its 
first formulations—still imperfect in part—were 
published by German and Austrian scholars who 
had to vindicate their academic independence from 
the dominant fashion of political value judgments 
around them. Among them were philosophers Hein- 
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rich Rickert and Georg Simmel, political scientist 
Georg Jellinek, sociologist Max Weber (who in 
1904 wrote the most influential article on it), and 
the jurists Hermann Kantorowicz, Gustav Radbruch, 
and Hans Kelsen. The first four did not use the 
name “relativism”; the last three did. Less outspoken 
varieties of the same methodological principle 
(“latent” relativism) had begun to permeate the 
work of many other social scientists elsewhere— 
especially American pragmatists and lawyers such 
as Justice Holmes, French sociologists, and legal 
positivists everywhere—at about the same time, 
and have spread ever since. 


7. Scientific and legal validity distinguished 


Scientific validity of moral and legal principles 
derived from nature (natural law) has frequently 
been claimed on the ground that otherwise law 
courts, government officials, and the public at large 
would have to accept as legally valid any govern- 
mental commands, even those of the most cruel 
character, if given in a formally correct manner. 
This reasoning is fallacious, however. Rejection of 
the scientific validity of natural law does not imply 
acceptance of legal positivism; the latter is not a 
scientific necessity. Whether commands issued in 
the form of law (for example, laws declaring at- 
tempted suicide, mercy killing, adultery, homo- 
sexuality, racially mixed marriages, or violations of 
“National Socialist principles” to be punishable 
crimes, or laws ordering mass killings of a certain 
type of people) should be considered legally valid, 
that is, binding for courts, government officials, and 
the public at large, depends on human ideas and 
designs concerning the legitimate powers of gov- 
ernment. These ideas and designs are motivated 
not only, and not even chiefly, by scientific thought, 
but more generally by ideas concerning the de- 
sirable type of government and the desirable way 
of life. Whether commands exceed the powers 
people want to concede to government is a question 
quite distinct from that of violating scientific truth. 
He who denies scientific validity to higher law or 
natural law may yet deny legal (juridical) validity 
to legislative commands not because they are in 
conflict with scientific truth, but because they have 
been issued in excess of the powers conceded to 
government. 


8. Natural law and scientific political theory 


Many references to natural law as a legitimate 
source of scientific political theory either are logi- 
cally defective because they derive inferences in 
the form of “ought” from observations in the realm 


of “is” or are otherwise lacking in intersubjective 
transmissibility because they presuppose the divine 
will they want to prove. There are, however, three 
natural-law categories that can be fruitfully de- 
veloped in scientific political theory. They consti- 
tute a significant meeting ground for the two an- 
tagonistic schools. 

Universal characteristics of human nature. Our 
knowledge of typical characteristics of human 
nature is “transmissible” qua knowledge to the 
extent that the respective traits are actually shared. 
But which traits are present in all men, or even 
in all members of a large group, cannot be decided 
by pure reason, since inductive generalizations are 
in need of empirical corroboration. Often empirical 
evidence is so strong as to leave no room for 
reasonable doubt; this applies to the characteristic 
phenomena of hunger, thirst, pain, joy, sexual 
desire, desire for affection, and the ability to 
think, to remember, to reflect, and to concentrate 
attention on something that is being thought of as 
real or as only imaginary. Regarding some other 
traits, the evidence, while less compelling, may still 
be strong enough to justify the tentative “hypoth- 
esis” that a feature is universal, for example the 
desire for being respected by others and for self- 
respect. Similarly in the area of justice, there is 
strong evidence that the experience of ideas or 
feelings of justice and injustice is a universal 
human phenomenon and that even under the most 
diverse cultural environments such ideas or feel- 
ings always include postulates of (a) truth, or Vers 
acity, in discriminatory statements, (b) generality 
of the system of values applied, (c) equal treat- 
ment of what is equal under the accepted system, 
(d) respect for freedom insofar as it is not in cona 
flict with that system, and (e) respect for the limits 
of possibility set by nature. 

Empirical research on the universal nature of 
these and other human traits is highly significant 
for political theory. Tentative hypotheses, formed 
on the basis of material already available, may 
anticipate the results of further research. But the 
ever-present temptation to consider one’s own pref- 
erences universal must be resisted in strictly sci- 
entific work. Furthermore, the theorist must always 
keep in mind that universality of a human trait 
alone constitutes no sufficient ground for the infer- 
ence that it must be universally respected. The piina 
ciple that all men, in spite of their individual 
differences, ought to be treated as being essentially 
equal has its source in religion, history, tradition, 
and man’s creative idealism rather than in science, 
which can do little more than try to find out m 


what respects men are or are not equal. The only 
scientific conclusion safely to be drawn from the 
universality of human traits is that ignoring them 
will lead to resentment, opposition, and possibly 
revolt or revolution; respecting them will please 
those concerned, but will possibly antagonize 
others. These inferences are by no means negligi- 
ble. They enable us, for example, to expect wide 
support for shining acts of justice and wide oppo- 
sition to acts of flagrant injustice even where sup- 
port or opposition does not immediately show up. 

Self-ayvenging laws of human conduct (poena 
naturalis). Just as our knowledge of the nature 
of things often enables us to predict undesired 
consequences of individual behavior (for example, 
the early waste of physical strength and moral 
fiber in the dissipated life of a playboy and drunk- 
ard), so our knowledge of social interrelations 
justifies predictions of undesired consequences 
likely to follow, for example, from selfishness, 
visionless materialism, corruption, and lack of 
“virtue,” especially from absence of a spirit of 
service and devotion in public life. 

Impossibility (limited possibility). The objec- 
tion raised against a proposed political action that 
it cannot be executed, or against a proposed goal 
that it cannot be reached by the means available 
or by any means, is relevant irrespective of any 
value judgment regarding the desirability of the 
goals pursued. The limits set by nature or environ- 
ment to what can be done or attained are manifold; 
there are physical, biological, psychological, logical, 
and legal impossibilities. Most are so obvious that 
political scientists tend to overlook their theoretical 
significance. It is impossible to work continuously 
for 24 hours a day, or to work without food, or to 
work hard when ill and underfed; or not to think 
when fully awake, or to enforce a command not to 
think of something; to equalize the physical and 
mental conditions of all individuals, their state of 
health, the length of their lives, their character, 
the family life in which they grow up, the happi- 
ness of their matings, the number, length of life, 
and behavior of their children and friends, the 
satisfaction they find in their work, and many other 
conditions of personal happiness; to establish full 
economic equality or, if established for a day, to 
maintain it over any length of time; to supply 
desired goods to everyone who wants them if there 
are fewer goods than wanted; to have any two of 
the following at the same time: equality per capita, 
equality according to need, equality according to 
quality of work, equality of opportunity; to make 
economic conditions more equal than they are 
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without interfering in liberty (Brecht 1959, pp. 425 
ff.). Also, it is often impossible to reach a desired 
goal with no undesired side effects. More specifi- 
cally, it is impossible to guarantee that a person to 
whom dictatorial power is given, even if he be good 
and wise today, will be so tomorrow and that his 
successor will also be good and wise. It is impos- 
sible for one man, or a small group, personally to 
watch over many millions of people or to control 
the conduct of more than a small number of agents 
and keep well informed of their doings without the 
aid of independent institutions (ibid., pp. 437 ff.). 
The limits set by nature to a man’s direct “span of 
control” illustrate that social scientists can make 
certain nonmoral statements based on the laws of 
nature (in this case, biological ones) with the 
same assuredness as natural scientists. [See NAT- 
URAL LAW.] 


9. Political goals 


It has often been said that scientific political 
theory can deal with means only and must leave 
the deliberation of ends (goals, goal-values) en- 
tirely to politics, philosophy, or religion. This sharp 
antithesis is incorrect, however. Scientific political 
theory is able to deal with ends in various signifi- 
cant ways. It can examine (a) the meaning of 
goals or goal-values and all the implications of that 
meaning, (b) the possibility or impossibility of 
reaching them, (c) the cost of pursuing and reach- 
ing them, especially the price to be paid through 
the sacrifice of other goals or values and through 
other undesired side effects, (d) all other conse- 
quences and risks involved, and (e) implications, 
consequences, and risks of alternative goals (goal- 
values), so as to make an informed choice possible. 

To engage in these various types of legitimate 
examination of goals is the proper task of scientific 
political theory. Far greater efforts of political sci- 
entists should be diverted to the critical examina- 
tion of goals and goal-values than have been in 
the past several decades; indeed, the examination 
of goals might well be put back in the center of 
political theory, where it used to be in former cen- 
turies; however, in scientific work it should now be 
carried through with strictly scientific means only. 


10. Basic units of political theory 

There has been a growing trend recently toward 
“conceptualization” in political theory, especially a 
search for “basic” concepts apt to be useful as 
building stones for a mature political theory. To be 
useful they have to be not only clear but realistic 
in the sense that they or their analytical derivatives 
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correspond to carefully observed and politically 
relevant facts. The following concepts have been 
suggested as basic: 

Group (Arthur F. Bentley, Harold D. Lasswell, 
David Truman, David Easton, Oliver Garceau, Earl 
Latham, Richard W. Taylor, Phillip Monypenny, 
Gabriel Almond, et al.). The group is in many con- 
texts regarded as a more realistic basic concept 
in political theory than the individual [see POLITI- 
CAL GROUP ANALYSIS; POLITICAL SCIENCE]. 

Equilibrium (same authors). Borrowed from 
physics and economics, this concept is considered 
particularly useful for political theory as well, al- 
though some writers have wondered whether “dis- 
equilibrium” would not be more realistic as a focal 
point in politics since no full equilibrium ever 
seems to be established. 

Power, control, influence (George E. G. Catlin, 
Charles Merriam, Harold Lasswell, C. Wright Mills, 
Bertrand Russell, Hans Morganthau, James G. 
March, et al.). These have been called the “most 
basic” concepts for political theory. 

Action (Talcott Parsons, Edward Shils, Hannah 
Arendt, et al.). This concept, in the sense of a 
deliberate new start from a given situation directed 
toward some anticipated goal and engaged in with 
intelligence and expense of energy, puts emphasis 
on a distinctly human factor. 

Elite (Vilfredo Pareto, Harold Lasswell, Milovan 
Djilas, et al.). The idea of elite emphasizes the 
basic relevance of the phenomenon that there are 
leading groups or strata in any system of society 
[see ELITES]. 

Choice and decision making (Max Weber, Gus- 
tav Radbruch, Chester I. Barnard, Herbert Simon, 
Talcott Parsons, Arnold Brecht, et al.). These are 
concepts that have risen sharply in the esteem of 
political theorists as basic ones for descriptive as 
well as theoretical work [see DECISION MAKING]. 

Anticipated reaction (Carl J. Friedrich) and 
game (John von Neumann, Oskar Morgenstern, 
et al.). Both are subconcepts within the general 
theory of choice and decision making—and they 
have led to a particularly intensive, highly tech- 
nical series of studies [see SIMULATION]. 

Function (Robert K. Merton, Talcott Parsons, 
et al.). In the sense of a function or dysfunction 
performed—either purposively or with no purpose 
—for or against society as a whole or some part of 
it by any socially relevant factor (including usage, 
belief, behavior pattern, institution, science), this 
is primarily a sociological concept apt to serve 
descriptive purposes, but it also has some explana- 
tory merit [see FUNCTIONAL ANALYSIS]. 

There is, furthermore, a marked tendency to 


form concepts that are apt to serve as basic units 
not for one field alone but for all the social sciences 
or indeed for all science. 

Useful as the newly developed concepts are, it 
would be misleading to expect that they will sup- 
plant the continuing relevance of older concepts, 
such as institution, government, justice, liberty, 
equality, and—more basic perhaps than all others 
—cause and effect, consequences, risks, possibility 
and impossibility (see section 8), and universal 
human features. No all-comprehensive political 
theory can be based on only one of the older or 
newer concepts in isolation, and no single concept 
is likely to emerge as the most basic, most construc- 
tive one. Each deals with a different aspect of the 
multifaceted phenomena referred to when we say 
“political.” 

This warning applies, in particular, to over- 
estimating the concept of “power” as the basic unit 
of political science. This concept, not unlike that 
of happiness, which previously had long dominated 
Anglo-American political theory, is too broad and 
too vague to serve as a well-defined basic unit. It 
disregards the great variety of both means and 
purposes of power. Brute force or threat of its use, 
prestige, authority, persuasion, wealth, personal 
attraction, beauty, charisma, heroic deeds, prom- 
inence in sports or arts, humility, altruism—all 
give “power.” Even ideas have often been called 
powers. Furthermore, a person may not have 
sought the power he has and may not use it for 
political purposes. Although not apt to be treated 
as the basic unit in political theory, the concept of 
power, if used with care and qualification, is indis- 
pensable for it. We must, in particular, distinguish 
between that type of power (“p-power”) that de- 
notes the constitutional or legal right or authority 
to do something (for example, the power of a 
legislature to issue valid laws), and the factual 
power (“m-power”) to influence the use of the legal 
power or to circumvent it [see PowER]. 


11. Changes in theoretical topics 


The topics that had attracted the most attention 
up to the beginning of this century, such as the 
ends of the state and the proper goals and the 
best form of government (see section 5), have re- 
ceded to the background under the impact of the 
distinction between scientific and nonscientific 
political theory, since scientific theory is unable to 
say what is “best” or “proper” in absolute terms, 
that is, with no reference to the questions for 
whom” and “for what” it is best or proper- One 
that has been specified, scientific theory can i 
its services; but it cannot, by itself, answer e 


questions “for whom” or “for what” in absolute 
terms (see section 12). 

Likewise, the concept of “sovereignty” has lost 
much of the glamour that held political theorists 
spellbound for more than three hundred years [see 
SOVEREIGNTY]. It has not, however, completely lost 
theoretical significance. It is logically conclusive to 
argue that (a) as long as people are inclined and 
able to resort to violence, peace can be secured only 
by preventing such outbursts, if need be, coercively, 
and (b) as long as several agencies, whose meas- 
ures may be in conflict, assume that preventive 
function, peace is not secured unless one agency 
or body is given the final decision on quarrels be- 
tween them, Hence, under the two major assump- 
tions that peace is always more desirable than 
violent action and that coercive intervention by 
some higher authority is always more valuable than 
private violence, the postulate of sovereignty re- 
mains theoretically useful, provided it means no 
more than the logical derivations from those two 
assumptions. The orthodox theory of sovereignty, 
however, did involve far more than that. It included 
a sort of triangular definition of sovereignty, state, 
and law to the effect that “state” is the unit within 
which “sovereignty” determines what is or is not 
“law,” and that law is only that, and all that, which 
the sovereign state determines to be law. This con- 
ceptual triangle, persuasive by its logical consist- 
ency, cannot claim to be the only logically consistent 
theory or to be the only desirable one. Even under 
its own terms it would be more consistent to ascribe 
sovereign power to a global type of government 
rather than to isolated state governments. Further- 
more, some functions may be either so peaceful 
or so specific in character as not to need a common 
sovereign to prevent violence among the agents 
devoted to their care. Finally, certain norms may be 
considered legally valid without being laid down 
by the government (for example, those demand- 
ing respect for human dignity) or legally invalid 
despite being proclaimed by it (for example, cruel 
commands). Analysis of such problems remains a 
proper function of political theory. 

Other topics of strictly scientific political theory 
include the following: 

(a) Types of power and influence, with special 
regard to novel methods and discoveries of psychol- 
ogy, manipulation of opinions, and mass-media and 
other communication problems. 7 

(b) Types of choices and decision making, with 
special regard to distinctions to be made between 
them. 

(c) Types of values or valuations that can be 
distinguished theoretically as objects of choice (de- 
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cision), or types of goals, and the implications, 
consequences, and risks of their pursuit. 

(d) Rival forms of government, conceivable al- 
ternatives never tried, and the implications, conse- 
quences, and risks involved in each. 

(e) Details of constitutions, and the implica- 
tions, consequences, and risks of each. 

(f) Alternatives in the setup of administrative 
institutions and in the methods of manning them, 
and the implications, consequences, and risks of 
each. 

(g) Alternatives in the conduct of foreign pol- 
icy, and the implications, consequences, and risks 
of each. 


12. Scientific political theory of democracy 


This section will illustrate briefly the manner in 
which modern scientific theory tackles types of 
government. Although only democracy will be an- 
alyzed here, other types of government are subject 
to the same method of scientific examination. 

No scientific argument enables us to compel the 
use of any particular definition of the terms “de- 
mocracy,” “communism,” “socialism,” and the like. 
Neither the etymological origin of the terms nor 
the meaning that has been actually associated with 
them at a particular time and place justifies con- 
clusive statements to the effect that they should 
be used one way and not another, especially since 
their use has actually changed a great deal in the 
course of history. Scientific political theory can, 
however, greatly facilitate an orderly discussion by 
listing and, through neutral symbols, distinguish- 
ing the changing historical and contemporary uses, 
and the evolving alternative uses. Thus, a theoret- 
ical analysis of democracy (D) may list the follow- 
ing contemporary uses: 

D, =M = Majority rule in the sense that the 
majority is considered entitled to decide any ques- 
tion in any manner. 

D,=hR +iC + M[1 — (hR + iC]) = Human rights 
and independent courts considered exempt from 
executive and legislative interference, majority rule 
limited to other affairs (this is today the prevailing 
use of the term “democracy” in Western language). 

D, => gW + E = Successful promotion of the 
general welfare and of equality, regarded as both 
goal and criterion of democracy, its achievement, 
if necessary, to be taken care of by a specifically 
trained “vanguard” of the people (this is the cur- 
rent communist use of the term “democracy”), 

D., Ds, «+: = Any other meaning of the term; 
either discovered in actual usage or proposed. 

From these basic distinctions, political theory 
can proceed to further specifications by listing, in 
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each case, the ends (ideals, goals) whose pursuit 
is meant to be suggested or postulated by the term 
democracy, and the means (institutions) by which 
they are to be pursued. Under alternative D, (cur- 
rent Western use) we may tentatively put down as 
ends: government for the people, respect of human 
dignity, development of each to the best of his 
abilities, general welfare, liberty, equality (at least 
of opportunity and before the law), and justice; 
and as means: government by the people, universal 
adult suffrage, periodic elections of legislatures 
and of the chief executive or cabinet, independent 
courts, civil liberties (freedom of speech, press, 
assembly, association, religion, etc.), and other 
items. Again, both these lists of ends and means 
cannot be fixed dogmatically as the only correct 
ones, but must either refer to definite historical 
examples, such as the American, British, French, 
West German, Swiss, or Scandinavian specimens, 
or be kept open for any proposed modification or 
addition. All this amounts to building a series of 
“models” as the base for detailed work (see sec- 
tion 4), 

Each item on the list of ends is then to be sub- 
jected to a thorough cross-examination of its mean- 
ing, possible alternatives of its meaning, implica- 
tions, consequences, risks (see section 9), and of 
its compatibility with the other goal-items. When- 
ever goal-items collide, possible modifications and 
relative priorities must be explored. 

Each item on the list of means is likewise exam- 
ined as to its meaning, possible alternatives of 
meaning, implications, consequences, and risks, and 
especially regarding the compatibility or incom- 
patibility of consequences to be expected from the 
use of the respective means with the ends in the 
previous column. To illustrate: (a) The means- 
item “periodic elections” involves the emergence of 
parties and of uneven influences exercised on the 
outcome of the elections by party bosses or secre- 
taries, press lords, radio magnates, money inter- 
ests, etc. (b) Elections may be carried through in 
various ways (plurality, absolute majority, propor- 
tional representation, etc.). Scientific political the- 
ory examines the distinctly different consequences 
of these methods for the results of elections, for 
the types of candidates that are likely to be pre- 
sented and elected, for their behavior before elec- 
tion and after, especially in the legislative bodies 
(on the assumption that they want to be re-elected), 
and the influence all this is likely to have on the 
election of cabinets where that is a function of 
the assembly (majority or coalition cabinets), on the 
frequency and character of cabinet crises, on pres- 
idential elections, and the like. (c) Majority deci- 
sions of legislative assemblies differ according to 


the modes of vote taking used, particularly in the 
event that three or more alternative proposals are 
put to a vote. Small groups (like the Gauche radi- 
cale in France from 1924 to 1932 or the Free 
Democratic party in West Germany around 1960) 
may obtain pivotal importance and power quite out 
of proportion to the number of people represented 
by them. The internal organization of the legisla- 
tures (committees, rules of procedure, etc.) also 
influences the results. (d) The item “executive 
branch” involves the problem of public administra- 
tion. Scientific political theory examines the con- 
sequences and risks that must be expected from 
the establishment of any of various alternative 
systems of administrative organization, such as 
the American “departmental” or the European 
“ministerial” system, and of either a relatively 
stable body of public employees (civil service) or 
a relatively floating one (spoils system), and of the 
various types of education and training of em- 
ployees. (e) The institutional item “civil liberties” 
calls for an examination of the extent to which the 
respective institution is apt to serve the ideal goals 
or to thwart their realization. Numerous other 
topics and subtopics can be examined in the same 
way. 

The result, almost item for item, is certain to be 
that some of the consequences of the institutional 
means inevitably collide with the envisaged ends. 
These means lead, in part, to nonequality instead 
of equality, to decisions by minorities instead of 
majorities, to government not consented to instead 
of government by consent of the governed, and to 
bargaining for special interests instead of promo- 
tion of the general welfare; in the case of public 
administration they lead either to favoritism (non- 
equality) and inefficiency (no general welfare) or 
to bureaucracy (rule by minorities, nonequality, 
although possibly efficiency); in the case of civil 
liberties they lead to criminals going scot-free be- 
cause of procedural restrictions (nonjustice ) and 
to slander and blackmail facilitated by the institu- 
tional liberty of the press (nonliberty, nonjustice), 
etc. i 

Examination of these and other inconsistencies 
can be accompanied by the discussion of potential 
“remedies” (reforms), such as corrupt practices 
acts or new elections (“turn the rascals out”). ~ 

Other great problems calling for theoretical in- 
vestigation include the handicaps that, under the 
democratic rules of the political game, slow om 
or entirely prevent long-range planning not ae 
where such planning is clearly undesirable bu! 
also where, in the interest of the democratic goals, 
it is actually desired by the majority of the people- 
Under the democratic rules, no majority of today 


can bind the majorities of tomorrow, except by 
constitutional amendments (or, to a lesser degree, 
treaties and contracts), This handicap calls for 
close theoretical studies of its consequences and 
risks and of possible remedies, especially the draft- 
ing of minimal deviations from the democratic 
structure that might be considered, for example, 
when an underdeveloped country is in need of 
industrialization or other reforms that cannot be 
achieved under the democratic rules or at least 
not as quickly as desirable or necessary. 

It is theoretically of great importance also to 
examine the conditions of life, communication, 
education, public spirit, etc., that must be assumed 
to prevail in a country in order to enable the demo- 
cratic form of government to operate in line with 
expectations. Distortions that are likely to follow 
from the absence of any or all of those conditions 
must be exposed to adequate attention. 

Finally, no theory of democracy is complete 
unless it includes a comparative analysis of the 
implications, consequences, and risks that pertain 
to rival forms of government, since the claim to 
superiority raised for democracy is based, to a 
considerable extent, not on faultless excellence but 
on the graver perils that threaten from other forms 
of government, especially totalitarian ones. [See 
DEMOCRACY; DICTATORSHIP; TOTALITARIANISM. ] 

This illustration shows not only that there are 
a great number of topics that need close theoretical 
examination but that a scientific political “theory 
of democracy,” fully carried through, would fill 
many volumes. This may serve to explain why no 
such comprehensive work yet exists. Concentrating 
on a few items that seem most important to a par- 
ticular writer is feasible, of course, and may be 
useful. Also, the painstaking work on individual 
items that is going on in many countries might be 
supplemented by periodic surveys of the whole 
field, which would consolidate the results of the 
partial studies. 

Whenever a political leader, a political philos- 
opher, or a scientific theorist produces what he 
claims to be constructive ideas about a desirable 
remodeling of democracy through new formula- 
tions of goals or new modifications of means, it 
becomes the task of scientific political theory to 
examine the implications, consequences, and risks. 

ARNOLD BRECHT 
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History of Political 


I 
TRENDS AND GOALS 


Organized political societies flourished long be- 
fore men theorized about them. Political beliefs, 
including notions of authority, obedience, justice, 
and political explanations, also preceded the ap- 
pearance of theories and are to be found in the 
myths, sagas, and folklore of all ancient literate 
peoples. J 

The idea of theory as a form of systematic 
knowledge systematically pursued developed out 
of the intellectual revolution which, beginning in 
the sixth century B.c. in Greece, resulted in re- 
markable achievements in drama, science, mathe- 
matics, history, and philosophy. One of the radical 
notions which accompanied this creative outburst 
and which came to form the basic presupposition 
of political inquiry was that thought could abstract 
certain colligated phenomena from their more 
comprehensive setting and subject them to ger 
tiny, explanation, and even control. Political . e 
was gradually defined as one form of coe 
or “family,” of phenomena. The crucial step 1n | $ 
process was the attempt to understand politic: 
life as formed of both the actions between men 
and those between men and their environment. 
This understanding was pursued with progres- 
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sively less reliance upon explanations based on 
occult forces or the intervention of the gods. 


Classical political theory 

The classic form of political theory took shape 
in fifth-century Athens and was largely the work 
of Socrates and his circle. This achievement was 
consolidated by Plato and Aristotle, and it involved 
primarily an attempt to forge a synthesis of three 
elements: politics, the idea of a theory, and the 
practice of philosophy. The success of the en- 
deavor is suggested by the fact that the synthesis 
survived until the late nineteenth century, when 
science came to usurp the place of philosophy in 
the synthesis. The nature of the classical synthesis 
is best revealed by considering its constitutive 
words and their meaning. [See the biographies of 
ARISTOTLE and PLATO.] 

“Political” derives from a family of Greek words 
relating to the polis, or city-state—for example, 
politeia (“constitution”), polites (“citizen”), and 
politikos (“statesman”), all of which connoted 
public concerns and, hence, formed a contrast 
with what was regarded as private or “one’s own” 
(idion). Thus, the subject matter of political theory 
consisted of those matters and happenings that 
were of public concern. 

The second element of the synthesis, theory 
(theoria), also represents a confluence of mean- 
ings. Originally, a “theorist” (theoros) was a 
public emissary dispatched by his city to attend 
the religious festivals of other Greek cities. Theoria 
referred at first to a festive occasion, but gradually 
it acquired the connotation of a long journey un- 
dertaken to see (theorein) different lands and to 
observe their diverse institutions and values. The 
way in which these meanings were gathered to- 
gether and given a political orientation is illus- 
trated in Plato's Laws. Among the institutions of 
the projected city was one which provided for a 
few carefully selected theorists (theorei), who 
were to be sent abroad for the purpose of observ- 
ing the educational and legal systems of other 


cities and conversing with politically wise and’ 


learned men there. Upon his return, the theoros 
was required to present the results of his investi- 
gations to the highest political authority in the 
community, This cluster of notions furnished 
some of the features which were to become the 
distinguishing marks of a theory: the observation 
of practice and the collection of experience; the 
achievement of perspective upon one’s own society 
by an act of liberation which brought the theorist 
into contact with a wide range of comparative 
political experience; and, finally, the process of 
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appraising the importance of what had been ob- 
served in the light of what was known. 

As the third element in the synthesis, philosophy 
provided the conventions which were to govern 
the systematic pursuit of knowledge. They cen- 
tered on the task of establishing the grounds for 
believing some assertions to be true and others 
erroneous. In addition to signifying the quest for 
reliable knowledge, philosophia meant the love or 
passionate pursuit of wisdom—that is, a knowl- 
edge which would enable men to become wiser in 
the conduct of life. When directed to political 
matters, the aim of philosophy was, as Plato stated 
it, to make men wiser in the handling of common 
affairs and, thereby, to benefit all who lived in the 
political community. 

Although many ancient writers contributed to 
the development of political theory, Plato and 
Aristotle were the most influential in determining 
its methods and objectives for the next several 
centuries. The viewpoint of the present discussion 
is that political theory consists of certain fairly 
well-defined conventions relating to methods of 
inquiry, the constitution of the subject matter, and 
the purposes of inquiry. A new theory represents 
an attempt to challenge the prevailing conventions 
in order to substitute new ones. The history of 
political theory contains a record of these chal- 
lenges and of the changing conventions which 
have governed the practice of theorizing. For most 
of these challenges, the point of departure was 
the idea of political theory as formulated by Plato 
and Aristotle. Their formulation represents the 
truly classical paradigm, and if later alternatives 
are to be understood, some attempt must be made 
to state what that paradigm consisted of. 

(1) Political theory was the practice of system- 
atic inquiry whose aim was to acquire reliable 
knowledge about matters concerning the public 
province, Knowledge was valued as the supreme 
means for improving the quality of human life 
in the political association. As a philosophical pur- 
suit, theory sought to establish a rational basis for 
belief; as a politically inspired pursuit, it sought 
to establish a rational basis for action, 

(2) Classical theory identified the political with 
the common involvements which men shared by 
virtue of membership in the same polis. Romans 
of the republican period called their political order 
a res publica, literally, “a public thing”; the same 
idea was reflected in the sixteenth-century English 
usage of “commonweal.” The core meaning of 
“political”—a sharing of what is common—was 
eloquently expressed by Cicero: “Further, those 
who share Law must also share Justice; and those 
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who share these are to be regarded as members 
of the same commonwealth” (De legibus 1, 7.23). 
Theory was not restricted to the problems of se- 
curing and extending the common benefits of po- 
litical life; it was shaped by the sobering recogni- 
tion that there were common predicaments and a 
common fate issuing from politics and that the 
ordinary evils besetting human existence tended 
to be magnified by politics because where power 
was concentrated, the possibilities of injustice and 
violence, whether intended or inadvertent, were 
enhanced. 

(3) The polis was the basic unit of analysis 
adopted by classical theory. It was, according to 
Aristotle, the highest and most comprehensive 
association, which included all lesser associations 
within it. This belief formed the basis for the 
classical idea that theory dealt with political 
wholes or, stated differently, that a theory must 
be as comprehensive and inclusive as the political 
association itself. In working out this assumption, 
classical theory specified what were the significant 
“parts” of the polis, how they functioned, and what 
their effect was on the quality of life in the polis. 
In this way theory came to view the polis as com- 
posed of, and dependent upon, various interrelated 
structures, The structures which received most 
attention were certain structures of activity (e.g., 
ruling, warfare, the settling of disputes, education, 
religious practices, and economic production); 
certain relationships (e.g., between social classes, 
between types of superiority and inferiority, be- 
tween authorities and subjects); and certain struc- 
tures of belief (e.g., concerning the gods, the 
meaning of justice and equality). Here, as in so 
much of classical theory, a moral concern with 
the quality of political life provided the impetus 
for developing analytical methods and concepts. In 
trying to state what a “right” structure looked like, 
theory was led to inquire into the ways in which 
a wide range of factors affected the polis. For 
example, it not only examined the institution of 
private property and its relationship to inequality 
and class conflict but tried to specify the different 
effects which different degrees of inequality and 
conflict produced. Thus, theory undertook to pro- 
vide explanations of what caused certain states 
of affairs and not others and of what might cause 
other states of affairs. The task of explanation also 
accomplished one other objective: it provided cues 
indicating what among the welter of phenomena 
was significant or relevant, and in this way it sug- 
gested what it meant to think in political terms. 

(4) The notion that a political society consti- 
tuted a whole served as the basis for the abstract 


idea of an order. No idea in the history of theory 
has exercised greater influence; it is the source of 
many later conceptions of balance, equilibrium, 
stability, and harmony. To conceive of a political 
society as an order was to conceive of it as having 
a discoverable structure of one kind rather than 
another, depending on the arrangement of func- 
tions, the relationships among various subgroups 
and associations, and the institutional forms in 
existence. But, as, the classical discussion of politi- 
cal disintegration suggests, every order had an ele- 
ment of the problematic, because every order 
necessarily tended to the advantage of some and 
to the disadvantage of others. Consequently, theory 
was preoccupied with analyzing the sources of con- 
flict and with trying to enunciate the principles of 
justice which might guide the political association 
in discharging its distributive function of assigning 
material and nonmaterial goods in a context of 
competing claims. 

There was an obverse side to the concern with 
order. Like most forms of inquiry, theory was a 
response to problems, not so much problems of an 
intellectual kind as ones arising from perceived 
derangements in the world. The attempt to explain 
disorder led classical theory to develop the basic 
political vocabulary of diagnosis: instability, an- 
archy, anomie, and revolution. Here, too, investi- 
gation centered upon the conditions and causes of 
instability, adhered to a comparative approach, 
and was moved by the same therapeutic motiva- 
tion that governed theorizing about order. ; 

(5) From the beginning, classical theory in- 
sisted upon the significance of comparative studies 
for supplying a more comprehensive form of ex- 
planation and a wider range of alternatives. In 
order to cope with the many and diverse phe- 
nomena introduced by comparative studies, clas- 
sical theory developed a classification for political 
forms (e.g., monarchy, aristocracy, democracy, 
their variants, and their perversions) and a set of 
concepts which enabled the theorist to place com- 
parable phenomena side by side. Concepts such as 
law, citizenship, participation, and justice were 
used to order the relevant phenomena, thus Su 
paring the way for an explanation which wou. 
account for differences or similarities. 

(6) The theoretical imagination of the cla: A 
writers felt challenged more by the poe, A 
political phenomena disclosed by comparison t z 
by the regularities. This response was rooted in 3 
moral outlook which conceived of a constitution n 
a manifestation of a particular culture. Each ee 
stitution represented distinct beliefs pabout he 
ordering of society, the treatment of individu 
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and classes, the possession and distribution of 
power, the qualifications for participating in politi- 
cal deliberations, and the promotion of certain 
collective values. Theory undertook to appraise the 
various constitutional forms, to determine the form 
most suitable for a particular set of circumstances, 
and, above all, to decide whether there was one 
absolutely best form. 

(7) The quest for the absolutely best form of 
polity, which was among the most important pre- 
occupations of classical theory, had a profound 
bearing on the practice of theory, but it has been 
greatly underestimated and often dismissed as vi- 
sionary and utopian. It reveals, as perhaps nothing 
else can, the intellectual boldness and radicalism 
of classical theorizing; it deserves comparison with 
another achievement recorded by an ancient his- 
torian: “Anaximander the Milesian, a disciple of 
Thales, first dared to draw the inhabited world on 
a tablet.” The creation of ideal states was a way 
of teaching the fundamental element of theoriz- 
ing: the reduction of the world to manageable 
proportions and its simultaneous reassembling in 
a new way so that others could see the concate- 
nated relationships of the whole. Far from being 
an idle pastime, the projection of ideal states pro- 
vided an invaluable means of practicing theory 
and of acquiring experience in its handling. 

The preoccupation with the best form of gov- 
ernance was based upon fundamental conviction 
about the purpose which ought to govern theoriz- 
ing, Plato’s query “Can theory ever be fully real- 
ized in practice? Is it not in the nature of things 
that action should come less close to truth than 
thought?” (Republic, 473B) implicitly rejected the 
assumption that a theory ought to be a verbal 
miniature of an existing state of affairs in which 
the theoretical statements corresponded to the way 
things appeared to observation. Instead, classical 
theorizing hoped to effect an alliance between 
thought and action, which would lead to the world 
becoming the embodiment of a theory. This was 
exactly opposite to what became the main motive 
of theorizing inspired by modern science, which 
was to make theory into a miniature of the world. 

To summarize the main features of the classical 
paradigm: The quest for political knowledge was 
an attempt to organize, explain, and, hopefully, 
master the phenomenal world. A theory embodied 
an organization of concepts which often were re- 
finements of notions in everyday use, such as 
justice, equality, and law. By means of logical 
analysis and observation, these concepts were con- 
nected to form a unified construction. In binding 
together a set of concepts—for example, justice, 
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law, authority, and citizen—the “facts” of experi- 
ence and observation, as well as the constructs of 
everyday thought, were heightened, refined, and 
modified, and, ultimately, reaffiliated in a new 
way by the colligative relations established by the 
theory. This procedure of reassembling the politi- 
cal world was governed by the conviction that it 
could be better understood and explained by ex- 
posing its interconnections and that successful 
action must be preceded by a knowledge which 
simultaneously explained things as they were and 
stated what they should be. 

This approach may entail a logical confusion 
between description, explanation, and prescription 
or between empirical explanation and normative 
discourse. Without trying to evaluate this objec- 
tion, we must try to understand why the Greeks 
proceeded as they did. In Plato's formulation, re- 
liable knowledge was identified with knowledge of 
the forms. A form (eidos) was immaterial, im- 
mutable, and universal. It was apprehended by an 
arduous process of thought (the dialectic) which 
gradually progressed from reliance upon sense ex- 
perience and conventional beliefs to the contem- 
plation of pure form. Aristotle retained the eidos 
but modified it with the assumption that forms 
revealed themselves dynamically over time and 
that an intimate connection existed between forms 
and observable realities. “Observation tells” and 
“experience shows that” were favorite expressions 
of Aristotle. He held that for each class of things 
there was a nature or structure which was seeking 
realization through the matter composing it; the 
refractory nature of matter often prevented the 
perfect fulfillment of the form and, under certain 
circumstances, it was possible for art to supple- 
ment and assist nature. 


Political theory and political reality 

The vexed problems surrounding the Platonic 
and Aristotelian doctrine of forms—whether they 
are “true” or whether it makes sense to ask whether 
they are true—cannot be treated here. What is 
important to the idea of theory is the question 
which the doctrine of forms tried to answer: What 
must the world be like to permit a theory to be 
true? The classical response was that it is a world 
arranged into a hierarchy of forms. Although later 
theorists abandoned the classical formulations, 
they relied on other ideas of what the world or 
man must be like, in order to render political 
knowledge a possibility. Thus, Machiavelli as- 
sumed that “men have, and always have had, the 
same passions, whence it necessarily comes about 
that the same effects are produced” (Discourses 
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Upon Livy m, 43). Hobbes postulated “a general 
inclination of all mankind, a perpetual and rest- 
less desire of power after power, that ceaseth only 
in death” (Leviathan, chapter 11). Tocqueville 
found his starting point in the “social condition” 
of man rather than in certain traits of human 
nature: “the gradual development of equality,” 
which he traced back to the Middle Ages, is a 
fundamental “fact” which is “universal, lasting, 
constantly eludes all human interference, and all 
events as well as all men contribute to its progress” 
([1835] 1945, vol. 1, p. 6). Marx’s well-known 
conception of historical materialism provided a 
similar example of what the nature of the political 
and social world was like and of how that nature 
rendered theoretical knowledge possible. Contem- 
porary theories may use slightly different lan- 
guage, as when they refer to the “regularities of 
human behavior” or to “the expectation that the 
universe does in fact contain the essential data 
and processes that may be necessary for the solu- 
tion” of political problems (Deutsch 1963, p. 238). 
Yet the same assumption persists, that the political 
world must possess certain characteristics for 
theory to be possible. 

Although each theory rests on some notion of 
what the world is like, this knowing in itself is 
not an automatic response, but a decision. A 
theory is preceded by, and is a working out of, a 
decision to study political life in one way rather 
than another. Whether it is the classical way of 
dialectical inquiry, the Machiavellian way of jux- 
taposing contemporary and ancient practices, the 
Hobbesian procedure of developing axioms about 
human nature, or the Marxist search for the dy- 
namics of historical development, every theory 
represents a commitment to a particular way of 
viewing political realities, a particular method of 
inquiry, a particular language or way of talking 
about political subjects, and a particular distribu- 
tion of emphasis indicative of what the theorist 
deems important, The paradox that is involved in 
this enterprise—and it is a paradox common to 
all forms of theorizing, not just to political theo- 
rizing—is that while aiming at a complete under- 
standing of the subject matter of politics, it is 
deliberately selective, that is, it omits some matters 
and exaggerates others. By a complete understand- 
ing of politics is meant the ancient and persistent 
attempt to grasp the political society in the round, 
so to speak, and to explain its workings as a uni- 
fied whole. To achieve this, the theorist has been 
compelled to select what is significant and relevant 
and, above all, to reduce the world to intellectually 
manageable proportions. An important part of this 


procedure involves the establishing of perspective 
so that phenomena are reduced in scale, some 
even eliminated altogether, and others enlarged, 
The perfect expression of this procedure is a meta- 
phor employed by Tocqueville: 

My present object is to embrace the whole from one 
point of view: the remarks I shall make will be less 
detailed, but they will be more sure. I shal perceive 
each object less distinctly, but I shall descry the prin- 
cipal facts with more certainty. A traveler who has just 
left a vast city climbs the neighboring hill; as he goes 
farther off, he loses sight of the men whom he has just 
quitted: their dwellings are confused in a dense mass; 
he can no longer distinguish the public squares and 
can scarcely trace out the great thoroughfares; but his 
eye has less difficulty in following the boundaries of 
the city, and for the first time he sees the shape of the 
whole. . . . The details of the immense picture are lost 
in the shade, but I conceive a clear idea of the entire 
subject. (1835, vol. I, chapter 18) 

This same paradox of wholeness and distortion 
was present in Hobbes's Leviathan, the civil polity 
of Locke, and Rousseau’s model community and 
can be found as well in contemporary notions of 
a political “system” or of an “input-output model.” 

According to our preceding discussion, a theory 
is a complex way of organizing, seeing, explain- 
ing, and altering the world. This view suggests 
that the difference between one theory and an- 
other resides in the handling of these four ele- 
ments. To these elements one more must be 
added in order to complete the considerations: 
each theory presupposes a notion of what is plaus- 
ible or what is required for the theory to be ac- 
cepted as true. The briefest glance at the history 
of political theory reveals an astonishing variety 
of beliefs concerning what makes for plausibility: 
Aristotle’s reliance upon reasoning and observa- 
tion, Machiavelli's appeal to the facts of history, 
Hobbes’s abstract and geometrical axioms, Locke’s 
reliance upon natural law and common sense, 
Marx’s “scientific” laws of history, and the con- 
temporary appeal to “empirically verifiable hy- 
potheses.” Each of these is not only an appeal to 
a particular conception of plausibility but is ad 
companied by a conception of what is considered 
implausible. For example, Hobbes ridiculed Aris 
totelian forms of proof, while today’s theorist may 
be equally harsh on “metaphysical erpin p 

There are many disturbing questions raised i 
these considerations. Does each theory present U 
with a different political world? Is political T 
a bedlam of subjectivity and relativism? How ae 
one decide whether one theory is truer than $ 
other? Is the history of political theory merely A 
succession of different theories, instead of succe: 
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sive additions to our knowledge and understanding 
of politics? These questions cannot be answered 
here, even assuming that they can be answered 
satisfactorily at all; what can be done is to sug- 
gest some of the relevant considerations by exam- 
ining aspects of the questions as they have ap- 
peared in earlier and more recent theories. 

A useful case in point is the political theory of 
the early Christian writer St. Augustine, who, it is 
sometimes alleged, was not a political theorist at 
all. The grounds for this allegation, however, us- 
ually rest on a different notion of what is plaus- 
ible. sometimes it is said that Augustine appealed 
to revelation rather than reason, that he was un- 
systematic, or that his arguments were neither 
confirmable nor refutable by logic or fact. Obvi- 
ously, these criticisms themselves presuppose an 
agreement about what shall constitute acceptable 
proof and what makes “sense.” They also presup- 
pose a different kind of world in which theory is 
possible—a world in which God is not the imma- 
nent presence that he was for Augustine, a world 
in which supernatural forces have ceased to be 
credible. 

Yet the example of Augustine is important, not 
for its supernaturalism, but for the way in which 
it illuminates recurrent characteristics of theoriz~ 
ing. In the first place, Augustine made it clear that 
he was offering not simply a list of criticisms but 
a new and comprehensive explanation which de- 
liberately contradicted the prevailing one in funda- 
mental respects. He spent a great many pages 
criticizing the inadequacies of classical theory as 
transmitted by Roman authors—especially Cicero 
—and arguing that it had ceased to provide a 
satisfactory or plausible explanation of the politi- 
cal world. 

Second and equally crucial, he was contending 
that the political world itself had changed. Classi- 
cal theory could not truly account for the novelty 
of Christianity or for its implications. The broader 
bearing of this point cannot be stressed too much. 
It suggests one of the crucial problems which fre- 
quently bedevil the political theorist and render 
his task quite different from that of the scientific 
theorist. For the physicist, the world does not 
change, but his way of perceiving it may, as the 
contrast between Newton and Einstein shows. For 
the political theorist, the world itself changes, as 
is shown by a comparison of the Greek polis with 
the Hellenistic and Roman empires (or feudalism 
with the nation-state). Thus, political theory €x- 
hibits a twofold complexity: theoretical perspec- 
tives change in response to a changing political 
world and theoretical perspectives can differ even 
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when viewing the same world, as is demonstrated 
by contrasting the views of Paine and Burke regard- 
ing the French Revolution [see the biographies of 
BURKE and PAINE]. 

Third, as a result of the growth of Christianity 
in the ancient world, politics came to be viewed 
from a different perspective and through different 
categories, such as sin and grace. The shift in 
perspective, concept, and language resulted in a 
new way of talking about politics and political 
orders, one which extended to new notions of 
space (universal rather than in terms of mon- 
archies or empires) and of time (providential and 
apocalyptical rather than cyclical). Despite all 
these differences, Augustine retained the ancient 
theoretical purpose of trying to organize and 
account for the world. His De civitate Dei con- 
tained an explanation of human behavior, social 
conflict, imperialism, and war. Above all, it is 
one of the most sustained analyses of the inter- 
connections of order and the causes of disorder 
that has ever been formulated in Western thought. 
Despite its very different notions of plausibility, 
the theory constituted a mighty effort at account- 
ing for the way things appeared, what caused 
events to happen in the way that they did and men 
to behave as they did. Like all theories, it did not 
purport to discover new facts or provide new in- 
formation, but rather—by the use of new con- 
cepts, such as original sin, predestination, and the 
symbolism of the “two cities”—to supply a new 
interpretation of the facts. The classical writers 
did not need to be told by Augustine that men 
were wicked, any more than Hobbes’s contempo- 
raries needed to be told that political disobedience 
was a widespread fact. The function of theory is 
not to amass new facts but to disclose hitherto 
unsuspected relationships between them. This dis- 
closure is achieved by looking at the facts differ- 
ently—that is, from a new theoretical perspective. 

When we examine Augustine's theory it is ap- 
parent that we are witnessing a profound de- 
parture from the classical paradigm of Plato and 
Aristotle, The change involves a shift in perspec- 
tive, language, and a redistribution of emphasis 
among the concepts in the Augustinian theoretical 
network. The continuities are not easy to locate, 
for the change is a radical one, extending to a 
wholesale redefinition of politics and to what is 
properly political. For Augustine unredeemed man 
is incapable of developing a politics which will not 
be deeply streaked by conflict, aggression, and 
violence. At the same time, he strips the political 
order of the proud function assigned to it by the 
classical writers, the maintenance of an educative 
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ethos (paideia) which will assist in the perfection 
of men. [See the biography of AUGUSTINE.] 

The re-evaluation of central political concepts 
has occurred many times in the history of political 
theory. For example, Plato defined political activity 
largely in terms of eradicating conflict; Polybius 
and Machiavelli, on the other hand, perceived a re- 
vitalizing force in contending political groups and 
interests; Madison deplored conflict but figured 
that the costs of eradication were excessive [see 
the biography of Mapvtson]. Or, to take another 
example, the scope of what is political has also 
varied enormously. For Plato the political com- 
prised all significant relationships in society; for 
many liberal writers of the eighteenth and nine- 
teenth centuries, what was political was restricted 
to the narrow functions of enforcing law, main- 
taining peace, and defending the society from 
foreign invasion and domestic authority, while 
others have treated it like some dangerous Gulliver 
and bound it with innumerable legal restraints. 
Some writers have, like Rousseau, exalted political 
participation into the highest form of social ac- 
tivity, while others have described it as of negli- 
gible significance, and still others have praised the 
opposite value of “political apathy.” Although this 
state of affairs, where the same concept is as- 
signed different weight and significance, is often 
deplored for preventing the cumulative growth of 
political knowledge, it is important to recall that 
as long as one of the major functions of a theory 
is to account for events happening in the world 
and as long as that world is a changing one, the 
diversity of meanings is inevitable. It might even 
be argued that far from being a deficiency, it is one 
of the achievements of the language of Western 
political theory that it has been sufficiently elastic 
and adaptable to be used for a wide variety of 
circumstances. 

Although every theory strives to present a 
rounded picture of the political world in terms set 
by the theory itself, the danger of an autarchic 
condition in which each theory presents a surface 
impenetrable to criticism is minimized by certain 
practices inherent in theorizing. Most of the major 
political theories of Western history have disputed 
ground held by previous theories. Thus, Augustine 
attacked the classical writers; Machiavelli tried to 
dismiss the utopian tradition; Hobbes disputed 
Aristotle and the Schoolmen; Marx contended with 
liberal and radical theories: and contemporary 
writers feel compelled to combat the “ancient 
myth about the concern of citizens with the life 
of the democratic polis” (Dahl 1961, p. 281) or, 
alternatively, the nineteenth-century liberal theory 


of rational citizenship (Berelson et al. 1954, pp, 
306 ff.). The act of disputation itself necessitates 
the acceptance of some common matters; other- 
wise the disputants risk being ignored. Thus, 
Augustine accepted much of the vocabulary estab- 
lished by his predecessors; his discussion of power, 
authority, government, and law presupposed an 
understanding rooted in a pre-existing tradition 
of political discourse. Like all previous and subse- 
quent theorists, Augustine introduced new mean- 
ings, yet much of his thought followed customary 
significations and, hence, rendered itself suscep- 
tible to external criticism. Further, Augustine ad- 
dressed himself to certain common or public facts 
—for example, the sack of Rome, the decline of 
Roman power—and hence it was possible to scru- 
tinize his construction of the facts from a very 
different theoretical vantage point. What was true 
of Augustinian theory was true also of later theo- 
ries: the particular perspective of a theory does 
not isolate it from criticism and judgment. A tradi- 
tion of common conceptual language with fairly 
stable meanings combined with the existence of 
public facts makes it possible to appraise every 
theory and assign it worth. 

It is often said that a theory should be ap- 
praised in terms of criteria regarding the consist- 
ency and clarity of definitions and the adherence 
to logical reasoning. Granting the importance of 
these criteria does not advance us very far. They 
are too formal to provide anything more than 
minimal requirements. Far more complicated and 
interesting problems are raised by the criterion of 
“adequacy,” which may refer either to the suffi- 
ciency of evidence, the methods of proof, the 
range of matters included, or the solutions pro- 
posed. Each of these raises different kinds of ques- 
tions and each remains a continuing source of 
controversy. What deserves notice, however, is 
that the purposes and conventions which inform 
a particular theory may enable it to fulfill some 
criteria, while preventing it from fulfilling others. 
For example, Machiavelli may have achieved his 
purpose of describing the actual behavior of rulers, 
but his theory is almost wholly useless for the 
purpose of analyzing the obedience of citizens, a 
subject which dominated political discussion dur- 
ing the last half of Machiavelli’s century. 


The quest for scientific political theory 

Since the seventeenth century, there have been 
repeated expressions of dissatisfaction with D 
state of political theory. These have been inspirec 
mainly by the example of progress in modern m 
entific knowledge. The complaint, as voiced bY 
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Hobbes and later by others, has contrasted the 
steady, incremental advance of scientific knowl- 
edge with the seemingly static condition of political 
knowledge and has located the cause in the failure 
of political theory to adopt scientific methods of in- 
quiry. It is essential, however, to separate two 
problems: the adoption of scientific methods— 
whether mathematical, empirical, or a combina- 
tion of the two—and the problem of incremental 
knowledge. It is too easily assumed that the sys- 
tematic exploitation of scientific inquiry is the only 
method for steadily adding to our supply of knowl- 
edge. Science—particularly its method of organiz- 
ing inquiry—may be the most efficient and power- 
ful means, but it is not the only means. 

The history of political theory supplies an im- 
portant illustration of this. If, as it is sometimes 
suggested, the progress of science has been the 
result of the careful and exhaustive working out 
of a given theory in order to discover how much 
it can account for and what problems it presents, 
a roughly analogous situation has occurred over 
long stretches of the history of political theory. 
It is no exaggeration to say that the theories of 
Plato and Aristotle have served longer as models 
of political inquiry than any comparable pair of 
theories in the natural sciences. Not only did each 
of these writers found research institutions which 
were occupied with amassing new facts and clas- 
sifying them, but both theories provided the frame- 
work whereby writers of later centuries tried to 
account for new phenomena and to resolve new 
problems, For example, during the struggles be- 
tween the papal and secular powers of the Mid- 
dle Ages, apologists for both sides attempted to 
apply Aristotle's theory to a state of affairs that he 
had never anticipated. Although modern textbook 
writers are inclined to deride this achievement by 
pointing out that Aristotle's theory had been de- 
signed to explain the polis, not the imperium or 
sacerdotium, it is well to recall that among scien- 
tific theories, one of the basic criteria for prefer- 
ring one theory over another is its capability for 
being extended to cover phenomena not antici- 
pated by the theory. The history of political theory 
is full of such examples of viable theories which 
have been amplified by writers who have followed 
in the footsteps of the giants: Augustinianism and 
Thomism have persisted as the basic theoretical 
outlook for countless Roman Catholic theories; 
Locke’s theory was used to analyze conditions in 
Revolutionary America and Revolutionary France; 
Marxism has been extended to cover diverse prob- 
lems of colonialism, imperialism, and the newly 
emerging nations. 


The foregoing suggests that in the history of 
political theory there has been a succession of 
theories which have been fruitfully used for long 
periods of time to explain and account for political 
events. The same history also shows that dissatis- 
faction with the capabilities of a theory has led to 
new forms of theory. Dissatisfaction, for the most 
part, has issued not from intellectual reasons 
alone; that is, it has not been their lack of logical 
consistency, their infidelity to fact, or their sterility 
that has caused theories to be abandoned, but 
rather changes in the political world, giving rise 
to problems that seem to be insoluble by means of 
accepted theories. Moral and political motives, not 
purely intellectual ones, have been the primary 
inspiration for new theories. The first great ex- 
ponent of scientific politics was Hobbes, and yet 
every one of his theoretical writings originated in 
his avowed purpose of settling the basis of au- 
thority and providing secure grounds for obedience 
during an age of revolution. Only during the past 
half century has scientific politics sought to dis- 
entangle itself from moral and political concerns, 
and even now there are sufficient protests among 
contemporary political scientists to make naive dis- 
engagement seem disingenuous. 

Despite these continuities, however, the quest 
for a scientific theory of politics has altered the 
character of theorizing in several significant ways. 
Before examining these, it is necessary to note 
the conditions under which such a quest became 
intellectually compelling. This means asking not 
only what view of the world was held which made 
scientific theory possible but also, more crucially, 
what kind of political world came into being which 
rendered the application of scientific methods 
fruitful and its criteria of truth plausible. 

Prior to the sixteenth century, the political 
orders of Europe were characteristically decentral- 
ized. Uniform legal systems were a rarity, and cen- 
tralized bureaucracies still in the formative stage. 
Order itself was not a presupposition of daily life 
but a precarious achievement. Habits of civility 
were dictated by local loyalties, and the whole 
complex of national duties, rights, and codes of 
deference were only slowly being developed. In 
brief, political life lacked those qualities of uni- 
formity, regularity, routine, and settled expecta- 
tions that we now take for granted. 

Against this background, it is not surprising 
that political theory had eschewed making regu- 
Jarities in political behavior and processes the 
basis of theorizing and had, instead, been more 
intent on establishing them. To the extent that 
scientific theory presupposes order and regularity 
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in phenomena, political societies prior to the es- 
tablishment of the modern centralized, bureaucra- 
tized state could not furnish the necessary basis 
for scientific theorizing. To be sure, Machiavelli 
ridiculed those who denied such regularities—“as 
if the sky, the sun, the elements, men, were 
changed in motion, arrangement, and power from 
what they were in antiquity” (Discourses Upon 
Livy 1, preface)—and both he and Hobbes con- 
tended that under specified conditions it was pos- 
sible to predict how men would respond, because 
it was possible to know the mainsprings of human 
motivation [see the biography of MACHIAVELLI]. 

Indeed, Hobbes had made the first sustained at- 
tempt to incorporate politics into a scientific ex- 
planation which would comprehend matter, man, 
and society. Basing his theory upon the laws of 
motion, which he believed were operative uni- 
versally, he proposed a deductive science of poli- 
tics, which would proceed from simple to more 
complex forms of social motion. The concept of 
the state of nature provided an imaginary condi- 
tion in which the laws of human psychology were 
best observable; it was then possible to reason out 
the structure of political society which would best 
accord with the laws of psychology governing hu- 
man behavior. Hobbes’s theory was hardly empiri- 
cal by contemporary standards, yet it remains the 
first systematic effort to assimilate political to 
scientific and mathematical reasoning. At the 
same time, the Hobbesian political order was in 
no sense a replication of what the political world 
was like, but rather a projection of what it must 
or should be. Stated differently, Hobbesian theory 
was not a report of regularities but an attempt to 
create them, as was suggested by his wistful re- 
mark (Leviathan, chapter 31) that “this writing 
of mine may fall into the hands of a sovereign” 
who would put the theory into practice. [See the 
biography of Hosses.] 

In the three centuries that have elapsed since 
Hobbes wrote Leviathan, the major Western so- 
cieties, on the whole, have developed and enforced 
complex mechanisms for directing and ordering 
human behavior, Bureaucracies, legal systems, the 
police, together with national conceptions of citi- 
zenship and of authority, have conjoined to pro- 
duce sustained regularities in political life, far 
greater than in any previous era. At the same time, 
the development of industrialization has added a 
powerful element to the routinization of life, and 
the appearance of the modern city is the symbol 
of a condition wherein men live lives of remark- 
able similarity and uniformity. This list could be 
extended to include the growth of standardized 


education and the spread of popular literature, but 
the point would be the same: the destruction of 
localism and of discrepancies in wealth, educa- 
tion, culture, and power and their replacement by 
uniformities have created the conditions for a 
science of politics. 

In the eighteenth century the popularization of 
Newtonianism gave a strong impetus to the search 
for “laws” governing society and politics, laws 
which would express the true political and social 
“nature of things” rather than the will or prefer- 
ences of rulers, priests, and aristocrats. One of the 
earliest and most ambitious attempts at a science 
of politics was made by Montesquieu. Defining 
“law” as “the necessary relations arising from the 
nature of things,” he tried to show that various 
types of political societies were an embodiment 
of necessary relationships arising out of the con- 
ditions of that society. His list of conditions was 
based on geography, economic occupations, re- 
ligion, government, and mores; these operated as 
forces which determined the kinds of laws, prac- 
tices, and institutions which would prevail in a 
particular society. Each society exhibited a definite 
“set,” or “spirit,” in the way that it fitted its rela- 
tionships to its conditions; each society was, in a 
sense, a system of interrelationships in which po- 
litical, social, religious, and economic institutions 
sustained and modified each other. When rightly 
adjusted, the system represented a kind of New- 
tonian equilibrium. Although Montesquieu favored 
a constitutional monarchy tempered by aristocratic 
privileges and corps intermédiaires, the compara- 
tive basis of his theory tended to sanction all po- 
litical forms as the natural products of adaptation 
to environment, save only for despotism, which he 
viewed as a monstrous aberration. [See the biog- 
raphy of MONTESQUIEU.] 9 

By showing how the complex facts of a society 
were informed by a common structure, Montes- 
quieu’s typology of political systems permitted com- 
parison between political societies. During the last 
half of the eighteenth century, however, theorists 
tended to follow a different path toward a science 
of politics. They turned to the nature of man 
rather than of things and found psychological laws 
of human behavior, usually involving some prin: 
ciples of attraction and aversion or pleasure P 
pain and, on this basis, erected theories ve 
claimed to be empirical and universal. Yet, in t X 
case of the major writers in this tradition, suc. 
as Helvétius and Holbach in France and pent 
in England, the theories remained dominantly 3 
priori, with facts serving primarily as allo 
[see the biography of BENTHAM]. Above all, ther 
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was nothing scientifically neutral about their for- 
mulations. All three writers were critics and re- 
formers who were mainly intent on having the 
political world reflect their theories rather than 
having their theories reflect the world. 

Although many of the major theorists of the 
nineteenth century—such as Hegel, Tocqueville, 
and J, S. Mill (excluding the Logic)—practiced 
theory in ways dominantly traditional, there was 
a growing tendency toward making politics a sci- 
entific study [see the biographies of HEGEL; MILL; 
TOCQUEVILLE]. Nevertheless, it is not easy to 
characterize this tendency, because so many di- 
verse understandings of science were competing. 
For Comte and his followers, “positive” science 
was the search for the smallest possible number 
of “invariable natural laws” which controlled “all 
phenomena” [see the biography of ComTE]. To the 
social Darwinists science meant classifying phe- 
nomena according to categories of evolution and 
struggle made famous by Darwin and now adapted 
to social and political uses [see SoclaL DARWIN- 
IsM]. For the Marxists, science meant extending 
the “necessary” laws of history, which were laws 
of economic development, to explain the past and 
present and to predict the future [see MARXISM]. 

Toward the close of the nineteenth century, 
however, the social sciences had sufficiently de- 
veloped to enable men like Durkheim and, some- 
what later, Max Weber, to fight clear of most of 
these earlier disagreements about the nature of 
science and to lay down some basic ideas about 
what a theory should be [see the biographies of 
DurkHEIM and WEBER, MAX]. 

Twentieth-century concepts of political theory 
have continued to evolve beyond the point attained 
by Weber and Durkheim, particularly by borrowing 
from the fields of social psychology and psycho- 
analysis and by seeking to utilize mathematics and 
statistics, The result has been a profound change 
in the notion of what theory is and what are the 
appropriate conventions governing its use. Most 
theoretical efforts now aim at establishing knowl- 
edge by methods that are empirical and quanti- 
tative; to a considerable degree, they aspire to a 
form of knowledge that will be precise, rigorous, 
verifiable, and predictable. Like its predecessors, 
contemporary theory embodies a decision about 
what is worth studying, how it shall be studied, 
and what shall be accepted as knowledge. 


Contemporary political theory 

Although it is risky to attempt a characteriza- 
tion of contemporary notions of theory, some of 
the main features may be indicated. It should be 
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mentioned that sharp disagreements exist, not 
only among those who purport to share the same 
scientific orientation, but between them and others 
who remain loyal to traditional methods. What 
follows is an attempt to designate the main char- 
acteristics of what is new—that is, of the scientific 
conception of theorizing. 

The alliance with philosophy has been severed, 
at least temporarily. Although there are some signs 
of attempts being made to utilize contemporary 
philosophical techniques of language analysis and 
its variants, most theorists proceed on the as- 
sumption that the adoption of scientific methods 
obviates the need for elaborate philosophical 
techniques. 

This development has been accompanied by the 
abdication of the traditional attempt to formulate 
synoptic pictures, or epitomes, of the whole so- 
ciety. Although conceptions of a “political system” 
are widely utilized, these are avowedly artificial 
constructions which are not intended to represent 
actual societies either literally or in the transfigur- 
ing way of older theories. They are artifacts whose 
sole justification is their utility, which remains a 
moot point. The contemporary conventions reject 
so-called grand or comprehensive theories and 
prefer to pursue testable hypotheses. According to 
one representative viewpoint, “Whether the propo- 
sition is true or false depends on the degree to 
which the proposition and the real world corre- 
spond” (Dahl 1963, p. 8). “Analysis” has tended 
to replace “theory” as the preferred expression; 
this change is accompanied by a determination to 
utilize whatever methods appear to have scientific 
authority; survey data, psychological and socio- 
logical findings, decision making, bargaining, 
communications theories, etc. Those of an empiri- 
cal and quantitative persuasion frequently express 
the hope that by patient and systematic investiga- 
tion it will be possible to establish tested proposi- 
tions of ever-increasing generality and that, grad- 
ually, an interconnected and logically consistent 
series of propositions will culminate in a general 
theory of universal validity. 

As we have previously noted, the scientific im- 
pulse in eighteenth-century theory took the form 
of seeking “laws” which would embody the true 
“nature of things.” Twentieth-century theory has 
perpetuated the quest but has abandoned the as- 
sumption that there is any underlying “nature” to 
political phenomena. In this respect contemporary 
theory is in the tradition of Hobbes, who was the 
first to launch a systematic attack against the 
classical notion that there was a natural structure 
appropriate to every political form and that man 
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possessed a nature which required a political asso- 
ciation for its fulfillment. The divorce of politics 
from nature and of theory from nature did not 
become complete until the twentieth century. One 
effect has been to liberate the theoretical imagina- 
tion, for instead of seeking to approximate natural 
structures, it has felt free to experiment with all 
manner of artificial models and constructs, insist- 
ing that such artifacts are to be judged solely by 
their “pay-off” in explanatory or predictive power. 
Accordingly, the “real world” is discussed in very 
different terms, usually as a kind of whirl of phe- 
nomena to which a theory is “plugged in” in an 
admittedly arbitrary way. Theory no longer seeks 
to grasp the political world synthetically; rather 
it seeks to slice into it. The political world is, as 
one contemporary theorist has described it, “a re- 
source” which is to be exploited on the expectation 
that the universe does contain “the data and proc- 
esses” necessary to the solution of theoretical 
problem (Deutsch 1963, p. 238). Theorizing tends 
to be sustained by the belief that the political 
world exhibits sufficiently recurrent regularities 
and repetition of causal sequences to allow for the 
testing of generalizations. Theory thus becomes 
the search for what is “repetitious, ubiquitous, and 
uniform” (Easton 1965a, p. 15). “It is the task of 
theory to detect in the welter of the unique facts 
of experience that which is uniform, similar, and 
typical” (Morgenthau 1959, p. 18). 

The concern with regularities forms part of 
an outlook which contrasts sharply with the tra- 
ditional preoccupation with the dangers and de- 
rangements besetting the political order. Tradi- 
tional theory had been powerfully influenced by 
the hope of providing knowledge for action; its 
language, concepts, and values were primarily 
those of the actor. Contemporary theory, with its 
emphasis upon objectivity, scientific detachment, 
and testable hypotheses, tends to be governed by 
the values of inquiry rather than of potential 
action, This appears most strikingly in systems 
theory, where conceptions such as equilibrium, 
homeostasis, inputs and outputs are, whatever 
their value for research, wholly irrelevant to 
action. For the present, at least, theory appears 
to have surrendered the critical function, which 
has been one of its dominant characteristics since 
Plato. 

Throughout the first half of the present century, 
most advocates of a scientific approach to theory 
accepted a naive distinction between “facts” and 
“values” and contended that science could not pro- 
nounce on questions of preference but only deal 
with matters of fact. Gradually, this austere posi- 
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tion has been relaxed into an attitude which in- 
sists that scientific knowledge can contribute to 
the clarification of choices, by indicating that 
some choices are unfeasible and that others are 
too costly. It is still insisted, however, that “new 
theory tends to be analytic, not substantive, ex- 
planatory rather than ethical . . .” (Easton 1965a, 
p. 22). It is fair to say that most scientifically 
minded theorists today are bored by the fact- 
value controversy and are trying to negotiate an 
armistice along the lines of a division of political 
theory into “empirical” and “normative” theories. 
The former would represent theorizing based 
upon scientific methods of collecting and classify- 
ing data and of testing hypotheses by statistical 
or mathematical methods. Its goal would be the 
empirically verified hypothesis. To normative the- 
ory would be assigned an ill-assorted collection 
of activities whose common element would be a 
lack of scientific methods. It would include all 
questions regarding values, all historical studies, 
and all conceptual inquiries. 

One would be hard-pressed to concoct a better 
solution for the sterilization of political theory. Not 
only does it rest on a profound ignorance, or even 
arrogance, regarding the nature of traditional 
theories and their subtle blend of empirical ob- 
servation and theoretical speculation, but more 
strikingly, it flies in the face of the kind of ex- 
perience which the proponents of such a divi- 
sion would be expected to be the first to recognize 
—namely, the history of theory in the natural 
sciences. Historians of science are generally agreed 
that few, if any, of the great scientific theories 
have been the product of strictly empirical zS 
search. Such a belief is a remnant of the Baconian 
myth concerning the primacy of induction. It has 
also been argued by philosophers of science, such 
as Popper, that theories which are shown to be 
wrong can still be of inestimable value, for ea 
nation of them allows us to be quite precise in 
locating the source of the error. But there are 
other reasons which are equally telling and should 
caution against a too easy acceptance of the dis- 
tinction between empirical and normative ea 

The assumption that there can be some pna 
purely empirical theory derives from the ee 
positivist assumption that facts can be known oe 
curately if only we could lay aside our ean 
and biases. Although it is always admitted 
facts rest on a principle of selectivity, this ai 
sion does not dispose of the issue. The on 
is that we know facts only by means of e 
guistic-conceptual apparatus with which we ote 
and perceive; facts are only interesting when 
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served through theoretical concepts, and hence, 
facts present themselves as “theory-laden.” The re- 
sult is that when the empirical theorist employs a 
procedure in which observations are matched with 
theory, his enterprise is more circular than he is 
usually willing to admit, for the observations them- 
selves represent theory-laden facts. 

It is also important to recognize that the indis- 
pensability of concepts in all forms of theorizing— 
empirical and otherwise—raises special problems 
which cast doubt upon the desirability of rendering 
empirical theory autonomous. Although the selec- 
tion of concepts for use in empirical inquiry will, 
of necessity, be determined by their empirical rele- 
vance and utility, concepts are abstractions, not 
empirical entities. Moreover, the nature of con- 
cepts shares a family resemblance with the con- 
cerns of so-called normative theory, not because 
concepts are “idealized” constructions, like “fric- 
tionless bodies” in physics, but because they possess 
the basic characteristic of all normative construc- 
tions—that is, they are discriminatory and selec- 
tive. Every concept is a distillation achieved by 
deciding to disregard some features of reality and 
to emphasize others. 

Perhaps the most important problem besetting 
contemporary political theory is not the question 
of whether theory ought to be firmly wedded to 
the methods and outlook of the natural sciences, 
but rather what version of science it will choose: 
the rigorous, fact-minded, anticonceptual view 
which believes that cumulative knowledge is the 
result of patient and dogged application of sci- 
entific methods or the view of science as an imag- 
inative undertaking, with its full share of specula- 
tion, playfulness, proclivity to error, and its ability 
to imagine worlds as yet undreamed of —an ability 
which would maintain the critical, projective 
quality that has enabled past theories to speak 
meaningfully to the quandaries of political ex- 


istence. 
SHELDON S. WOLIN 


[See also POLITICAL SCIENCE.] 
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The Encyclopaedia of the Social Sciences, pub- 
lished between 1930 and 1935, contained no special 
article on comparative politics or comparative gov- 
ernment, There is an article “Government,” followed 
by articles on individual governments; but no ex- 
plicit comparative themes are treated, nor do they 
appear elsewhere under other topics. Much of the 
great tradition of political theory, in contrast, is 
essentially comparative, classificatory, typological. 
Comparison is such an intrinsic methodological 
assumption that it is not separated out as a spe- 
cific subfield or approach. This background sug- 
gests the thesis of this article: that contemporary 
comparative politics is a movement rather than a 
subfield or subdiscipline. 

The case for comparative politics as a move- 
ment is quite persuasive. We start from a tradition 
in which comparison is an intrinsic aspect of polit- 
ical theory, then move to a situation in which it 
practically disappears along with the creative polit- 
ical theory of which it is a part, and arrive at the 
contemporary situation, in which it is a salient and 
separate part of the political science curriculum. 


The theory of democratic progress 


How can we explain this development? Perhaps 
the fate of the 


we may begin by commenting on 
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Aristotelian classification scheme and theory of 
political change in the history of political science. 
On the surface it would appear that this Aristo- 
telian macrotheoretical tradition continues straight 
on through subsequent millennia, up to and in- 
cluding Dahl's Modern Political Analysis (1963). 
And yet it would be misleading to say that the 
Aristotelian approach to comparison continues as 
a dominant intellectual construct into present-day 
political science. Enlightenment political theory 
moved away from the relativistic and cyclical ap- 
proach of Aristotle to a unilinear approach to polit- 
ical history and political development. This is 
particularly marked in British, French, and Ameri- 
can political theory, where at first democracy was 
justified as the best form of government on the 
basis of natural law and social contract, and then, 
as the democratic revolution spread, as the inevi- 
table direction of human history. Locke and Rous- 
seau are typical of the first approach to democrati- 
zation, while Tocqueville is among the first to view 
it as historically inevitable. His Democracy in 
America (1835) reflects a growing conviction that 
democratic politics is the political form of the 
future. America is the laboratory from which he 
seeks to derive some sense of the political, social, 
moral, and cultural consequences of this inevitable 
democratization. Ostrogorskii (1902) focused on 
the development of democracy in Britain and 
the United States, convinced that this was to be the 
trend of the future but deeply troubled by the 
growing elitism and bureaucratism of the mass 
political party in Britain and America. The impor- 
tant point is that Tocqueville and Ostrogorskii were 
both concerned with comparisons within the demo- 
cratic framework and were interested in non- 
democratic systems principally as base lines against 
which democratic systems can be evaluated. 

The influence of social and political setting on 
political theory is reflected vividly in the contrast- 
ing reactions to democratic development of Wood- 
row Wilson (1889) and Bryce (1921) and those 
of Pareto (1916), Michels (1911), and Mosca 
(1896). Both the Anglo-American theorists and 
the continental European theorists reject the Aris- 
totelian relativistic typology and cyclical theory 
of political change. Pareto, Michels, and Mosca! 
argue that all political rule is oligarchical or elitist, 
regardless of its formal legal or ideological charac- 
teristics, while the British and American political 
theorists see a sweeping historical movement in the 
direction of constitutional and democratic forms. 
Woodrow Wilson’s The State (1889) is an inter- 
esting syncretic product. On the one hand, it 
shows the great influence of nineteenth-century 
German political theory, with its massive ethno- 
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graphic and historical learning, and the persistence 
of the Aristotelian categories, but at the same time 
the work is suffused with an evolutionary, demo- 
cratic faith. In Bryce’s Modern Democracies (1921) 
faith turns into conviction. He speaks in his intro- 
duction of “. . . the universal acceptance of democ- 
racy as the normal and natural form of government.” 

Thus, on the eve of the development of American 
political science as a university-based, professional 
discipline, the theory of democratic progress dom- 
inated the field and justified a loss of interest in 
the classification and comparison of types of polit- 
ical systems or in the general theory of political 
change. The answers to these questions were viewed 
as self-evident, and normative speculation about 
the relative value of different kinds of political 
systems, and even empirical study of nondemo- 
cratic political systems, were pointless, or at best 
were useful, in the language employed in W. B. 
Munro's preface to his The Governments of Europe 
(1925), to make possible the “. . . comprehension 
of the daily news from abroad.” 

One might say, therefore, that it was the Enlight- 
enment itself, and in particular the parochial 
American “populistic” and “progressive” version of 
that faith, which resulted in an attenuation of 
interest in political comparison and typologies, as 
well as of the political theory of which that interest 
was a part. 

Broadly speaking, this unilinear evolutionary 
theory of political systems and political develop- 
ment characterized American political science as 
it began to develop into a substantial university- 
based discipline in the first decades of the twentieth 
century. As separate departments of political sci- 
ence began to appear in these decades, the prin- 
cipal stress in the curriculum was on American 
politics. Research and teaching proliferated around 
themes having to do with problems of American 
democracy. The field of public administration was 
essentially concerned with the development of a 
professional public service and with the introduc- 
tion of rational organizational and management 
practices in American government. Constitutional 
law was concerned principally with the conserva- 
tism of the Supreme Court and with what some 
constitutional lawyers thought of as the usurpation 
by the judiciary of law-making and constitution- 
making functions. The “real” functioning of the 
Congress and the executive also were fields of 
teaching and research interest. But the distinctive 
development during this period was the study of 
the informal, or the nonlegal, aspects of politics— 
the role of political parties, the political machine, 
the lobby and pressure groups, and the popular 
press. While students in these various fields were 


concerned with generalizing about these phenom- 
ena, their theories were based essentially on Amer- 
ican experience, and practically the whole burden 
of the research effort in the growing profession of 
political science was on American political institu- 
tions and processes. 

The older tradition of comparison and political 
theory survived in the work of such scholars as 
Carl Friedrich and Herman Finer. Both Finer’s 
Theory and Practice of Modern Government (1932) 
and Friedrich’s Constitutional Government and 
Democracy (1937) are works in the older political 
science tradition. They are comparative, and they 
treat varieties of forms of government and of gov- 
ernmental institutions and processes in the context 
of some of the great themes of political theory. But 
it is of interest that both of these scholars focus 
their work predominantly on constitutional, demo- 
cratic systems. 

This approach to political science teaching and 
research was characteristic of American political 
science in the first four decades of the twentieth 
century, roughly until the end of World War 1. 
This was the time when the profession was devel- 
oping its own departments of political science (or 
government) and when the membership of the 
American Political Science Association was expand- 
ing from around two hundred at the turn of the 
century to roughly three thousand at the end of 
World War 11. The American profession consisted 
in the main of students of American public ad- 
ministration, American public law, American polit- 
ical parties and pressure groups, Congress and the 
executive, and American state and local govern- 
ment. International relations and European gov- 
ernments were essentially minor themes, and 
students of non-Western political systems were 
oddities, working in isolation. 

The breakdown of this pattern was the conse- 
quence of the frustration of Enlightenment expec- 
tations resulting from the spread of fascism an 
communism in the 1920s and 1930s and increasing 
with the emergence of communism in the post- 
World War m period as an unambiguously com- 
peting form of modernization. Another factor 
making for the rejection of the earlier confidence 
in democratic development was the emergence a 
the post-World War 1 period of the many ne 
nations of Asia, the Middle East, and Africa, W! 
their confusing variety of cultural and structur 
patterns and developmental processes. 


The comparative politics movement 4 

p itic 

Comparative politics as a movement in a a 

science acquired momentum after Wordi Be 
(Interuniversity Summer Seminar . . - 1953; 


ring 1953; Kahin, Pauker, & Pye 1955; Macridis 
1955; Almond 1956; Heckscher 1957; Neumann 
1957; Rustow 1957; Apter 1958; Eckstein & Apter 
1963). Among the principal intellectual influences 
which fed into it are (1) the growing body of data 
on non-Western political systems; (2) the intro- 
duction into foreign political studies of concepts 
and methods that had emerged in research on 
American political processes; (3) anthropological, 
psychological, and psychoanalytic theories of cul- 
ture and personality; and (4) the concepts and in- 
sights of historical sociology and sociological theory. 

Acquisition of non-Western data. During the 
1950s young political scientists streamed into Asia, 
the Middle East, Africa, and—somewhat later— 
into Latin America, producing monographic studies 
often as sophisticated analytically as the best of 
American and European political studies. The dom- 
inance of the political science profession by Amer- 
icanists and Europeanists began to be challenged 
by a young generation of political scientists whose 
field experience was non-Western. The require- 
ments of their research gave them a sensitivity and 
sophistication in the use of sociological and anthro- 
pological methods and theories that their Ameri- 
canist and Europeanist colleagues often lacked 
(Apter 1955; 1961; Binder 1962; Pye 1962; Weiner 
1962). Parliamentary institutions, bureaucracies, 
political parties, and interest groups in these new 
and developing nations often had quite a different 
significance than they had in Western nations, 
particularly the United States. Thus, in their search 
for the effective policy-making and policy-imple- 
menting processes in these nations they were led 
to look for their functional equivalents. “System,” 
“process,” and “functional” concepts, anthropologi- 
cal field methods, and anthropological and socio- 
logical theory had a natural appeal to these students 
of non-Western political systems. 

The behavioral movement. A second signifi- 
cant influence in the development of the compara- 
tive politics movement was the behavioral move- 
ment, which had its origins in studies of American 
electoral and political processes. One significant 
channel in this process of intellectual diffusion 
was the Committee on Political Behavior of the 
Social Science Research Council, which stimulated 
the organization of the Committee on Comparative 
Politics, which in turn played an important role 
in diffusing into the field research in non-Western 
and European areas many of the insights and 
methodologies that had developed in American 
political studies. The principal contributions from 
American political studies adapted to studies of 
foreign areas were the “process” frame of refer- 
ence (Herring 1940; Schattschneider 1942; Key 
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1942; Truman 1951); the emphasis on the non- 
formal aspects of political processes—political par- 
ties, interest groups, media of communication, and 
public opinion (Ehrmann 1957; Eckstein 1960; 
Weiner 1962; LaPalombara 1964)—and the newer 
and more rigorous methodologies employed in the 
studies of elections and public opinion (Lazarsfeld, 
Berelson, & Gaudet 1944; Campbell, Gurin, & Mil- 
ler 1954), 

The culture and personality approach. A third 
influence feeding into the comparative politics 
movement was the so-called psychocultural ap- 
proach developing out of the work of Freud and 
some of his disciples and the psychoanalytically 
oriented social scientists from the 1920s on, in- 
cluding such figures as Harold D. Lasswell (1930; 
1948), Ruth Benedict (1934), Margaret Mead 
(1928-1935), Abram Kardiner (1939), Ralph 
Linton (1945), and Nathan Leites (1948), Their 
work, particularly in response to World War 1 
problems, on the German, Japanese, Russian, and 
American national characters, created a sensitivity 
among students in the comparative politics move- 
ment to these aspects of politics and public policy. 
The political culture approach in comparative poli- 
tics was greatly influenced by this psychoanthropo- 
logical literature. It sought to relate cognitive and 
attitudinal patterns in national and subnational 
populations to the characteristics and functioning 
of political systems, through the use of cross-section 
survey methods, studies of particular elite groups, 
and the like (Almond & Verba 1963; Pye & Verba 
1965). 

Historical sociology and sociological theory. A 
fourth intellectual current was that of historical 
sociology and sociological theory. In particular, the 
work of such sociological theorists as Max Weber 
(1906-1924; 1922), Ferdinand Ténnies (1887), 
Talcott Parsons, and Edward Shils (see Parsons & 
Shils 1951; Shils 1959-1960) influenced the efforts 
of some of the newer students of comparative poli- 
tics to develop theoretical frameworks capable of 
ordering and codifying the research results and the 
insights produced by this extraordinary empirical 
research effort, The system concepts of Talcott 
Parsons and of the information theorists influenced 
the work of a group of political theorists and 
students of comparative politics. These included 
the theoretical contributions of David Easton (1953; 
1965), Karl Deutsch (1963), David Apter (1965), 
Gabriel A. Almond and James Coleman (1960), 
and Lucian Pye (1962; 1966). 


Achievements and prospects 
If one speculates about the future of the com- 
parative politics movement, a number of points 
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are suggested. In the first place, the need to codify 
the growing accumulation of research on political 
systems from all of the various culture areas of the 
world and at all levels of structural differentiation 
and secularization has brought about a return to 
the classic themes of political theory. The classifi- 
cations of Almond (1956), Shils (1959-1960), 
Dahl (1963), Apter (1965), Almond and Coleman 
(1960), and others reflect a return to the Aristo- 
telian tradition—but now with greater theoretical 
sophistication and with more, and more reliable, 
information. Similarly, the concern with theories 
of political development among these and other 
authors is again indicative of a return to the classic 
theoretical problems of political change. The future 
of this classificatory and developmental interest 
would appear to lie with political theory rather 
than with a special subdiscipline of comparative 
politics. 

A second development brought about by the com- 
parative politics movement is the breakdown in the 
parochialism of the theories of various special in- 
stitutions and processes, such as bureaucracy 
(LaPalombara 1963; Riggs 1964), political parties 
(Duverger 1951; Neumann 1956; LaPalombara & 
Weiner 1966), interest groups, and the like. Here 
again, one cannot see a future in a comparative 
politics subdiscipline for these developments but, 
rather, in the more adequate development of the- 
ories of particular processes and institutions in the 
functioning of political systems. 

Third, the studies of specific political systems 
now broadly grouped under the heading of com- 
parative government and politics in most political 
science curricula are shifting from a configurative 
approach to one that employs schemes of classifi- 
cation, generic categories of a functional kind, and 
is illuminated by comparison. It makes no sense 
to include these studies of individual political sys- 
tems under the heading of comparative politics, 
since all political systems will be treated compara- 
tively, whether within the framework of theoretical 
courses, which will group them into classes and 
varieties, or in the more intensive analyses of in- 
dividual cases, which will draw upon general 
classificatory and developmental theories, and upon 
specific institutional and process theories. 

Another and more recent development in com- 
parative politics would again seem to have long-run 
implications for the development of empirical and 
normative political theory rather than for the de- 
velopment of a special discipline of comparative 
politics. As political scientists have become increas- 
ingly concerned with the adaptation and transfor- 
mation of political systems, and particularly with 


problems of public policy relating to the new 
nations, there has been an increasing tendency to 
focus on the interaction of whole political systems 
with their domestic and international environ- 
ments, since it is at this level that it becomes pos- 
sible to explain political change. This most recent 
development among students of comparative poli- 
tics and political development holds out the pros- 
pect of bridging the discontinuity between empirical 
and normative political theories. As methodologies 
are developed that will make possible precise char- 
acterization of the interaction of political systems 
with their environments, the problem of the ethical 
evaluation of political systems becomes more of a 
rigorous, empirically based exercise. This, of course, 
is not to say that empirical performance is the 
same thing as ethical evaluation. It can, however, 
provide the information essential to evaluation and 
can test hypotheses regarding the ethical proper- 
ties of varieties of political systems. 

Finally, the most recent preoccupation in the 
field of comparative politics—with political devel- 
opment and with the logic of a theory of resource 
allocation to effect political change—again holds 
out the prospect for the enrichment of political 
theory rather than for the future of a particular 
subdiscipline within the field. Thus, this interest 
too may be assimilated into a general body of polit- 
ical theory. 

It is difficult to see, therefore, that comparative 
politics has a long-run future as a subdiscipline 
of political science. Rather, it would appear that, 
like the political behavior movement which pre- 
ceded it, its promise lies in enriching the discipline 


of political science as a whole. 
GABRIEL A. ALMOND 


[See also MODERNIZATION; POLITICAL ANTHROPOLOGY; 
POLITICAL CULTURE. Other relevant material may be 
found in GovERNMENT; LEGAL SYSTEMS, article on 
COMPARATIVE LAW AND LEGAL SYSTEMS; POLITICAL 
BEHAVIOR; POLITICAL SCIENCE; PUBLIC ADMINISTRA- 
TION; PUBLIC LAW, article on COMPARATIVE PUBLIC 
Law.] 
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POLLUTION 


One of the great puzzles in comparative studies 
of religion has been the reconciliation of the con- 
cept of pollution, or defilement, with that of holi- 
ness. In the last half of the nineteenth century, 


Robertson Smith asserted that the religion of prim- 
itive peoples developed out of the relation between 
a community and its gods, who were seen as just 
and benevolent. Dependent on a sociological ap- 
proach to religion, Robertson Smith continued al- 
ways to draw a line between religious behavior, 
concerned with ethics and gods, and nonreligious, 
magical behavior. He used the term taboo to de- 
scribe nonreligious rules of conduct, especially 
those concerned with pollution, in order to distin- 
guish them from the rules of holiness protecting 
sanctuaries, priests, and everything pertaining to 
gods. The latter behavior he held to be intelligible 
and praiseworthy and the former to be primitive, 
savage, and irrational—“magical superstition based 
on mere terror.” 

He clearly felt that magic and superstition were 
not worth a scholar’s attention. But Sir James 
Frazer, who dedicated The Golden Bough to Robert- 
son Smith, tried to classify and understand the 
nature of magical thinking. He formulated the two 
principles of sympathetic magic: action by con- 
tagion and action by likeness. Frazer followed 
Robertson Smith in assuming that magic was more 
primitive than religion, and he worked out an evo- 
lutionary scheme in which primitive man’s earliest 
thinking was oriented to mechanical ideas of con- 
tagion. Magic gradually gave way to another cos- 
mology, the idea of a universe dominated by super- 
natural beings similar to man but greatly superior 
to him. Magic thus came to be accepted as a word 
for ritual which is not enacted within a cult of 
divine beings. But obviously there is an overlap 
between nonreligious ideas of contagion and rules 
of holiness. Robertson Smith accounted for this by 
making the distinction between holiness and un- 
cleanness a criterion of the advanced religions: 


The person under taboo is not regarded as holy, for he 
is separated from approach to the sanctuary as wel 
as from contact with men, but his act or condition 1S 
somehow associated with supernatural dangers, bate 
ing, according to the common savage explanation, 
from the presence of formidable spirits which are 
shunned like an infectious disease. In most savage a 
cieties no sharp line seems to be drawn between t 
two kinds of taboo . . . and even in more ae ee 
nations the notions of holiness and uncleanness © z 
touch . . . [to] distinguish between the holy a 
the unclean, marks a real advance above savagery: 
({1889] 1927, p. 153) 

Frazer echoes the notion that confusion between 
uncleanness and holiness marks primitive He 
ing. In a long passage in which he considers mi 
Syrian attitude to pigs, he concludes: ‘Some 


: said 
this was because the pigs were unclean; others 


| 


it was because the pigs were sacred. This . , . points 
to a hazy state of religious thought in which the 
ideas of sanctity and uncleanness are not yet 
sharply distinguished, both being blent in a sort of 
vaporous solution to which we give the name of 
taboo” ([1890] 1955, vol. 2, part 5, p. 23). 

The work of several modern-day students of 
comparative religion derives not directly from 
Frazer but from the earlier work of Durkheim, 
whose debt to Robertson Smith is obvious in many 
ways. On the one hand, Durkheim was content to 
ignore aspects of defilement which are not part of 
a religious cult. He developed the notion that magi- 
cal injunctions are the consequence of primitive 
man’s attempt to explain the nature of the uni- 
verse. Durkheim suggested that experimentation 
with magical injunctions, having thus arisen, has 
given way to medical science. But on the other 
hand, Durkheim tried to show that the contagious- 
ness of the sacred is an inherent, necessary, and 
peculiar part of its character. 

His idea of the sacred as the expression of soci- 
ety's awareness of itself draws heavily on Robertson 
Smith’s thesis that man’s relation to the gods, his 
religious behavior, is an aspect of prescribed social 
behavior. It followed, for Durkheim, that religious 
ideas are different from other ideas. They are not 
referable to any ultimate material reality, since 
religious shrines and emblems are only themselves 
representations of abstract ideas. Religious experi- 
ence is an experience of a coercive moral force. 
Consequently, religious ideas are volatile and fluid; 
they float in the mind, unattached, and are always 
likely to shift, or to merge into other contexts at 
the risk of losing their essential character: there is 
always the danger that the sacred will invade the 
profane and the profane invade the sacred. The 
sacred must be continually protected from the pro- 
fane by interdictions. Thus, relations with the 
sacred are always expressed through rituals of sep- 
aration and demarcation and are reinforced with 
beliefs in the danger of crossing forbidden bound- 
aries. [See the biographies of DURKHEIM; FRAZER; 
SMITH, WILLIAM ROBERTSON.] 

If contemporary thinkers were not already well 
prepared to accept the idea that “religious” restric- 
tions were utterly different from primitive super- 
stitions about contagion, this circular distinction 
between two kinds of contagion could hardly have 
gone unchallenged. How can it be argued that con- 
tagiousness is the peculiar characteristic of ideas 
about the sacred when another kind of contagious- 
ness has been bracketed away by definition as 
irrelevant? 

This criticism of Durkheim's treatment of sacred 
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contagion is implicit in Lévy-Bruhl’s massive work 
on primitive mentality (1922). Lévy-Bruhl docu- 
mented a special kind of outlook on the universe, 
one in which the power to act and to be acted upon 
regardless of restrictions of space and time is widely 
attributed to symbolical representations of persons 
and animals, He himself explained the belief in 
such remote contagion by the dominance of the 
idea of the supernatural in the primitive view of 
the world. And since he would expect “supernatu- 
ral” to be equated with Durkheim's “sacred,” he 
seems to have seen no conflict between his and 
the master’s views. [See the biography of Livy- 
BRUHL.] 

We cannot accept Durkheim's argument that 
there are two kinds of contagion, one the origin of 
primitive hygiene and the other intrinsic to ideas 
about the sacred, because it is circular. If we ap- 
proach the problem of contagion in Lévy-Bruhl's 
terms, then the scope of the answer is broadened: 
there is not simply a residual area of magical be- 
havior that remains to be explained after primitive 
religious behavior has been understood but rather 
a whole mentality, a view of how the universe is 
constituted. This view of the universe differs essen- 
tially from that of civilized man in that sympathetic 
magic provides the key to its control. Lévy-Bruhl 
is open to criticism; his statement of the problem 
is oversimple. He bluntly contrasts primitive men- 
tality with scientific thought, not fully appreciating 
what a rare and specialized activity scientific think- 
ing is and in what well-defined and isolated con- 
ditions it takes place. His use of the word “pre- 
logical” in his first formulation of primitive thinking 
was unfortunate, and he later discarded it. But 
although his work seems to be discredited at pres- 
ent, the general problem still stands. There is a 
whole class of cultures, call them what you will, 
in which great attention is paid to symbolic de- 
marcation and separation of the sacred and the 
profane and in which dangerous consequences are 
expected to follow from neglect of the rituals of 
separation. In these cultures lustrations, fumiga- 
tions, and purifications of various kinds are applied 
to avert the dangerous effect of breach of the rules, 
and symbolic actions based on likeness to real 
causes are used as instruments for creating positive 


effects. 


The cultural definition 

If we are not to follow Robertson Smith in treat- 
ing the rules of uncleanness as irrational and be- 
yond analysis, we need to clear away some of the 
barriers which divide up this whole field of inquiry, 
While the initial problem is posed by the difference 
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between “our” kind of thinking and “theirs,” it is a 
mistake to treat “us” the moderns and “them” the 
ancients as utterly different. We can only approach 
primitive mentality through introspection and un- 
derstanding of our own mentality. The distinction 
between religious behavior and secular behavior 
also tends to be misleadingly rigid. To solve the 
puzzle of sacred contagion we can start with more 
familiar ideas about secular contagion and defile- 
ment, In English-speaking cultures, the key word 
is the ancient, primitive, and still current “dirt.” 
Lord Chesterfield defined dirt as matter out of place. 
This implies only two conditions, a set of ordered 
relations and a contravention of that order. Thus 
the idea of dirt implies a structure of ideas. For us 
dirt is a kind of compendium category for all events 
which blur, smudge, contradict, or otherwise con- 
fuse accepted classifications. The underlying feel- 
ing is that a system of values which is habitually 
expressed in a given arrangement of things has 
been violated. 

This definition of defilement avoids some histori- 
cal peculiarities of Western civilization. For exam- 
ple, it says nothing about the relation between dirt 
and hygiene. We know that the discovery of patho- 
genic organisms is recent, but the idea of dirt ante- 
dates the idea of pathogenicity. It is therefore more 
likely to have universal application. If we treat all 
pollution behavior as the reaction to any event 
likely to confuse or contradict cherished classifica- 
tions, we can bring two new approaches to bear on 
the problem: the work of psychologists on percep- 
tion and of anthropologists on the structural analy- 
sis of culture. 

Perception is a process in which the perceiver 
actively interprets and, in the course of his inter- 
preting, adapts and even supplements his sensory 
experiences. Hebb has shown that in the process 
of perception, the perceiver imposes patterns of 
organization on the masses of sensory stimuli in 
the environment (1949; 1958). The imposed pat- 
tern organizes sequences into units—fills in missing 
events which would be necessary to justify the 
recognition of familiar units. The perceiver learns 
to adjust his response to allow for modification of 
stimuli according to changes in lighting, angle of 
regard, distance, and so forth. In this way the 
learner develops a scheme or structure of assump- 
tions in the light of which new experiences are 
interpreted. Learning takes place when new ex- 
perience lends itself to assimilation in the existing 
structure of assumption or when the scheme of past 
assumptions is modified in order to accommodate 
what is unfamiliar. In the normal process of inter- 
pretation, the existing scheme of assumptions tends 


to be protected from challenge, for the learner rec- 
ognizes and absorbs cues which harmonize with 
past experience and usually ignores cues which are 
discordant. Thus, those assumptions which have 
worked well before are reinforced. Because the se- 
lection and treatment of new experiences validates 
the principles which have been learned, the struc- 
ture of established assumptions can be applied 
quickly and automatically to current problems of 
interpretation. In animals this stabilizing, selective 
tendency serves the biological function of survival. 
In men the same tendency appears to govern learn- 
ing. If every new experience laid all past interpre- 
tations open to doubt, no scheme of established 
assumptions could be developed and no learning 
could take place. 

This approach may be extended to the learning 
of cultural phenomena. Language, for example, 
learned and spoken by individuals, is a social phe- 
nomenon produced by continuous interaction be- 
tween individuals, The regular discriminations 
which constitute linguistic structure are the spon- 
taneous outcome of continual control, exercised on 
an individual attempting to communicate with 
others. Expressions which are ambiguous or which 
deviate from the norm are less effective in com- 
munication, and speakers experience a direct feed- 
back encouraging conformity. Language has more 
loosely and more strictly patterned domains in 
which ambiguity has either more or less serious 
repercussions on effective communication. Thus 
there are certain domains in which ambiguity can 
be better tolerated than in others (Osgood & Sebeok 
1954, p. 129). 

Similar pressures affect the discrimination of 
cultural themes. During the process of encultura- 
tion the individual is engaged in ordering newly 
received experiences and bringing them into con- 
formity with those already absorbed. He is also in- 
teracting with other members of his community 
and striving to reduce dissonance between his 
structure of assumptions and theirs (Festinger 
1957). Frenkel-Brunswik’s research among s 
children who had been variously exposed to racið 
prejudice illustrates the effects of ambiguity a 
learning at this level. The children listened ti 
stories which they were afterwards asked to rec i 
In the stories the good and bad roles were not a 
sistently allocated to white and Negro characte 
When there was dissonance between their ae 
lished pattern of assumptions about racial V as 
and the actual stories they heard, an aa ee 
effect was received. They were unable to recall i 
stories accurately. There are implications here a 
the extent to which a culture (in the sense o 


consistent structure of themes, postulates, and 
evaluations) can tolerate ambiguity. It is now com- 
mon to approach cultural behavior as if it were sus- 
ceptible to structural analysis on lines similar to 
those used in linguistics (Lévi-Strauss 1958; Leach 
1961). For a culture to have any recognizable char- 
acter, a process of discrimination and evaluation 
must have taken place very similar to the process 
of language development—with an important dif- 
ference. For language the conditions requiring clear 
verbal communication provide the main control on 
the pattern which emerges, but for the wider cul- 
ture in which any language is set, communication 
with others is not the only or principal function. 
The culture affords a hierarchy of goals and values 
which the community can apply as a general guide 
to action in a wide variety of contexts. Cultural 
interaction, like linguistic interaction, involves the 
individual in communication with others. But it 
also helps the individual to reflect upon and order 
his own experience. 

The general processes by which language struc- 
ture changes and resists change have their ana- 
logues at the higher level of cultural structure. The 
response to ambiguity is generally to encourage 
clearer discrimination of differences. As in lan- 
guage, there are different degrees of tolerance of 
ambiguity. Linguistic intolerance is expressed by 
avoidance of ambiguous utterances and by pressure 
to use well-discriminated forms where differences 
are important to interpretation and appropriate re- 
sponses, Cultural intolerance of ambiguity is ex- 
pressed by avoidance, by discrimination, and by 
pressure to conform. 


The functions of pollution beliefs 


To return to pollution behavior, we have already 
seen that the idea of dirt implies system. Dirt avoid- 
ance is a process of tidying up, ensuring that the 
order in external physical events conforms to the 
structure of ideas. Pollution rules can thus be seen 
as an extension of the perceptual process: insofar 
as they impose order on experience, they support 
clarification of forms and thus reduce dissonance. 

Much attention has been paid to the sanctions 
by which pollution rules are enforced (see Steiner 
1956, p. 22). Sometimes the breach is punished by 
political decree, sometimes by attack on the trans- 
gressor, and sometimes by grave or trivial sanc- 
tions; the sanction used reflects several aspects of 
the matter. We can assume that the community, 
insofar as it shares a common culture, is collec- 
tively interested in pressing for conformity to its 
norms, In some areas of organization the com- 
munity is capable of punishing deviants directly, 
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but in others this is not practicable. This may hap- 
pen, for example, if political organization is not 
sufficiently developed or if it is developed in such 
a way as to make certain offenses inaccessible to 
police action. Homicide is a type of offense which 
is variously treated according to the relationship 
between killer and victim. If the offender is him- 
self a member of the victim's group and if this is 
the group which is normally entrusted with pro- 
tection of its members’ interests, it may be held 
contradictory and impossible for the group to inflict 
punishment. Then the sanction is likely to be 
couched in terms of a misfortune that falls upon 
the offender without human intervention, This 
kind of homicide is treated as a pollution. 

We would expect to find that the pollution beliefs 
of a culture are related to its moral values, since 
these form part of the structure of ideas for which 
pollution behavior is a protective device. But we 
would not expect to find any close correspondence 
between the gravity with which offenses are judged 
and the danger of pollution connected with them. 
Some moral failings are likely to be met with 
prompt and unpleasant social consequences. These 
self-punishing offenses are less likely to be sanc- 
tioned by pollution beliefs than by other moral 
rules. Pollution beliefs not only reinforce the cul- 
tural and social structure, but they can actively 
reduce ambiguity in the moral sphere. For example, 
if two moral standards are applied to adultery, so 
that it is condemned in women and tolerated in 
men, there will inevitably be some ambiguity in the 
moral judgment since adultery involves a man and 
a woman, A pollution belief can reduce the am- 
biguity. If the man is treated as dangerously con- 
tagious, his adulterous condition, while not in itself 
condemned, endangers the outraged husband or 
the children; moral support can be mustered 
against him. Alternatively, if attention is focused 
on the pollution aspect of the case, a rite of puri- 
fication can mitigate the force of the moral con- 
demnation. 

This approach to pollution allows further ap- 
plications of Durkheimian analysis. If we follow 
him in assuming that symbolism and ritual, 
whether strictly religious or not, express society's 
awareness of its own configuration and necessities, 
and if we assume that pollution rules indicate the 
areas of greater systematization of ideas, then we 
have an additional instrument of sociological anal- 
ysis. Durkheim held that the dangerous powers 
imputed to the gods are, in actual fact, powers 
vested in the social structure for defending itself, 
as a structure, against the deviant behavior of its 
members. His approach is strengthened by includ- 
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ing all pollution rules and not merely those which 
form part of the religious cult. Indeed, deriving 
pollution behavior from processes similar to per- 
ception comes close to Durkheim’s intention of 
understanding society by developing a social theory 
of knowledge. 

Pollution rules in essence prohibit physical con- 
tact. They tend to be applied to products or func- 
tions of human physiology; thus they regulate con- 
tact with blood, excreta, vomit, hair clippings, nail 
clippings, cooked food, and so on, But the anthro- 
pologist notes that the incidence of beliefs in physi- 
ological pollution varies from place to place. In 
some communities menstrual pollution is gravely 
feared and in others not at all; in some, pollution 
by contact with the dead is feared, in others pollu- 
tion of food or blood, Since our common human 
condition does not give rise to a common pattern of 
pollution observances, the differences become in- 
teresting as an index of different cultural pattern- 
ing. It seems that physiological pollutions become 
important as symbolic expressions of other unde- 
sirable contacts which would have repercussions on 
the structure of social or cosmological ideas. In 
some societies the social definition of the sexes is 
more important than in others. In some societies 
social units are more rigorously defined than in 
others, Then we find that physical contact between 
sexes or between social units is restricted even at 
second or third remove. Not only may social inter- 
course be restricted, but sitting on the same chair, 
sharing the same latrine, or using the same cook- 
ing utensils, spoons, or combs may be prohibited 
and negatively sanctioned by pollution beliefs. By 
such avoidances social definitions are clarified and 
maintained. Color bars and caste barriers are en- 
forced by these means. As to the ordered relation of 
social units and the total structure of social life, 
this must depend on the clear definition of roles 
and allegiances. We would therefore expect to find 
pollution concepts guarding threatened disturb- 
ances of the social order. On this, nearly everything 
has been said by van Gennep. His metaphor of so- 
ciety as a kind of house divided into rooms and 
corridors, the compartments carefully isolated and 
the passages between them protected by cere- 
monial, shows insight into the social aspects of 
pollution. So also does his insistence on the rela- 
tive character of the sacred: 


Sacredness as an attribute is not absolute; it is brought 
into play by the nature of particular situations. . . . 
Thus the “magic circles” pivot, shifting as a person 
moves from one place in society to another. The cate- 
gories and concepts which embody them operate in 


such a way that whoever passes through the various 


positions of a lifetime one day sees the sacred where 
before he has seen the profane, or vice versa. Such 
changes of condition do not occur without disturbing 
the life of society and the individual, and it is the 
function of rites of passage to reduce their harmful 
effects. (Gennep [1909] 1960, pp. 12-13) 


Van Gennep saw that rites of transition treat all 
marginal or ill-defined social states as dangerous. 
His treatment of margins is fully compatible with 
the sociological approach to pollution. But van 
Gennep’s ideas must be vastly expanded. Not only 
marginal social states, but all margins, the edges 
of all boundaries which are used in ordering the 
social experience, are treated as dangerous and 
polluting. 

Rites of passage are not purificatory but are 
prophylactic. They do not redefine and restore a 
lost former status or purify from the effect of con- 
tamination, but they define entrance to a new 
status, In this way the permanence and value of 
the classifications embracing all sections of society 
are emphasized. 

When we come to consider cosmological pollu- 
tion, we are again faced with the problem unre- 
solved by Lévy-Bruhl. Cosmological pollution is to 
the Westerner the most elusive, yet the most inter- 
esting case. Our own culture has largely given up 
the attempt to unify, to interpenetrate, and to cross- 
interpret the various fields of knowledge it encom- 
passes. Or rather, the task has been taken over by 
natural science, A major part of pollution behavior 
therefore lies outside the realm of our own experi- 
ence: this is the violent reaction of condemnation 
provoked by anything which seems to defy the ap- 
parently implicit categories of the universe. Our 
culture trains us to believe that anomalies are only 
due to a temporarily inadequate formulation of 
general natural laws. We have to approach this 
kind of pollution behavior at second hand. 

The obvious source of information on the place 
of cosmic abnormality in the mind of the prim- 
itive is again Lévy-Bruhl. Earthquakes, typhoons, 
eclipses, and monstrous births defy the order of the 
universe. If something is thought to be frightening 
because it is abnormal or anomalous, this implies 
a conception of normality or at least of categories 
into which the monstrous portent does not fit. a 
more surprising that anomaly is taken to be, ‘i 
clearer the evidence that the categories which 1 
contradicts are deeply valued. à 

At this point we can take up again the pe 
of how the culture of civilization differs fron ae 
which Lévy-Bruhl called primitive. Recalling 
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dirt implies system and that pollution beliefs in- 
dicate the areas of greatest systematization, we 
can assume that the answer must be along the 
same lines. The different elements in the primitive 
world view are closely integrated; the categories of 
social structure embrace the universe in a single, 
symbolic whole. In any primitive culture the urge 
to unify experience to create order and wholeness 
has been effectively at work. In “scientific culture” 
the apparent movement is the other way. We are 
led by our scientists to specialization and compart- 
mentalism of spheres of knowledge. We suffer the 
continual breakup of established ideas. Lévy-Bruhl, 
looking to define the distinction between the scien- 
tific and the primitive outlook, would have been 
well served if he had followed Kant's famous pas- 
sage on his own Copernican revolution. Here Kant 
describes each great advance in thought as a stage 
in the process of freeing “mind” from the shackles 
of its own subjective tendencies. In scientific work 
the thinker tries to be aware of the provisional and 
artificial character of the categories of thought 
which he uses. He is ready to reform or reject his 
concepts in the interests of making a more accurate 
statement. 

Any culture which allows its guiding concepts to 
be continually under review is immune from cos- 
mological pollutions. To the extent that we have 
no established world view, our ways of thinking are 
different from those of people living in primitive 
cultures, For the latter, by long and spontaneous 
evolution, have adapted their patterns of assump- 
tion from one context to another until the whole of 
experience is embraced. But such a comprehensive 
structure of ideas is precarious to the extent that 
it is an arbitrary selection from the range of pos- 
sible structures in the same environment. Other 
ways of dividing up and evaluating reality are con- 
ceivable. Hence, pollution beliefs protect the most 
vulnerable domains, where ambiguity would most 
weaken the fragile structure. 


Emotional aspects of pollution behavior 


Pollution beliefs are often discussed in terms of 
the emotions which they are thought to express. 
But there is no justification for assuming that ter- 
ror, or even mild anxiety, inspires them any more 
than it inspires the housewife’s daily tidying up. 
For pollution beliefs are cultural phenomena. They 
are institutions that can keep their forms only by 
bringing pressure to bear on deviant individuals. 
There is no reason to suppose that the individual in 
a primitive culture experiences fear, still less un- 
reasoning terror, if his actions threaten to modify 
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the form of the culture he shares, His position is 
exactly comparable to a speaker whose own lin- 
guistic deviations cause him to produce responses 
which vary with his success in communicating. The 
dangers and punishments attached to pollution act 
simply as means of enforcing conformity. 

As to the question of the rational or irrational 
character of rules of uncleanness, Robertson Smith 
is shown to have been partly right. Pollution beliefs 
certainly derive from rational activity, from the 
process of classifying and ordering experience. 
They are, however, not produced by strictly rational 
or even conscious processes but rather as a spon- 
taneous by-product of these processes. 


Mary DOUGLAS 


[See also CASTE; COMMUNICATION; MAGIC; MYTH AND 
SYMBOL; RITUAL.] 
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I 
THE FIELD OF DEMOGRAPHY 


Demography is the quantitative study of human 
populations. Its basic materials are censuses, vital 
statistics, and, increasingly, sample surveys. Its 
central concerns are the measurement and dis- 
covery of uniformities in the basic processes of 
human birth, death, population movement, and 
population growth; these phenomena are treated 
in both their socioeconomic and their biological 
contexts. The methods of demography are em- 
pirical and statistical and make as much use of 
advanced mathematics as those of any branch of 
the social sciences. Like anthropology and psy- 
chology, demography bridges the social and bio- 
logical sciences. 

The word “demography” was apparently first 
used by Achille Guillard in his Éléments de sta- 


tistique humaine, ou démographie comparée in 
1855. Its Greek origins are demos (people) and 
graphein (to draw, describe). Until recently, the 
word had not attained general usage among Eng- 
lish-speaking peoples, and those using it were fre- 
quently the victims of typographers and casual 
audiences who insisted on confusing it with “de- 
mocracy.” The most widely used textbook on popu- 
lation problems in the United States during the 
period from 1930 to 1960 does not mention the 
word in the first three editions and refers to it only 
casually in the fourth and fifth (Thompson & Lewis 
1930). However, the word is gaining both wider 
usage and wider meaning as describing the scien- 
tific aspect of population study. 

Formal demography. A distinction is commonly 
made between “formal,” or “pure,” demography 
and broader population studies. Formal demogra- 
phy is a well-defined, technical subject with a 
highly developed mathematical methodology. It is 
concerned primarily with the measurement and 
analysis of the components of population change, 
especially births, deaths, and, to a smaller extent, 
migration. It is concerned with population struc- 
ture—that is, the age, sex, and marital composi- 
tion of the population—as it contributes to the 
understanding of population change. 

Formal demography has provided some of the 
most fruitful uses of mathematical models in the 
social sciences. These include the construction of 
life tables, which provide the terms within which 
life insurance and social security systems must 
operate; intrinsic rates of reproduction and other 
sophisticated measures of natality, which have 
contributed much to the understanding of the secu- 
lar decline of birth rates in the West and their fluc- 
tuations after World War m; stable population 
analysis, which has provided methods of estimating 
birth rates and rates of population growth in the 
absence of reliable vital statistics and often with 
grossly inaccurate census data; and population pro- 
jections, based on the analysis of trends in the 
components of population change. Such ptoje 
are in great demand for purposes of economic A 
velopment, market research, city planning, e 
tional planning, estimating future labor supp» 
and numerous other approaches to measuring 
people as producers and consumers. 

For actuarial purposes, demograph h 
defined in the narrow sense of formal demograp ð 
(Cox 1950). This definition also prevails in certa 
European countries, notably Italy. y 

Broader usage. A broader and inerea 
popular usage of the term “demography” ine ie 
studies of demographic variables in their $0 


y is usually 
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contexts as well as their biological contexts. In this 
approach, demographic changes are viewed as part 
of, and as both cause and effect of, their social 
environment. Demographic analysis has also come 
to mean “not only the statistical manipulation of 
population data, but more important, the study 
of such data as a method of solving empirical 
problems” (Spengler & Duncan [1956a] 1963, 
p. xiii). 

In addition to the components of population 
change, demographic variables are rather arbi- 
trarily assumed to include those mass measure- 
ments of population that are commonly enumerated 
in population censuses: the size and distribution 
of the population, its biological composition by age 
and sex, and certain of its more measurable socio- 
economic characteristics. Among these variables, 
data on population size, geographical distribution 
(including rural-urban), age-sex composition, and 
marital status are generally considered to be more 
specifically the subject matter of demography. 
Other socioeconomic characteristics, such as race, 
language, religion, education, occupation, and in- 
come, are studied in their own right and often in 
relation to group differences in birth rates, death 
rates, and migration. 

The broader definition of demography inevitably 
involves interdisciplinary aspects, since demogra- 
phers and other social scientists may study human 
populations in relation to other variables. Thus, a 
study of rural-urban migration may be considered 
demographic if it is primarily concerned with 
measurement and quantitative uniformities, eco- 
nomic if it is concerned with economic causes and 
effects, and sociological if it deals with sociological 
causes and effects. There is no firm line of demar- 
cation, nor should one be expected. Beyond the 
specific discipline of formal demography there is 
no distinguishing criterion of demography as such, 
other than its involvement in quantitative meas- 
ures of human populations. Thus, it can be said 
that the subject matter of demography includes 
those population theories that are based on quanti- 
tative observation and generalization, but not those 
that are formulated a priori at a more abstract and 
polemical level. 


The development of demography 

Censuses in various forms were conducted early 
in human history. A Roman census was taken 
under Caesar Augustus the year Christ was born 
(thus forcing Joseph and Mary to journey to Beth- 
lehem to be enrolled in the place of their ancestry); 
and a major census was conducted in China in the 
year 2 A.D., during the Han Dynasty. 


While such scattered censuses are of great in- 
terest for historical demography, modern censuses 
usable for scientific pursuits are essentially of nine- 
teenth-century origin, Regular decennial censuses 
were initiated in the United States in 1790 and in 
France and England in 1801. Important exceptions 
are the population registers in Scandinavia, which 
date back to 1686 in Sweden and 1769 in Denmark. 

Origins in vital statistics. Empirical demogra- 
phy began in the context of vital statistics with the 
work of John Graunt. His Natural and Political 
Observations Upon the Bills of Mortality, pub- 
lished in 1662, is a study of current reports on 
burials and christenings for a population of about 
500,000 persons in the vicinity of London. The 
very title of this early work illustrates the fact that 
demography has roots in both biological and social 
inquiries. In the 1670s, William Petty built on 
Graunt’s work to write a treatise on what he de- 
scribed as “political arithmetick.” Gradually demog- 
raphy was absorbed into the new and more general 
study of statistics to the point that, in the United 
States, for example, concern about the accuracy 
and implications of vital statistics in the city of 
Boston led to the establishment of the American 
Statistical Association in 1839. 

The line of succession in the biostatistical tra- 
dition of demography is a notable one (see Lorimer 
1959). Interest in biostatistics led to attempts to 
explain population change in terms of mathemat- 
ical processes, particularly logistic curves. Popula- 
tion change was conceived in terms of mechanical 
models by nineteenth-century researchers such as 
Adolphe Quetelet and Pierre Francois Verhulst and 
in biological terms by Pearl and Reed (1920), 
Alfred J. Lotka (1925), and G. Udny Yule (1925). 
While mechanical and biologically deterministic 
models did not ultimately prove convincing, they 
did lead to major developments in the statistical 
analysis of the processes of birth and death. Out- 
standing among these were Lotka’s contributions 
in computing reproduction rates and intrinsic, or 
true, rates of natural increase, holding constant the 
age structure of the population. The use of mathe- 
matical models, both stochastic and deterministic, 
has enjoyed a rebirth in demography, perhaps be- 
cause of the vogue of model-building in social 
sciences generally, and certainly because of the 
capabilities of modern computer technology. 

Malthusian theory. The economic approach to 
demography is commonly traced to the work of 
Thomas Malthus, although he readily admitted the 
contributions of his predecessors. His Essay on the 
Principle of Population, first published in 1798, is 
undoubtedly the most discussed work in the field 
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of demography. While he modified his position in 
successive versions of the volume, Malthus held the 
general view that population tends to increase 
faster than the means of subsistence and thus to 
absorb all economic gains, except when checked by 
moral restraint, vice, and misery. 

Malthus’ influence was enormous, but his view 

was later challenged in three ways. First, during 
the nineteenth century economic growth far out- 
distanced population growth among European peo- 
ples, resulting in rising levels of living. Second, 
despite rises in the level of living, birth rates de- 
clined, first in France and Sweden, and by 1880 
in western Europe generally. Finally, liberals and 
Marxists challenged the Malthusian view by assert- 
ing that poverty was the result of unjust social 
institutions rather than of population growth. Re- 
gardless of the validity of this argument, it caused 
population study to be associated with the conserva- 
tive social philosophy of Malthus, an association 
which retarded the growth of demography as a 
scientific discipline. As one author says, 
The marriage of demography and economics while 
both were immature—“Parson” Malthus officiating— 
resulted in a stormy and unfruitful union. Both the 
dynamics of interactions among economic factors and 
the dynamics of vital trends in relation to population 
structure were long neglected in a hasty synthesis that 
placed undue emphasis on the relation of population 
to resources and the corollary theory of a hypothetical 
fixed “optimum.” This led to the fallacious assumption 
that increase of population is necessarily advantageous 
in a country with low density of population and to an 
unwarranted pessimism about the possibilities of eco- 
nomic progress in densely populated countries. (Lori- 
mer 1957, p. 21) 

Dissatisfaction with the Malthusian approach 
led to the divorce of demography from economics 
and to a continuing suspicion among some econo- 
mists that demography overemphasizes the force of 
population growth and that population control in 
underdeveloped areas is in some way a diversion 
from, and even a threat to, the central purpose 
of economic development. Therefore, the econom- 
ics of population change in the contemporary world 
has not received the attention that its importance 
would seem to merit, although it has not been en- 
tirely neglected. Now largely separated from eco- 
nomics, demography in the English-speaking world 
is commonly taught as a branch of sociology. 

The birth rate in Western society. In its socio- 
logical aspects, population study emerged as a sci- 
ence in the 1920s and 1930s. Up to that time, 
births, deaths, and natural increase had generally 
been regarded as biologically given, exogenous both 
to the economy and to the social system. But by 
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the interwar period, the general reduction of the 
birth rate in Western society had become highly 
visible and was the principal focus of demographic 
inquiry. Demographic studies demonstrated the so- 
cial, as opposed to the biological, correlates and 
determinants of the decline in the birth rate. 
Thus, empirical studies showed that the decline 
was not the result of biological sterility but of vol- 
untary restriction of births, predominantly by 
methods requiring a high level of motivation. It is 
perhaps not widely known even today that reduc- 
tion of the birth rate through contraception was 
initially and predominantly the result of “male” 
methods, specifically the practice of coitus inter- 
ruptus and, later, the use of the condom; the 
principal “female” method of birth control was 
abortion [see FERTILITY CONTROL]. The feminist 
birth control movement was noisy, often coura- 
geous, and certainly psychologically important, but 
there is little empirical evidence that the methods 
it advocated or the services it provided ever played 
a large part in the reduction of the birth rate in the 
Western world. In much of the Western world the 
chief methods of birth control are still coitus inter- 
ruptus and abortion, although these are being in- 
creasingly replaced by new methods that have be- 
come generally available only since 1960, 
Another major concern of demographers was 
“differential fertility,” that is, the general observa- 
tion that birth control was first adopted by the 
more urbanized, the better educated, and the upper- 


income groups, with consequent rural-urban, edu- 
cational, and class differences in birth rates. 
more than 


Demographers were reluctant to do ; 
measure these differences, but their observations 
reinforced concern about population “quality,” since 
class differences in natality clearly seemed to favor 
the reproduction of the socially and economically 
unsuccessful, and possibly also the genetically 
inferior. 

The obvious role of social environment and 
untary control in the reduction of the birth a 
gave rise to a new realization that birth and deat 
rates, like migration, are effects as well as ae 
of social changes—dependent as well as indepen 
ent variables with reference to the society in wi 
they occur. Some felt that the existing trends ee 
disturbing and even alarming, threatening rac 
suicide,” on the one hand, and deterioration o: 
“quality,” on the other. 

The reaction to the declining birt n 
strongest in the totalitarian countries that for a 
tionalistic reasons adopted “pronatalist” pone a8 
policies. These policies included such meas 
propaganda for larger families, family allowance», 


vol- 


h rate was 
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suppression of induced abortion, marriage loans 
with deductions for each child born within the 
period of repayment, “motherhood medals” for par- 
ents of large families, and various privileges in 
taxation and in the social insurance schemes. Prior 
to World War m such policies were adopted in 
fascist Italy, Nazi Germany, the Soviet Union, and 
Japan. Some of these measures, without the na- 
tionalistic overtones, were also adopted in France 
and Sweden [see PopuLATION, article on POPULA- 
TION POLICIES]. 

In the English-speaking countries reaction to the 
declining birth rate was slower. In the United 
States the problem was given intensive study by 
the National Resources Committee (U.S. National 
Resources . . . 1938). In Great Britain and Canada 
family allowances were adopted after World War 11; 
and in Great Britain a Royal Commission on Popu- 
lation was created and issued a report that is a 
landmark in British thinking on this point (Great 
Britain 1949-1954), 

The effect of these pronatalist policies is con- 
troversial, The only clear cases of moderate success 
are rises in birth rates achieved in Nazi Germany 
and in the Soviet Union. In Germany the suppres- 
sion of abortion and the promotion of earlier mar- 
riages and childbirth by marriage loans were con- 
tributing factors in the rise in the birth rate from 
14.7 in 1933 to 20.3 in 1939 (Kirk 1942; Demo- 
graphic Yearbook 1948, p. 262). But a substantial 
part of this increase was almost certainly due to the 
general improvement in economic conditions, espe- 
cially the reduction of unemployment. In the Soviet 
Union the suppression of abortion was especially 
successful in raising birth rates in the cities, be- 
cause abortion had formerly been legal and had 
even been provided as a health service of the state. 
In the mid-1950s abortion was again legalized and 
the birth rate in the Soviet Union is now quite low. 
In the period since World War 1, French observers 
have expressed the view that family allowances 
have contributed to maintaining a higher birth rate 
in that country. 

Theory of the vital revolution. In Western coun- 
tries there were important regularities in the de- 
clines in death rates and birth rates, whether 
viewed in time or space. The spread of declines, 
first in death rates and later in birth rates, is a 
well-documented example of cultural diffusion. 
From these regularities were derived a number of 
general propositions concerning the sequence of 
demographic developments in the modern world 
Taken together, these propositions have come to 
be called the “theory of the vital revolution,” or 
the “theory of the demographic transition.” 
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According to this theory, in premodern societies 
both birth and death rates are high; reproduction 
is inefficient because it is necessary to have high, 
relatively unrestricted natality in order to match 
the waste of a high death rate. The initial effect of 
modernization is a fall in the death rate, apparently 
owing to higher levels of living and to the intro- 
duction of epidemic controls and other elementary 
public health measures. Without a comparable de- 
cline in the birth rate, there is a growing margin 
of births over deaths and an accelerating rate of 
population growth, At a later stage of socioeco- 
nomic development, with the achievement of gen- 
eral literacy, urbanization, and industrialization 
(as in the Western countries and in Japan), the 
size of family is reduced by birth control and the 
birth rate falls, eventually reducing the rate of 
population growth. 

The theory of the demographic transition gave 
a plausible interpretation of demographic events 
in Western countries up to World War 11. Demog- 
raphers sought to extrapolate it in time, by pro- 
jecting the then existing trends, and in space, by 
assuming that in the process of modernization the 
rest of the world would follow the same sequence 
of events as in the West. Population “projections” 
based on extrapolation of natality and mortality 
trends gained wide currency and were commonly 
regarded as estimates rather than as the mechan- 
ical extrapolations that they really were. 

It is ironic that demographers developed the 
techniques for projecting certain long-standing 
trends in the components of population growth, 
especially in natality, just at a time when these 
trends were about to dissolve, New attitudes favor- 
ing earlier marriage and more children appeared 
in the very societies where the great majority of 
families had been practicing birth control. The re- 
covery of the birth rate in Western countries just 
before, during, and especially after World War 1 
violated the projections of previous trends and 
those formulations of the demographic transition 
that considered Western countries to be approach- 
ing a stationary or declining population. While the 
prolonged “baby boom” after World War 1 has re- 
ceded, all Western countries continue to have 
modest rates of population growth. 

The demographic transition has not fully ma- 
terialized in non-Western cultures, aside from 
Japan. The transition has largely run its course in 
countries inhabited by peoples of predominantly 
European race and heritage, including Europe, the 
Soviet Union, and the overseas countries of chiefly 
European background. As yet, very few non-Euro- 
pean countries have clearly reached the stage of 
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declining birth rates and slower rates of population 
growth. There has not been a continuum of coun- 
tries moving through the stages of transition, as 
Western countries were doing at the time when 
the theory was developed. Instead, there is now a 
sharp division between countries that have passed 
through the “vital revolution” and now have modest 
birth rates and growth rates, and countries in 
which the death rates, but not the birth rates, have 
been going down, so that the rate of population 
growth has been accelerating. This division sepa- 
rates the developed from the underdeveloped world, 
and, aside from the important exception of Japan, 
it also separates the peoples of European race and 
tradition from the peoples of the rest of the world. 
But there are clear indications that other non- 
European countries such as Taiwan and Korea may 
now be entering the later phases of transition, and 
the development of government policies and espe- 
cially of better contraceptives may well accelerate 
the transition. As yet, no country has become 
“modern”—in the sense of achieving mature socio- 
economic development—without also undergoing a 
decline in its birth rate. 

Later methodological developments. Two ma- 
jor developments have occurred in demography 
since World War n as a result of the problems and 
failures of previous population projections: the 
development of a complex methodology to analyze 
trends in natality and the use of field surveys to 
determine causal factors affecting the number and 
timing of births. 

The attempts to obtain better forecasts of future 
population resulted in successive improvements in 
the measuring of reproduction. These included the 
refinement of the crude birth rate to a population 
standardized by age; Lotka’s historic breakthrough 
in determining the true birth and death rates, given 
the continuation of existing age-specific natality 
and mortality rates; the analysis of births by dura- 
tion of marriage, developed primarily in Europe 
(since such data were not available in the United 
States); the analysis of cohort fertility, that is, the 
number, spacing, and order of births per woman 
in her total reproductive experience (Whelpton 
et al. 1965); and stable population analysis, that 
is, the analysis of mathematical regularities in the 
interrelations of births, deaths, age structure, and 
population growth at fixed schedules of natality 
and mortality (Coale & Demeny 1966). Some of 
the methods in this area of growing demographic 
importance involve elaborate computations that 
are being facilitated by advances in computer 
technology. 


Despite great technical virtuosity and much 
better understanding of the various ways in which 
natality may be measured, it is too early to tell 
whether these efforts have added much to the ac- 
curacy of population forecasts, although they have 
improved understanding of current natality trends. 
Cohort fertility analysis certainly provides mean- 
ingful interpretations of past trends and provides 
the basis for reasonable estimates of the total num- 
ber of children a cohort of women will bear well 
before they have actually reached the end of their 
reproductive period. In the United States women 
born during the period 1905-1914 show the small- 
est average number of births and stand at the end 
of the long fertility decline that began in the early 
nineteenth century. It seems certain that later 
cohorts of women—that is, those born during the 
period 1915-1935—will show a larger average 
number of births (U.S. Department of Health... 
1966, p. 6). 

After World War 1, couples in the United States 
married earlier, had their children earlier, and had 
more children than their parents. The result was 
the higher birth rate in the United States and in 
some other Western countries during the postwar 
period. Recent trends suggest that these forces are 
receding. 

The official statistics used in the sophisticated 
methods of analysis referred to above have not 
given the reasons for the changes that have oc- 
curred. These have been sought in the second of 
the recent major developments in demographic 
methodology: field studies of attitudes, motiva- 
tions, and expectations relating to family size. In 
the United States demographers have conducted a 
series of national sample surveys of attitudes and 
practices in relation to family planning. These 
surveys are documenting the changing aspiration’ 
of American families with regard to the size of 
family desired and the changing practices of mar- 
riage, spacing of children, and methods of family 
limitation (for example, see Whelpton et al. 1965; 
Westoff et al. 1961; 1963). 


Demography and the population explosion 


Since World War n, interest in population 1 
lems has re-emerged with great force. This is Te 
lated to the rapid progress achieved in on 
or postponing deaths throughout the world, whic 
has occurred without a compensating decline in 
the birth rate in the underdeveloped areas. The n 
sult has been the acceleration of population growl 
dramatized in a vast and sometimes controversi: 


n” The 
literature about the “population explosion. Th 


prob- 
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rate of world population growth, over 2 per cent 
per year, is accelerating [see POPULATION, article 
On POPULATION GROWTH]. 

World population growth in the present period 
is historically unique, and there is a general realiza- 
tion that it leads to serious obstacles to the whole 
range of socioeconomic development, whether in 
food, health, education, housing, or the general 
level of living. Population growth is a major block 
to the satisfaction of rising aspirations in the 
underdeveloped countries (Coale & Hoover 1958). 

In both developed and developing countries the 
rapid growth of metropolitan areas with their wide 
range of problems has also been classified as a 
“population problem” on the mistaken assumption 
that this is primarily due to the birth rate rather 
than to migration, which has been the principal 
factor. It is recognized that population growth and 
concentration have contributed to a wide range of 
problems involving the quality of education, the 
urban sprawl, the contamination of the atmos- 
phere, and, according to some, the mediocrity of 
mass culture and the cacophony of much of 
modern life. 

As logical results of these concerns, social scien- 
tists have become more interested in the population 
problem, whether defined as a social problem or 
as an area of scientific interest. Demography has, 
of course, provided analysis of the population 
trends and prospects that underlie this concern. 
Prior to World War m demographic studies were 
largely confined to countries with adequate cen- 
suses and vital statistics—that is, chiefly countries 
of the Western world. In recent years there has 
been growing interest in the demography of the 
underdeveloped world, where great ingenuity and 
methodological skill are often needed to make up 
for the absence of reliable data. This changing 
emphasis has been reflected, for example, in the 
much greater attention given the underdeveloped 
areas in the Second World Population Conference 
in Belgrade in 1965, as compared with the First 
World Population Conference in Rome in 1954 
(World Population Conference 1966, p. 2). 

The growth of applied demography. Demogra- 
phy has supplied much of the factual background 
for the population policies and family planning 
programs that have been adopted by several of the 
developing countries. Much of the demographic 
literature now relates to attitude studies and to 
action experiments designed to introduce family 
planning in these areas. 

On the world scale, the field surveys of knowl- 
edge, attitudes, and practices (KAP studies) con- 
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cerning family size and family planning are per- 
haps the broadest cross-cultural series of studies 
on any subject in the social sciences. Sample sur- 
veys of this type have been done in some twenty 
countries in Africa, Asia, and Latin America. These 
surveys have revealed the feasibility of obtaining 
useful information on such delicate subjects in all 
societies; the existence of a substantial “market” 
for contraceptive information and services almost 
everywhere; and actual contraceptive practice in 
close relation to the degree of modernization of the 
country or socioeconomic class concerned [see 
FERTILITY CONTROL]. 

Although demography has been described as 
“observational,” as opposed to experimental, at 
least some demographers have embarked on con- 
trolled experiments in relation to efforts to reduce 
birth rates. In several instances, the KAP studies 
have included an action component, that is, experi- 
mentation in introducing family planning to the 
population concerned. These studies have met with 
varying success, depending on the characteristics 
of the populations and the methods of education 
and motivation employed. Such attempts have been 
successful where the population was already in the 
stream of modernization, had experienced increas- 
ing survival of infants, and had achieved some 
success in avoiding births through postponement 
of marriage and through contraception and abor- 
tion. Projects of this nature have been working in 
Taiwan, Korea, and Ceylon. However, where the 
population had not already made serious efforts to 
reduce family size, as in India and Pakistan, the 
results of such experiments have been limited 
(Conference . . . 1962; International Conference... 
1966; Studies in Family Planning). 

A number of countries have adopted national 
programs for introducing family planning: Com- 
munist China, South Korea, Taiwan, Malaysia, 
Singapore, Ceylon, Pakistan, Turkey, the United 
Arab Republic, Tunisia, Morocco, Barbados, Ja- 
maica, Honduras, and Chile. The list grows each 
year. Most of these programs are as yet too new 
and too small-scale to have achieved measurable 
success in reducing the birth rate. However, their 
chances for success may change rapidly with the 
improvement of contraceptive technology, notably 
oral contraceptives and plastic intrauterine devices. 
All contraception involves motivation; the newer 
methods are effective because they help the less 
strongly motivated and those living in circum- 
stances where earlier methods were impractical. It 
seems likely that these and further improvements 
will revolutionize the whole approach to family 
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planning in developing and developed countries 
alike (International Conference . . . 1966). 

There is controversy among demographers about 
the scientific relevance of applied studies in their 
field. There is less controversy about the role of 
getting better information on vital statistics and 
population growth, since these pursuits serve both 
academic and applied ends. In over half the world, 
including most of the countries that have adopted 
population policies, official vital statistics are in- 
adequate for determining either the true levels or 
year-to-year changes in birth rates, death rates, and 
population growth. New approaches that are being 
undertaken to obtain better estimates include 
closely supervised registration in sample areas, 
repetitive surveys, and various combinations that 
have come to be called “population growth estima- 
tion studies.” The real measure of the success of a 
national family planning program is reduction in 
the birth rate and in the rate of population growth, 
and demographers are therefore becoming increas- 
ingly involved in problems of measuring population 
growth, as well as in the evaluation of the results 
of family planning programs. 


Demography in the social sciences 


Demography is generally considered an inter- 
disciplinary subject with strong roots in sociology 
and weaker, but still important, connections with 
economics, statistics, geography, human ecology, 
biology, medicine, and human genetics. It is rarely 
thought of as a completely separate discipline, but 
rather as an interstitial subject or as a subdivision 
of one of the major fields. In English-speaking 
countries persons who describe themselves pro- 
fessionally as “demographers” usually hold posi- 
tions of broader definition as sociologists, econo- 
mists, or statisticians, or are in offices and research 
organizations whose titles include the word “popu- 
lation” but only rarely “demography,” a word that 
is more commonly used in other countries. 

Interest in demography has led recently to the 
rapid multiplication of offices and institutes de- 
voted specifically to population research. In the 
United States there are some ten such institutes 
associated with universities. There are important 
centers of demographic study in France, Italy, the 
United Kingdom, Australia, and Japan. The United 
Nations sponsors three regional centers of demo- 
graphic training (in Santiago, Chile, for Latin 
America; in Bombay, India, for Asia; and in Cairo, 
United Arab Republic, for north Africa), and others 
are under consideration. Some of the most rapid 
progress in demographic work is occurring at 


newly established centers in eastern Europe and in 
the developing areas. 

Demography is, in one sense, a general servant 
of other social sciences. It evaluates and initially 
digests the vast reservoirs of social data compiled 
in censuses and vital statistics. It provides sub- 
stantial raw material for the study of social, polit- 
ical, and economic change. Moreover, in spite of 
past disappointments over their accuracy, projec- 
tions and estimates are continually requested from 
demographers for a multitude of planning purposes. 

Demography has not had a prominent place in 
social scientific theory except in that of economics, 
through the works of Malthus. It is quantitative, 
empirical, and methodologically rigorous. Its sub- 
ject matter is statistical categories that only indi- 
rectly reflect real social groups. Its quantitative 
documentation of specific social change is not 
readily integrated into the structural—functional 
theories prevailing in sociology, which are com- 
monly speculative and hard to subject to empirical 
tests. 

Within its own domain demography has had its 
greatest successes in the analysis of mortality [see 
LIFE TABLES; MorTALITY]. Its greatest interest and 
virtuosity has been in the study of natality [see 
FERTILITY]. The stepchild of demography is migra- 
tion, which up to now has defied the application 
of refined measurements comparable to those de- 
veloped in the other two fields. Although interna- 
tional migration is no longer of great demographic 
importance, internal migration in the United 
States, for example, is the chief cause of variations 
in rates of population growth between states, cities, 
and regions. The importance of migration, as well 
as the advantages of the new computer technology, 
may bring greater talent and resources into this 
field [see MIGRATION]. 

In recent years perhaps the most neglected area 
in demography has been on the important frontiers 
between demography and economics, in such areas 
as population growth in relation to capital forma- 
tion; application of life table principles to labor 
force changes; the economics of health, morbidity, 
and mortality; and the economic implications © 
different levels of morbidity and mortality. ma 
are also unexplored possibilities for more fruittu: 
cross-disciplinary work between demography ve 
the fields of anthropology, politics, geogtaP 7 
ecology, and genetics [see GENETICS, article A 
DEMOGRAPHY AND POPULATION GENETICS]. T 
are beginnings, but as yet the joint efforts in the 


related fields are minimal. Duprey KRK 
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n 
POPULATION THEORIES 
In this disi ion we will leave aside such sub- 
jects as population genetics, eugenics, and the 
geographical distribution of people in a particular 
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area. We will concentrate instead upon those theo- 
ries which relate population size to environment. 
This relationship is expressed, for instance, by the 
terms “overpopulation” and “underpopulation” and 
by the concept of an optimum state which lies in 
between. 

It is appropriate to start with a short historical 
overview. 

Antiquity. Since overpopulation in China goes 
back to very ancient times, it is not surprising that 
Chinese philosophers, especially Confucius, enter- 
tained the notion of a numerical balance between 
population and environment. They also looked for 
means to check the increase in numbers. The 
foundations of a theory of optimum population 
level, fully stated only in the twentieth century, 
can be found in their writings. 

In ancient Greece, the earliest thinkers favored 
the expansion of population. Plato, however, be- 
came the foremost exponent of the restrictionist 
point of view, although he was by no means its 
first representative. In his Laws, following Hippo- 
damos, he advocated an absolute limit to the num- 
ber of citizens and suggested means to that end 
which would frighten today’s moralists. In Plato's 
Laws, however, as well as in his Republic and in 
Aristotle's Politics, the perspective is rather narrow, 
and any concern for the whole of mankind is well- 
nigh missing. Fear of overpopulation finally con- 
tributed to the decay of Hellenic civilization. At 
this point, Polybius analyzed the causes of the low 
fertility of the Greek population and proposed rem- 
edies—but it was too late. 

Early Rome was characterized by a fertility cult, 
as was typical of many rising civilizations, includ- 
ing Greece. This ideal found practical support in 
military and political expansion. From Ennius to 
Varro, all the writers of the republican period 
claimed that the primary function of marriage 
was to provide citizens—if not soldiers—for the 
state. 

Later writers denounced the evils of depopula- 
tion, and several emperors, notably Augustus, in- 
troduced legislation to encourage parenthood. In 
spite of these efforts, Rome's lack of men finally 
contributed to its downfall, since it was unable to 
hold back the barbarian invasions. 

The Middle Ages. Medieval population theory 
was not, generally speaking, grounded in eco- 
nomics. Since the concepts of state, race, and 
nation made little sense to medieval man, political 
concern for population growth could not crystal- 
lize. Gradually, however, the depopulation ensuing 
from disastrous invasions gave rise to the feeling 
that population growth in itself was a good thing 


and a mark of divine favor. This view was to sur- 
vive a long time, and to this day it has not com- 
pletely vanished. 

The setting up of a social hierarchy could only 
strengthen this line of reasoning. The leisure 
classes dreaded any decrease in the numbers of 
those whose labor they were exploiting (the analogy 
with a herd was becoming obvious). Thus, the 
return of the populationist tradition coincided with 
the reorganization of Western society after the 
era of feudalism. 

The Enlightenment. The populationist theory 
which appeared during the seventeenth century 
and flourished in the eighteenth century rested on 
paternalistic premises. According to an early state- 
ment by Guillaume Budé: “The king's glory is in 
the multitude of the people.” Similarly, Perriére 
asked: “Who will carry the weapons, if men are 
lacking?” Jean Bodin was even more explicit: 
“There is no wealth nor strength but in men.” 

Mercantilism was a natural extension of popu- 
lationist theory from the political to the economic 
field. Mercantilist theory was nationalistic and ex- 
pansionist in outlook; it aimed at increasing both 
employment and number of men by processing as 
much raw material as possible within the national 
boundaries. In fact, it was less of a scientific theory 
of population than a political attitude. Political 
concern with increasing the population as a means 
of promoting wealth and power for the lord, the 
king, and later for the state, extended through the 
centuries, well beyond the period of mercantilism. 
Most orthodox thinkers in the eighteenth century 
instinctively took a populationist position. The 
frankest statement of that position was given by 
Turmeau de la Morandiére, in one brutal sentence: 
“Subjects and beasts must be multiplied.” We shall 
see why nationalism, in its early stages, and dic- 
tatorships are more prone to populationist argu- 
ments than are democratic regimes. In Germany, 
Hermann Conring also attributed the power of 
states mainly to population. In England, at m 
same period, another confirmed populationist T 
William Petty, who founded the science of “po 
ical arithmetic,” or demography, and who ier 
his truly brilliant gifts to raise questions $0 peaks 
that they opened up vast areas to scientific inv 
tigation. 

Sn France, the physiocrats and their lea 
Francois Quesnay, thought that proper regula 
of landed property by means of capital invest a 
should take precedence over e ma ‘ 
Quesnay’s Tableau économique (written 2 
and revised in 1766) contained the first anne 
model of a national economy and was in this 


spect a forerunner of modern national accounting 
and the matrix developed by Leontief [see INPUT- 
OUTPUT ANALYSIS]. 

Adam Smith took little interest in the population 
question, which, as a matter of fact, baffled him 
ightly. The first to raise it in a formal way was 
Thomas Malthus. 

Malthus and his contemporaries. The autocratic 
ruler who wields absolute power without avowed 
obligations always desires a greater number of sub- 
jects. The scope of this observation extends beyond 
the field of demography; it applies to the sovereign 
as much as to the collector of objets d’art, to the 
head of a family as much as to the office manager. 
But when his subjects—or rather, the objects in 
his possession—have rights or occasion some ex- 
pense, the ruler’s point of view changes, and he 
grows increasingly cautious with regard to num- 
bers. Such a situation existed in seventeenth- 
century England, where the poor laws resulted in 
an increasing burden for the ruling class at the 
same time that the poor themselves were ceasing 
to rely on charity and were becoming conscious of 
their rights. This new dispensation was the cue 
for Malthus. 

The desire of Malthus and his supporters to see 
poor families restrict their progeny has been de- 
scribed alternatively as altruistic and egoistic. In- 
deed, it is both things at the same time: pity for 
the paupers and dread of having to share with 
them. 

Many members of the ruling classes, including 
a majority of classical economists, agreed with 
Malthus that it was advisable to keep down the 
size of the labor force. Reacting to Malthus and 
his school, the contemporary socialist writers took 
an opposite stand: Owen, Fourier, Proudhon, and 
Marx were all unwilling to ascribe pauperism to 
excessive population and therefore rejected the 
remedies advocated by Malthus. The egoism dem- 
onstrated by Malthus, particularly in his famous 
statement that a man born into a world already 
owned by others has no inherent right to a place 
at nature's feast, elicited violent indignation. This 
reaction is still partly responsible for several of the 
misconceptions that surround the population prob- 
lem. Let us try, then, to clear up these misunder- 
standings by presenting the problem in the light of 
the most recent scientific developments, while not 
forgetting that one’s point of view on the subject 
may vary with one’s social status. 


f 


The concept of optimum population 


The concept of optimum population corresponds 
to the idea of a happy medium: it is assumed that 
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since we talk about overpopulation and under- 
population, there must be some intermediate stage 
that avoids either defect. We have already seen 
that this notion of the proper limit, this concern 
about avoiding both overpopulation and under- 
population, has a lengthy history. Plato aside, it 
can be found in rudimentary form in the works of 
many authors in various countries, from Giovanni 
Botero to Voltaire to the classic liberal economists. 
However, it was first stated in scientific terms only 
at the beginning of the twentieth century, when 
the word optimum was applied to population by the 
German economist Julius Wolf. He was soon fol- 
lowed by other economists, such as Knut Wicksell 
(Sweden), Paul Mombert (Germany), and Edwin 
Cannan (England). The notion of optimum popu- 
lation was further elaborated during the interwar 
period by A. B. Wolfe and H. P. Fairchild in the 
United States and by Lionel Robbins and A. M. 
Carr-Saunders in England, among others. Mean- 
while, it was criticized mostly by writers on the 
Continent, such as Corrado Gini, Eugène Dupréel, 
W. E. Rappard, and Adolphe Landry. 


Determining optimum population 

Talking of an “optimum population” implies try- 
ing to find out how many inhabitants it would be 
desirable to have if a given end is to be achieved. 
The end itself remains to be specified, and there 
are a number of such ends that can be proposed. 
Among them are the following: 


Full employment, that is, work for all persons of 
working age; 

Power, that is, the full range of means that can be 
set to work to obtain a collective end, whatever the 
end may be; 

Long life and good health; 

Knowledge and culture; 

Aggregate welfare, or, put ina slightly different way, 
the aggregate income of the population; 

Number of years lived by the population as a whole; 

Average standard of living. 


Still other objectives, such as social harmony, 
family stability, and so on, may be conceived; thus 
Plato had a political view of optimum population 
and Eugéne Dupréel an aesthetic one. Here we 
shall only discuss two of these objectives: the eco- 
nomic objective, that is, the average standard of 
living, and the objective of the collectivity, which 
is much the same thing as power. The pursuit of 
either of these objectives leads to a specific opti- 
mum population size. 

Minimum and maximum population. Before 
proposing a definition of optimum population, we 
have to specify the extremes on either side of the 
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Figure 1 — Minimum and maximum population 


optimum. In other words, we have to specify the 
range between minimum and maximum population. 

Minimum population is determined jointly by 
economic and biological considerations. Life is 
very hard to sustain without at least some division 
of labor. Besides, a minimum number of people is 
required in order to provide marital partners for 
all, without too much consanguinity or disparity 
of ages. On these grounds, it has been estimated 
that an isolated population cannot safely fall under 
five hundred or six hundred inhabitants [see 
GENETICS, article on DEMOGRAPHY AND POPULA- 
TION GENETICS]. 

Maximum population is the highest that could 
conceivably subsist on the total product, provided 
it is distributed fairly among all. 

In both minimum and maximum, the population 
is drawn toward the bare subsistence level—a very 
low standard of living indeed. 

In Figure 1, the horizontal axis measures popu- 
lation size, and the vertical axis measures produc- 
tion per head. The minimum population is equal 
to p, and the maximum population to T. The sub- 
sistence level is ST or mp. Between these two ex- 
treme positions, there must be at least one popula- 
tion size that maximizes the standard of living. 
This is the optimum population, P, with a standard 
of living equal to MP. 

The general model. In order to specify the 
position of the optimum population, P, we cannot 
avoid using a very simplified model. We shall re- 
view its defects at a later point. 

In Figure 2, population size is still measured on 
the horizontal axis. Curve AA’ denotes marginal 
productivity, that is, the increase in total produc- 
tion resulting from the addition, one by one, of 
extra individuals to the population. Initially this 
curve rises, thanks to the division of labor and the 
sharing of overhead costs; the latter part of the 
curve slopes downward, reflecting saturation and 
diminishing returns. The curve’s highest value (J) 


is reached at population size K. Curve BB’ repre- 
sents total production; it is the integral of curve 
AA’, Its point of inflection, I, is for population 
size K, the same population size that maximizes 
curve AA’, 

Point M is touched by a tangent from O, the 
origin. Production per capita (measured by the 
tangent of angle MOP) is highest for that point M; 
on the right of M, it decreases. Curve mRS also 
indicates output per capita, but this time as meas- 
ured on the vertical axis. It is maximized at popu- 
lation size P, corresponding to curve BB’. Curves 
AA’ and mRS cross at R, since marginal produc- 
tivity and output per capita are equal at that point. 
On the production axis, OV, mp, or ST represents 
subsistence level. 

The way in which optimum population is de- 
rived can now be described as follows: 

If n is population and f(n) is total production, 
then curve AA’ represents f’(n), curve BB’ repre- 
sents f(n), and curve mRS represents f(n)/n. The 
economically optimum population, ns, is given by 
f'n.) = f(m)/ny. Furthermore, V being the sub- 
sistence level, both ne (the minimum population ) 
and ny (the maximum population) are given by 
f(tm)/M%m = f(r) /ny =V. A second kind of op- 
timum population, n,, is that which maximizes 
power, in a sense to be described below. This op- 
timum population occurs where curve AA’ inter- 
sects the horizontal at V, f’(n,) = V. 

Now we have at our disposal all the elements re- 
quired to assess the state of a population that is 


PRODUCTION AND 
PRODUCTION PER CAPITA 


POPULATION 
of living: 


Figure 2 — Optimum population, standard 
and power 


subject to environmental pressure. Going from left 
to right in Figure 2—that is, in the direction of 
population growth—we find: 


p = minimum population, 

K = population with highest increase in pro- 
duction as a function of population 
size, 

P = optimum population from an economic 
point of view, 

Q = optimum population insofar as power is 
concerned (see below), 

T = maximum population. 


Optimum power. When we speak of optimum 
power, we have in mind not only military power 
but also any objective involving the collectivity. 
Such an objective can even be the wealth of a des- 
potic ruler or sovereign who aims at the highest 
possible income by exploiting the population. 

If we ignore the case of minimum population, 
which has no practical importance here, we see that 
power, defined in these terms, is whatever exceeds 
the subsistence level. Power can therefore be de- 
scribed as the area in Figure 2 between curve AA’ 
and straight line VS. Maximum power, N, is at- 
tained for a population size Q. Beyond population 
size Q, power decreases, for when marginal pro- 
ductivity is less than average production, any addi- 
tional person can be fed only at the expense of the 
surplus. 

Since point Q is on the right of P, the popula- 
tion that is optimum for maximizing collective 
power is always larger than the population that is 
optimum from the economic point of view. This 
difference has important social and political impli- 
cations. It explains why, all other things being 
equal, totalitarian states have more of a stake in 
acquiring large populations than have democracies, 
which tend to be more concerned with the stand- 
ard of living. Thus it will easily be seen that in any 
sociopolitical system the effect of any military or 
other collective burden is to raise the size of a 
country’s optimum population. Let us refer, at this 
point, to Figure 3, in which the upper total pro- 
duction curve is identical with curve BB’ in Fig- 
ure 2, with the maximum at M, when P is optimum 
population size. 

The lower curve in Figure 3 is the result of a 
constant shift on the vertical axis, so that the tan- 
gent through O, the origin, now passes through 
point M’. On the horizontal axis, M’ corresponds 
to P’, which is well to the right of P. The way in 
which collective burdens increase optimum popu- 
lation can also be demonstrated as follows: Let 
n, (the P of Figure 2) be the optimum population 
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POPULATION 
Figure 3 — Collective burdens increase optimum size of 
population 


size when there is no collective burden. Production 
per capita is f(n)/n, which is maximized (differ- 
entiate) when nf'(n)-— f(n) = 0. Thus, as was 
noted for Figure 2, ,f’(n,) — f(m) = 9. 

Now if a collective burden, K, is added, usable 
production per capita is reduced to [f(n) — K]/n. 
This function will be maximized when NFR) — 
f(n) + K=0, or 

7 f(a) KB 

Hi On) pi mat 0. 
The only way to achieve this equality is for n to be 
larger than n,, since f'(n) is a decreasing function 
of n in the region in question. 

The influence of fixed costs. The important con- 
clusion reached in the preceding section can now 
be generalized: a country that, for any reason, has 
to meet fixed costs or, at the very least, costs that 
are disproportionate to the size of its population 
will benefit if it succeeds in increasing its popula- 
tion. For instance, if the aged members of a popu- 
lation are to be helped by retirement pensions or 
other means, the economically active members of 
that population will benefit from an increase in 
numbers, so as to spread the burden over a larger 
number of shoulders. In such a case, optimum pop- 
ulation size is increased because of two factors: 
the number of retired persons, since these are eco- 
nomically inactive, and the number of economically 
active persons, for reasons already explained. From 
this we can judge what kinds of problems arise 
when, for instance, the retirement age is lowered: 
not only do many of the economically active cease 
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contributing to production by becoming economi- 
cally inactive, but a need also arises for an increase 
in the economically active population. Let us spell 
out the same point in figures. Suppose that a 
population of ten million happens to be of eco- 
nomically optimum size. If one million of its 
economically active members suddenly become eco- 
nomically inactive, the standard of living is going 
to drop. In order to raise the standard of living to 
the highest level possible for that population, it 
would be necessary to attract a number of immi- 
grants in excess of one million—two or three mil- 
lion, for example. 


Evolution of optimum population 


Optimum population has increased considerably 
since the 1860s in the developed countries. In every 
case, the increase in optimum population has been 
faster than the increase in actual population. This 
trend has resulted from (a) the importance of fixed 
costs in modern society; (b) the fact that industrial 
production is less subject to decreasing returns 
than is agricultural production; (c) the fact that 
technological progress, contrary to what is generally 
thought, does increase the number of jobs. 

Under an agrarian economy, the populations of 
European countries traditionally exceeded the op- 
timum. This was demonstrated when, as usually 
happened, the standard of living improved after 
the population had been reduced by a severe epi- 
demic, such as the Black Death in the fourteenth 
century. 

Today's situation is entirely different. Let us 
consider the example of Switzerland. In the late 
eighteenth century the population probably did not 
exceed 1.7 million inhabitants. Nevertheless, some 
of them were forced to emigrate, and Swiss men 
often served in foreign armies. We can assume 
that the optimum size of the Swiss population at 
that time was somewhere between one million and 
1.5 million people. Today the population is over 
five million, which must still be below the economic 
optimum, since it has been necessary to call on 
700,000 workers from abroad. 

The reversal of the secular trend in migration, 
which instead of flowing out of Europe, as it did 
for so long, is now flowing in Europe's direction, 
also shows how much the optimum population must 
have grown in the course of the twentieth century 
[see MIGRATION]. 


Review of the optimum population concept 


Early in this century, when the concept of opti- 
mum population was given systematic expression, 
writers on the subject thought that a definitive 


solution had been found for the population prob- 
lem, at least in theory, and that the only difficulty 
remaining was to apply the solution in practice, 
This opinion did not survive long under scrutiny, 
especially since the facts were against it. The con- 
cept of optimum population is indeed too static; 
it implies that population size can vary without any 
change occurring in the other factors that could 
affect the population’s standard of living. 

Besides, even if it were possible, at a given point 
in time, to calculate a country’s optimum popula- 
tion correctly, it would still be difficult to base a 
population policy on that calculation. For example, 
if it could be demonstrated that 400 million inhabi- 
tants would fare better in present-day India than 
the actual 500 million, there could be no question 
of killing 100 million people in order to improve 
the others’ lot. Conversely, if it could be shown that 
the optimum population of France is 60 million 
instead of its present 47 million, or that that of 
Australia is 30 million instead of its present 11 
million, there could be no question of suddenly 
raising the population of these two countries by 
such large numbers. Even if enough immigrants 
could be found, there would still be the problems 
of fitting them all out and providing them with 
housing, schools, and so on. Thus, there is also a 
question of time involved. 

Considerations such as these lead to a less static 
orientation or, in other words, to the study of pop- 
ulation dynamics, But before we turn to that sub- 
ject, it should be stressed that the concept of opti- 
mum population is a usable one provided one is 
discussing, as we were above, the way in which a 
specific factor affects the optimum. It is, therefore, 
a useful tool, and one that, if used correctly, would 
enable us to explain many historical phenomena 
and in future to avoid certain dangerous sociologi- 
cal misconceptions. 


Optimum population growth 
Technological progress and employment. The 
problems of development, and therefore the very 
concept of an optimum rhythm of population 
growth, hinge on a very important phenome 7 
which has been too often neglected by economus 7 
because the subject breeds superficial prejudis 
and discourages empirical research. That impor 
phenomenon is the relationship between technolo; 
cal progress and employment. s A 
tt is OSEI believed that technological prog: 
ress has the effect of reducing employment on 
though this opinion, which is very old, a fet 
constantly contradicted by the facts, it never e: 
continues to command belief and regains steng 


with every technological innovation, as it has with 
automation. If this opinion were based on fact, if 
the economy really were headed toward the “push- 
button” economy imagined by Simonde de Sismondi 
(the king of England, alone on his island, enjoy- 
ing a very high standard of living through the oper- 
ation of handles and levers), then population 
growth would become useless. Furthermore, the 
economic system would have to be altered dras- 
tically. But actual trends have been in the opposite 
direction. The gainfully employed population is 
very much larger now in the developed countries 
than before the industrial revolution, while it is in 
the underdeveloped countries—that is, in the coun- 
tries that lack machines—that underemployment 
is most severe. 

This is not the place to expound a theory of em- 
ployment and technological progress. Let us con- 
tent ourselves with emphasizing the fundamental 
roles of population growth and of the proliferation 
of consumer needs. The orientation of consumption 
toward goods that contain an ever-increasing pro- 
portion of value added by manufacture and by 
services continually raises the number of jobs and 
at the same time reduces the threat of overpopu- 
lation. Of course, other factors may prevent full 
employment in a specific country. It is even con- 
ceivable that there will be a constant increase in 
the number of available jobs and yet that full em- 
ployment will never be reached. 

We are now in a position to approach the ques- 
tion of an optimum rate of population growth. 

The concept of optimum growth rate. When a 
population grows very rapidly, it is confronted with 
a major need for new capital goods. It is therefore 
compelled to devote a large part of the national 
income to capital investment, and, as a result, to 
lower its standard of living. If, on the contrary, the 
population decreases or remains constant, it suffers 
various troubles that are difficult to analyze but 
that have results which are quite obvious. Not one 
single historical or present-day instance can be 
cited of a declining or stagnating population that 
has enjoyed or is now enjoying any real economic 
expansion. 

If we follow a chain of reasoning analogous to 
the one that led to the concept of optimum popula- 
tion size, we come to the conclusion that there 
must exist, between the two extremes of rapid 
growth and decline or stagnation, some interme- 
diate condition of the population that guarantees 
the highest possible rate of growth in the standard 
of living. Let us call such a condition the optimum 
rate of growth, as opposed to the static optimum, 
defined above, of population size. 
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The concept of optimum rate will become clearer 
if we examine first the costs, and second the advan- 
tages, of growth. As previously, we are concerned 
only with the economic aspects. 

The cost of growth. Once again, we shall sim- 
plify the problem by turning to a model, If, as a 
result of certain investments, J, national income 
increases in each of the years following the invest- 
ments by a quantity r, the ratio r/I is the national 
rate of interest, T. The reciprocal, I/r, is called the 
capital—output ratio. For instance, if an investment 
of 1,000 million increases national income by 300 
million a year, the national rate of interest of that 
investment is 30 per cent. More often than not, the 
national rate of interest is between 20 and 50 
per cent. 

Let us accept the simplifying assumption that, 
for all the investments made each year, the capital- 
output ratio remains constant, so that the increase 
in national income is equal to IT. If the ratio I/r 
remains constant every year, national income will 
grow exponentially at a rate (IT)/r. And if popu- 
lation itself remains constant, the increase in per 
capita national income, or in average standard of 
living, will follow the same geometrical progression. 

Let us now introduce population increase into 
the model. By demographic investment we mean all 
outlays or efforts aimed at avoiding a deterioration 
of living standards as a result of growth. Many 
authors have used the same capital—output ratio 
for population increase as for the growth of national 
income that results from economic investments. 
For example, if 4 per cent of the national income 
has to be invested in order to raise national income 
by 1 per cent in each of the years following the 
investment, these authors assume that 4 per cent of 
the national income would also be required in order 
to accommodate a population growth rate of 1 per 
cent without a decline in income per capita. This 
equation of one type of return on investment with 
another is a mere statistical convenience that may 
sometimes be very remote from fact. Except when 
the rates coincide by chance, the assumption is 
justified only in instances where all persons added 
to the population are completely unproductive, i.e., 
contribute nothing to national income. 

Some idea of the size of the required demo- 
graphic investment can be obtained by adding to 
the cost of the technical means of production the 
cost of educating a man to the level at which he 
can make use of them—a process that may add up 
to 10 or 12 years of work. Consequently, a 1 per 
cent increase in a population would, if we assume 
that this population is stable, absorb 10 or 12 per 
cent of its national income. This figure is much 
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higher than the one quoted previously for the 
capital—output ratio. It would be only too easy to 
infer from this that any growth of population is 
costly in itself and that, from a strictly economic 
point of view, it is in a population's interest not to 
grow, and even to decrease, But this conclusion 
runs counter to too many empirical findings not 
to be highly suspect. Moreover, the advantages of 
population growth should be taken into considera- 
tion, and a clear distinction made between advan- 
tages of a purely economic nature and the social 
advantages that have an effect on the economy. 

Economic advantages of growth. We have al- 
ready mentioned overhead, or fixed costs, and the 
fact that in an industrial economy the classic law 
of diminishing returns is superseded by a more 
favorable law and sometimes even by a marked 
increase in returns. 

Another factor, which has been neglected but 
which should certainly be mentioned, is the way in 
which growth facilitates the correction of every 
mistake, every structural defect or imbalance, in 
any aspect of population distribution—geographi- 
cal, occupational, or other. When a faulty propor- 
tion, of whatever kind, has to be corrected in any 
organism whatsoever, this can be done by ampu- 
tation—that is, by taking something away. But in 
social matters it is hard to take anything away 
from what already exists, and strong opposition is 
likely to be encountered. Population growth, how- 
ever, makes it possible to improve existing struc- 
tures without encountering these problems, since 
faulty proportions can now be corrected by adding 
what is needed instead of taking away what is not 
needed. 

The advantages of growth are still more striking 
in cases where the proportions that need to be 
corrected are continually changing. This is espe- 
cially true of technological progress and economic 
progress, both of which are continually modifying 
the occupational distribution of the labor force, not 
only among the three traditional sectors—primary, 
secondary, and tertiary—but also among the occu- 
pational subdivisions of each sector. In a stationary 
population with low mortality, the economically 
active population is replenished by the younger age 
groups at a rate of only 2.2 per cent a year. Such 
a rate is not sufficient to ensure the occupational 
reallocation required by technological change. The 
result, relatively speaking, is an excess of people 
in obsolete trades and a consequent slowing down 
of economic progress. 

Social advantages of growth. The first advan- 
tage of a growing population that should be men- 
tioned here is its age distribution. Since mortality 


has declined in most human populations and since, 
as a result, the absolute number of the aged is 
bound to increase for years to come, a population 
that remains about the same size is bound to con- 
tain fewer young people and fewer adults of work- 
ing age. In other words, the average age of the 
population is going to increase rapidly. When a 
population ages in this way, there are certain mate- 
rial consequences that have been fairly well in- 
vestigated and certain moral consequences that so 
far have hardly been studied. However, it is gen- 
erally acknowledged that the spirit of enterprise 
does not flourish in an aging population; nine- 
teenth-century France, and perhaps modern Ireland, 
are significant instances of this. Nor is an aging 
electorate the only factor. Not many new busi- 
nesses are created when the generations hardly 
replace one another, since the younger generation 
is content to wait until jobs are vacated by the 
older. In such a population most positions of power 
and responsibility are held by elderly people, and 
thus institutional rigidities set in. It seems that 
there is a certain analogy in this respect between 
social and biological organisms. 

The large proportion of one-child families in a 
stationary population is another reason why the 
spirit of enterprise is weakened in both the older 
and the younger generations. 

Estimating the optimum growth rate. AS far as 
research on a specific country is concerned, the 
optimum growth rate has been just as neglected 
as the optimum population size. There is, of course, 
no question of proposing a single rate for all econ- 
omies. In an agrarian economy with an arrested 
technology, growth is advisable only if the country 
is underpopulated. Even then the optimum grow! 
rate will be slow, unless underpopulation is very 
marked, as was the case in the United States ani 
Canada during the eighteenth and nineteenth cen- 
turies. 

Thus the concept of optimum grow 
better suited to an industrial economy. 
the more abundant or accessible natural sree 
are through international trade, and the faster wee 
nological progress, the higher the rate of populatio 
growth should also be. For western Europe yee 
recent years, a growth rate of between 0.5 oa 
per cent per year seems to have been necessary 
by no means excessive. ’ 

The problems of measurement in thi: 
from the fact that it is harder to meas' 
vantages of population growth—advan ede 
as we have seen, are in part social—than ee 
advantages. Even if all the elements involved a 
known, it would still be necessary to choose 
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ure the ad- 
tages that, 


tween maintaining current standards of living and 
making sacrifices for the sake of future genera- 
tions. Finally, it should be remembered that demo- 
graphic trends are slow and far-reaching, so that 
the consequences of any change that is too sudden 
may be felt for years afterwards. 

Social and political aspects of growth. Although 
it is now possible to outline a theory of the influ- 
ence of population growth on economic develop- 
ment and standards of living, various elements 
needed to apply it correctly are still missing. This 
uncertainty has only exacerbated political and class 
conflict over population policies. Marxist doctrine, 
for instance, has never recognized the possibility 
of a population problem, especially one of over- 
population. This position is out of date; it still 
shows the effects of the violent reaction against 
Malthus’ egoistic view of society. However, there 
have been significant departures from this position 
n such countries as Poland and Yugoslavia, where 
the birth rate—and thus the growth rate—has been 
voluntarily reduced in order to make it possible to 
meet the cost of demographic investments. In West- 
ern countries, many Marxists take a more realistic 
view of population growth than is warranted by 
orthodox Marxist doctrine. 

In capitalist countries there is still some fear of 
the effects of population growth on employment, 
although recent trends in Germany, Austria, Swit- 
zerland, the Netherlands, Italy, and other countries 
have demonstrated the weakness of economic the- 
ory on this subject. Meanwhile, these same coun- 
tries dread population increase in the underdevel- 
oped countries of the Third World—an attitude 
that is reminiscent of Malthus but that, since 1962, 
is no longer opposed by the Soviet Union. 

The underdeveloped nations themselves are in- 
fluenced by two conflicting trends. Tradition, con- 
cern for power or prestige, and sometimes the 
availability of wide-open spaces lead to a favorable 
view of present rapid growth rates. This attitude 
is exemplified by the countries of Latin America 
and tropical Africa. On the other hand, concern 
about standards of living and fear of overheavy 
demographic investments are arguments in favor 
of—or at least not against—the acceptance of 
birth control. This attitude is encountered mostly 
in the Far East and the Arab world. 


All too often in the history of ideas authors have 
believed themselves to have given the final answer 
to a problem and to have built a definitive and 
universally valid theory. Such experiences warn us 
to be far more cautious. 

Despite considerable progress made since the 
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early 1950s in understanding these phenomena, 
one cannot suppress a feeling of dissatisfaction. 
Building a coherent theory of the interrelations of 
economy and population requires continuous and 
careful appraisal of actual developments. Because 
of the practical complexity of such an undertaking, 
prejudice and facile theories always tend to have 
the upper hand. As a result, continuous observation 
of the facts leads to conclusions different from 
those currently held. These conclusions give food 
for thought. In particular, they suggest that eco- 
nomic research should focus on the working popu- 
lation and its problems rather than on finance, 
monetary policy, or materials. These latter are only 
the results of something else; as economic phe- 
nomena, they lie only at the surface, and lead all 
too often to neglect of the essential factor: men as 
producers and consumers. Statisticians, demog- 
raphers, and economists should all work together 
to establish an all-encompassing, usable theory of 


population and economic development. 
ALFRED SAUVY 


[Directly related is the entry POPULATION, articles on 
OPTIMUM POPULATION THEORY and POPULATION 
POLICIES. Other relevant material may be found in 
FERTILITY; FERTILITY CONTROL; MORTALITY; PLAN- 
NING, ECONOMIC; PLANNING, SOCIAL; and in the 
biographies of BODIN; CARR-SAUNDERS; FOURIER; 
Grint; MaLTHUS; MARX; OwEN; PETTY; PLATO; 
PROUDHON; QUESNAY; SIMONDE DE SISMONDI; WICK- 
SELL.] 
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m 
OPTIMUM POPULATION THEORY 


The term optimum population denotes a popula- 
tion of such size that, given this population, a 
specified indicator of “welfare” is maximized. This 
indicator may be output, or consumption per 
capita, or some more complicated social welfare 
function. How large a population must be to be of 
optimum size thus depends upon the content of 
the welfare indicator selected; it will be greater, 
in proportion, as population-oriented elements 
enter into this content. 

Optimum size of population also depends, given 
some welfare index, upon other determinants of 
the value of this index. Thus, if W denotes this 
index, W = f (P, X, Xs, Xa, ***, Xa), Where P denotes 
population and the x; denote other conditions, 
changes in which affect the value of W and could 
conceivably affect that value of P with which, 
under given conditions, the maximum value of W 
is associated. 

The population P is therefore of optimum size 
when, with conditions X, Xa, Xa,- *, Xn essentially 
constant, neither an increase nor a decrease in P 
can augment W. This optimum may not be invari- 
ant; a change in the ceteris paribus assumption 
may make greater or smaller the P with which a 
greater or smaller maximum value of W now is as- 


sociated. The augmentability of the optimum is 
limited, however; a point is reached when further 
improvement in conditions Xi, Xa, X3,” **, Xa NO 
longer requires further increase in P for W to be 
maximized with respect to P, and thereafter such 
improvements increase W, but do not enlarge the 
optimum. A change in the value of W, dependent 
as it is on many sorts of change, does not indicate 
whether population maladjustment (defined be- 
low) is changing; it is only change in W, in rela- 
tion to change in P (with other conditions essen- 
tially given), that indicates this. The optimum 
concept is applicable to sovereign states, regions 
within states, and metastates. 

Optimum size and optimum growth rate. The 
advantages or disadvantages associated with a 
population's being, or not being, of optimum size 
must be analytically distinguished from those as- 
sociated with the process whereby a population 
changes size; for the former are associated with 
differences in population size as such, whereas the 
latter are associated with positive or negative popu- 
lation growth. Population growth, as such, may 
make for advantage; it also entails costs—costs in 
the form of the inputs required to train, accommo- 
date, and equip population increments, as well as 
costs in the form of such unfavorableness in age 
composition as is associated with population growth. 
Further population growth is desirable, until an 
optimum size has been attained. Some of the rates 
at which this growth might proceed are preferable 
to others, and some particular rate might even be 
preferable to all the others. This last rate could then 
be designated as the optimum rate of growth, and 
alternative rates could be described as nonoptimal. 
This article does not, however, deal with this pat 
ticular optimum or with advantages and disad- 
vantages of population growth as such. pi 

Population maladjustment. When the size o: 
a state's population deviates from what is optimum 
for it, such a state is experiencing population 
maladjustment. Let M, A, and O, respectively, he 
note degree of population maladjustment, actu 
al population, and optimum population. T 
M = (A — O)/0; and negative maladjustment $ a 
nifies “underpopulation,” while. positive male i 
ment signifies “overpopulation.” Although som? j 
the circumstances that underlie O also conditio. 
A, O is looked upon as normally independent of 
Insofar, however, as changes in A generate change 
in conditions underlying O, it may be said that 
is functionally connected with A. 

Yet, as a rule, changes in A don 
novel effects; they merely change 
preferred positions on existing indiffe: 
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transformation curves, etc. To this statement, 
changes in A attributable to migration are most 
likely to be exceptions, inasmuch as such changes 
may produce changes in the technologies or the 
tastes of the affected countries. Underpopulation 
is more easily corrected than overpopulation as 
defined above; its correction merely requires popu- 
lation growth, whereas that of overpopulation de- 
pends on suitable autonomous changes (for exam- 
ple, in tastes, technology, external relations), whose 
occurrence may not be very probable. 

History of optimum population theory. Pre- 
Malthusian writers occasionally complained of 
population maladjustment, but they did not define 
a population optimum or measure population mal- 
adjustment in relation to it. Presumably, they were 
not inspired by the attempts of Plato and Aristotle 
to fix what amounted to an optimum number of 
citizens for a Greek polis and, in addition, to indi- 
cate roughly how many slaves and noncitizens 
would be required. Malthus and some of his fol- 
lowers suggested the existence of an optimum; but 
they were concerned primarily with showing that 
positive population maladjustment is virtually in- 
evitable, and that, consequently, it is essential to 
favor institutions that tend to retard population 
growth. 

J. S. Mill ({1848] 1961, pp. 191-192) defined 
an optimum population, although he did not so 
name it: “After a degree of density has been at- 
tained, sufficient to allow the principal benefits of 
combination of labour,” he wrote, “all further in- 
crease tends in itself to mischief, so far as regards 
the average condition of the people.” Mill thus sup- 
posed the optimum would remain constant, al- 
though he implied that it had increased in the past. 
However, it was not until the late nineteenth cen- 
tury and thereafter that considerable attention was 
given to this concept, usually an optimum defined 
in terms of per capita income and reflecting a bal- 
ance between “increasing returns” in manufactur- 
ing and related industry and “diminishing returns” 
in extractive industry. Whether increase in popu- 
lation was accompanied by increase in positive 
maladjustment, therefore, turned on which of 
these two forces was the more powerful. 

On this issue, authors differed. Some expected 
that improvements in prospect were likely to in- 
crease optimums further. Others (for instance, 
Knut Wicksell) believed that populations in most, 
if not all, settled countries are of supraoptimum 
size and unlikely to derive much relief from 
optimum-increasing forces; they reasoned, there- 
fore (partly on the basis ofa generalized marginal 
productivity theory), that average income would 


increase most rapidly if growth in numbers was 
restrained and improvements in methods, together 
with increases in capital, were concentrated upon 
increasing output per head. Most discussions of the 
optimum were policy-oriented; they were designed 
above all to force proponents of population growth 
(of whom there were many in military, ecclesi- 
astical, commercial, and other circles) to face the 
question, Cui bono? 

In the 1920s, concern with overpopulation, 
stimulated in part by interpretations of the causes 
of World War 1, intensified interest in optimum 
population theory. This interest declined somewhat 
in the 1930s, as the rate of natural increase fell 
in developed countries, only to be revived after 
World War ur, as adverse effects of population pres- 
sure came to be more widely recognized. 

Formal theory. Theory respecting optimum 
population must identify this population, given 
some welfare indicator, and it must account for 
the tendency of output to grow, under essentially 
ceteris paribus conditions, at a rate that both varies 
and differs from that at which population grows. 

Specification of the optimum can then proceed 
as follows. Let the chosen indicator be output or 
consumption per capita; and let it be assumed that 
(within limits and for reasons examined below) 
output grows faster than population. This is as- 
sumed to be true even though output-governing 
conditions, other than increase in population, re- 
main constant, with the exception of adjustment 
of input use to this population change, together 
with the distribution of activities in space and 
among occupations. 

Then, to facilitate identification of the optimum, 
when population increases while other output- 
affecting variables are held constant, let T denote 
total output; S, total savings or provision for capi- 
tal formation; C, total consumption (C = T-S); 
and A, actual population. Let t and c, respectively, 
denote output and consumption per capita; and m, 
marginal output per capita (m= AT/AA); then 
if T increases faster than A, with m increasing 
initially faster, and then slower, than t, t will be at 
a maximum when m and t coincide. 

The population size associated with the coinci- 
dence of m and t is sometimes designated the op- 
timum, although this designation is applicable only 
if there is no need, or incentive, to form capital, 
that is, when there is a glut of agents of produc- 
tion, complementary to labor, and with zero mar- 
ginal productivity (as the equality of m and t 
implies). In all probable situations, however, these 
agents will be scarce and command a return, and 
further increments of capital will increase t; should 
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the condition m =t accidentally come into exist- 
ence, it would be unstable, for dissaving would 
result, and t, as well as the magnitude of the opti- 
mum, would decline. 

A stable optimum population may coincide with 
the point of equivalence of m to c, a point at which 
a return, A(t — m), is imputable to the comple- 
mentary agents and there are corresponding sav- 
ings, S; this population is necessarily larger than 
that associated with m = t, since the marginal curve 
intersects the average consumption curve (which 
lies below the average output curve) to the right 
of where it intersects the average output curve. 
Even so, c will be about as high when m = c and S 
is saved as it would have been at the point m = t, 
had the state then impounded an amount corre- 
sponding to S and thus reduced actual consump- 
tion at that point. When c<t, the population 
optimum associated with m = c, although not un- 
stable because of dissaving (there being saving), 
might increase in size as a result of the continuing 
capital formation made possible by savings, S. 
Given existing technology, this is only a possible 
outcome, however; for the increase in capital per 
head may raise t and c and yet not produce 
changes that increase the optimum. Indeed, many, 
and probably most, of the changes that increase t 
and c do not also increase the optimum, O, and do 
not therefore reduce population maladjustment, M; 
they merely make it more bearable. 

The optimum is further increased when actual 
consumption is reduced not only by savings, S, but 
also by the more or less unavoidable use of re- 
sources for purposes other than the supplying of 
the domestic population with consumer goods and 
services and capital. Illustrative of this is the em- 
ployment of resources for the supplying of security 
against possible foreign enemies, or for the sup- 
port of unproductive idlers, or for unilateral pay- 
ments of tribute and so-called economic aid to 
foreign states. Let all such outlays be denoted by 
L; then consumption, C, on the part of the produc- 
tive population (hitherto supposed to embrace all 
persons), becomes not T—S but T — S -— L. Ac- 
cordingly, m will coincide with average consump- 
tion (T — S — L)/A at a point lying to the right of 
the point where c approximates (T — S)/A; the 
population of optimum size will be larger, there- 
fore, and t will be smaller when L is positive than 
when L is zero. In general, threats to a nation’s 
security enlarge its optimum-size population and 
diminish its average consumption. 

Values underlying choice of optimum. A na- 
tion’s population optimum is further enlarged when 


it seeks to maximize not, for instance, consump- 
tion per capita but some other social welfare func- 
tion into which populousness enters as a substitute 
for output or consumption—or, in other words, up 
to the point where a further increment in popu- 
lousness does not compensate for the diminution 
in per capita income and consumption associated 
with it. The size of the optimum population corre- 
sponding to this point depends on how powerful 
are the tastes favoring populousness; it may oc- 
casion problems not associated with smaller opti- 
mums. Thus when this optimum is reached and 
families can therefore average barely more than 
two children, unsatisfiable tastes for children may 
occasion discomfort, This optimum also conduces 
to relatively greater inequality in the distribution 
of pretax income (although this can be offset by 
differential taxation) than do smaller optimums. 
Moreover, it would result in greater positive popu- 
lation maladjustment, should tastes for populous- 
ness decline markedly. 

Of more fundamental concern than these vari- 
ous formulations is the rationale underlying “in- 
creasing returns,” or the sometime tendency of 
output to increase faster than population or the 
labor force (here supposed to constitute a constant 
fraction of the population), even though other con- 
ditions (technology, capital stock, etc.) remain un- 
changed. One source of this tendency is the indi- 
visibility of some complexes of agents of production 
with which labor in general is combined, such as 
the transport system or much of the apparatus of 
state, Indivisibilities of this sort are most important 
when over-all population density is low; they be- 
come of little or no importance after density has 
become relatively high, even though quite lumpy 
increments are sometimes added to a esse 
equipment. The second, and principal, source o 
this tendency is extension of the market, together 
with facilitation of the extension of various forms 
of the division of labor. Within limits, enlargement 
of the labor force permits greater technical su 
division of productive activities in economic Ee 
geographical space and, at the same time, by ae 
viding an enlarged market, makes much of 


its 

subdivision sufficiently profitable to wane 
undertaking. A point is finally reached when ; 
ncome is 


bers are sufficiently great and average } 
sufficiently high to provide as large a market oe 
required to support the best of current and ca 
pective technological possibilities. This point A 
be approached at a rate proportionate to the sp! A 
at which small-scale plants and firms pecora 
efficient as large-scale undertakings, and as in 
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national and interregional trade remains unim- 
peded and, hence, able to add, for particular 
products of specialized undertakings, external to 
internal markets. 

Some limiting factors. In the past, it was sup- 
posed that discovery and invention were positively 
associated with population density, up to a point, 
because interpersonal stimulation and communi- 
cation were facilitated by increase in density. This 
supposition, while not without limited validity, is 
now seldom of much import; today, international 
interchange of technical information, together with 
improved internal means of communication, sup- 
plies most of the stimulus and information essen- 
tial to discovery and invention. 

To the limits to “increasing returns” already 
noted—namely, the overcoming of indivisibilities 
and the exhaustion of benefits consequent upon 
extension of division of labor and markets—there 
must be added a third: the emergence of factors 
that may become operative even before the other 
two limits are reached. This third type of limit be- 
comes operative when whatever inputs have been 
available gratuitously (for example, fresh water ) 
or under conditions of constant supply price either 
are no longer susceptible to further increase in 
supply or are susceptible to such increase only 
under conditions of rising real cost. While the ad- 
vent of such a limit may be partially counterbal- 
anced by substitution at the technological, as well 
as the consumer, level, it is never wholly counter- 
balanced save through technological changes that 
often, and perhaps usually, are the result in large 
part of input investment that might otherwise have 
been used to increase average output. Furthermore, 
should the limitational factor or its replacement 
be a wasting resource (for example, fossil fuels), 
its scarcity would increase more rapidly than other- 
wise, and so might prove less susceptible to tech- 
nological alleviation. 

It is doubtful if the populations of many ad- 
vanced countries (barring such countries as Aus- 
tralia and South Africa) are of less than optimum 
size, given that consumption per capita is the in- 
dicator to be maximized. It is probable that tech- 
nological improvements, together with increase in 
capital per head, have long been almost entirely 
responsible for increases in output per man-hour 
and that larger populations have not been essen- 
tial, or in a free-trade world would not have been 
essential, to the optimum exploitation of these im- 
provements. It is now quite possible, furthermore, 
that additional technological improvements, to- 
gether with increase in capital per head, may re- 


duce the optimum size of population. Should this 
come about, both positive population maladjust- 
ment and average consumption will increase, as, 
indeed, they seem to have been doing for many 
years. Maladjustment will increase further, of 
course, as physical environmental limitations to 
growth of output make themselves felt. 

Tendency to optimum. Although some authors 
have suggested that populations tend to assume 
optimum size, this inference is of quite limited 
validity. Populations are likely to respond signifi- 
cantly to changes in the degree of population mal- 
adjustment only if these changes generally modify 
the ability of men to satisfy their aspirations and 
only if men can respond thereto by regulating their 
fertility or otherwise controlling their numbers. 
This sort of regulation is possible in a country in 
which control of the apparatus of state is highly 
centralized and the welfare function to be max- 
imized is one prescribed largely by those in control 
of this apparatus; powerful incentives, depriva- 
tions, and propaganda can then be used to modify 
fertility and perhaps also mortality. In a modern 
state in which individual sovereignty is great, how- 
ever, increase in population maladjustment is 
likely to make for a decline in fertility only if such 
an increase slows down the rate of growth of aver- 
age income more than the rate of growth of the 
aspirations to be realized through the expenditure 
of this income; for then, family size, if not the 
rate of family formation, will be subjected to 
greater restraint. 

In underdeveloped and backward states, a con- 
trasting situation may be encountered. In these 
states, an increase in positive population malad- 
justment is likely to be accompanied by a diminu- 
tion in the rate of population growth, when, even 
under optimum conditions, average income sup- 
plies little beyond pressing needs, An increase 
in population maladjustment then tends to lower 
average income somewhat, and hence to worsen 
mortality, as well as to intensify some of the re- 
straints to which fertility is subject. 

The optimum and policy. Theory relating to 
the optimum, as such, is not very well adapted to 
the formulation and execution of population policy, 
even though population maladjustment is very 
common and the marginal social benefit of popu- 
lation growth seldom coincides with the marginal 
social cost of such growth. It is rarely, if ever, pos- 
sible to determine precisely what population size 
is optimum for a country, under given conditions; 
and it cannot be supposed that these given con- 
ditions will be static. Nor is it possible to “zero in” 
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on the optimum point, after the manner of re- 
vealed preference theory. It is possible, however, 
given some welfare index and working criteria of 
changes in this index, to narrow the zone within 
which the defined optimum almost certainly lies. 
One may choose a tentative optimum point, inquire 
carefully if a larger or a smaller population is de- 
sirable, and then move this point in whatever di- 
rection reduced maladjustment seems to lie. There 
will emerge an “optimum” zone, within which the 
true optimum may be assumed to lie. Even though 
the precise location of the true optimum is not 
determinable, it may be assumed that welfare per 
capita at any point within this zone is not likely 
to be notably different from that associated with 
any other such point. 

Some believe, however, that the smallest of the 
populations lying within such a zone are the ones 
to be preferred; then the rate at which depletable 
resources are used is depressed, and (should tech- 
nological or other changes make for a reduction 
in the magnitude of the optimum) positive mal- 
adjustment almost certainly will be less than it 
would have been had the larger of the populations 
lying within the optimum zone been chosen. This 
belief rests upon the supposition that whereas a 
condition of underpopulation is easily remedied, 
one of overpopulation often is irremediable, 

If the actual population of a country corresponds 
to one lying within such an optimum zone, it may 
be inferred, when there is little uncertainty respect- 
ing the boundaries of the zone, that further popu- 
lation growth is not desirable; and this inference 
becomes all the stronger when the actual popula- 
tion corresponds to one situated in that portion of 
the zone where the largest populations are found. 
It is then highly probable, given the welfare func- 
tion chosen for maximation, that the marginal 
social cost of a further increment in population 
would outweigh its marginal social benefit. Under 
these circumstances, it becomes clearly desirable, 
should the marginal private benefit of population 
growth still appear to exceed its marginal private 
cost, to introduce incentives and deprivations suited 
to bringing these benefits and costs into balance. 

It is not likely, however, that such action would 
be taken in the typical polity where short time- 
horizons, ideologies, and superstitions play impor- 
tant roles. While modern governments may subsi- 
dize family planning, they tend to more than offset 
this by somewhat easing tax burdens and costs 
associated with the production and rearing of 
children. It is to be expected, therefore, that the 
net effect of government policies will almost in- 


variably be to re-enforce the forces making for 
positive population maladjustment, until it þe- 
comes very pronounced. 

JOSEPH J. SPENGLER 


[See also LABOR FORCE; MIGRATION; POPULATION, arti- 
cle on POPULATION POLICIES; and the biography of 
MALTHUS.] 
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IV 

POPULATION COMPOSITION 
The term “population composition” refers om 
various social and biological categories into re 
the members of a population may be classed: a 
composition of a population at any one ware A 
only a reflection of its social history but 


it 
indicator of the kinds of social problems that 
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will face in years to come. It is therefore of great 
interest to governments to obtain, by means of 
censuses, systems of vital registration, and so forth, 
the most complete and most accurate information 
possible concerning population composition. The 
analysis of population composition forms an integ- 
ral part of demography, the science of population. 

Although the number of traits that distinguish 
one person from another is theoretically unlimited, 
the number of individual characteristics generally 
included in the study of population composition is 
rather small. Only characteristics that are socially 
and demographically relevant are considered, and 
even then the technical and financial problems of 
recognizing and recording individual attributes tend 
to impose stringent limitations, The characteristics 
by which populations are usually classified include 
such basic demographic variables as age, sex, and 
marital status, and such elementary indicators of 
social organization as nationality, race or color, 
language, religion, education, Jabor-force status, 
occupation, and industrial classification. This in- 
formation, as already stated, is generally gathered 
by means of census enumerations and through the 
registration of births, deaths, and marriages. Al- 
though the “vital statistics” generated by the latter 
method refer only to people who have experienced 
a vital event in a given year, they can be used to 
estimate certain characteristics for the total popu- 
lation and, thus, to maintain a continuous inven- 
tory of characteristics. For example, the registra- 
tion of births and deaths by race permits a fairly 
accurate estimate of the racial composition of the 
population in intercensal periods. Another source 
of information, of growing importance, is the 
sample enumeration (or microcensus, as it is 
called in some countries), which provides data 
needed in intercensal intervals and also permits 
questioning about characteristics that cannot be 
included in complete enumerations. 

The specific population characteristics included 
in census and other official statistics vary from 
nation to nation, depending upon their relevance 
in each case. Thus, questions about race or lan- 
guage are omitted in countries which are ethnically 
homogeneous; likewise, questions about literacy 
have been deleted from the censuses of many 
advanced nations. On the other hand, some goy- 
ernments, for reasons of policy, deliberately omit 
certain important characteristics from their enu- 
merations. For example, no question has ever been 
asked about religious affiliation in the United States 
decennial census, because of a rigorous interpre- 
tation of the constitutional separation between 
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church and state. In India enumeration of caste 
membership was dropped after independence, as a 
matter of social policy. 


Sex composition 


Most easily measurable, and of fundamental im- 
portance both demographically and socially, is a 
population’s composition by sex. In any given area 
the distribution of the sexes tends to be unequal, 
owing to the operation of any or all of three factors. 
First, a differential sex ratio at birth is universal; 
more males are born everywhere than females. On 
the other hand; at all ages the death rates for 
males are usually higher than those for females. 
Finally, migration is sex selective; in long-distance 
migration males tend to outnumber females, but 
in short-distance movements females usually pre- 
dominate. 

The most frequent measure employed in the 
study of sex composition is the sex ratio: the num- 
ber of males per 100 females. It is easily computed 
by dividing the number of males in the population 
by the number of females and then multiplying by 
100 (in European countries the ratio is often ex- 


Table 1 — Sex ratio (males per 100 females) in selected 


countries 

Country Censusyear Sex ratio 
Ceylon 1953 111.2 
Pakistan 1961 110.7 
Libya 1954 107.9 
Tunisia 1956 107.2 
Israel” 1948 106.9 
Malaya 1957 106.4 
India 1951 105.6 
Argentina 1947 105.1 
Cuba 1953 105.0 
Australia 1954 103.2 
Canada 1956 102.8 
Panama 1960 102.7 
Ireland 1956 101.9 
New Zealand 1956 101.1 
South Korea 1955 100.0 
Thailand 1960 99,5 
Brazil 1950 99.3 
Bulgaria 1956 99.1 
Norway 1950 98.3 
Belgium 1947 97.4 
United States 1960 97.1 
Chile 1952 96.4 
Italy 1951 94.4 
Hungary 1960 93.3 
United Kingdom 1951 92.4 
France 1954 92.2 
West Germany 1950 88.2 
Austria 1951 86.6 


* Jewish population only. 
Source: Adapted from Demographic Yearbook 1960, 
table 1. Copyright © United Nations 1961. 
Reproduced by permission. 


364 POPULATION: Population Composition 


pressed as the number of females to a hundred 
males). Table 1 shows the sex ratio of a number 
of selected countries throughout the world. It indi- 
cates a variation from less than 87 males per 100 
females in Austria to 111 males per 100 females 
in Ceylon and Pakistan. The countries where the 
sex ratio exceeds 100 fall into two distinct groups. 
First, there are underdeveloped areas, where fe- 
males in the childbearing ages have higher mor- 
tality rates than males. In this category belong 
such countries as Ceylon, Pakistan, Libya, Tunisia, 
and India. The second group consists of countries 
which have experienced large-scale immigration 
which is predominantly male. This explains the 
relatively high sex ratios of Israel, Malaya, Argen- 
tina, Australia, Canada, and New Zealand, for ex- 
ample, and of Cuba and Panama, which have at- 
tracted young male laborers from other Caribbean 
countries. An unusual case is that of Ireland, a 
country with the relatively high sex ratio of 101.9. 
The explanation probably lies in the fact that emi- 
gration from Ireland is predominantly female; citi- 
zens of Ireland may enter the United Kingdom 
freely, and young Irish women avail themselves in 
large numbers of the opportunity to obtain posi- 
tions in the neighboring country. 

At the other end of the scale are the countries 
with low sex ratios. The imbalance of the sexes in 
those countries is the result of a combination of 
war losses and long-standing emigration. France, 
Germany, and Austria have been heavily affected 
by war losses, whereas emigration has played the 
greater role in accounting for the low sex ratios of 
the United Kingdom and Italy, for example. 

In countries where migration plays a minor role, 
the sex composition is determined almost entirely 
by the interplay of birth and death rates. In the 
industrially advanced countries the sex ratio at 
birth is usually around 105 or 106, but mortality 
differentials favor females at all ages, so that the 
sex ratio declines steadily with advancing age and 
reaches 100 between the ages of 40 and 50. The 
total populations of these countries tend to have 
sex ratios between 94 and 97. In the underdevel- 
oped countries maternal mortality is high enough 
to offset the lower mortality rates of older females; 
consequently, the sex ratio for the total population 
in such areas averages around 100. 

Migration also exerts a powerful influence upon 
the sex ratio within a given country. Internal mi- 
gration generally consists of short-distance move- 
ments, which in their net effect shift the popula- 
tion from rural to urban areas. Urban migration 
usually includes more women than men. Occupa- 


tional opportunities open to women are greater in 
urban areas; women are therefore strongly at- 
tracted to the cities. As a result, sharp differences 
may arise between the sex ratios of rural and urban 
areas. In general, cities have low sex ratios, while 
rural—-farm areas have high ones. There are, how- 
ever, some exceptions; cities which are steel or 
mining centers or naval bases have high sex ratios. 

The ratio of the sexes in a given area has an 
important bearing upon the incidence of marriages, 
births, and deaths. The number of possible mar- 
riages depends in part upon the proportion of the 
sexes in the marriageable ages. The surplus of 
women in the urban areas reduces their chances 
of finding a marriage partner, whereas their short- 
age in rural areas may create difficulties for eligible 
young men. An imbalance of the sexes will delay 
marriage and reduce the number of legitimate 
births. Because mortality rates among males tend 
to be higher than among females, a dispropor- 
tionate sex ratio also affects the over-all death rate. 


Age composition 

Age, as well as sex, is a primary characteristic 
of the individual. Since it is related to physical 
capacity and mental maturity, every society uses 
age as a major building block in its social organ- 
ization. Social roles and responsibilities are as- 
signed in accordance with an individual's age. The 
age composition of a society, therefore, determines 
the number of people available for important social 
categories. Of special importance here is the size 
of the labor force and of those groups wholly de- 
pendent upon the working force—children and the 
elderly. In addition to serving as an important 
basis for the ascription of social status, age 1S bio- 
logically related to fertility and mortality. i 

The obvious connection between age and a wide 
range of demographic and social phenomena 
makes the classification of a population by age 
groups especially important. In conjunction wi 
sex, the age distribution provides the basis for i 
detailed demographic analyses. Birth and deal : 
rates can be meaningfully compared and w 
preted only when the age and sex composition E 
the population is taken into account. It is mea 
indispensable for the construction of life tables 
for making population projections. 

Data on age are less reliable than sex dal 
nonliterate societies individuals often do not ah 
their age, and census takers have to resort to a é 
mates. Even in literate populations there is a oo 
nite tendency for respondents to concen tial as 
numbers ending with zero or five, and on 
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numbers. For purposes of refined analysis it is, 
therefore, necessary to calculate correction factors 
to offset the effects of age heaping. 

The current age structure of any population is 
the product of birth, death, and migration rates 
which have operated in the past. A comparison of 
the age composition of various countries, illus- 
trated in Table 2, shows the existence of wide 
variations between different parts of the world. 
One can distinguish several major types of age 
structures. At the one extreme are the under- 
developed countries of Asia, Africa, and Latin 
America, which encompass most of the surface of 
the earth and include the greater part of the 
world’s population, These countries, characterized 
by high fertility and high mortality rates, have a 
very similar age composition. They all have a high 
proportion of children—approximately two-fifths 
of their population is in the 0-14-year age group. 
Over one-half the population is in the 15-64-year 
age range, which contains most of the economically 
active people. The proportion of people over 65 
years is quite small, generally below 4 per cent. 
The high proportion of children means a heavy de- 
pendency burden on the economically active mem- 
bers of the population. 

At the other end are the economically advanced 
countries of northwestern and central Europe, 
North America, and Oceania, where birth and 
death rates are both low. In these countries chil- 
dren under 15 years constitute only one-fifth to 


Table 2— Percentage distribution of selected popula- 
tions by three broad age groups 


AGE GROUP 


Census 

Country year 0-14. 1564" 65+ 
Dominican Republic 1950 44.5 52.6 29 
1950 43.8 52.5 3.7 


Paraguay 

Venezuela 1950 419 554 2.7 
Malaya 1950 41.3 55.5 3.2 
South Korea 1952 Al = 55.2 3.7 
Egypt 1947 381 588 31 
Turkey 1950 38.0 586 3.4 
Japan 1950 35.4 596 49 
Yugoslavia 1951 30.9 635 5.7 
Australia 1954 28.5 632 83 
United States 1950 271 647 82 
Denmark 1950 263 646 91 
West Germany 1950 23.6 672 93 
Switzerland 1950 23.6 668 96 
Sweden 1950 23.4 66.3 10.3 
United Kingdom 1951 225 667 10.8 
France 1954 23.3 646 121 
Belgium 1947 20.6 687 107 


Adapted from Demographic Yearbook 1960, 
table 5. Copyright © United Nations 1961. 
Reproduced by permission. 


Source: 
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one-fourth of the total population; about two-thirds 
of the population are in the 15-64-year age range; 
and the proportion over 65 ranges from 8 to 12 
per cent. These age structures are the result of 
major fertility declines, which have had the effect 
of sharply reducing the proportion of children and 
increasing, first, the percentage of persons of work- 
ing age and, in time, that of the aged. The popula- 
tions of France and Sweden, for example, where 
the decline of the birth rate began early, were 
already comparatively aged at the turn of the 
twentieth century. But in most countries of this 
group the modifications of the age structure are a 
relatively recent phenomenon. In the New World 
substantial immigration has also contributed to 
enlarging the proportion of adults in the 15-64- 
year age range. The result has been an immediate 
economic advantage, since this has meant a light 
burden of dependency on a broad base of eco- 
nomically active age groups. However, the reduc- 
tion of fertility means that the present extraordi- 
narily broad groups of people in the working ages 
will not be fully replaced in the near future, Conse- 
quently, the proportion of those over 65 will con- 
tinue to rise. 

A third group of countries are now in a tran- 
sitional stage. After drastic reductions in their 
mortality, these countries have recently begun to 
experience substantial fertility declines. Analyses 
which have measured the effects of changes in 
mortality and fertility separately have shown con- 
clusively that fertility is the main factor that 
affects the age structure of any given population. 
Countries with currently declining fertility will 
therefore experience a progressive aging of their 
population: Japan and some eastern European 
countries are cases in point, As economic develop- 
ment progresses, other countries will also enter the 
transitional phase; but the age structure of the 
presently underdeveloped countries is not likely to 
change appreciably in the near future. 

In the economically developed countries the 
phenomenon of aging has attracted widespread 
public attention, since it has created considerable 
social and economic problems, In an agricultural 
economy societies have a familistic social organ- 
ization. There the aged not only are maintained 
within the family household but occupy a high social 
status. By contrast, in industrial societies the aged 
are in a much less enviable position. They tend to 
maintain separate households and to drop out of 
the labor force, often involuntarily. They can play 
fewer socially useful roles, and those who cannot 
live on their savings become dependent upon their 
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children or the state. These changes in status have 
had not only economic but also major psychological 
repercussions, which require attention and ameli- 
oration. The situation was viewed with considera- 
ble alarm in the 1930s, when the birth rate of 
most Western countries reached a new low and 
the populations of many of them appeared to be 
aging very rapidly. The postwar resurgence of the 
Western birth rate, however, has reversed the 
trend and resulted in a measure of population re- 
juvenation in a number of countries—for instance, 
France and the United States. Nevertheless, the 
absolute number of old people continues to rise, 
and the concern about their welfare is greater 
than ever. 

Age composition and the incidence of aging vary 
also among subgroups of the population within a 
given country. Age differentials exist whenever 
groups exhibit different fertility and mortality 
levels; their age structure is also affected by 
migration. Thus, the rural-urban migration of 
the younger age groups causes rural populations to 
be older, on the average, than city populations. 
Considerable differentials are often found, also, be- 
tween various ethnic and religious groups. For ex- 
ample, the higher fertility of Roman Catholics gives 
them a younger over-all age structure than that of 
Protestants or Jews in the same countries. Differ- 
ential mortality also creates significant age differ- 
ences between the sexes. In the developed coun- 
tries females live, on the average, from four to 
seven years longer than males; consequently, in 
such societies the proportion of old women is con- 
siderably higher than that of old men. 


Marital status 


In every human society the family functions as 
the basic unit, charged with the continual replace- 
ment of the generations. This involves not merely 
procreation but also caring for the children over 
a long period of time and training them for the 
performance of socially useful roles. Because of 
this important part played by the family, informa- 
tion about the marital condition of the population 
is generally included in official census enumera- 
tions, and changes in family formation are care- 
fully studied by demographers, sociologists, and 
economists. Differences in marital status have a 
direct bearing on both fertility and mortality. The 
age at which people get married and the proportion 
of the population that eventually gets married is 
an important determinant of fertility, since in gen- 
eral fertility is higher in places where marriages 
are concluded early and where the proportion mar- 
ried is high. Mortality also varies with marital 


status; both married males and married females 
have lower death rates than do unmarried persons 
of the same ages. In part these differentials arise 
from the fact that marriage is selective of indi- 
viduals in better health. Partly, however, the great- 
er longevity of the married may result from the 
better personal care and greater regularity which 
usually characterize married life. 

International comparisons of marital status re- 
quire considerable caution because in certain 
countries a high proportion of couples live in con- 
sensual unions and may or may not report them- 
selves to the enumerators as married. This problem 
exists in all countries, because of those who are 
legally or de facto separated. As a result, the num- 
ber of married males is not equal to the number of 
married females in the census statistics of any 
country. Despite these difficulties of classification, 
the percentages of married people in various age 
groups, as reported by their respective national 
censuses, show some interesting variations. First 
of all, there are great differences in the age at 
marriage, as can be seen from the data presented 
in Table 3. India and Turkey, for example, show 
pronounced patterns of early marriage, whereas 
the tendency to postpone marriage is clearly evi- 
dent in such countries as Ireland, Spain, and 
Japan, In addition, there are substantial variations 
in the frequency of marriage. The proportion mar- 
ried at different ages undoubtedly reflects varia- 
tions in social organization and in cultural patterns 
between different societies; but the connections 
between marital status and social structure are 
complex and do not permit easy generalizations. 

The only generalization which applies univer- 
sally is that the percentage married reaches its 
maximum for females at an earlier age than for 
males. As Table 3 shows, women in the 15-24-year 
age group are much more likely to be married than 
men. However, after the age of 35 the proportion 
of married men exceeds that of married women 
everywhere except in Ireland, where male ma 
is unusually high and the reversal occurs i 
after age 44. In all countries the disproportion he 
comes quite pronounced after age 45 because a 
number of widows increases much more rapidly 
than the number of widowers. at ener 

As in the case of age and sex composition, 4 
also exist differences in marital status an 
subgroups of the population within a given aa Bs 
try. Perhaps the most pronounced differentials K 
the residential ones which are the effects of m j 
tion. The cities attract a disproportionate pe 
age of unmarried women but a relatively a us 
portion of single men. By contrast, the suburbs 
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Table 3 — Percentage married in selected countries, by age group and sex 


Census 15-24 
Country year Male Female 
India 1951 442 79.8 
Turkey 1955 340 61.9 
Egypt 1947 147 527 
Bulgaria 1956 226 46.9 
United States 1950 21.9 42.4 
Yugoslavia 1953 19.8 33.6 
Bolivia 1950 20.2 33.2 
United Kingdom 1951 12.5 27.2 
France 1954 93 23.5 
Finland 1950 11.3 228 
Sweden 1950 8.2 22.7 
Japan 1955 48 17.0 
Spain 1950 3.0 10.9 
Ireland 1951 24 86 


the open country contain a very high proportion 
of married women. 


Occupational composition 

When we turn from the more strictly demo- 
graphic to the social attributes of a population, it 
cannot be denied that occupational status assumes 
a place of paramount importance. Men do not live 
by bread alone; but the need to make a living re- 
quires the sustained effort of a very substantial 
proportion of the population in all societies. More- 
over, the way this living is made largely deter- 
mines the general social and economic position of 
the worker and his family. Lastly, the classification 
of workers by occupation and industry provides a 
major measure for any comparative analysis of the 
stage of economic development reached by differ- 
ent societies, as well as an index of regional dif- 
ferences within a given country. 

Labor-force status, The first step in the analy- 
sis of occupational characteristics is the separation 
of the total population of a country into two groups 
—those who are economically active, also called 
the working force or labor force, and those who 
are economically inactive. Actually, this distinc- 
tion is hard to make in primitive societies or in 
grossly underdeveloped areas, where the low level 
of technology forces virtually the entire population 
to participate in the production of goods and serv- 
ices. However, with the rise of a market economy, 
there is an increase in the proportion of people 
who consume goods and services but do not pro- 
duce them, and the distinction between economi- 
cally active and inactive becomes significant. Yet 
even in highly developed societies it is no easy 


matter to delineate adequate criteria for distin- 


AGE GROUP 
25-34 35-44 45+ 
Mole Female Male Female Mole Female 


82.8 89.1 87.7 78.6 76.9 46.5 
84.9 90.6 92.6 88.3 89.0 56.4 
70.8 87.4 90.9 80.9 90.2 44.4 
86.4 90.0 95.9 NP 88.9 71.3 
79.1 847 87.0 84.3 78.8 61.2 
80.6 78.7 91.2 78.2 83.2 56.7 
741 744 85.2 75.4 78.8 54.1 
72.0 79.8 86.2 82.0 80.8 58.2 
70.0 79.1 83.3 81.8 79.2 54.5 
68.1 71.8 83.6 73.1 78.0 48.6 
60.9 76.5 78.9 80.2 73.0 56.3 
71.1 80.4 95.3 819 85.3 56.9 
53.8 62.8 84.6 74,2 79.9 52.9 
32.2 53.7 58.4 69,1 59.8 48.1 


Source: Adapted from Demographic Yearbook 1960, table 10. Copyright © 
United Nations 1961. Reproduced by permission. 


guishing between those who are economically 
active and those who are not. [See LABOR FORCE, 
article on DEFINITIONS AND MEASUREMENT, ] 
These difficulties notwithstanding, statistics on 
the labor force are included in the census tabula- 
tions of most countries. Although different coun- 
tries vary considerably in their definitions and 
classification procedures, these differences affect 
mainly the statistics for women, youths, and aged 
persons because these are the groups in which rela- 
tively large proportions of workers are engaged in 
unpaid family work or work only part time, season- 


Table 4 — Percentage of economically active population 
in selected countries 


AGED MALES 
Consus 20-59 YEARS ENGAGED IN 
Country year Malos Females AGRICULTURE 
United Kingdom 1947 97.6 38,0 6 
Belgium 1947 91.7 25.8 14 
United States 1950 90.8 34.0 15 
West Germany 1950 94.8 41.7 16 
Australia 1947 96.0 23.5 19 
New Zealand 1951 96.1 25.3 22 
Canada 1951 94.9 25.8 24 
Sweden 1950 95.9 32.7 25 
France 1946 94.6 51.0 26 
Denmark 1950 96.7 44.8 30 
Chile 1952 95,0 29.1 37 
Japan 1950 94.7 52.8 40 
Cuba 1953 93.8 15.6 47 
Greece 1951 93.7 17.5 50 
Ceylon 1946 94.1 28.6 51 
Egypt 1947 99.2 99.3 52 
Hungary 1949 95.5 36.7 52 
Portugal 1950 92.8 21.6 52 
Brazil 1950 95.5 13.3 63 
Bolivia 1950 95.9 64.3 68 


Source: Adapted from Demographic Yearbook 1956, 
tables 11 and 12. Copyright © United Nations 
1957, Reproduced by permission, 
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ally, or intermittently. By contrast, the statistics 
for men in the working ages are much more com- 
parable, because the overwhelming majority of 
males in these age groups are economically active. 
Indeed, the proportion of males between the ages 
of 20 and 60 years who are in the labor force in 
various countries is almost uniform. Thus, in the 
20 countries listed in Table 4, which represent 
widely differing social and economic structures on 
six continents, the proportion of men aged 20-59 
years who are economically active ranges from 91 
to 99 per cent. By contrast, labor force participa- 
tion rates for women range all the way from 13.3 
per cent for Brazil to 99.3 per cent for Egypt, with 
15 of the 19 countries reporting less than half of 
the females in this age range as economically 
active. 

The total size of a country’s economically active 
population depends to a large extent on the size 
and on the age and sex composition of the total 
population. Demographic factors establish the max- 
imum limit of the number of people who can par- 
ticipate in economic activities. Such factors are 
the principal determinants of the size of the male 
labor force, since in all countries nearly all males 
pursue some gainful activity from the time they 
enter adulthood until they reach old age. However, 
the number of women, young people, and old per- 
sons who are economically active is strongly 
affected by a variety of nondemographic, cultural, 
and economic factors. The type and organization 
of economic production, the level of income, and 
the relative value placed on such noneconomic 
activities as formal education, retirement, child 
rearing, and leisure-time pursuits strongly influence 
the labor-force participation rates of these three 
groups. In underdeveloped countries, children be- 
gin to work at a tender age and few old people can 
afford the luxury of retirement. In industrially 
advanced nations, almost all youths are in school 
until they are 16 or 17 years old and a sizable pro- 
portion continue their formal education well into 
* their twenties, while pension plans facilitate and 
often force the retirement of the elderly. The ex- 
tent to which women participate in the labor force 
depends on the type of employment available to 
them, their marital status and childbearing pat- 
terns, and the degree of their social emancipation. 

Industry and occupations. In addition to sepa- 
rating the total population into the two basic cate- 
gories of those who are economically active and 
those who are not, most national censuses further 
classify the economically active according to the 
industries in which they are engaged and the types 
of occupations they follow. The relative distribu- 
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tion of the labor force by major lines of endeavor, 
such as agriculture, manufacturing, trade and 
commerce, transportation, and service, provides 
one of the most useful indicators of the stage of 
economic and cultural development a particular so- 
ciety has attained. The same is true of the division 
by major occupation groups, such as professional 
and technical workers, managerial and clerical 
employees, sales workers, farmers and fishermen, 
craftsmen, industrial operatives, laborers, and serv- 
ice workers. Major shifts have occurred in the 
occupational composition of all economically ad- 
vanced countries as the processes of urbanization 
and industrialization have proceeded. The propor- 
tion of the labor force engaged in agricultural 
activities has declined rapidly, while the percentage 
engaged in manufacturing, trade, transportation, 
and services has sharply increased. In the United 
States, for example, in 1840 nearly 70 per cent of 
the economically active were in agricultural occu- 
pations; a century later the proportion had fallen 
to less than 20 per cent. Under the impact of tech- 
nological change, the redistribution of the labor 
force is a continuing process. 

Unfortunately, national variation in the defini- 
tion of the various industry groups and in the con- 
tent of the occupation categories greatly limits the 
validity of international comparisons. Neverthe- 
less, a rough measure of the relative degree of in- 
dustrialization can be readily obtained by compat. 
ing the proportion of the male labor force engaged 
in agriculture. The smaller the percentage of men 
engaged in agriculture, the more highly developed 
is a country’s economy. The proportion of males in 
the labor force who are engaged in agriculture is 
shown in column 3 of Table 4. Although these 
20 countries show remarkable uniformity in the 
percentage of their male population who are co 
nomically active, they diverge widely in degree 0 
industrialization. Countries like the United King- 
dom, Belgium, the United States, and West i 
many all have reached an advanced stage of n = 
trialization, with less than one-sixth of their E 
labor force engaged in agriculture. In sharp © 
trast stand underdeveloped areas like Bolivia a 
Brazil, where two out of every three males 
still occupied in agricultural pursuits. 


Educational status 


Next to occupation, to whi 
lated, education ranks as an elemen le 
importance in the composition of modern popei 
tions. The educational achievements of es cant 
vidual largely determine his level of living, M 
tural opportunities, his social status and P 


ch it is closely 7° 
t of major 
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They influence his health and survival chances, 
his mental attitudes, and his entire outlook on life. 
By the same token, it is well recognized that the 
degree of literacy and the total amount of school- 
ing received by the population as a whole plays a 
major role in its social, cultural, and economic 
well-being and influences the rate of its develop- 
ment. Educational status therefore serves as a most 
useful index of the socioeconomic position of indi- 
viduals and of the class composition of a whole 
society. 

Educational data are collected by the censuses 
of most nations, but until recently the amount of 
educational information has been quite limited. 
The simplest and most widely available measure of 
educational status is the proportion of the popula- 
tion unable to read and write [see LITERACY]. In 
most advanced countries illiteracy has been almost 
eliminated, and therefore questions on literacy are 
no longer included in population censuses. Thus, 
the data for Sweden stop with 1930, and those for 
Canada stop with 1931. Mliteracy still constitutes 
a serious problem, however, in many countries. 

Because literacy rates are merely crude approxi- 
mations of educational achievement, an increasing 
number of national censuses have recently begun 
to employ more refined measures. They ascertain 
the number of years of school completed by each 
respondent and the proportion of the population 
in specified age groups who are attending school. 
Unfortunately, the great diversity in the structure 
of national education systems precludes any valid 
international comparisons. Within a given society, 
however, statistics on years of school completed, 
cross-classified by age, sex, residence, and other 
characteristics, serve as very useful indexes of 
cultural achievements and of needed improve- 
ments. In the United States, for example, the 
median level of school years completed by the total 
population 25 years of age and over rose from 
8.6 in 1940 to 10.6 in 1960. Particularly significant 
is the fact that the proportion of those who have 
at least some college education rose even more 
steeply, from 10.1 per cent of the total population 
25 years of age and over in 1940 to 16.5 per cent 
in 1960: in the same 20 years, the percentage of 
college graduates increased from 4.6 to 7.7. The 
trend of college enrollment has been rising very 
rapidly since World War 11. Of all American stu- 
dents who were graduated from high school in 
1959, 42 per cent were enrolled in college in 1960, 
on either a full-time or part-time basis. In the 
United States a higher education may soon become 
the rule, rather than the exception. 

Although educational gains have been widely 
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shared by males and females, rural and urban 
population, white and nonwhite groups, there still 
persist sharp differences in the average level of 
schooling. In 1960 the median number of school 
years completed by the nonwhite rural farm popu- 
lation was only 5.7 years, or just about half the 
11.5 years of the urban white population. This 
extreme contrast proves that even a country like 
the United States, where educational progress has 
been rapid and widespread, still has far to go 
before all elements of the society are assured a 
reasonably adequate amount of schooling. 


Ethnic and religious composition 


Most of the remaining population divisions that 
have major demographic and social significance 
refer to ethnic and religious differences. Cultural 
diversity generates differentials in fertility, mor- 
tality, migration, marriage patterns, educational 
attainments, and socioeconomic status. In many 
societies, therefore, classification by ethnic and re- 
ligious characteristics is essential for any useful 
analysis of population data. Thus, as the data in 
the preceding paragraph indicate, a meaningful 
analysis of the educational status of the United 
States population should show statistics subdivided 
not only by age, sex, and residence but also by 
race. 

The divisions created between different seg- 
ments of a population by religious differences 
have ranged all the way from violence and war- 
fare to segregated school systems. In most Western 
countries religion now plays a less drastic role than 
formerly; but there are some exceptions. In the 
Netherlands, for example, Roman Catholics and 
Protestants have developed an almost completely 
dual social structure, from separate schools to 
separate labor unions and separately organized 
leisure activities, including sports. In most other 
nations, the influence of religion has become atten- 
uated and individuals often are not consciously 
aware of it. But its effects on behavior patterns 
still persist in many subtle ways; in the United 
States, for instance, most fertility differentials have 
declined noticeably in recent years but differences 
between Roman Catholics and Protestants remain 
undiminished. 

The most important characteristics with respect 
to which ethnic groups are usually compared are 
race, color, national origin, and language. In im- 
migration countries all of these factors, in various 
combinations (and often intertwined with re- 
ligion), give rise to important cultural differences. 
In such countries, place-of-birth statistics are use- 
ful in identifying immigrant minorities, and in 
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highlighting problems of acculturation and assimi- 
lation. In most areas which have been subjected to 
conquest and colonization by Europeans, race or 
color differences divide the population into several 
segments, which remain sharply differentiated in 
all demographic and social respects. In all such 
countries statistical breakdowns by race are indis- 
pensable for demographic and sociological analysis. 

In most of Europe, of course, racial composition 
is not as important as in areas of European colon- 
ization, but nationality and language differences 
play a prominent part in many countries. The 
history of Europe abounds with incessant clashes 
over nationality differences, which have resulted 
in numerous wars and frequent persecution of na- 
tional minority groups. The outstanding exception 
to this pattern of conflict is modern Switzerland, 
where four language groups and the adherents of 
three major religions have long been integrated 
into a stable and harmonious unity without losing 
their distinctive identities. By comparison, the 
animosity between the Belgian Flemings and Wal- 
loons, which shows no signs of abating, appears a 
‘curious anachronism in the very center of the 
rising European Community. 

It is likely that in the long run the all-pervasive 
processes of industrialization and urbanization will 
have the effect of imposing a similar socioeconomic 
structure upon most, if not all, societies, thereby 
greatly attenuating the importance of internal 
ethnic and religious diversity. In the meantime, 
however, statistics on nationality, language, race 
or color, and religion remain indispensable in most 
countries, not only for analytic purposes but also 
for the making of social policy. 

Kurt B. MAYER 


[See also AGING; FERTILITY; LABOR FORCE; LITERACY; 
MIGRATION; MINORITIES; NuPTIALITY.] 
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v 
POPULATION DISTRIBUTION 


Population is in essence a quantitative concept 
used to denote aggregates of various kinds. The 
study of population distribution is concerned with 
how and why the unit parts of such aggregates are 
distributed over space in one way rather than an- 
other. Particular patterns of population distribu- 
tion are seen to be closely associated with varying 
rates of population growth and migration. As ap- 
plied to human beings, this topic has obvious 
theoretical and empirical interest, not only for 
sociology, but also for economics, anthropology, 
political science, and all those disciplines con- 
cerned with the organization and structure of com- 
munities, regions, economies, or other groupings 
that denote relatively stable patterns of human 
settlement. 

In the study of any population, emphasis can 
be placed either on the whole or on its parts. His- 
torically studies of population distribution have 
shown a strong tendency to involve a restricted 
area, such as one city or one section of a country, 
and, at least until recent years, they have been 
descriptive rather than analytical in scope (Bogue 
1959). Many are case studies of the size and com- 
position of the populations of minor divisions with- 
in a state or district or of changes in these over 
time. Quite often, population trends in a local area 
are contrasted with those in some larger region of 
which the area is a part. 


Measuring population distribution 


The concern of geography with space has bee 
clearly demonstrated for a long time. It is a 
space itself, however, that is of interest tO p 
geographer, but rather the persons and objec! 


occupying space and their relations to each other 
and to a habitat. ne 
The conventional method which geograp 


have used to present data on population ast 
tion is mapping. Various cartographic tech 4 
such as dot maps and linear-distance maps nee 
been developed (James 1954). Although 4 A 
have a certain convenience, they lack’ precis! 


and are useful only as first approximations to 
understanding the distributions they depict. How- 
ever, their utility is enhanced when they are used 
in conjunction with statistical measures of various 
kinds. Obviously maps, charts, and tables are de- 
pendent on the collection of census data or on 
data from registration systems or special surveys. 
They are no more than graphic representations of 
the basic figures on distributions of populations. 
It must be emphasized that any measures which 
summarize the distribution of population are likely 
to be strongly influenced by the choice of area for the 
collection of data, [See GRAPHIC PRESENTATION. ] 

Measures of density. All measures of popula- 
tion distribution are concerned with variations in 
numbers or characteristics of a population over 
time or by place. The simplest measure is a ratio 
of numbers to land area, Where information on 
number of people and amount of land is available, 
it is a simple matter to use the ratio P/A, popu- 
lation divided by area, to yield a population density. 
So crude a measure is only slightly improved by 
substituting population per square mile of arable 
land. Furthermore, density ratios are averages that 
give no indication of significant internal variations. 
Nevertheless, countries, provinces, and other areas 
are often compared in terms of relative degrees of 
concentration or dispersion of population -by the 
use of a series of density ratios. 

Such comparisons may be useful when there is a 
rather high degree of both geographic and cultural 
uniformity among regions. Just the opposite, how- 
ever, is frequently true, In consequence, ratios of 
population density are often not comparable. In 
the absence of other information it is not very help- 
ful to know that in 1961 the population density of 
the United Arab Republic was 27 times that of 
Australia, Such a figure must be interpreted, and 
it is of little consequence that large areas of both 
countries are uninhabited desert. What is signifi- 
cant is that there exist differences in social and 
economic organization which produce quite diverse 
patterns of land utilization and dissimilar func- 
tional relations with external areas. Both of these 
result in differences in ability to provide support 
for the population of each country at particular 
levels of living and in specific distribution patterns. 

Around the world the highest densities are 
found at two extremes. In crowded underdeveloped 
countries both fertile lands and nearby large com- 
mercial—industrial cities support dense populations 
at low levels of living. In modernized nations the 
highest densities appear in and around those large 
cities that are best situated to mediate in the ex- 
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change of goods and services on a national and 
world-wide basis and, by so doing, to support their 
people at relatively high levels of living. 

Measures of spacing. In addition to density 
ratios, various other measures of spacing and con- 
centration are employed to indicate the extent to 
which numbers are massed or dispersed over a 
given territory (Duncan 1957). The Lorenz curve 
has been adapted to measure variation in popula- 
tion distribution, and related indexes of concen- 
tration and of dissimilarity have been developed. 
The fundamental problem remains that when dif- 
ferent sets of areas are used as a base, different 
and even opposing results may be obtained. Conse- 
quently any interpretation of an index of this kind 
must be directly related to the system of subareas 
adopted. 

Measures of spacing have evolved both from the 
community studies of plant ecologists and from 
the economists’ use of models of areas within 
which an even distribution of population is hy- 
pothesized. Various centrographic methods meas- 
ure central tendency by computing centers of 
population or of activity. Such measures have been 
popularized by their use in census volumes, par- 
ticularly in the United States, The mean center 
conveniently summarizes the distributions within a 
particular area at one or more points in time. 
Simultaneously computing the centers for several 
different groupings, such as population, urban 
population, ethnic groups, farm population, and 
manufacturing, will summarize general trends 
within a region. However, the choice of the mean 
point (center of gravity) or some alternative de- 
vice, such as centilides or quartilides, will depend 
on the problem under investigation. [See GEOG- 
RAPHY, article on STATISTICAL GEOGRAPHY. ] 

Using an analogy from physics, John Q. Stewart 
and William Warntz (1958) have described the 
structure of particular population distributions by 
the use of population potential models expressed 
by the coefficient N/d, numbers divided by dis- 
tance. These are measures of the influence, or 
“drawing power,” of people at specific distances 
from one another. They are illustrated by the at- 
traction of a major population center for migrants 
initially located at various distances. Finally, use 
has been made of several categorical measures 
that involve the classification of populations by 
size of community, distance from designated cen- 
ters, rank-size, function, and so on. Examples are 
the division of urban centers into metropolitan and 
nonmetropolitan and of villages into industrial, 
agricultural, and suburban. The population dis- 


372 POPULATION: Population Distribution 


tribution “found” by the categorical approach will 
depend heavily on the scheme chosen by the 
investigator. 


World population distribution 


The single most striking fact about the distribu- 
tion of the world’s population is its unevenness. 
Roughly nine-tenths of all people are crowded into 
no more than a quarter of the land surface. There 
are three broad areas of heavy massing of popula- 
tion. The most prominent is in Asia, where the 
most densely populated areas are Japan, South 
Korea, mainland China and Taiwan, Java (in 
Indonesia), India, and east Pakistan, Also heavily 
settled are the greater part of the European penin- 
sula, the northeast coast and Great Lakes rim of 
the United States, and parts of the Caribbean and 
of coastal South America, Non-Soviet Asia, which 
constitutes not quite 20 per cent of the land sur- 
face of the globe, is occupied by an estimated 
60 per cent of the world’s people, and this percent- 
age is increasing. By contrast, South America, with 
13 per cent of the land surface, presently accounts 
for only 6 per cent of the world’s people. It too is 
growing at a rapid rate. 

Differences among areas in natural or social 
conditions produce marked variability in the degree 
of population concentration. The causes are in 
part geographic. The significance of climate as a 
determinant of population distribution, a theme 
familiar since Montesquieu, was enlarged upon in 
the nineteenth and early twentieth centuries by 
such geographers as Friedrich Ratzel. Not only do 
climatic conditions limit the spread of population 
by maintaining arctic, desert, and tropical areas 
where settlement of any sizable numbers is diffi- 
cult or impossible, they also influence the fertility 
of lands, such as the valley of the Nile or the flood 
plains of the Ganges, where natural conditions 
encourage a massing of human beings. Similarly, 
topographic features may provide favorable or un- 
favorable conditions for settlement. For instance, 
only a minute fraction of the world’s people live 
at any great altitude above sea level. 

In the modern world cultural factors are of 
greater importance for population distribution than 
all but the most extreme geographic conditions. 
The existence of local natural resources may be 
an immediate cause for the settlement or rapid 
development of specific areas. Nevertheless, iron 
ore, coal, and other natural resources have no real 
significance except in a culture technologically 
and economically able to exploit them. The growth 
of large communities on favored coastal and river 
sites is a function of the development of trade, 


particularly of industrial commerce. In both cases 
the likelihood of concentration of population at a 
given point will be related to the existence of com- 
peting sites and to the requirements of an economy 
for specific raw materials or finished goods. The 
size, and even the existence, of particular places 
will also vary according to such outside conditions 
as economic costs, transport time, or changing 
routes and market requirements. 


Distribution of urban population 


Marked changes in human organization are re- 
flected in the redistribution of population into an 
increasing number of larger centers. Between 1900 
and 1960 alone the number of cities in the world 
with more than a million inhabitants increased 
from 10 to 63. Around 18 per cent of the people 
of the world now live in cities with populations of 
100,000 or more. This urban trend has been ac- 
companied in the modernized nations by a pre- 
cipitous decline in the proportion of the population 
living on farms. In the United States, for instance, 
only some 5 per cent of the population lived in 
places with 2,500 or more people in 1790, when 
the first census was taken. By contrast, in 1960, 
70 per cent lived in such places or in the built-up 
fringe of large urban centers. Farm families made 
up 95 per cent of the population in 1790 and only 
9 per cent in 1960. 

Rapidly spreading urbanization, as well as the 
concentration of population in an increasing num- 
ber of growing cities, has had an obvious and Hy 
found effect on population- patterns, not only in 
Europe and North America, but increasingly in 
the settled portions of the globe. History peo 
examples of isolated cities with several ee 
thousand people, usually commercial centers lo 
cated in fertile agricultural areas, or administrative 
capitals supported by the wealth of such wa 5 
regions. A network of large cities, on the 0! 
hand, is based on the creation of an e: 
technology and involves the interchange of ME 
goods and services, not only between a i 
cities and their immediate hinterlands, but a 
among cities, regions, and nations that are voit 
scattered. Thus, in the 30 years from 1888 to ee 
the population of Japan's six largest oles A 
from under 2.5 million persons to over 6 "987 
(Taeuber 1958, p. 47). Between 1947 ante A 
Singapore’s population grew by almost a qu arägë 
a million people, representing an annual ar a 
growth rate of some 3 per cent. Such grow = but 
supported not by any local natural restr 
by the pre-eminence of Singapore as a trade 
in its region. 


The reasons for urban growth are complex, 
whether we are dealing with South Africa or north- 
ern Europe, Asia or the Americas. In terms of 
purely demographic variables, cities grow because 
of natural increase, net in-migration, or a com- 
bination of these. In terms of cultural variables, 
growth is dependent on technological developments 
in industry, transportation and communication, 
agriculture, science, and medicine. Such develop- 
ments have reduced agricultural labor require- 
ments, made possible the exploitation of distant 
as well as local resources, and established cities as 
the centers of perceived opportunities for migrants. 
They have made close living possible for masses of 
people by controlling epidemics and providing for 
the necessary transportation of persons, food, and 
goods. 

Urban concentration. In the industrialized na- 
tions the pattern of population distribution now 
exhibits a high degree of concentration. Even in 
those agricultural countries that have not achieved 
full economic development, cities are claiming a 
rapidly increasing percentage of the total popula- 
tion. In Malaya, for example, the proportion of the 
population living in centers of 1,000 or more 
people increased from 35.1 per cent in 1947 to 
50.5 per cent in 1957 (Caldwell 1963, p. 41). 
Indeed, since World War n, urban growth rates 
have tended to be highest in the less developed 
nations, In such countries rates of growth for 
smaller cities have commonly been higher than for 
the largest city, although absolute increments have 
remained highest for the major centers. 

The increasing concentration of population in 
urban areas, however these are defined, is a world- 
wide phenomenon. But in general the more devel- 
oped nations are also the most urbanized. For in- 
stance, in England and Wales fully 80 per cent of 
the population was defined as urban in 1961. Other 
countries with over 60 per cent of their populations 
in the urban category are, in descending order of 
per cent urban, Australia, Israel, Denmark, Sweden, 
East and West Germany, Scotland, Canada, the 
United States, Chile, New Zealand, Japan, Bel- 
gium, and Venezuela. By contrast, countries with 
under 25 per cent of their population in the urban 
category included, in the early 1960s, Ghana, 
Kenya, Senegal, the Sudan, Tanganyika, Togo, 
Cambodia, India, Indonesia, Pakistan, Thailand, 
and North Vietnam (percentages based on data 
compiled from official sources by the author; 
compare Demographic Yearbook 1 962, Table 9, 
pp. 304-315). 

The definition of urban population varies mark- 
edly from country to country and may shift over 
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time. Nevertheless, by any definition, industrial 
growth has been closely tied to urban growth. A 
concentration of population in urban areas may 
also be found in countries like Denmark, where 
commercial agriculture is of prime importance. In 
addition, there are many developing countries, 
such as Thailand or Egypt, which have one or more 
cities growing at a more rapid rate than is com- 
mensurate with their increase in employment op- 
portunities (a trend sometimes called “overurban- 
ization”). It has been alleged that heavy net 
migration to such cities reflects the “push” of ex- 
cess population off the land, but such a view fails 
to explain the long history of rural population 
pressure without migration. Proponents of this type 
of theory also tend to overlook the fact that mar- 
ginal urban employment often represents a gain 
in the migrant’s level of living (Sovani 1964, 
pp. 117-119). 

Number and size of centers. There is a notice- 
able difference among countries in the distribution 
of urban population by size of center. In India, for 
instance, there were, in 1951, 11 cities with popu- 
lations over 500,000 and another 59 cities with 
populations between 100,000 and 500,000. Despite 
this, slightly less than 7 per cent of the country’s 
total population lived in cities of 100,000 or more, 
whereas nearly 17 per cent were dwellers in places 
(many of them agricultural villages) of 2,000 to 
5,000 population. By contrast, in Australia there 
are only 6 centers containing more than 100,000 
people, but in 1947 some 51 per cent of the total 
population could be counted in the large-urban 
category and only 8 per cent in the small-urban 
grouping (see Gibbs 1961, p. 405, Table 1). The 
total population in India’s “urban places” (that is, 
in all those centers of 2,000 or more people) was 
24 times greater than that in places of comparable 
size in Australia, Obviously this is, to a consider- 
able degree, a function of differences in the total 
numbers of inhabitants of the two countries— 
441,631,000 in India in 1961 and 10,508,000 in 
Australia in the same year. What is significant is 
that the ratio of urban to rural population and, 
within the urban category, the ratio of residents in 
large centers to those in small ones are both totally 
different in the two countries, reflecting profound 
differences in the extent to which each is urban- 
ized. This in turn suggests important national dif- 
ferences in economic and social organization. 

Primate cities. Another way to denote degrees 
of urban concentration is to compare countries 
with respect to the percentage of either the total 
population or the total urban population that is 
found in the largest city. Great variability will be 
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found from country to country. For instance, the 
more than 14,000,000 people in the New York 
metropolitan area in 1960 were still only 8 per 
cent of the total U.S. population of 179,323,175. 
But in Greece the 1961 population of 1,852,709 
for metropolitan Athens constituted 22 per cent of 
the national population, and in Uruguay in 1963 
the 1,173,114 inhabitants of Montevideo made up 
46 per cent of the total population. 

In many countries, particularly those of small 
or medium size, the urban population shows a pro- 
nounced concentration in a single center. Such 
places, termed “primate cities,” tend to mass within 
their limits not only population but also a variety 
of economic, social, and political functions. These 
diverse functions—commerce, government, manu- 
facturing, recreation, and so on—are the basis of 
support for relatively large local populations. It is 
sometimes alleged that this concentration at one 
point in space inhibits the growth of rival cities 
and thus holds back national development by focus- 
ing the means of growth in a single limited area. 
This argument remains unproved and ignores the 
stimulus to development which is achieved by a 
concentration of growth factors. Moreover, primacy 
may be a temporary phenomenon since, although 
it is logical to suppose that the primate city bears 
some fairly stable size relation to the cities beneath 
it in rank, the assumption is not borne out by any 
available data. Estimates of the 1955 populations 
of the metropolitan areas in ten countries show 
the size of the largest urban area to vary from 
1.1 times that of the next largest, in Canada and 
Italy, to 16.3 times in South Vietnam (California, 
University of ... 1959). 


Ecological considerations 


From the ecological viewpoint, an area consti- 
tutes an environment to which a collective adap- 
tation has been made by a population. The pattern 
of distribution of human beings over an area is 
thus a generalized indication of their economic 
organization and life style [see EcoLocy, article on 
HUMAN ECOLOGY]. It is also an index of the degree 
to which there have developed complex, well- 
differentiated social orders, in which both activities 
and populations exhibit high degrees of specializa- 
tion as interdependent unit parts of areally or- 
ganized social structures. Such social orders, or 
communities, show relatively high rates of both 
migration and social mobility, subjects of consider- 
able prominence in distribution research. 

A central problem for ecology, then, is to under- 
stand the collective adaptation, to a particular 
environment, of an aggregate with identifiable 
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characteristics. Among human beings such adapta- 
tion varies with the existing technology and results 
in distinctive and modifiable patterns of social or- 
ganization. Functionally these human communities 
are affected by their position in a hierarchy of 
interdependent communities, ranked in size and 
distributed over space. The physical and social 
structure of the community may be viewed as 
responses to its predominant functions, whether 
that community be hypothetical or real [see CITY, 
article on COMPARATIVE URBAN STRUCTURE]. How- 
ever, the relationships among these factors—spe- 
cifically those between numbers of people and their 
distribution in space—have not as yet been sys- 
tematically explored. Current knowledge depends 
rather heavily on a combination of logical assump- 
tions, conjecture, and limited, unrelated observa- 
tions of particular populations that are often 
unrealistically treated as closed systems. Interde- 
pendence of parts is well established in theory, but 
quantitative proof is largely lacking (see, however, 
the demonstration of economic interrelations in 
Duncan et al. 1960). 

Nevertheless, ecological studies continue to dem- 
onstrate how varying distributions are reflected in 
clearly differentiated patterns of organization. For 
example, Donald Bogue (1949, part ur) has shown 
how variations in population density, related to 
distance from metropolitan cities, were associ- 
ated with variations in the average amounts of 
trade, services, and manufacturing activities in the 
metropolitan communities of the United States in 
1940. The growth of cities has been shown to n 
accompanied by an increasing differentiation 0 
specialized parts, with a relative concentration 0 
population and of functions in particular sectors. 
The location of functional areas and their relations 
to one another vary in time and space, but m 
tendency for an uneven distribution of people ani 
activities to exist has led to the development © 
theories of urban structure based on zones, sopa 
multiple nuclei, and functional diferenri E 
(Duncan et al. 1960, chapter 2; Harris & Ullm: 
1945). 

Ecological processes. 
even distribution has also led ecolog reat 
phasize such processes as concentration, Cen a 
ization, segregation, invasion, and uoces ta K 
order to describe aspects of the process of pop! e 
tion redistribution in local areas. The study o 

A i articularly 
massing of people (concentration), p: traliza- 
around points of specialized activity (cen ofa 
tion), or the tendency for people or achive 
similar sort to locate together and to develop ee 
of relative homogeneity (segregation) seems SI 


ficially to present few methodological difficulties. 
Closer examination, however, shows that these 
common concepts are difficult both to define and 
to measure. A major problem is that none of the 
processes is independent of area. Accordingly, the 
selection of different areal boundaries will alter 
the substantive findings. For instance, in calculat- 
ing the population density of a city, it makes a 
great deal of difference whether the base for cal- 
culation is the central part of a metropolitan area 
or some larger unit that includes suburban or even 
agricultural land. Similarly, the various indexes 
that have been constructed to measure residential 
segregation have frequently been criticized for fail- 
ing to measure what their authors thought they 
were measuring, The difficulty, again, is in the 
areal base unit selected. 


Rank-size distribution of places 


The question of distribution of population by 
rank-size of community has received considerable 
attention. As early as 1826 the German economist 
J. H. von Thiinen, in Der isolierte Staat, suggested 
that ideally a single city on uniformly flat land 
devoted to agricultural activity would locate cen- 
trally and become surrounded by concentric rings 
of land use, Over a century later Walter Christaller 
(1933) argued in a similar vein that a central 
place, whose function it is to provide services for 
a surrounding hinterland, can exist only where 
there is an amount of productive land sufficient to 
support it. The central-place theory is an ideal- 
type formulation and therefore cannot be expected 
to provide an accurate description of the distribu- 
tion of a specific population, Nevertheless, it is 
useful in identifying a number of factors involved 
in place distribution and in providing a generalized 
measure from which deviations are to be expected 
in reality, The explanations of such variance are 
not apt to be covered by any single theory. [See 
CENTRAL PLACE.] 

A number of attempts have been made to for- 
mulate a more precise statement of the distribution 
pattern of communities as a function of their size 
(Berry & Garrison 1958). One example is the work 
of George K. Zipf (1949), who developed a rule of 
rank-size relations to describe a general tendency 
toward rank frequency in all human activity. As 
applied to communities, Zipf’s hypothesis attempts 
to make specific the common observation that 
in most inhabited regions there are a few large 
places and an increasing number of smaller ones. 
The number of inhabitants of a center multiplied 
by its rank would thus yield a constant. Testing 
this simple rank-size rule with places of 1,000 or 
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more population in the United States in 1950, 
O. D. Duncan and A. J. Reiss, Jr. (1956, pp. 26-28) 
concluded that it did not provide a good fit for the 
distribution of places by size in that year. Despite 
this they judged the rule in a generalized form a 
useful summary of the widely noted inverse rela- 
tionship between the size of a community and the 
number of such centers, The same authors went 
on to demonstrate that there are significant differ- 
ences in age, marital conditions, income, and other 
demographic and social factors by size of place. 
This confirms the importance of studies of the 
changing distribution of population into metropoli- 
tan, urban, and rural categories, supplemented as 
necessary by finer classifications, 


Vincent H. WHITNEY 


[See also CENTRAL PLACE; EcoLoGy, article on HUMAN 
ECOLOGY; GEOGRAPHY; MIGRATION; RANK~SIZE RELA- 
TIONS; SOCIAL MOBILITY; and the biographies of 
RATZEL; THÜNEN.] 
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vI 
POPULATION GROWTH 


“Population growth” seems a partial term for a 
field of study where decline, fluctuations, and sharp 
changes can also occur, unless it is understood 
that “growth” is not necessarily positive or mon- 
otonic. Population growth is the sum of natural 
increase (births minus deaths) and net migration 
(immigrants minus emigrants); but each of these 
balances can be negative, and so can be their sum. 
Population growth equals natural increase only 
where net migration is nil or negligible. 

Ordinarily, population growth is studied by the 
comparison of rates in time and space—for exam- 
ple, by comparing annual or average annual rates 
of growth. This presupposes the absence of com- 
pelling events disruptive of orderly time sequences, 
such as the invasion of a territory, removals or 
displacements of minorities, and casualties sus- 
tained in famine, epidemic, war, or a large natural 
disaster. Such events are said to cause population 
“gains” or “losses,” and balance sheets can be 
drawn up after their occurrence. But it is not pos- 
sible to make a clear separation between population 
changes caused by disruptive events and all other 
kinds of population changes. In a long historical 
perspective, all societies are prone to incur crises; 


the victims, constituting “excess deaths” in the 
short run, are then part of the normal long-run 
risks of society. Again, war losses are not confined 
to deaths from deliberate military action but in- 
clude excess deaths due to deterioration in food, 
shelter, and medical care, a birth deficit due to the 
mobilization, captivity, or death of potential fathers, 
etc. Such losses are, therefore, incalculable. 


Population growth from 1950 to 1960 


Two types of population growth rates are used: 
exponential rates (per 1,000 per annum), for com- 
parison with other demographic measures, and 
compound interest rates (per cent per annum), for 
comparison with growth in nondemographic quan- 
tities, e.g., income. There is a slight difference in 
per-unit value of these rates, since the exponential 
rate refers to population of midyear, or the mean 
of the period, while the compound interest rate 
refers to that at the beginning of each year. 

Table 1 shows world population counts and pop- 
ulation growth rates, as compiled by United Na- 
tions demographers, for the period 1950-1960. 
Many of these figures are unofficial, and some are 
very rough, The four most populous countries con- 
tain half the world’s people within one-third of 
the world’s land. The 20 countries which follow 
have 30 per cent of the world’s inhabitants on 
one-sixth of the world’s land. The 205 less popu- 
lated countries and territories, not separately 
shown, extend over large land areas, including 
Canada, Australia, and large portions of Africa and 
South America. i 

From 1950 to 1960 the world’s population in- 
creased by an amount (482 million) larger than 
the 1960 population of India (433 million) or of 
Europe outside the Soviet Union (425 million). 
The 1950-1960 gain in China alone approximately 
equaled the 1950 population of Japan; other com- 
parisons of this sort are no less spectacular. Ex- 
ceptionally high rates of growth were shown by 
Indonesia and by Pakistan, which by 1960 had 
both surpassed Japan in population. Comparisons 
between the growth rates of European and non- 
European nations are also instructive; Brazil, for 
instance, with a growth rate of 3 per cent per 
annum, not only is growing twice as fast as any 
of the western European nations except Switzer- 
land but also has, since 1950, drawn far ahead of 
any of them in absolute numbers. ’ 

Among gains in average density not shown in 
the table, it should be noted that during 1950- 
1960 some 800 persons were added per square 
kilometer of land in Hong Kong and about 160 per 
square kilometer of habitable or cultivable land in 
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Table 1 — Estimated 1950 and 1960 population, land area, population gain 1950-1960, gain per square kilometer, 
and average annual rate of growth: 24 most populous countries and rest of the world 


POPULATION AVERAGE 
Millions LAND AREA POPULATION GAIN ANNUAL 
1950 Million square Per square GROWTH’ 

i (midyear) 1960 kilometers Millions kilometer Per cent 
China 

(Chinese People’s Republic) 560.0° 650.0° 9.6 90.0* 9 1.5 
India 359.2 432.8 3.0 73.6 25 19 
Soviet Union 180.0 214.4 22.4 34.4 2 18 
United States 152.3 180.7 9.4 28.4 3 17 
Japan 82.9 93.2 0.4 10.3 26 1.2 
Indonesia 767 94.2 1.5 17.5 12 21 
Pakistan 75.0° 100.0° 09 25.0° 28 2.9 
Brazil 52.3 70.5 8.5 18.2 2 3.0 
United Kingdom 50.6 52.5 0.2 19 10 0.4 
West Germany p 

(German Federal Republic) 47.8 53.2 0.2 5.4 27 11 
Italy 46.6 49.6 0.3 3.0 10 0.6 
France 41.7 45.7 0.5 4.0 8 0.9 
Nigeria 37.5 50.0° 0.9 12.5° 14 2.9 
Spain 27.9 30.3 0.5 2.4 5 0.8 
Mexico 25.8 35.0 2.0 9.2 5 3.1 
Poland 25.0 29.7 0.3 47 16 17 
Turkey 20.9 27.8 0.8 69 9 29 
South Korea 

(Republic of Korea) 20.5 247 0.1 42 42 1.94 
Egypt 

(United Arab Republic) 20.4 26.0 1.0 5.6 6 2.5 
Philippines 20.3 27.4 0.3 71 24 3.0 
Thailand 19.5 26.4 0.5 6.9 14 3.1 
Burma 18.5 22.3 07 3.8 5 19 
East Germany! 

(German Democratic Republic) 18.4 17.2 0.1 1.2 -12 —0.7 
Argentina 17.2 21.0 2.8 3.8 1 2.0 
Other countries? 519.0 623.4 68.9 104.4 2 19 

World total 2,516.0 2,998.0 135.8" 482.0 4 1.8 


2 


. Compound interest rate. 


r 


. Population estimates very rough. 

. The 1960 total of 99.95 millions was adjusted for un 
to adjust the 1950 total. This probably accounts fi 

|. 1955-1960 average: 2.8. 

. Habitable and cultivable area: 35,580 square kilometers. 

. Includes East Bi 

. Consists of 205 countries and territories. 


2 


ze 


. Antarctic not included. 


Egypt. In Table 1 the density gain of South Korea 
stands out as the largest, and average density in- 
creased by more than 20 persons also in India, 
Japan, Pakistan, West Germany, and the Philip- 
pines. Some fast-growing countries (€.g., Brazil 
and Mexico) had smaller density gains than some 
slow-growing countries of initially high density 
(e.g, the United Kingdom and Italy). But the 
measure is summary and can easily deceive. Gains 
in average density have been slight in the United 
States and Argentina, while, with rapid metro- 
politanization, the proportion of those countries’ 
inhabitants living in high-density zones has marked- 
ly increased. Also, average density is unrepresenta- 


\derenumeration of the 1961 census as esti 
‘or the much higher rate of growth than the 


Sources: Adapted from Uni 


mated by the Planning Commission's report. No attempt was made 
w officially calculated one of 2.1 per cont per annum for 1950-1960. 


ited Nations 1966 (population); Demographic Yearbook 
1965 (areas). Population gain and growth rates calculated by the author. 


tive of density in those minor portions of China, 
Indonesia, and Pakistan which contain the major- 
ity of those countries’ populations. 

The countries in Table 1 can be grouped by 
1950-1960 growth rates as follows: (1) 2.5 per cent 
or higher—Brazil, Nigeria, Mexico, Turkey, South 
Korea (based on 1955-1960 data only), Egypt, the 
Philippines, and Thailand, totaling 288 million 
inhabitants in 1960; (2) 1.5 to 2.4 per cent— 
China, India, the Soviet Union, the United States, 
Indonesia, Pakistan (based on 1950 and 1960 offi- 
cial estimates), Poland, Burma, and Argentina, 
totaling 1,745 million inhabitants in 1960; (3) 
0.5 to 1.4 per cent—Japan, West Germany, Italy, 
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France, and Spain, totaling 272 million inhabitants 
in 1960; and (4) 0.4 per cent or less—the United 
Kingdom and East Germany, totaling 70 million 
inhabitants in 1960. 

The above list excludes the 205 less populated 
countries and territories, but it is evident that the 
great majority of the world’s population was sub- 
jected during 1950-1960 to rates of growth be- 
tween 1.5 and 2.4 per cent per annum and that 
growth rates greater than 2.5 per cent were not 
exceptional. In these respects the 1950s have been 
unlike any earlier phase in mankind's history. 
Except where there occurred settlement of mostly 
vacant territory, no sizable population has ever 
before grown at a rate as high as 2.5 per cent. 
Sustained growth of a large population at a rate 
as high as 1.5 per cent was relatively exceptional 
in the decades before 1950. 


World population growth up to 1750 


If the history of Homo sapiens is counted in 
hundreds of thousands of years, the present world 
population could have resulted from an initial 
small tribe with an annual rate of growth of 0.01 
or 0,02 per cent. The notion that any human pop- 
ulation would grow so steadily at so minimal a 
rate is, of course, absurd. Both growth and decline 
must have occurred at different epochs in different 
areas, but during some critical epochs growth may 
have prevailed significantly over decline. Thus, 
very few million individuals could have subsisted 
at any time on the earth’s surface with a paleolithic 
culture, whereas a neolithic technology and social 
organization might have sustained a substantial 
number-of millions, 

The major early metal-using civilizations (those 
of the Euphrates, Nile, Indus, and Yellow rivers) 
probably supported several million inhabitants 
each, all of them dangerously exposed to destruc- 
tive predatory attack. Establishing large areas of 
comparative peace and security, the Han empire 
of China, ASoka’s empire in India, and Roman 
rule around the Mediterranean basin combined 
larger populations, between 50 and 100 million 
each (Durand 1960; Russell 1958; Boak 1955). 
The world’s population at the beginning of the 
Christian era has been estimated at 300 million. 

Undoubtedly there was population decline when 
these empires crumbled and more still when reli- 
ance on military prowess became a substitute for 
the then irrecoverable social organization that the 
empires had sustained. Previously sizable popula- 
tions in Central America, Mesopotamia, and south- 
east Asia have also vanished at various times, 
possibly owing to similar cultural changes. But his- 
torical demographic research has been focused 


mainly upon Europe and China. According to J. C. 
Russell (1958), Europe’s population was 33 mil- 
lion in A.D. 1, had fallen to 18 million by about 
600, rose to 70 million after the end of the thir- 
teenth century, and was reduced by the Black 
Death to possibly little more than 40 million. 
John Durand (1960) has estimated that China’s 
population was 71 million in A.D. 2, had fallen to 
37 million by 705, rose to more than 120 million 
in the thirteenth century, and was reduced to 60 
million in the fourteenth. Inexplicably, these ups 
and downs in Europe and China were nearly syn- 
chronized. Little is known about population changes 
elsewhere in the world during the same centuries; 
if we had more knowledge of such changes, it 
might be guessed how much the world total had 
shrunk by the seventh century, how much it had 
risen by the thirteenth, and how much it decreased 
again in the fourteenth. These guesses cannot jus- 
tifiably be made without a prior marshaling of large 
amounts of relevant evidence, most of it very hard 
to find. 

As estimated by W. F. Willcox (1931) and 
revised by Alexander Carr-Saunders (1936), world 
population may have totaled 545 million by 1650, 
rising to 728 million by 1750; an average annual 
rate of world population growth of 0.3 per cent 
is thereby suggested, not impossible for that period 
or for times such as the rise of the ancient em- 
pires or a few of the centuries between A.D. 700 
and 1300. Since growth did not occur in every area 
and probably met with temporary reversals even 
in the areas of growth, a long-term world average 
of 0.3 per cent can be consistent with rates two to 
five times as high within relatively favored areas 
in undisturbed times, and such rates would have 
resulted under premodern peacetime levels of fer- 
tility and mortality in organized states. Such a rate 
might have caused the 300 million population of 
Roman times to increase to the size of the present 
world population within one millennium instead of 
two. Actually, severe cutbacks occurred in the first 
few centuries A.D. and again in the fourteenth 
century. 


Growth in world regions, 1750-1960 


Tables 2 and 3 link the world population esti- 
mates of Carr-Saunders (1936), used here for 
1750-1900, with more-recent estimates, for 1920- 
1960 (United Nations 1966). Comparable data a 
1910 are lacking, but rough figures can þe inserte i 
since the two series are in substantial agreement 
The figures are for the traditional continents; rete 
Europe includes the Soviet Union up to the Ur 
Mountains. 4 sld 

It will be seen that, by half centuries, wo! 
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Table 2 — Estimates of world population (in millions), for selected periods, 1750-1960 


Asia’ Europe* Africa 
1750 479 140 95 
1800 602 187 90 
1850 749 266 95 
1900 937 401 120 
1910 1,018 450 135 
1920 1,044 461 143 
1930 1,149 505 164 
1940 1,278 541 191 
1950 1,417 536 222 
1960 1,706 593 273 


North Latin World 
America® America® Oceanio total 
13 n1 2 728 
57 18.9 2 906 
26 33 2 1171 
81 63 6 1,608 
100 75 7.5 1,786 
5 90 8.8 1,862 
134 107 10.4 2,069 
144 130 11.5 2,296 
166 162 13.2 2,516 
198 212 16.3 2,998 


o. Includes portions of Soviet Union and Turkey, as traditionally assigned to Asia and Europe respectively. 


b, United States, Canada, Greenland, and minor areas. 
c. Caribbean, Middle and South America. 

d, Includes Hawalli, 

e. Author's rough conjecture (see text). 


population growth accelerated gradually until the 
late nineteenth century and sharply thereafter. 
This happened despite a nineteenth-century slow- 
down in Asia (particularly China) and a twentieth- 
century slowdown in Europe. Africa's population 
probably languished under the slave raids until 
pacification caused its renewed growth. The small 
indigenous populations of North America and 
Oceania were swamped by European settlers, who 
arrived at a high rate in relation to initially small 
numbers; but the growth rates of these two con- 
tinents decreased as the settlement populations 
themselves attained larger size. In Latin America 
an initially substantial indigenous population is 
believed to have declined severely after the Spanish 
conquest; the partly mixed population eventually 


Table 3 — Average annua! 


I rates of world population growth (per cent), 


Sources: Adapted from Carr-Saunders 1936 (estimates for 1750-1900); 
United Nations 1966 (estimates for 1920-1960); figures for 1910 
are author's rough conjectures. 


resumed growth, and its present rate of natural 
increase is of a level unsurpassed by any other 
population of such size. 

If we examine the same data by decades, it 
becomes clear that the world’s population was, 
at the turn of the century, already growing at the 
rate it almost maintained until 1950—in spite of 
a number of setbacks: World War 1 and a world- 
wide influenza epidemic in the decade from 1910 
to 1920; the economic depression of the 1930s; 
and World War 1 in the 1940s. Accordingly, an 
unprecedentedly high rate of growth emerged in 
the relatively undisturbed 1950s, Europe (includ- 
ing most of the Soviet Union ) suffered the heaviest 
war losses and fell increasingly behind the world 
average rates of growth, although even here growth 


for selected periods, 1750-1960 


North Latin World 

Asia* Europet Africa America” America® Oceania’ totol 

1750-1800 0.5 0.6 - 3.0 WW - 0.4 
1800-1850 0.4 07 - 3.1 11 - 0.5 
1850-1900 0.4 0.8 0.5 2.3 13 2.2 0.6 
1900-1950 0.8 0.6 1.2 1.4 1.9 1.6 0.9 
1910° 0.8 1.2 1.2 2.1 18 2.3 1.0 

i RD 0.3 0.2 0.6 14 18 16 0.4 
1920-1930 1.0 0.9 14 1.5 1.8 17 i 
1930-1940 MW 0.7 1.5 0.7 2.0 1.0 MW 
1940-1950 1.0 -0.1' 1.5 1.4 2.2 14 0.9 
1950-1960 1.9 1.0 21 1.8 27 21 1.8 


. Includes portions of Soviet Union and Turkey, 

. United States, Canada, Greenland, and minor areaž: 
Caribbean, Middle and South America- 

. Includes Hawaii. 

e. Author's rough conjecture (see text)- 

f. Net decrease due to heavy war losses- 


esre 


as traditionally assigned to Asia and Europe respectively. 


Sources: Adopted from Carr-Saunders 1936 (estimates for 1750-1900); 
United Nations 1966 (estimates for 1920-1960); figures for 1910 
are author's rough conjectures. 
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was faster in the 1950s than in any decade since 
1910. The manner in which Africa’s population 
growth has accelerated is guesswork, but there is 
no doubt that the current rate is high. Immigration 
has been of decreasing effect in the Americas and 
in Oceania. A large recovery from the depressed 
birth rates of the 1930s brought population growth 
in North America and Oceania back to a high level 
in the 1950s. Because of decreasing death rates, 
the rate of natural increase has risen enormously 
in Latin America, very considerably in Asia, and 
at least appreciably in Africa, 


Projections and forecasts 


Methodological considerations. The dynamics 
of population growth has come to be recognized as 
a complex subject and far from self-contained. On 
the scale of decades, potentials for growth depend 
on the age composition of the population. But 
changes in age composition occur also as a result 
of changes in the components of growth: prin- 
cipally, changes in the birth rate; to a minor ex- 
tent, relative changes in age-specific death risks 
and also migration, in those instances where this 
is large and affects particular age groups. Although 
the crude birth rates and crude death rates roughly 
reflect fertility and mortality trends, both they and 
the consequent natural increase are in part a 
function of varying age composition and should 
be interpreted accordingly. 

A fall (or rise) in mortality affects population 
growth very directly, especially in high-fertility pop- 
ulations which have age structures such that any 
considerable change in death risks causes a large 
change in the crude death rate. The present rapid 
accelerations of population growth in Africa, Asia, 
and Latin America are thereby explained. 

A change in fertility does not attain its full 
effect on population growth until the passing of 
several decades. The effect on the birth rate be- 
comes large when the reduced (or increased) num- 
bers of children reach the ages of parenthood. In 
the still longer run, the proportions of persons of 
advanced age can become considerably altered, 
thereby affecting the level of the crude death rate. 
The higher proportion of aged persons explains 
why there are higher death rates in European coun- 
tries with superior health conditions than in some 
poorer countries elsewhere. 

These few observations do not exhaust the ana- 
lytic complexity of the subject. It must be added 
that the fertility, mortality, and migration changes 
themselves occur in response to changing economic 
and social conditions, attitudes, and culture. 

The future of world population. A population 
projection (or forecast) is calculated, age group 


by age group, by applying data and trends in age- 
specific fertility and mortality rates to an initial 
set of data or estimates of the population’s age 
composition. The quality of the basic data is, of 
course, important and may require detailed ana- 
lytic appraisal or the construction of estimates cor- 
rected for detectable sources of error, Judgment is 
necessary, furthermore, concerning factors likely 
to affect levels and trends in fertility, mortality, 
and migration in the future and concerning the 
likely nature and extent of those effects. An area 
of plausible foresight may then become circum- 
scribed that, if sufficiently narrow, can provide a 
basis for economic and social decision making. 
Since plausibility of the moment is sensitive to any 
new item of information, projections intended to 
have predictive relevance may have to be revised 
frequently. 

Tables 4 and 5 show calculations of future world 
population, as prepared by the United Nations 
(1966). The distinction here between “projection” 
and “forecast” lies in the intended greater predic- 
tive relevance of the latter. A projection may show 
the consequences of given trends, whether these 
are believed likely or not. The consequences of 
trends or changes judged to be likely constitute a 
forecast. . 

The “constant fertility, no migration” projection 
shown in Tables 4 and 5 measures the long-range 
effect of continuance of world-wide population 
trends which were typical of the 1950s: a continu- 
ing tempo in the decline of mortality, maintenance 
of current levels of fertility, and no migration. 
The effects of such trends, calculated separately 
for each region, are then summarized. On these 
assumptions, population growth in most regions 
would accelerate much beyond the 1950-1960 rate 


and the world’s population would total 7,500 mil- 
lion by the century's end. 


Table 4 — Projections and forecasts of world population 
(in millions), 1960-2000 


“Constant fertility, 


no migration” 
je “Medium” forecast 


Estimate projection 


1960 1980 2000 1980 2000 

South Asia B65 1,446 2702 1420 27 

East Asia 794 1,143 1,811 1,041 Lae 

Europe" 425 496 S71 4798 aa 

Africa 273 458 860 449 fe 
Soviet Union 214 295 402 278 R i 

Latin America 212 387 756 32: a 
North America® 199 272 388 262 a | 
Oceania* 157 22.0 32.5 22.6 4 j 
World total 2,998 4,519 7,522 4,330 613 | 


a. Without Europeon portions of Soviet Union and Turkey- 
b. Includes Hawaii: 
e Excludes Howoii. 


| 
Source: Adapted from United Nations 1966. l 


Table 5 — Projections and forecasts of average annual 
rates of growth (per cent) of world popula- 
tion, 1960-2000 
“Constant fertility, 


no migration” “Medium” 
projection forecast 

Estimate 1960- 1980- 1960- 1980- 

1950-1960 1980 2000 1980 2000 
South Asia 2.2 2.6 3.1 2.5 21 
East Asia TS 1.8 2.3 1.4 1A 
Europe" 0.8 0.8 0.7 0.6 0.5 
Africa 2.1 2.6 3.2 2.5 27 
Soviet Union 17 1.6 1.6 1.3 1.2 
Latin America 2.7 3.1 3.4 29 27 
North America® 1.8 1.6 1.8 1.4 1.5 
Oceania® 2.1 17 2.0 1.8 ee 
World total 1.8 2.1 2.6 1.9 1.8 


a. Without European portions of Soviet Union and Turkey. 
b. Includes Hawaii. 
c. Excludes Hawaii. 
Source: Adapted from United Nations 1966, 


The forecasts employ varied assumptions, made 
separately for each region, as to plausible decreases 
in mortality, some continuance of recent currents 
of migration, and the possibility that sooner or 
later substantial decreases in fertility might ap- 
pear in different parts of the world. Three sets of 
forecasts were originally calculated: two aiming 
near the upper and lower margins of probability, 
and the “medium” set near the center of the range 
of what present information suggests to be fairly 
probable, Only the medium forecasts are shown 
here. The information available at the time the 
calculations were made suggested as most probable 
a continuance of world population growth at ap- 
proximately its recent rate for at least three and 
possibly several more decades, this being a net 
outcome of probable acceleration in some regions 
and slowdown in some others. For the year 2000 
the medium forecast leads to a world total of about 
6,000 million; the high, to 7,000 million; and the 
low, to 5,400 million. 

It is evident that many decades will have to 
elapse before ordinary processes can reduce the 
current tempo of world population growth to a 
moderate rate, The intervening phase of rapid 
growth adds human numbers so large as to be 
difficult to imagine. One may justly wonder 
whether human ingenuity will suffice either for a 
sufficiently rapid strengthening of social organiza- 
tion to cope with the increasing tasks or for a 
sufficiently rapid spread of motivation for and 
confidence in methods of family limitation through- 


out the world. 
Joun V. GRAUMAN 


[See also FERTILITY; FERTILITY CONTROL; Foon, article 
PLANNING, ECO- 


on WORLD PROBLEMS; MORTALITY; 
Nomic; and the biography of Wit1cox.] 
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VIL 
POPULATION POLICIES 


Population policies may be defined as legislative 
measures, administrative programs, and other gov- 
ernmental action intended to alter or modify exist- 
ing population trends in the interest of national 
survival and welfare. Many aspects of public policy 
and of social change in general have an impact 
upon demographic trends. Population policy em- 
braces those aspects of public policy that are de- 
signed to counteract the unwanted demographic 
effects of over-all policy and of other social forces. 
Most frequently, attention is focused upon efforts 
to maintain, increase, or restrain the rate of growth 
of a population. Thus, the major purpose is to con- 
trol population size, but consideration may also be 
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given to influencing its composition and its geo- 
graphic distribution. 

The quantitative aim of population policy is em- 
phasized here, partly because policies now in force 
are concerned primarily with affecting size and rate 
of change and partly because the inclusion of non- 
quantitative, or qualitative, aims would make pop- 
ulation policy virtually synonymous with public 
policy in general. Biological quality, which is the 
object of measures designed to control the genetic 
structure of a population, is sometimes regarded 
as properly the concern of population policy. Al- 
though some countries have enacted legislation 
that authorizes abortion and sterilization on eu- 
genic grounds, the number of such operations is 
quite small and their effect upon the biological 
constitution of the population as a whole is prob- 
ably negligible. Differential fertility, on the other 
hand, may have some effect, but existing policies 
do not attempt to control that aspect of demo- 
graphic change. Current action in this area is 
largely confined to those sections of immigration 
policy that are selective as to racial or ethnic 
origin, 


Historical background 


Government concern over matters of population 
is not a new phenomenon, State intervention, in 
the form of laws or decrees encouraging marriage, 
taxing the unmarried, subsidizing families with 
children, regulating immigration and emigration, 
fixing a legal minimum age for marriage, and the 
like, have existed since ancient times (Glass 1940, 
chapter 2). In general these measures represented 
a populationist philosophy that equated power and 
prosperity with large numbers. 

The expansionist motivation in population policy 
reached a climax in Germany, Italy, and Japan 
during the period between the two world wars. 
Intensive pronatalist propaganda, cash payments 
to families with children, the rewarding and hon- 
oring of motherhood, the repression of birth con- 
trol, the regulation of emigration, and the enact- 
ment of “eugenic” laws, all reflected the drive for 
larger native and racially “pure” populations, and 
they were directly associated with the political and 
territorial ambitions of the Axis powers. 

During the same period, policies with a some- 
what similar content but a different rationale were 
taking shape in other countries where very low 
rates of growth were evoking fears of an impending 
decline in numbers. Fertility rates were below re- 
placement levels in many of these countries, and 
although only France and Austria actually recorded 
an excess of deaths over births, it was considered 


but a matter of time until most of western Europe 
would be experiencing a natural decrease of pop- 
ulation. At the same time, sustained economic de- 
pression was precipitating a new concept of social 
justice, and governments were taking steps to pro- 
tect workers against the risks of unemployment 
and to guarantee a minimum family wage that 
would take account of the number of dependents 
supported by each worker. 

Although it was not clear to what extent the 
low birth rates then current were a continuation of 
the secular trend and to what extent they were a 
temporary phenomenon, it was thought that low 
marriage rates and low fertility within marriage 
had an essentially economic explanation. Conse- 
quently, the attempt to sustain or increase the 
birth rate became linked to the development of 
social security programs, particularly those aspects 
of social security that contribute to the economic 
security of the family. Because of this linkage, it is 
sometimes difficult to say whether measures favor- 
ing the family, maternity, and infancy have a 
demographic as well as a welfare intent, unless 
the government concerned specifically so states. 
Insofar as these programs do have a demographic 
intent, they are distinguishable from the popula- 
tionist policies described above, in that they are 
not expansionist in the imperialist sense but, rather, 
are animated by a desire to avoid population 
decline or, at most, to achieve a gently increasing 
population. 

Similar programs have developed in the Soviet 
Union and eastern Europe, as an integral part of 
the plan to build the socialist state. The popula- 
tionist overtones of these programs stem in part 
from the old controversy between Malthus and 
Marx, in which Marx took the position that “over 
population” was a misnomer for imperfect social 
organization, and in part from a felt need, in the 
Soviet Union at least, for a larger population. But 
in these countries, as elsewhere, the nature of 
population policy—and even the question of whether 
a policy exists—is to some extent a matter of the 
interpretation a government chooses to make of its 
actions and programs. Thus, a nation’s stated pol- 
icy is not necessarily an exact statement of its 
purposes of the moment. Indeed, the prevailing 
pattern of social and political organization is such 
that much of national policy in any area takes se 
through a series of compromises between contend 
ing pressures and hence has elements of ambiguity, 
not to say of ambivalence. 

After World War m, with the emergence of new 
nations and a growing awareness of the economic 


problems of underdeveloped countries, population 


a 


policies that represented a different point of view 
began to develop. In many of the underdeveloped 
countries mortality was falling rapidly, as a result 
of large-scale preventive measures, but fertility re- 
mained high and rates of increase as great as 2.5 
to 3.0 per cent per annum were either recorded or 
in prospect. Such rates were without precedent in 
Western experience. They implied a possible dou- 
bling of population within a generation and aroused 
fears that the effort to raise levels of living would 
be impeded by the necessity to provide subsistence 
for the increasing numbers. Policies that favor re- 
ducing or stabilizing the rate of population growth 
have therefore begun to evolve in some of the 
densely populated underdeveloped countries of the 
Far East and the Caribbean. 


Elements of policy 

Ideally, population policy involves the examina- 
tion of past and current demographic trends and 
their causes; an appraisal of the future demo- 
graphic changes implied by these trends; an evalu- 
ation of the social and economic consequences of 
expected patterns of change, in the perspective of 
what is regarded as the national interest; and 
finally, the adoption of measures designed to bring 
about desired changes or prevent undesired ones. 
Demographic trends are a function of changing 
relations between the forces of fertility, mortality, 
and migration, whether in the population as a 
whole or differentially in its various segments. 
Policy makers are therefore logically concerned 
with understanding the factors of change in these 
three processes and with ways and means of in- 
fluencing the direction and amount of change in 
each of them, However, practical considerations are 
such that most of population policy, as it exists 
today, is directed at influencing fertility, although 
the trends and effects of migration and mortality 
may also be carefully studied, for changes in them 
can be the precipitating factors that render popu- 
lation a “problem.” 

Migration. Control of international migration 
as a means of adjustment between high-density 
countries and low-density countries holds only lim- 
ited possibilities, principally because of national 
feelings, political differences, ethnic preferences, 
problems of assimilation, and fears of the eco- 
nomic consequences of inundation from abroad. 
The immigration laws of the so-called countries of 
immigration (Australia, New Zealand, and most of 
the countries of western Europe and the Americas) 
are generally restrictive, setting limitations upon 
the number and source of immigrants and barring 
those who, for political, social, or medical reasons, 
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are considered undesirable (International Labor 
Office 1954). Although some migration from high- 
density to low-density countries is encouraged, it 
is carefully controlled, often through bilateral 
agreements between the governments concerned. 
In these programs quantitative aims are present, 
but nonquantitative considerations usually take 
precedence. 

From the point of view of the densely populated 
underdeveloped countries, there are, thus, no avail- 
able outlets that could possibly siphon off the cur- 
rent and prospective increases in numbers. In effect, 
the solution of what are regarded as demographic 
problems is almost strictly a national affair. Ad- 
justments of population to resources or to a pro- 
gram of economic development must be effected 
principally within national borders, 

Insofar as the redistribution of population is a 
demographic process, the regulation of internal 
migration may be regarded as population policy, 
This aspect of public policy has not, on the whole, 
been so regarded, its aims being generally quali- 
tative rather than quantitative; but government 
action in this area can have great significance and 
can benefit from demographic analysis and demo- 
graphic insights. The important internal migrations 
of the past have, like other demographic processes, 
been largely unplanned and unguided; chief among 
them has been the movement from farm to city. 
Problems of urban congestion and of urban and 
rural slums exist almost everywhere. Modern pro- 
grams of resettlement, city planning, urban renew- 
al, relocation of industry, and aid to agriculture 
are examples of government attempts to cope with 
such conditions, and some of them utilize a demo- 
graphic approach and have in mind a partially 
demographic solution—namely, to influence the 
redistribution of population. This facet of policy 
could be increasingly emphasized if social and eco- 
nomic planning should become more prevalent and 
more sophisticated; it would take on a quantitative 
character if security considerations were to become 
a compelling factor. In the underdeveloped coun- 
tries economic progress will depend in part on the 
success with which population distribution and re- 
distribution are taken into account and made to 
contribute to, rather than impede, development. 

Mortality. Broadly defined, population policy 
includes measures intended to affect the death rate. 
But the purpose of such measures is to improve the 
health of the population, not to control the rate or 
direction of numerical change. To include the total- 
ity of such measures would be equivalent to iden- 
tifying health policy with population policy. Manip- 
ulation of the death rate in order to control the rate 
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of growth is not feasible, because there is only one 
policy in relation to mortality that is socially ac- 
ceptable—namely, to reduce it. In industrial coun- 
tries where there is a desire to encourage population 
growth, death rates are already so low that there 
is very little to be gained, in the way of growth, 
from further reductions at most ages. Thus, popu- 
lation policy, as such, does not place a great deal 
of emphasis on the reduction of mortality, although, 
of course, other aspects of welfare policy do em- 
phasize it very strongly. Indeed, the concept of 
public health has been extended in most of these 
countries, by means of national health insurance 
schemes or national health service systems, to in- 
clude individual medical care in the realm of public 
responsibility. 

In the underdeveloped countries where there is 
a desire to restrict the rate of growth, it might be 
possible to relax the struggle against mortality and 
allow death rates to rise. However, such a policy 
not only is offensive from the humanitarian point 
of view but would also run the risk of defeating 
the basic objective of improving the living condi- 
tions of the general population. Governments that 
allowed death rates to rise would soon realize that 
a competent labor force, essential to economic de- 
velopment, cannot be achieved without some safe- 
guards to health and some positive measures to 
insure adequate diet and tolerable working con- 
ditions. Furthermore, skills require training, prac- 
tice, and education of some sort. Meeting these 
requirements would contribute to a fall in the death 
rate, even if the government, as a matter of popu- 
lation policy, took no positive steps to reduce mor- 
tality. And there is the added circumstance that, 
because of modern technical knowledge and facili- 
ties, the basic preventive measures necessary to 
control the spread of infectious disease—mass in- 
oculation, spraying, and elementary sanitation— 
are relatively inexpensive and easy to administer. 


Control of fertility 


Under present political, cultural, and technologi- 
cal circumstances, the principal focus of efforts to 
influence population trends necessarily centers on 
the control of fertility. It so happens that, in gen- 
eral, countries wishing to stimulate growth are 
low-fertility countries and countries wishing to 
restrain growth are high-fertility countries. 

Only three countries can be said to have co- 
herent, carefully constructed, and frankly stated 
population policies: France, representing a strong 
pronatalist view; Sweden, representing a more tem- 
pered pronatalist view; and India, representing an 
antinatalist view. A much larger number of coun- 


tries have taken cognizance of population problems 
in one way or another: by appointing commissions 
to study the question and make proposals; by issu- 
ing statements of official attitudes; or by enacting 
legislation which probably has inherent demo- 
graphic aims, although other objectives may be the 
only ones acknowledged. A brief description of the 
policies of France, Sweden, and India, along with 
some indication of similar or relevant specific meas- 
ures taken in other countries, should give perspec- 
tive on how developments in this area are moving 
in the world in general. 

France. The essentials of French population 
policy are set out in the Code de la Famille, which 
came into force in 1940 (“Décret relatif a la fa- 
mille . . .” 1939). Its purpose is both to encourage 
family formation and childbearing in numbers 
sufficient to maintain a moderate increase in popu- 
lation and to counteract the general aging of the 
population. Specific provisions to this end include, 
on the one hand, positive measures for financial aid 
to marriage and child rearing and, on the other 
hand, repressive measures restricting the use of 
induced abortion and contraception, Subsequent 
legislation has introduced some changes in the pro- 
gram, the general effect of which has been to im- 
prove its administration and increase its benefits. 

The principal economic measure is the system 
of family allowances. Monthly cash allowances are 
payable to all families having two or more children 
under 15 years of age; in special circumstances 
the age limit is as high as 20 years. Reflecting pro- 
natalist intent, allowances are higher for the third 
and subsequent children than for the second child. 
Furthermore, families with only one wage earner 
receive allowances beginning with the first child, 
and the allowance per child is higher. In addition, 
prenatal and maternity allowances are available to 
all women. The amount of allowances is based on 
the current minimum wage of metalworkers, and 
so it tends to vary with the cost of living. As of 
1960, three-child single-wage families received al- 
lowances equal to at least 133 per cent of the i 
wage. The prenatal allowance averaged 21 per ceni 
of the base wage, and the maternity allowance, 
payable after the birth, was equal to 200 per Es 
of the base wage for the first birth and ee 
cent for subsequent births occurring within 
years of a prior birth. 

Further benefits to married coup 
ernment loans for various purposes, tax! 
and rebates on the costs of public servic 
services in aid of the family have taken 
of subsidies to school canteens, boarding 5 ne: 
vacation camps, day nurseries, and kindergartens; 


les include gov" 
ax reductions, 
es. Soci: 

the form 
chools, 


the provision of household help; and family coun- 
seling. Certain benefits, available to persons cov- 
ered by the social security scheme and contingent 
upon attachment to the labor force, are regarded 
as part of France’s program to compensate family 
expenses and have been regularly included in the 
computation of the costs of France's population 
policy. Principal among these benefits are reim- 
bursement for most of the cost of the medical care 
of the spouse and children of the insured, includ- 
ing maternity care of insured women and spouses 
of insured men, paid maternity leave for insured 
women, and leave with pay for the father at the 
time of the birth of a child. 

On the repressive side, the Code de la Famille 
re-embodied earlier legislation which made birth 
control propaganda, sale or advertisement of con- 
traceptives, and incitement to abortion illegal. The 
condom, considered a prophylactic, may be freely 
bought and sold, but other devices are forbidden. 
[For the laws governing abortion in France and 
other countries, see FERTILITY CONTROL.] 

Supplementing the pronatalist policy is the 
encouragement of immigration of a type that is 
considered compatible with both manpower and 
demographic needs. A final aspect of French policy 
is the existence of the Institut National d'Etudes 
Démographiques, created in 1945 to conduct re- 
search in problems relevant to population, follow 
studies and developments in other countries, and 
explore all possible means of increasing the num- 
ber and improving the quality of the population. 

Sweden. Demographically, Sweden is similar to 
France and most of the other countries of western 
Europe. Like France, Sweden has a closely rea- 
soned, highly developed population policy that is 
oriented toward sustaining the þirth rate. But in 
Swedish policy, consideration of individual welfare 
and personal freedom have taken precedence over 
pronatalist aims wherever the two were in conflict. 
Also, much more emphasis is placed on payments 
in kind and on the provision of institutional or 
social services in behalf of the family. 

Government action in matters of population be- 
gan in 1935, with the appointment of a population 
commission. On the basis of the deliberations and 
recommendations of that commission and of a sec- 
ond commission, which functioned from 1941 to 
1946, Sweden has developed a well-coordinated 
program, the themes of which are voluntary parent- 
hood and child welfare (Myrdal 1941; Gille 1948). 
Family allowances are payable in behalf of each 
child under the age of 16. Reflecting a welfare, 
rather than a pronatalist, emphasis, this is a flat- 
rate allowance beginning with the first child, and 
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the rate per child does not increase with the num- 
ber of eligible children. The amount relative to the 
cost of child care is somewhat lower than that paid 
in France, but the system of supplementary serv- 
ices and allowances in kind is much more exhaus- 
tive. The supplementary aids include marriage 
loans for the purchase of household equipment, a 
comprehensive system of maternal and child wel- 
fare centers, housing and fuel grants for families 
of moderate means with two or more children, free 
school meals, home-help services, holiday travel for 
mothers and children of families in difficult cir- 
cumstances, and tax relief. The sickness and ma- 
ternity insurance scheme covers all resident citizens 
and registered aliens, Maternity leave is compul- 
sory, and the costs of confinement are borne by the 
state. Women employees may not be dismissed 
because of pregnancy or childbirth. 

In keeping with the aim of voluntary parenthood, 
contraceptive advice is given at hospitals and health 
centers, contraceptives may be purchased at all 
pharmacies, the laws against induced abortion have 
been relaxed, and sex education has been made a 
regular part of the school curriculum. The object 
of this part of the program is to improve the qual- 
ity of the population, as well as influence popula- 
tion growth in the direction desired. Abortion has 
been legalized to the degree that medical boards 
may authorize the interruption of pregnancy on 
rather broadly defined therapeutic or eugenic 
grounds, taking into account the general social, 
medical, and psychological circumstances of the 
woman involved. Sterilization may be authorized 
for similar reasons. It was hoped that the more 
lenient attitude toward induced abortion would in 
the end reduce the number of such abortions; 
however, it is not yet possible to ascertain whether 
this program has had the desired effect (Gille 
1955). 

India. In India, population policy is oriented 
toward restraining the rate of increase, on behalf 
of economic development and of raising the level 
of living of the people. This policy was initiated in 
1952, with the first five-year plan, and subsequent 
action has put increasing emphasis upon the need 
to reduce the widening gap between a lowering 
death rate and a persistently high birth rate. The 
third five-year plan, promulgated in 1961, stated, 
“The objective of stabilising the growth of popula- 
tion over a reasonable period must be at the very 
centre of planned development” (India 1961, 
p. 675). The plan calls for a large-scale program 
of education and motivation for family planning, 
provision of birth control advice and contraceptive 
supplies, and government-sponsored research in 
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demographic trends, contraceptive methods, and 
family-planning motivation. Family-planning clin- 
ics have been established in a large number of 
rural areas, and family-planning services are avail- 
able at urban medical and health centers. [See Com- 
MUNITY, article on COMMUNITY DEVELOPMENT.] 
These activities are steadily expanding, with the 
government subsidizing the manufacture and dis- 
tribution of contraceptives, The question of how to 
make the program more effective is under constant 
study. At the request of the government, the United 
Nations sent a team of experts to India in 1965, “to 
assess the problems involved in accelerating the 
adoption of family planning by the people and to 
advise the Government on action that might be 
taken for this purpose” (United Nations 1965, 
pp. 9-10). 

The law against induced abortion has not been 
relaxed, and pregnancy may be artificially termi- 
nated only to save the life of the mother. Voluntary 
sterilization, however, is regarded as an acceptable 
means of preventing births, and the practice seems 
to be spreading. Another facet of Indian policy has 
to do with the effort to raise the average age at 
marriage. In this connection, the third five-year 
plan places special emphasis on the education of 
women and on the provision of new employment 
opportunities for women. 

Programs in other countries. In Europe all the 
rest of the countries have adopted programs that 
resemble those of France or Sweden in various ways 
and to varying degrees, but in none of them has 
population policy been as fully developed or as 
clearly stated. These countries have family allow- 
ance schemes and social insurance or national 
health service systems that cover all or part of the 
costs of maternity and the medical care of workers 
and their dependents. 

Measures in the Roman Catholic countries of 
western and central Europe and in the Nether- 
lands are similar to those in France but less inten- 
sive. They may be characterized as favoring natality 
and as repressing the practices of birth control 
and abortion. Family allowances are generally of 
the progressive type; laws regarding birth control 
propaganda and the sale of contraceptives tend to 
be restrictive; induced abortion is prohibited. Al- 
though pronatalist influences contributed to the 
enactment of these measures, they do not neces- 
sarily have an announced or a sustained pronatalist 
intent. Two countries—Italy and the Netherlands 
—have been encouraging emigration. 

Programs in the Scandinavian countries, the 
United Kingdom, and Finland are similar to those 
of Sweden. The general attitude is pronatalist, but 


demographic considerations take second place to 
considerations of immediate welfare. Family allow- 
ances tend to be of the flat-rate type; supplemen- 
tary aids to the family and institutional services 
are provided; legislation regarding birth control is 
permissive. Supplementary aids to the family are 
especially well developed in the United Kingdom, 
where the population question was studied in depth 
by a royal commission and where the program has 
had the benefit of the commission's findings and 
proposals (Great Britain 1949). In the attempt to 
combat induced abortion, legal strictures against 
abortion have been relaxed in Denmark, Finland, 
and Iceland; abortion and sterilization are per- 
mitted on medical, social, or eugenic grounds; and 
family planning is actively encouraged. In Norway 
only medical and eugenic grounds for abortion are 
recognized; in the United Kingdom, only medical 
grounds. Measures in the overseas Commonwealth 
countries of Canada, Australia, and New Zealand 
resemble those of the United Kingdom, In addition, 
these three countries have, since World War 1, 
instituted policies promoting immigration of a 
selective type. 

Policies in the Soviet Union and in eastern Eu- 
rope since World War 11 have been pronatalist in 
tone. Family allowances of the progressive type 
predominate, and special awards for mothers of 
large families are a persistent feature. Since 1955, 
legislation against abortion has been greatly liber- 
alized in most of these countries; in some the 
operation is obtainable at the woman’s request. 
The change was made on humanitarian and health 
grounds and was declared a measure intended to 
combat the dangers and frequency of induced abor- 
tion. The use of contraception, rather than abortion, 
is strongly advocated. The new policy does E 
necessarily indicate abandonment of the an i 
Malthusian point of view, but there have been Dri 
expressions of the opinion that rapid populate 
increase may interfere with economic deve ah 
ment (Mauldin 1960, p. 197; Brackett & Huye! 
1962). : aothe 

Outside of Europe, the Soviet Union, an eh 
Commonwealth countries mentioned, naon 
family allowance schemes, inclusive health ins a 
ance or health service systems, and related E 
ures in favor of the family have not yet pee 
very far, but programs of limited scope oF © a 
extragovernmental type have begun to grow ae 
many countries (U.S. Social Security sane 
tion 1961). The European type of program dto 
be seen as a culture trait that is being am E 
all parts of the world. As such, it has no virtu- 
acquired an aura of rightness that makes it 


ally independent of population policy. Changes of 
policy from pronatalism to antinatalism or indif- 
ference are, therefore, unlikely to dislodge it, 
although they may bring about some alteration of 
specific provisions. 

Policies resembling that of India are emerging 
in a number of other densely populated underde- 
veloped countries. Intensive government-sponsored 
programs to promote family planning have been 
launched in Pakistan and the Republic of Korea. 
In Taiwan the government gives informal support 
to a program that is conducted by private organi- 
zations. The first five-year plan of Turkey reversed 
prior policy and provides for family-planning edu- 
cation, Iran’s third plan mentions the need to popu- 
larize family planning. In Tunisia a policy favoring 
birth control is under study and a family-planning 
campaign has been started. In the United Arab 
Republic the charter promulgated by the president 
in 1962 stated that family planning was one way 
of alleviating the problem of low per capita pro- 
duction. Official interest has also been demon- 
strated, by government approval or support of 
planned parenthood programs, in Hong Kong, 
Malaysia, Thailand, Barbados, Ceylon, and the 
Philippines. China has liberalized its laws on abor- 
tion and sterilization and, despite strong commit- 
ment to anti-Malthusianism, has begun to encour- 
age birth control and advocate late marriage 
(United Nations 1964; Nortman 1964; Tien 1963; 
Studies in Family Planning). 


Effects of population policy 

Not much can be said about the effects of popu- 
lation policy. As far as the three countries with 
well-defined policies —France, Sweden, and India— 
are concerned, evidence that the desired effects are 
being produced is inconclusive. In France fertility 
is above prewar levels, and French analysts believe 
that French policy accounts for that fact. In 
Sweden, however, the crude birth rate of 1960 was 
the lowest in Europe and probably the lowest in 
the world. Growth rates remain high in India. 
Elsewhere current levels and recent trends in na- 
tional birth rates appear to bear no consistent 
relation to the presence, purpose, or content of 
national policy. 

It seems clear that once the majority of a popu- 
lation has recognized at the personal level the 
desirability of controlling family size, it will act 
without much regard for the position of the law 
or official policy. Thus, the secular fall in Western 
fertility took place over a period when birth con- 
trol was officially opposed and policy, if any, favored 
increase. The experience in Japan since World 
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War 1 is another case in point. Japanese law on 
abortion and sterilization was liberalized between 
1948 and 1954, and at the same time the use of 
contraception, rather than abortion, was urged 
upon the population. Recorded abortions increased 
rapidly and soon were approaching the number of 
live births. By 1960 the birth rate had fallen to a 
level comparable with that in: Western industrial 
countries, But the resort to abortion and the fall 
in the birth rate began before 1948; the practice 
had undoubtedly existed on a wide scale for some 
time (Taeuber 1958, p. 278). Legalization may 
have caused some of the increase in the number 
of abortions, but the important fact, from the soci- 
ological point of view, is that a great many people 
wanted to limit the size of their families. In other 
words, when such a conviction has arrived, ways 
and means of attainment will be found. An anti- 
natalist policy has the problem of instilling that 
conviction; a pronatalist policy, of dispelling it. 
The question of whether governments can or will 
provide incentives strong enough to change behav- 
ior in this area is still open. 

The period of the 1960s was one of rapid devel- 
opment in the area of population policy, especially 
among underdeveloped countries. Many influences 
—social, political, economic, religious—were at 
work, both in and outside of government and at 
both national and international levels. For instance, 
the U.S. Congress, in passing the 1966 Food for 
Freedom bill, specifically authorized the president 
to use the local currencies acquired in sales of food 
under the Food for Freedom program to set up 
birth control clinics in any nation covered by the 
program, if its government so requested, In Janu- 
ary 1967 President Johnson, in his State of the 
Union message, formally endorsed the “export” of 
birth control as a continuing policy. The elements 
of a domestic population policy also began to 
emerge, with the deepening involvement of the 
U.S. Department of Health, Education and Wel- 
fare in birth control activities, and the support of 
these activities by certain members of the Congress 
(see U.S. Congress 1966). But the Roman Catholic 
church, in spite of mounting internal criticism and 
considerable evidence that many Catholics either 
were using or wished to use some form of “artifi- 
cial” contraception, still had not modified its tradi- 
tional stand by mid-1967. In the meantime, most 
governments of Latin America—the region with 
the world’s highest rate of population growth—had 
yet to evolve population policies, although pri- 
vately sponsored conferences in 1965 and 1967 
allowed, for the first time, some public expression 
of concern. During the same period, other govern- 
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ments, such as that of the United Arab Republic, 
whose natural resources seemed inadequate to sup- 
port their rates of population growth, were under- 
taking extensive birth control programs for the 
first time. 

But it was impossible to foresee the net impact 
of these forces upon the content and effectiveness 
of official policy in matters of population. A con- 
temporary survey of government opinion, covering 
53 countries in various stages of economic devel- 
opment, found that while many governments were 
aware of problems associated with population 
changes, there were differences between them in 
the interpretations they gave to such changes and 
in the policies they considered acceptable for deal- 
ing with the problems that are created (United 
Nations 1964). It must be concluded that knowl- 
edge about the interaction between population 
trends and economic growth is still imperfect and 
that there is plenty of room for honest disagree- 
ment about which population policies will be most 
effective in securing the general welfare. 


Hore T. ELDRIDGE 


[See also FERTILITY CONTROL; Foon, article on WORLD 
PROBLEMS; POPULATION, article on POPULATION 
THEORIES.] 
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POSITIVISM 


There are two positivisms: that of the nineteenth 
century and that of the twentieth. Common to both 
is a continuation of the eighteenth-century philos- 
ophy of the Enlightenment. Metaphysics and theol- 
ogy are again brought before the bar of reason, 
with the insistence that the institutions appealing 
to them for justification be reformed or replaced. 
Science is claimed to provide the standards applied 
in this critique. The name “positivism” derives 
from the emphasis on the positive sciences—that 
is, on tested and systematized experience rather 
than on undisciplined speculation. 

The older positivism of Auguste Comte viewed 
human history as progressing through three stages: 
the religious, the metaphysical, and the scientific. 
His positivism was presented as articulating and 
systematizing the principles underlying this last 
(and best) stage. Law, morality, politics, and re- 
ligion were all to be reconstituted on the new sci- 
entific basis. Traditional religion, for instance, was 
to be replaced by a religion of humanity and reason, 
with rituals and symbols appropriate to the new 
doctrine (Simon 1963). Comte’s evolutionary and 
scientistic perspectives were shared by such men 
as Herbert Spencer and Thomas Huxley, but con- 
temporary movements of thought have been very 
little influenced by the older positivism. 

Twentieth-century positivism came to be known 
as logical positivism, to distinguish it from the 
older philosophy. (The movement itself preferred 
the name logical empiricism. ) The adjective points 
to the importance of the rationalist component in 
the modern view, which owes as much to Leibniz, 
inventor of the differential calculus and one of the 
pioneers of mathematical logic, as to Hume and 
the later British empiricists, like John Stuart Mill. 

Twentieth-century positivism. Modern positiv- 
ism began in the early 1920s with the establish- 
ment of the so-called Vienna circle by Moritz 
Schlick in association with Rudolf Carnap, Otto 
Neurath, Herbert Feigl, and a number of mathe- 
maticians and scientists. A few years later Hans 
Reichenbach and others in Berlin developed closely 
related ideas. In the late 1930s the center of the 
movement shifted to Chicago, where Carnap ac- 
cepted an appointment. There, under the influence 
of C. W. Morris, the contributions of American 
pragmatism made themselves felt. The movement 
came increasingly to be called scientific empiricism, 


which reflected its broader outlook; an Interna- 


tional Encyclopedia of Unified Science was pub- 
lived Journal of Unified 


lished, as well as a short- ! 
Science. Positivism as such lasted into the 1940s, 
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but continued to be indirectly influential by way 
of its impact on British analytic philosophers, es- 
pecially Gilbert Ryle and A. J. Ayer. 

Among the movements contributing to the rise 
of logical positivism, three are especially worthy 
of notice. First, around the turn of the century, a 
number of scientists—Karl Pearson in England, 
Pierre Duhem in France, Ernst Mach in Austria, 
and others—were directing attention to the logical 
structure of scientific theory, proposing, and to 
some extent carrying out, a reconstruction of sci- 
ence on a strictly empiricist and even phenomenal- 
ist basis and looking to the replacement of pictorial 
models by axiomization. This line of thought 
reached its culmination in Einstein’s special theory 
of relativity, which positivists later widely adduced 
as illustrating the intimate connection between 
meaning and verification, apropos of the concep- 
tion and measurement of space and time. A few 
years later Bertrand Russell embarked on a pro- 
gram of reducing mathematics to logic, along lines 
previously followed by Gottlob Frege. In collabora- 
tion with A, N. Whitehead he wrote the monumen- 
tal Principia mathematica (Whitehead & Russell 
1910-1913), which provided a comprehensive 
symbolic logic that was to become the language of 
the new philosophy. In the early 1920s his pupil 
Ludwig Wittgenstein published the superlative and 
important Tractatus logico-philosophicus (1921), 
which laid out the philosophical implications of the 
new logic in concise and often cryptic form. Finally, 
the political situation in central Europe after World 
War 1 helped to shape logical positivism in the 
spirit of the Marxist critique of ideology; also, anti- 
clericalism gave particular relevance to a philos- 
ophy that denied meaning to even the questions 
posed by theology. 


Philosophy and metaphysics 

The positivist conception of the nature of philos- 
ophy marked a radical departure from the prevail- 
ing view. Philosophy is not a doctrine embodying 
“wisdom”—it is an activity; it is neither a theory 
nor a way of life but rather a way of analyzing 
what is said in the course of living or in theorizing 
about life. The business of philosophy is not to 
arrive at a certain set of propositions embodying 
a suprascientific truth; its business is to make prop- 
ositions clear. Schlick looked forward to the day 
when there would be no more books on philosophy 
but all books would be philosophically written. 

As a distinctive activity, philosophy consists in 
analysis, While the synthetic method, as practiced 
by mathematics and science, builds up conclu- 
sions from initial assumptions or data, analysis, 
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as Russell in particular emphasized, digs down to 
the foundations. It looks to presuppositions rather 
than to outcomes; it aims at laying bare the “logi- 
cal atoms” out of which our complex ideas are 
compounded. (The synthetic aspect of thought was 
later provided for in the positivist ideal of a “unified 
science.”) The name “logical positivism” calls at- 
tention to both the form and the matter of the new 
philosophy: its method is logical analysis, and its 
subject matter is the positive sciences. The later, 
so-called analytic philosophy, especially as it de- 
veloped in England (in such journals as Analysis 
and Mind), differs in both respects from the logical 
positivism by which it was so deeply influenced: 
its method is more linguistic than logi¢al, and its 
subject matter is provided as much by the discourse 
of law, morality, and everyday life as by what 
positivism calls “the language of science.” 

Clarity and meaning. The governing ideal of 
the activity of analysis is clarity. Russell, taking 
science and mathematics as his exemplar, insisted 
that it is better to be clearly wrong than vaguely 
right. Knowledge grows by disproof as well as by 
confirmation, but intimations and adumbrations of 
the truth are of no great cognitive value. Opposi- 
tion to the positivist movement in recent years has 
crystallized in the slogan “Clarity is not enough”; 
whether because of the intrinsic nature of philo- 
sophic problems or because of the limitations of 
the resources we bring to bear on their solution, 
the ideal of clarity is not just unattainable but 
misguided. According to this view, the great ques- 
tions of human nature and destiny will not yield 
to exact treatment. To give up muddleheadedness 
for simple-mindedness is a worthless exchange. 

From the outset, however, logical positivism re- 
jected these “great questions” as meaningless. The 
problems taken to be most characteristically phil- 
osophical—those of metaphysics—are in fact 
pseudoproblems, which are incapable of solution 
not because of their profundity but because they 
pose nothing to be solved. The questions asked 
have the form of questions but are lacking in con- 
tent. Philosophy need not decide between alterna- 
tive answers, since all are equally uncalled for. 
Thus, for instance, agnosticism is as much to be 
rejected as is theism or atheism, because the agnos- 
tic, in maintaining that the answer is unknown, 
acknowledges the genuineness of the question. To 
the categories of truth and falsehood, into which 
statements were previously classified, the positivist 
added a third category: nonsense. It is indeed this 
third classification that is the distinctive concern 
of philosophy; to decide whether statements are 
true or false is the business of science. What phi- 


losophy does is to show, by logical analysis, which 
statements are eligible for scientific consideration 
and how they are to be considered. 

:To do this work, philosophy needs what Karl 
Popper has called (while repudiating this use) a 
“criterion of demarcation”"—a way to distinguish 
meaningful from meaningless statements. Such a 
criterion positivism found in the so-called verifi- 
ability theory of meaning. (Whether it is indeed 
a theory, or rather a rule or stipulation, was a 
matter of controversy within the movement, as well 
as with its critics.) The verifiability principle al- 
lows meaning only to statements capable of verifi- 
cation, and it allows only so much meaning as is 
verifiable (unless they are statements of pure logic 
or mathematics). A way of testing whether a state- 
ment is true or false is necessary to the statement’s 
having meaning, and as the slogan had it, its mean- 
ing lies in its method of verification. By this last 
formulation, positivism is closely linked with oper- 
ationism (Bridgman 1927). 

A satisfactory formulation of the criterion occu- 
pied much of the attention of the positivist move- 
ment (Hempel 1950), but none was universally 
accepted, even within the movement. If, like Ock- 
ham's razor, with which the principle was often 
compared, it is to free us from “surplus meanings,” 
the problem is how to shave close without cutting 
into the flesh. The difficulties are twofold. On the 
one hand, the criterion must be made loose enough 
to allow entry to the whole of science. Thus, falsi- 
fication is as acceptable as verification; for Popper 
(1934) it is the fundamental requirement. Some 
degree of confirmation or disconfirmation is all that 
can be asked for (Carnap 1936-1937), and the 
possibility of verification may be either a technical, 
physical, or merely logical possibility ( Reichenbach 
1938). On the other hand, a criterion liberal enough 
to allow for statements containing theoretic 
terms, whose verification may be extremely me 
and indirect, may readmit ideologies, myths, an 
ultimately metaphysics. 


Logic and mathematics A 

A major concern of modern positivism, a 
is central to both its method and its content, : 
the nature of language. Philosophy does not analyze 
things, as science does, but rather our ne 
things—or, more precisely, the language in v 
our ideas are expressed. The object of any ar 
sophical inquiry is accordingly known as the obi; 
language; the language in which the igui z 
is formulated is the metalanguage. In par a 
cases the two languages may coincide, in wee 
or in part; but one must always distinguish betw 


using a word and mentioning it—that is, saying 
something about the word itself. Statements that 
purport to be about objects but that can be ana- 
lyzed as (or replaced by) statements about lan- 
guage were called pseudo-object sentences by 
Carnap; many characteristic statements of meta- 
physics were taken to be of this kind. Thus, Witt- 
genstein’s assertion “The world is the totality of 
facts, not of things” (1921) might well be ren- 
dered as “Science is the totality of true sentences, 
not of names or predicates.” The notions of a 
statement’s being “about” something and of one 
statement’s being “replaceable” by another later be- 
came the focal points of much analysis and discus- 
sion, under the rubrics designation and synonymy. 

Following Charles Peirce, the nineteenth-century 
American philosopher, later thinkers classified lan- 
guage under sign processes in general; Morris 
(1938) formulated a widely used theory of signs, 
which was largely a codification of distinctions 
rather than a theory in the strict sense. Signs may 
be analyzed in three “dimensions,” or aspects, of 
their working: in relation to other signs (which is 
the province of syntactics or logical syntax); in 
relation to what they signify (semantics); and in 
relation to their users (pragmatics). In its early 
years positivism was preoccupied with syntax; 
later, semantics became the chief concern. Com- 
paratively little was done in pragmatics before the 
positivist movement as such came to a close [see 
SEMANTICS AND SEMIOTICS]. 

Logic and scientific purpose. Logic was identi- 
fied as the syntax of the language of science and 
later broadened to comprise its semantics also. 
Thereby, logic was taken to be definitively freed 
from both psychology and ontology. The laws of 
logic are neither principles of reason nor truths 
of being but are rules of language or the conse- 
quences of those rules. These, however, are logical 
consequences, so that the analysis of any given 
logic presupposes a logic used in the analysis; but 
this regression was not regarded as a vicious one. 
For every language there are rules of formation, by 
which its signs can be combined into sentences, 
and rules of transformation, by which, given cer- 
tain sentences, certain others can be asserted. The 
tule of modus ponens, for instance, allows us to 
assert the sentence B, given the sentences A and 
“If A then B” Because of this rule, the sentence 
“A, and A implies B, together imply B” is a logical 
truth, and the second implication is a logical im- 
plication. 

The rules may 
be asserted regar 
These sentences are then kn 


also allow certain sentences to 
dless of what others are given. 
own as postulates of 
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the system; if they can be so interpreted that their 
truth, and not merely their assertibility, is guaran- 
teed by the rules of the language, then these sen- 
tences, together with their consequences, are logical 
truths. Which rules are adopted for a language is 
a matter of convention; it is not the business of 
philosophy to prohibit certain modes of expression 
or inference (Carnap’s principle of tolerance). 
Thus, there are “many systems of logic and many 
languages proposed as “the” language of sci- 
ence. The question is always whether a language 
of a given structure is adequate for the purposes of 
science (or for some other special purpose). In 
particular, positivism promulgated the thesis that 
everything can be said in an extensional language, 
that is, one in which the truth of compound sen- 
tences is determined solely by the truth of their 
components and in which predicates designate 
classes rather than properties. But whether a cer- 
tain language is judged to be “adequate” for the 
purposes of science depends on one’s convictions 
as to what there is to be said. On this score, the 
issues dividing positivism from its critics remained 
unresolved and, indeed, largely unformulated. 

Foundations of mathematics. The logic of pos- 
itivism is not merely a symbolic but a mathemati- 
cal logic. Symbols have been used in logic since 
Aristotle, but only as abbreviations or auxiliaries, 
In the new logic everything hinges on the rules for 
the use of the notation. It is the focus on the 
combination and transformation of symbols that 
makes the logic mathematical. Mathematics, ac- 
cording to the positivist view, is itself a language. 
It does not tell us anything about the world, but 
it allows us to transform given statements into 
others and explores the possibilities of such trans- 
formations. 

By the turn of the century, mathematics had 
been put into postulational form. Questions then 
arose as to the nature of the postulates and the 
justification of the rules associated with them. 
These questions of the foundations of mathematics 
occupied much of the attention of the positivist 
movement. Russell held that mathematics is re- 
ducible to logic by defining numbers as certain 
classes of classes and by defining arithmetical oper- 
ations on numbers as certain logical operations on 
classes. Thus, Principia (Whitehead & Russell 
1910-1913) begins with purely logical postulates 
(such as “q implies p or 9”); eventually it pre- 
sents a proof of “1 +1 = 2.” In opposition to this 
logicist school, the intuitionists, led by L. E. J. 
Brouwer, looked at mathematics from the stand- 
point of pragmatics rather than semantics: mathe- 
matics is essentially a human activity; we cannot 
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meaningfully speak of the existence of a mathe- 
matical entity without being able to construct it. 
A third school, the formalist, following David Hil- 
bert, was concerned only with syntax—the occur- 
rence of mathematical symbols in certain com- 
binations—without regard to how the symbols are 
interpreted or used. 

Each of these approaches involved serious diffi- 
culties. The logicist view encountered certain para- 
doxes, especially in relation to class membership 
(Is the class of all classes that are not members 
of themselves a member of itself? ). The intuition- 
ist must make special provision for the infinite 
processes that are fundamental to large parts of 
mathematics—for instance, in connection with 
limits and continuity. Especially important results 
were achieved that set absolute limits to the formal- 
ist program, Gédel (1931) proved that any form- 
alism sufficiently rich to allow for the formulation 
of arithmetic also allowed for the occurrence of 
statements which, although true, could not be 
proved to be true within that formalism. 

Out of these various endeavors a whole new 
discipline of metamathematics emerged, in which 
questions about the nature of various mathematical 
statements and proofs are themselves treated in a 
rigorous mathematical way (Tarski 1956). Through 
the so-called new mathematics in elementary edu- 
cation, the elements of logic (set theory) are now 
becoming known to every schoolboy. 


Theory of knowledge 


Fundamental to any question about the scope 
and validity of human knowledge is some concep- 
tion of the nature of truth. The positivist emphasis 
on the analysis of the language of science was 
sometimes suggestive of the coherence theory of 
truth: a statement is accepted as true because of 
its relation to other statements that provide evi- 
dence or arguments for it. In the main, however, 
the positivist position was that ultimately certain 
statements (protocol sentences) are accepted on 
the basis of direct experience that is not itself 
verbalized. Truth is correspondence with fact, as 
disclosed by experience. This view, which goes 
back to Aristotle, was refined by Russell and Witt- 
genstein, who analyzed the correspondence in logi- 
cal terms. A proposition is true if it has the same 
structure as the fact it asserts. However, it is only 
the logical structure of the proposition that is 
involved, not the grammatical structure of the 
sentence formulating the proposition. Thus, “the 
present king of France” is not a logical constituent 
of the statement “The present king of France is 
bald,” but only its grammatical subject. Yet, how 


exactly to determine logical structure, whether of 
propositions or of facts, remained to some extent 
obscure and at any rate controversial (Ryle 1932; 
Hampshire 1948). 

Analytic and synthetic truths. Of special inter- 
est to positivism was the development by Tarski 
(1944) and others of the so-called semantic con- 
ception of truth. Here, also, truth is a matter of 
correspondence, but interest is focused on the way 
in which the truth of complex statements is de- 
finable by the truth of other, simpler expressions. 
The procedure is applicable, however, only to exact 
languages. 

Basic to the positivist theory of knowledge is 
the difference beween logical and factual truth. In 
positivism this difference reduces to that between 
analytic and synthetic statements. For Kant, analy- 
tic statements were those whose predicates were 
contained in their subjects (“Every effect has a 
cause”). Positivists regarded analytic statements as 
fundamentally either definitions or tautologies: 
compound statements which remain true for all 
possible combinations of truth-values of their con- 
stituents (“Either it will rain or it will not rain”). 
However, a satisfactory definition of “analytic” re- 
mained elusive, and in later years serious doubts 
were raised as to whether even the sharp distinction 
between “logical” and “factual” is tenable (Quine 
1953; see, however, Grice & Strawson 1956). 

A fundamental tenet of positivism is that only 
analytic truths can be known a priori. Metaphysics 
is rejected because, as Kant saw, it lays claim to 
synthetic a priori knowledge. However, “analytic 
and “a priori” were often defined, in effect, in terms 
of one another. If this is not done, some critics 
held, counterexamples to the positivist position can 
be provided. 

The problem of induction. The most important 
of these putative instances is some form of be 
so-called principle of induction. As Hume saw, wr 
principle is not analytic and therefore is not ae 
able a priori, yet it cannot be inductively grounde i 
a posteriori, without vicious circularity. Some ee 
tivists (such as Wittgenstein and Schlick) het 
that induction is not a matter of a “principle bu 
only of a rule, so that the question of its es 
does not arise. But even a rule calls for pares 
tion. In the main, positivists approached the Te 
tion in terms of a more general concern with t 
nature and foundations of inductive logic. de 

Inductive logic, it was widely agreed, 1s eer, 
mentally a matter of probability. But how pro ee 
ity is to be interpreted raised important ae 
even within the positivist movement. ae 
provides a probability calculus by which given Pr 


abilities allow for the calculation of others sought 
for. The question is what exactly we are given and 
whether this same calculus allows us to attach a 
determinate probability to, for example, a scientific 
hypothesis. Reichenbach (1935) defended the view 
that probabilities are essentially frequencies in the 
long run and that the frequency interpretation can 
be applied throughout. Carnap (1950), while 
acknowledging the importance of the frequency 
interpretation for certain cases, developed a con- 
ception of logical probability to be employed in the 
logic of confirmation. Each position faces acknowl- 
edged difficulties, some of which have in the mean- 
time been bypassed by the development of a third 
conception—that of “subjective” or “personal” prob- 
ability (Savage 1954). 

Operationism and the unity of science. What- 
ever the logic of induction, positivists agreed that 
inductive knowledge of extralogical truths can only 
be empirical. From Hume and Mach, positivism 
acquired a strong phenomenalistic bent: all knowl- 
edge can be cast in the form of statements about 
immediate experience (Carnap 1928). Alterna- 
tively, it can be formulated on a realistic basis (the 
“thing-language”). Most important is the claim— 
the thesis of “physicalism’—that everything can 
be said, in principle, in the language of physics. 
Closely connected with this thesis is the positivist 
thesis of the unity of science, which holds that 
there is no fundamental cleavage between Geistes- 
wissenschaft and Naturwissenschaft. Science has 
but one method; it is unified as to terms, in the 
sense of physicalism; and there is, again in prin- 
ciple, a unity of scientific laws, all of which can be 
derived from some single, comprehensive theory. 
The thesis of the unity of science, however, was 
of incomparably greater significance as a program 
than as an established philosophical doctrine. 

It was with respect to the unity of terms that 
most progress was made. Operationism, which was 
positivistic in spirit if not in origin, formulated 
conditions for the introduction of any term into 
the language of science: the specification of oper- 
ations for measurement or verification. It appears, 
however, that the meaning of a term cannot be 
identified or even univocally associated with these 
operations, for it is characteristic of science that 
there may be several quite different ways of meas- 
uring the same magnitude or of verifying the 
same hypothesis. A greater difficulty is that cer- 
tain terms are connected with observations, not 
directly, but only by way of their relation to other 
terms; symbolic operations are thus called for. But 
once such operations are admitted, much of the 
force of the operationist requirement is dissipated. 
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In the positivist movement, this difficulty cen- 
tered on the status of theoretical terms. Theory, 
according to the positivist view, is significant pri- 
marily as an intermediary between observations 
(or experiments). What is required is a specifica- 
tion of how theoretical terms can be brought into 
relation with observables. To this end Carnap 
(1936-1937) developed a theory of reduction sen- 
tences, which are partial definitions, as it were; no 
theoretical term is capable of being completely 
defined by way of observables. This is not to say, 
however, that theory posits an ontological domain 
other than what can be observed. To be sure, theory 
not only describes observable facts but also ex- 
plains them, But explanation is essentially a matter 
of prediction: to explain a fact is to adduce a law 
from which, together with appropriate initial con- 
ditions, the fact can be deduced—that is, predicted 
(Hempel & Oppenheim 1948), Here, too, problems 
of detail persisted, and some recent philosophers 
of science have tended to separate explanation 
from prediction and to emphasize the part played 
in explanation by unifying patterns. 


Positivist ethics 

As to ethics, some of the logical positivists (eg, 
Schlick 1930) espoused a naturalistic hedonism, 
akin to the liberal utilitarianism of nineteenth- 
century thought. But the distinctively positivistic 
view (Ayer 1936; Reichenbach 1951) applied the 
criterion of verifiability to moral judgments and 
concluded that these are strictly devoid of meaning. 
More accurately, a distinction was introduced be- 
tween two kinds of meaning, which came to be 
known as cognitive and emotive. The former is 
characteristic of scientific discourse, is expressed 
in declarative sentences, and is capable of being 
true or false; the latter is characteristic of the dis- 
course of politics, religion, morality, and art, and 
is expressed in imperatives or exclamations. The 
first conveys beliefs, whereas the second conveys 
attitudes (Stevenson 1944), Ethical statements do 
not embody propositions, but rather constitute 
commands, exhortations, and the like. 

Much of the severe criticism directed against 
this position begged the question of whether it 
“robs morality of any foundation,” although atti- 
tudes may be as firmly grounded in character and 
as effective on action as are beliefs. More philo- 
sophic objections were addressed to the workability 
of the distinction between the two sorts of meaning 
and to the question of whether the positivist analy- 
sis applies to moral judgments rather than only to 
expressions of moral sentiment. The later develop- 
ment of the positivist view gave rise to various 
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“deontic” logics, the precise postulational treatment 
of ascriptions of rights and duties—and related 
notions—in ways connected with the tradition of 
analytic jurisprudence, on the one hand, and with 
the utility theory of modern economics, on the 
other. 


The influence of positivism 


The persistence of identifiable schools of phi- 
losophy, each engaged in continuing polemic with 
other, opposing schools, seems more characteristic 
of the European scene than the American. At any 
rate, the dispersal of European scholars at the out- 
break of World War m marked the beginning of 
the end of logical positivism as a movement. The 
increasing diversity of viewpoints within the move- 
ment, as well as more widespread misunderstand- 
ing of its claims, made for increasing reluctance 
to identify with it. Moreover, as time passed there 
was a progressive softening of what had been taken 
to be its distinctive doctrines. The verifiability cri- 
terion was broadened; semantics and even prag- 
matics assumed more importance, as compared 
with syntax; and principles became programs to be 
espoused rather than theses to be defended. The 
revolutionary and even utopian impulse in some 
of the early positivists (for instance, Neurath) be- 
came dissipated. 

In philosophy, positivism had a marked impact 
on analytic philosophy, which is in a way its heir, 
and positivism is largely responsible for the central 
position in philosophic training accorded to mathe- 
matical logic almost everywhere. But its influence 
was much greater on science, and on the borders 
between science and philosophy, than on phi- 
losophy itself. On its empirical side, positivism 
added—especially in psychology and sociology—to 
the growing emphasis on observation and data, as 
against the theoretical and even speculative bent 
of the preceding generation or two. Positivism may 
also have contributed to the emergence of “be- 
havioral science” as something more than an 
alternative designation for the more traditional 
disciplines. It must be noted, however, that the posi- 
tivists were not, on the whole, inclined toward a 
strict behaviorism: both Carnap and Reichenbach 
were quite sympathetic to psychoanalytic ideas, 
for instance. The positivist interest in the logic of 
measurement and in the nature of probability at 
least coincided with, if it did not directly contrib- 
ute to, the growth of such disciplines as psycho- 
metrics and sociometrics. 

It is on its logical side, however, that positivism 
exerted its most unmistakable and distinctive in- 
fluence. The increasing interest during the last sev- 


eral decades in the application to empirical ma- 
terials of various logical and even mathematical 
systems is clearly indebted to the positivistic phi- 
losophy of science. In mathematics itself—espe- 
cially in foundation studies—a strong claim can 
be made for the value of postulational and even 
formal approaches. More dubious is the fruitful- 
ness of their application in the physical and bio- 
logical sciences (Reichenbach 1944; Woodger 
1952). In the social sciences the influence of posi- 
tivism can be recognized in the concern with 
“miniature systems” and “model building.” It may 
be too early to assess the value of this tendency. 
One recognizable danger may be identified as the 
“semantic myth”: that if concepts are introduced 
by the explicit operational definition of terms and 
if assumptions are clearly stated as postulates, the 
scientific significance of the undertaking is assured. 

In sum, the influence of positivism has been 
on form rather than substance—on methodology 
rather than on content. It has given new vigor to 
the ideals of clarity and precision of thinking, in & 
perspective in which the emphasis on theory is con- 
joined with an equal emphasis on the ineluctability 
of empirical data. But too much self-consciousness 
as to methodology may have a repressive effect on 
the conduct of scientific inquiry. Unintentionally, 
and even contrary to its own purposes, modern 
positivism may have contributed to a “myth of 
methodology”: that it does not much matter what 


we do if only we do it right. 
ABRAHAM KAPLAN 


[See also Ernics, article on ETHICAL SYSTEMS AND 
SOCIAL STRUCTURES; HISTORY, article on THE PHI- 
LOSOPHY OF HISTORY; PROBABILITY; SURVEY ANAL- 
ysıs; and the biographies of COMTE; PEARSON; 
Peirce; ScHLICK; WHITEHEAD.] 
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POUND, ROSCOE 


Roscoe Pound (1870-1964), an American legal 
philosopher and law teacher whose published 
works and public addresses have influenced world- 
wide juristic thought during this century, was born 
and reared in Lincoln, Nebraska, a university com- 
munity near what was then the western frontier. 
Despite his vast learning, he remained true to his 
vigorous, kindly, and optimistic Middle Western 
culture. From his father, a lawyer, he may have 
derived some of his political conservatism, and 
from his mother, a college graduate, he received 
encouragement for his ardent intellectual curiosity. 
At one time he specialized in botany, earning a 
pup. degree in that field from the University of 
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Nebraska and making significant scholarly con- 
tributions. His interests had earlier turned to law, 
and he spent a year as a student in the Harvard 
Law School. Returning to Nebraska, he was ad- 
mitted to the bar and practiced law. In 1901 he 
was appointed a commissioner (auxiliary judge) 
of the state Supreme Court, where he gained 
valuable insights into the work of the judiciary. 
Yet his academic interests prevailed. From 1903 
on he was, successively, a member of the law 
faculties of Nebraska, Northwestern, Chicago, and 
Harvard universities. At Harvard he was dean of 
the law school from 1916 to 1936, later becoming 
a university professor, a high academic honor. 

Sociological jurisprudence. Pound’s philosophy 
of law was influenced by Hegel's philosophy of his- 
torical evolution. Revolting against the German 
historicism of Savigny, with its reverence for the 
ancient Volksgeist, and against its American coun- 
terpart, the sanctification of English common law, 
Pound put forth his conception of legal change as 
experience developed by reason and tested by 
further reason. From William James, as well as 
from Hegel, he was led toward an evaluative theory 
of law based on the selection, reconciliation, and 
compromise of conflicting claims. From the soci- 
ology of Lester F. Ward and Edward A. Ross he 
obtained some of his enduring ideas. Both had an 
avowed belief in moderate social meliorism, which 
Pound came to share. Ward's conception of social 
forces and of their susceptibility to directive guid- 
ance by the human mind is analogous to Pound's 
theory of social interests, yet Pound did not use 
the term “social forces”; he avoided Ward's em- 
phasis on biological evolution and adopted none of 
Ward's whimsical “economic paradoxes.” Ward’s 
book The Psychic Factors of Civilization (1893), 
along with Hegel’s Philosophy of History and Jhe- 
ring’s Geist des römischen Rechts (“The Spirit of 
the Roman Law”), were important precursors of 
Pound’s Interpretations of Legal History (1923). 
From Ross (1901), Pound seems to have derived 
his view of law as only one means of social con- 
trol, but Pound expressly rejected the theory of 
social instincts, on which Ross relied. Thus, 
Pound’s sociological jurisprudence became an 
analytical—evaluative discipline, which he later dis- 
tinguished from the analytic—descriptive discipline 
that he called “sociology of law” (1943a). 

In his famous 1906 address to the American Bar 
Association, which aroused a storm of protest, 
Pound attacked the complacency of lawyers who 
mechanically follow outmoded rules of judicial 
procedure. In 1907 there appeared another influ- 
ential address, “The Need of a Sociological Juris- 


prudence.” Then came three of his best-known 
articles, each of which attacked some aspects of 
the common law and its American developments. 
In “Mechanical Jurisprudence” (1908) he acknowl- 
edged his homage to James’s pragmatism and pro- 
tested the judicial habit of applying mechanically 
outmoded rules in various fields, from constitu- 
tional law to court procedure. In “Liberty of Con- 
tract” (1909) he contrasted the theoretical Hegelian 
liberty of the industrial employee with his actual 
liberty and thus lent support to the long struggle 
for collective bargaining. In “Law in Books and 
Law in Action” (1910) he showed the disparity be- 
tween the legal rules as to liability for personal 
injuries and the verdicts of juries, thus preparing 
the way for workmen’s compensation laws, which 
abolished the common law defenses of employers 
(contributory negligence, assumption of risk, and 
the fellow-servant rule), imposed strict liability for 
industrial accidents, and dispensed with jury trials. 

Pound presented a more systematic treatment of 
these problems in a series of articles on sociological 
jurisprudence (1911-1912). Then followed a series 
of short books presenting aspects of his philoso- 
phy. He cited law and legal theory from American, 
from medieval, and from modern English and con- 
tinental European sources and from Roman law. 
By 1960 he had published 24 books and 287 major 
articles and addresses. His five-volume work Juris- 
prudence (1959) was chiefly a summary of earlier 
writings. 

Pound's great contribution to juristic thought 
was his view of law as an instrument of social 
engineering and his drive to discover how “to make 
[human] effort more effective in achieving the pur- 
poses of law” (1903, p. 34). Although he acknowl- 
edged his indebtedness to Oliver Wendell Holmes 
for this “functional conception of jurisprudence 
(see the letter to Holmes, dated November 10, 1919, 
in Sayre 1948, p. 276), Pound elaborated the idea 
and made it his own. The list of legal means 1$ 
never exhausted. Since law is only one means of 
social control, the lawmaker needs to study the 
social effects of legal institutions and legal doc- 
trines in relation to these other controls: a sock 
ology of law. Pound’s admiration of Eugen Ehr- 
lich’s work in legal sociology gained Ehrlich many 
American followers. The studies of William 
Moore, Herman Oliphant, William O. Douglas, $ 
other American legal realists were partly inspir 
by this new philosophy. 

pound believed tak professional study of the 
means of making legal precepts effective WO at 
lead to better remedies: money damages ee 
adequately recompense the badly injured employee: 


and 


rehabilitation training is a better means of restitu- 
tion. A sociological legal history, by revealing the 
occasion and the need for old law, aids the present 
generation to discard more intelligently the tradi- 
tional doctrines that are no longer needed. From 
Raymond Saleilles, Pound took the idea of making 
penal treatment fit the criminal, rather than the 
crime. In 1920 he became a codirector of the Cleve- 
land survey of criminal justice. The results of that 
survey led to improvements in criminal law and its 
administration in crowded urban areas. He re- 
mained a steadfast admirer of the legal profession 
despite its conservatism, and of judicial justice, 
the only possible “justice according to law.” His 
imaginative mind, viewing legal phenomena with 
the aid of pragmatism and Midwestern shrewdness, 
Hegelian and American idealism, produced a down- 
to-earth jurisprudence that could point to practical 
results, 

Theory of social interests. Practical results, 
however, often have a shorter life than do ideas. 
Pound's chief contribution to systematic legal phi- 
losophy is his theory of social interests, on which 
he began to work as early as 1913 (see the letter 
to Holmes, dated February 22, 1913, in Sayre 1948, 
p. 270). From Bentham and Jhering he took the 
idea of interest as a basic element of legal protec- 
tion: a right is a legally protected interest. From 
this idea of individual interests, Pound developed 
the concept of public interests (the interests of gov- 
ernmental units as property owners) and social 
interests (the claims and values of society which 
the legal order strives to maintain). He shared 
Jhering’s view that society is supreme and the 
state should be subordinated to it. Lawmaking and 
adjudication are processes in which a balancing of 
interests or a reconciliation of competing claims 
takes place; so dominant are the social interests 
that individual interests are entitled to protection 
only to the extent that one or more social interests 
will be furthered or maintained. The factory owner's 
claim to freedom from official interference in the 
operation of his machinery is supported by the 
social interest in the security of acquisitions, yet 
it may have to give way partly to the social interest 
in health and safety, as implemented by laws re- 
quiring certain safeguards on dangerous machin- 
ery. Again, the father’s individual interest in the 
upbringing of his minor children must give way to 
the social interest in security from juvenile aggres- 
sion and in the conservation of human resources 
(the juveniles). This argument was used to sup- 
port juvenile-court legislation when it was attacked 
as unconstitutional. 

Pound's theory sounds like the death knell of 
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nineteenth-century individualism until one finds 
that security and conservation rank high in three 
of his six classes of social interests, and that social 
interest in the individual human life—economic, 
political, cultural—is another value expressed in 
social legislation, Pound was a patient and opti- 
mistic idealist. His theory of social interests is 
much less radical than it seems at first reading. 
Five stages of legal history. Pound’s concept of 
the five stages of legal history resulted from his 
search for the ends of law, as developed in legal 
rules and doctrines. His first stage, primitive law, 
presupposes a clan-divided society and a weak 
central government that seeks chiefly to prevent 
blood feuds and to maintain the peace by providing 
a tariff of compensations for injuries, as exempli- 
fied in the Anglo-Saxon and other early laws. This 
stage was later expanded to include some extant 
primitive societies, as described by anthropologists. 
In the second stage, strict law, the end sought 
was certainty and security in the administration 
of legal remedies, to be attained by rigid enforce- 
ment of narrow procedural rules and by formalism 
in legal transactions. In ancient Rome, in this 
stage, the head of the family alone had full legal 
personality, and in the corresponding English stage 
the wife and minor children were in subjection to 
the husband-father. In English law the system of 
writs and of common law pleading, and in Roman 
law the period of jus strictum, exemplify this stage. 
In the stage of equity and natural law, the third 
stage, the unmoral attitude of strict law gave way 
to an insistence on justice in the ethical sense, and 
the concept of legal personality was broadened to 
include dependent members of the family. In 
Roman law this came about in the classical period, 
from Augustus to the early part of the third cen- 
tury A.D., and in England and continental Europe 
the corresponding transition came about in the 
seventeenth and eighteenth centuries. From this 
period in England date the married woman's sepa- 
rate estate in equity and the equity of redemption, 
by which a debtor—mortgagor who had not paid 
strictly on time could get the chancery court to 
compel the mortgagee, who by strict law now had 
legal title to the land, to accept a delayed payment 
and relinquish the land. In this and many other 
ways the formal rigors of the earlier period were 
diminished, and so was the emphasis upon security. 
A revival of this emphasis brought about the 
fourth stage, the maturity of law, in which the 
undue fluidity of law, resulting from the infusion 
of morals, was gradually corrected and the law be- 
came more stable without sacrificing all of the 
modifications effected in the preceding stage. As 
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Pound said, history does not repeat itself in mere 
cycles but moves upward in spirals. The idea of 
equality was carried over from the equity stage and 
led to the concept of equality of opportunity to exer- 
cise one’s faculties and to use one’s substance. In 
the nineteenth century in England, security and 
equality were reconciled in the safeguarding of the 
property and contracts of individuals. A similar 
stage is said to have been reached in Roman law, 
and in nineteenth-century Continental law. 

The socialization of law, the fifth period, brought 
out the emphasis on social interests rather than on 
individual interests, the limitations on property and 
contract, and many types of social legislation 
(1959, vol. 1, chapter 7). 


The dependence of law upon sociology and other 
social sciences was emphasized in Pound's earlier 
writings and was repeated in a recent essay ( Amer- 
ican Society for Legal History 1962, p. vii). Yet the 
social phenomena from which he derived his social 
interests were chiefly legal phenomena (1943b), 
and only in his survey of crime did he turn to 
criminological data. In his best work he found 


his own fertile insights to be sufficient, without’ 


benefit of any conclusions derived from an em- 
pirical sociology. The main contribution of socio- 
logical jurisprudence is a sustained and systematic 
demonstration that law is one means of social con- 
trol and that its merits or demerits are to be judged 
by its social consequences. 

EpwIn W. PATTERSON 


[For the historical context of Pound's work, see JURIS- 
PRUDENCE; Law, article on THE SOCIOLOGY OF LAW; 
LEGAL SYSTEMS; and the biographies of BENTHAM; 
EHRLICH; HEGEL; HOLMES; JAMES; KANTOROWICZ; 
LLEWELLYN; Ross; Warp, LESTER.] 
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POVERTY 


Poverty has always had several not entirely 
separable meanings and is always defined accord- 
ing to the conventions of the society in which it 
occurs. For administrative reasons definition may 
also take the form of fixing an absolute criterion of 
poverty (e.g., a “poverty line”). We may distin- 
guish three meanings: (1) social poverty, 
(2) pauperism, and (3) moral poverty. A 

Social poverty implies not merely economic in- 
equality (of property, income, living standards, 
etc.) but also social inequality, that is, a relation 
of inferiority, dependence, or exploitation. In other 
words, it implies the existence of a social stratum 
definable by, among other things, lack of wealth. 
In this sense poverty is relative, implying no par- 
ticular level of income or amount of property, al- 
though in ` seindustrial and underdeveloped e 
mies the level normally qualifying the ne 
(but not always the class) as “poor” is one not far 
removed from subsistence. able 

Pauperism describes a category of people una” = 
to maintain themselves at all, or to maintain ee 
selves at the level conventionally regarded as m 
mal, without outside assistance. At any given tie 
this implies the fixing of a minimum standar ar 
low which men are not supposed to fall and a a 
also implies a model of social relations that 1 


cates which paupers have a claim upon public 
assistance and who is to assist them. Pauperism 
arose historically beyond the border of the func- 
tioning primary social group (e.g., the kinship 
group) within which the economically dependent 
can expect assistance or maintenance without spe- 
cial institutional provisions. It therefore reflects the 
fluctuating fortunes of such primary groups, and 
in recent periods their secular diminution and 
functional decline. A man’s wife and children are 
not ipso facto paupers, but widows and orphans 
are perhaps the earliest clearly defined category of 
persons with a call upon public assistance. Some 
societies make a further distinction between types 
of paupers who “deserve” assistance and the equally 
indigent who do not. 

Moral poverty defines the place of poverty in the 
value system of a society or of its subgroups and 
institutions; that is, it defines whether poverty is 
morally acceptable and what status it confers or 
prevents the poor man from enjoying. It is there- 
fore difficult to separate from (1) and (2), except 
where it finds expression in specific bodies of men, 
in the past normally religious bodies, who volun- 
tarily undergo poverty. In stratified societies several 
values of poverty will normally coexist; for ex- 
ample, it will be a “shame” or a punishment for 
sin for some, a cause of pride for others, or both 
at the same time. 

The origins of poverty. The social category of 
the poor arises in stratified societies in which the 
upper and lower strata have direct experience of 
each other, It may also arise in situations where 
economically unequal groups coexist spatially, e.g-, 
townsmen and peasants or nomads and agricultural- 
ists, However, with spatial coexistence it is likely 
that the poorer will distinguish themselves from 
the richer and vice versa by qualitative rather than 
quantitative criteria—for example, as (poor) peas- 
ants and (rich) townsmen, rather than simply as 
“the poor” and “the rich.” Even elsewhere poverty 
is rarely the only criterion of stratification as seen 
from below. It is combined with numbers (Aris- 
totelian democracy); very commonly with labor 
(“the laboring poor”); or with a complex concept 
of “the people” or “the community,” aS perhaps in 
the phrase “the poor commons” familiar in English 
usage from the later Middle Ages. 

Seen from above, however, poverty may ade- 
quately define the lower strata, as in medieval legal 
terminology, where they are simply arme liute 
(poor people). The poor are normally contrasted 
with the rich; and a causal relationship is often 
assumed, as in the German proverbs “Poverty is 
the rich man’s cow” and “Poverty is the hand and 
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foot of wealth.” However, the contrast is also and 
at all times with the power and privilege that go 
with wealth; and consequently poverty always im- 
plies weakness and a low position in the social 
hierarchy, or personal inferiority. That the upper 
social classes are rich, the lower poor, is generally 
taken for granted. In the industrial period these 
broad distinctions are increasingly supplemented 
or replaced by the familiar distinction between eco- 
nomically defined classes. The “laboring poor” be- 
came the “laboring classes” and eventually the 
“working class.” 

Pauperism. Although poverty was assumed to 
be normal and, in the ordinary course of events, 
irremediable for most people, the basic view of 
pauperism in preindustrial societies was that it was 
abnormal and called for remedial action. The 
question was by whom. Intermittent general pau- 
perization, such as that due to famines and other 
catastrophes, required general public measures of 
remedy and prevention. The traditional “policy of 
provision” that imposed on the public authorities 
the duty of ensuring regular food supplies at rea- 
sonable prices, rather than the “poor law,” is the 
ancestor of modern welfare policies. Under normal 
circumstances it was assumed that the primary 
communities of men (in practice, determined by 
kin or locality) should and would be able to make 
adequate provision for members unable to main- 
tain themselves. This assumption survived the 
emergence of recognized supralocal pauperism in 
the late Middle Ages and led to an almost universal 
tendency to make each local community formally 
responsible for poor relief within its boundaries. 
It dominated public policy until the twentieth cen- 
tury. Relief, except in the case of general catas- 
trophes, was the formal and institutionalized sup- 
plementation of the voluntary aid for the helpless 
within primary communities, whether “natural,” 
like the kinship group that remained the tacit 
model, or “artificial,” like the various kinds of 
“protherhoods” or corporations. 

These assumptions, destined to break down with 
economic and social evolution, were first felt to be 
inadequate when the pauper was frequently an 
able-bodied adult not belonging to the community 
in which he claimed relief, or to any community 
(as in cities with a large fringe of the socially mar- 
ginal), but especially when he appeared as the 
migrant or vagrant “sturdy beggar.” Modern Euro- 
pean pauper policy began in the mid-fourteenth 
century with attempts to stem such interstitial 
pauperism (prohibitions of beggary). The regula- 
tion of beggars was then gradually transformed 
into the regulation of other kinds of paupers. From 
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the sixteenth century on, central authority tended 
to control, supplement, or establish a basically local 
system of financing and administering poor relief. 
Central control appeared in the most elaborate form 
in England, although local autonomy tended to 
revive there subsequently. In regions that were 
Protestant or economically advanced or both, the 
direct intervention of central and local authorities 
tended to prevail. In Roman Catholic or less devel- 
oped regions, there was a somewhat greater sys- 
tematization of private organized charity. Given 
the circumstances of its emergence, pauper policy 
set out to provide not only relief but also, and per- 
haps chiefly, social reorganization, often conceived 
as the restoration of a lost norm of traditional 
social and economic stability. However, increas- 
ingly it might also aim at assisting the development 
of a capitalist economy. The two aims were incom- 
patible, although in the short term both tended to 
rely partly on penal sanctions against the able- 
bodied unemployed and to make compulsory labor 
a condition of relief. 

Before the industrial revolution this incompati- 
bility was more visible in the theory of moral pov- 
erty than in the practice of relief. The ideologies 
of preindustrial societies left a large place for pov- 
erty as an ideal, or a comfort for the poor, express- 
ing such values in the comportment of saints, holy 
men, mendicant orders, etc. The value systems of 
societies devoted to economic development left 
room at best for the ideal of nonacquisitiveness 
(e.g., modestly recompensed public service), but 
not for one of poverty as such. Conversely, while 
traditional societies often recognized a category of 
culpable pauperism, they resisted the view that all 
poverty reflected personal inadequacy ( guilt, sin), 
a view that became characteristic of early capitalist 
society, especially in its Protestant versions. 

Poverty in industrial society. The industrial 
revolution opened a new era. The social category 
of the poor divided into the now familiar socio- 
economic classes, the urban poor tending to be 
assimilated conceptually into the “working class” 
or “proletariat,” although the rural poor tended 
(with some exceptions) to be less readily redefined. 
On the other hand, with the creation of a single 
world economy in the nineteenth century, entire 
national or racial groups increasingly came to be 
considered, or to consider themselves, as “poor” 
(and also as exploited, oppressed, or underprivi- 
leged) relative to the minority of “rich” peoples. 
With the mass emergence of new nations from 
former dependence in the mid-twentieth century, 
these international contrasts increasingly domi- 
nated the discussion of poverty, all the more so 


because they served as a measure of economic 
development. Moreover, the poor country was al- 
most invariably also the one in which the propor- 
tion of the poor in the population was higher and 
the degree of poverty was more acute than in the 
rich country. 

The era of industrialization also saw the rise of 
specific forms of the supralocal social organization 
of the poor, for both narrow and wide purposes. 
Among workers employed for wages (“proletarians”) 
the labor union established itself universally as the 
form of organization for the protection and ad- 
vancement of the standard of life, insofar as this 
depended on employment. No equally well-defined 
or uniform type of organization developed for equiv- 
alent purposes among the nonwage-earning poor. 
Until the increasingly systematic provision of wel- 
fare by the state, and of goods and services by 
economies oriented to the mass market, formal and 
informal community organizations of the poor de- 
veloped with a variety of functions, initially often 
on the basis of preindustrial institutions (kinship, 
religion, common geographic origin, etc.). Others 
were later improvised to meet new needs (mutual 
economic aid, education, public entertainment, and 
sport), sometimes under the wing of global mass 
movements of the poor, such as socialist parties. 
These (but not the labor unions) may be regarded 
in the main as phenomena of the transition to 4 
fully developed industrial society. However, they 
contributed to the formation of that complex of 
attitudes and behavior (“working-class culture, 
“culture of the poor,” and other subcultural phe- 
nomena) which reflected and still reflect the eco- 
nomic and social separation of the poor from the 
rest of society. 

Together with the labor union, the most charac- 
teristic new development in the era of industrial- 
ization was the emergence of political mass move- 
ments and parties committed to socialism, @ theory 
first elaborated to fit the situation of the industrial 
proletariat. Socialism also acquired a wider appe?» 
especially in “poor” countries anxious to overcome 
economic backwardness. Socialist movements n 
the broadest sense of the word became the major 
global form of social organization of the poor, Wi 
the object of abolishing poverty. They were often 
international in scope. : d 

Although the early phases of industrialism EN 
duced an unusually large and unmanageable pro! i 
lem of pauperism, the actual needs of the Pe 
played only a subordinate part in the fone F 
public policy until the decline of economic li a ae 
ism and the emergence of powerful organizatlo al 

ilize the politic 
of the poor or those prepared to utilize 


power of their numbers in elections and otherwise. 
Before this, no criterion of poverty was usually 
recognized other than destitution positively asking 
for relief, Although quantitative inquiries into the 
“state of the poor” have been made in Britain since 
the late eighteenth century, and on a firmer statis- 
tical base in industrializing countries since the 
1830s, no definition of a “poverty line” or survey 
of the proportion of citizens normally living below 
it that would be regarded as acceptable today oc- 
curred before the late nineteenth century, nor did 
any comprehensive public collection of unemploy- 
ment statistics. 

Public assistance in the 19th and 20th centuries. 
Liberal capitalism left the solution of pauperism to 
the spontaneous absorption of the able-bodied into 
employment by economic expansion, and the rest 
to self-help, mutual aid, charity, and ad hoc emer- 
gency measures ( preferably voluntary). It reserved 
public assistance for the irreducible residuum, or 
for catastrophic famine, increasingly rare in indus- 
trial, but not in underdeveloped, regions. The ob- 
ject of poor laws was primarily to assist the 
uninhibited working of the free economy, as by 
facilitating labor mobility, discouraging excessive 
population growth, etc., and to separate the re- 
siduum of involuntary and irremediable pauperism 
from the rest. The English Poor Law of 1834 was 
the prime example of such a policy, although mere 
abstention from ambitious public action was more 
common. In fact, and in spite of this bias toward 
abstention, economic conditions made it necessary 
even in the nineteenth century to establish or ex- 
tend public systems of relief in the growing areas 
of relative social disorganization, such as the cities 
and industrial regions. 

Beginning in the late nineteenth century, a more 
positive public policy developed, reflecting the 
abandonment of confidence in an essentially self- 
regulating capitalist economy, the growing politi- 
cal influence of the poor, and the appearance of 
planned socialist economies. Pauperism was reab- 
sorbed into the wider problem of providing all 
citizens with a minimum standard of life at all 
times, Since neither traditional social organization 
nor economic growth as such could ensure this, it 
was increasingly, although sometimes reluctantly, 
seen as the responsibility, in the main, of national 
central government. Where special poor laws ex- 
isted, they disappeared, as in Britain in 1929. In 
socialist countries, and with the great depression 
in advanced industrial countries, even such wider 
welfare policies tended to become part of still more 
comprehensive policies of economic management, 
e.g., policies of full employment. Backward regions 
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entering upon development in recent decades have 
therefore, even when adopting the patterns of capi- 
talist enterprise, rarely shown the nineteenth-cen- 
tury confidence in pure market forces. In such 
countries the adoption of ambitious policies of pub- 
lic welfare, simultaneously with economic develop- 
ment, has normally been a function of the political 
importance of the poor as veters or as revolution- 
aries, or of the attempt to forestall their revolu- 
tions. Paradoxically, therefore, the most socially 
conservative regimes have continued to show the 
greatest attachment to the policies toward pauper- 
ism of economic liberalism. 

Present attitudes toward poverty. The vast in- 
crease in wealth offered or achieved by modern 
technology, the growing role of social planning and 
management, and the political changes connected 
with both have led to a sharp devaluation of moral 
poverty. Poverty is no longer widely accepted as 
inevitable or desirable, and its abolition is univer- 
sally advocated. This implies not only the universal 
elevation of the minimum standard of material life 
to a level for which the actual standards in richer 
countries or among wealthier strata provide a guide, 
but also the abolition of the social inequalities 
inseparable from the concept of poverty. Govern- 
ments or bodies of opinion that do not publicly 
subscribe to these aims are now rare, However, the 
belief in the desirability of unlimited enrichment 
is by no means universal, although the belief in 
the abolition of poverty is. Those who voluntarily 
abstain from enrichment may continue to enjoy 
considerable respect, but rarely institutionalized 
status. Nevertheless, industrialization is marked by 
a fundamental alteration in the place of poverty in 
the value systems of societies. 


The measurement of poverty 

Increased public interest has since the late nine- 
teenth century led to a notable improvement in the 
quantitative information about poverty. Various rel- 
evant data (e.g., on incomes, cost of living, unem- 
ployment, consumer expenditure, housing) are now 
gathered as a matter of routine by many govern- 
ments and collated by international organizations, 
but adequate knowledge still demands periodic 
ad hoc surveys of the type pioneered by Booth and 
Rowntree before 1900 (see Booth et al. 1889-1891; 
see also Rowntree 1901). Measurement, moreover, 
poses problems both of technique and of definition. 

Material poverty. At the lowest level, the cri- 
terion of material poverty is an inability to achieve 
a minimum of physiological health and efficiency, 
and attempts to define a minimum nutritional 
standard (at present expressed mainly in calories, 
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with allowances for other aspects of the diet) have 
been made since the mid-nineteenth century. Their 
object, not always achieved, is to eliminate sub- 
jective and conventional elements in the assess- 
ment of basic poverty. Where the “poverty line” 
separates the hungry from the properly fed, as it 
still does in many parts of the world, such mini- 
mum standards have some value, although there is 
no absolute certainty as to what constitutes the 
physiological minimum. However, in economically 
developing and richer countries a conventional min- 
imum, tending to rise with time, must and always 
does determine poverty. Comparisons between 
richer and poorer countries and between poverty 
in one country at different times therefore become 
increasingly difficult. An objective minimum or an 
unchanging conventional standard measures not 
what is socially accepted as poverty in the richer 
country or at the later time, but only the difference 
between countries or points of time. The periodic 
“rediscovery” of poverty in countries whose stand- 
ards of living have markedly improved, as in Britain 
in the 1880s and in the United States and Britain 
in the early 1960s, is due largely to the lag in ad- 
justing standards to changing bases of assessment. 
Nonmaterial poverty. More serious difficulties 
arise in the measurement of nonmaterial aspects 
of poverty, that is, of the part played in it by the 
inferiority of rights, opportunities, and status of 
the poor and their sense of such inferiorities. Cer- 
tain of these aspects may be measured separately 
(for example, access to educational facilities and 
jobs), but the entire syndrome of frustration, re- 
sentment, and underprivilege has so far eluded 
quantification and comparison. The problem is 
complicated by the effects of social change or social 
disintegration on felt poverty. Such difficulties are 
illustrated by the inability of historians of British 
industrialization to agree on a generally acceptable 
weighting of material and nonmaterial factors in 
the poverty of the period they have studied. 
Measures of consumption. The most convenient 
measure of material poverty is through consump- 
tion, assessed directly or through income and ex- 
penditure. The latter has been found useful for 
comparisons, since Engel’s law, first formulated in 
1857, showed that—subject to reservations that are 
sometimes of major importance—the proportion 
of total expenditure on food, or more generally on 
basic needs, tends to vary inversely with income 
(Engel 1895). Thus, in about 1960 the mean per- 
centage of family expenditure on food ranged from 
25 to 30 (Canadian towns, white families in 
Northern Rhodesia) through 30-35 (northwest 


Europe: Sweden, Denmark, Netherlands, Great 
Britain), 50-55 (in the poorer countries of eastern 
Europe), over 60 (urban populations in parts of 
north Africa, Asia, and Latin America), to 70-80 
in rural and poorest urban strata in underdeveloped 
regions (International Labor Office 1963, tables 
22 and 23). Higher percentages occur in abnor- 
mally depressed zones, as during and after food 
shortages; lower percentages are found among the 
wealthy—for example, 15 per cent among the 
“rich” in the United States in 1935-1936 (U.S. 
National Resources Committee . . . 1939), Such 
juxtapositions allow us to make rough initial com- 
parisons between countries and social strata, but 
no more. 

Data on absolute per capita consumption also 
are often available, although without special in- 
quiries they are commonest in the crude form of 
“per capita availability” of certain commodities and 
facilities, which neglects or underestimates quali- 
tative differences and variations between social 
strata. Such data are therefore of limited value, 
although they are of some use in discussions of 
economic backwardness and growth, They show 
that the mean per capita food supply (in calories) 
in the poorest countries (south Asia, parts of the 
Far East and of Latin America) in the late 1950s 
was more than a third below that in the best-fed 
(North America, white Australasia, northwestern 
Europe, Argentina). Considerably wider disparities 
exist in the mean per capita availability of such 
elementary consumer goods as textiles. The same 
data can also be used to demonstrate such changes 
in consumption levels as the rise in mean per 
capita food supply in southern Europe during the 
1950s (Food and Agriculture Organization . : - 
1948; International Labor Office 1963). 

Income distribution. Data on the distribution 
of incomes and property provide some guidance to 
intranational economic inequality, but extreme cau- 
tion is needed in the use of official statistics on 
these subjects, especially in areas where incomes 
remain partly nonmonetary (see Titmuss 1962). 
Whatever the relevance of such inquiries to the 
measurement of poverty, they cannot replace ee 
cific inquiries into its extent. Since the major im- 
provement in living standards following the re- 
covery from World War 1, relatively few su 
investigations have been made in the industri 
countries, Estimates for the United States soe 
that about 20 per cent of the population is belo 
a “poverty line” appropriate to the current stani x 
of living in that country, including a high Ne). 
tion very much below it (Morgan et al. 1 


British estimates suggest a comparable figure of 
8-10 per cent, most (thanks to the more compre- 
hensive welfare provisions usual in western Eu- 
rope) only a little below the appropriate minimum 
standard (Townsend 1962), Comparisons of abso- 
lute minimum standards are possible with earlier 
industrial periods but difficult with preindustrial 
ones, although simple inspection normally reveals 
considerable improvement in the minimum in the 
past half-century. It is extremely difficult to make 
comparisons with periods before the twentieth cen- 
tury of the per cent of the population below 
changing conventional minimum standards, al- 
though in industrialized countries there has prob- 
ably also been improvement in this percentage. 
Comparisons of the effects of different types and 
levels of welfare provision are theoretically feas- 
ible, at least for the documented recent period, but 
the question has not attracted a great deal of 
attention. 


While the special problem of pauperism remains, 
and there is universal hostility toward poverty in 
most modern social value systems, the general 
problem of the abolition of poverty has increasingly 
merged with, and has often become the foundation 
of, the social and political policy of governments. 
A separate discussion of the abolition of poverty 
therefore goes beyond the limits of this article. 
Broadly speaking, in most countries of the world 
the solution to poverty is assumed to rest on eco- 
nomic development, although it is no longer gen- 
erally believed that this will in itself—and without 
the systematic, planned, and ambitious interven- 
tion of governments—eliminate a large residue of 
pauperism, Nor is it believed that development can 
be expected automatically to abolish, or perhaps 
even to diminish, the range of social and economic 
inequalities (see Paukert 1965). How far these can 
in fact be reduced or eliminated remains a thorny 
question. On the other hand there is general agree- 
ment that any notable degree of economic develop- 
ment will—although not necessarily in the short 
run or at all times—notably raise the average stand- 
ard of material consumption. There is also general 
agreement that material destitution, or poverty as 
defined in most preindustrial periods or regions, 


can be eliminated. 
E. J. HOBSBAWM 


[See also ConsumERS, article on CONSUMPTION LEVELS 


AND STANDARDS; ĪNCOME DISTRIBUTION, article on 
SIZE. Other relevant material may be found in Fam- 
INE; LABOR UNIONS.] 
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POWELL, JOHN WESLEY 


John Wesley Powell (1834-1902), American 
geologist and anthropologist, was born in Mount 
Morris, New York. The son of a Methodist preacher 
who moved first to Ohio, then to Wisconsin, and 
then to Illinois, Powell received a casual education. 
Although he attended Oberlin and Wheaton col- 
leges, he did not earn a degree. He taught in several 
common schools until the outbreak of the Civil 


War. He joined the Union army, serving more than 
three years in the artillery, and lost his right arm 
in the battle of Shiloh. Although promoted to 
colonel, Powell resigned as major and used this 
title the remainder of his life. 

Following the war Powell taught at Ilinois State 
Normal University, and during this period he or- 
ganized student field trips to Colorado to study 
geology and biology. He led two expeditions, one 
in 1869 and one in 1871/1872, down the Colorado 
River. This spectacular feat of exploration was 
the turning point in Powell’s career. In traversing 
the plateau country, he recognized the intimate 
relation between the land, the climate, and the 
people who occupied the land. His study of the 
semiarid lands and of the adjustments made by 
the Indians and Mormons who lived there em- 
phasized the essential character of the region: 
although the land is potentially irrigable, there is 
an insufficient supply of water. Powell believed that 
the aboriginal Indians and the Mormons had de- 
veloped social institutions and practices to adapt 
themselves to their environment. Because these 
lands were part of the public domain, Powell called 
upon the federal government to start land-utiliza- 
tion projects. His Report on the Lands of the Arid 
Region of the United States (1878) is considered 
one of the landmarks in conservation. 

From 1871 to 1879, Powell headed one of the 
four federal geological and geographical surveys 
of land in the public domain. These were replaced 
in 1879 by the U.S. Geological Survey. The Bureau 
of Ethnology was organized at the same time, and 
Powell became its first director, a position he held 
until his death. As an anthropologist, Powell is best 
known for An Introduction to the Study of Indian 
Languages (1877), in which he attempted a lin- 
guistic classification and set forth an interesting 
grouping of words by use, emotion, and purpose. 

Powell's greatest reputation as a scientist re- 
sulted from his studies of land forms and erosion. 
He advanced many original ideas in geology but 
was content to see them pursued by others. He T 
strongly influenced by the writings of Darwin bu 
believed that effective evolution does not apply to 
man, who by his intellect is able to divert or con- 
trol it. He wrote passionately about the misuse R 
man’s capacity to control his own destiny- v 

In 1881 Powell became director of the Geologiodt 
Survey, and he remained in this position be 3 
1892. This was an extremely influential post @ 
time when Congress was moving rapidly De 
regulatory field, Powell was successful in estal one 
ing, under the Geological Survey, an irrigation He 
vey which was intended to initiate the construc’ 


of extensive irrigation works. Congress curtailed 
the activities of the Irrigation Survey in 1885, but 
Powell continued with the mapping of water re- 
sources, and he trained specialists who later, in 
1902, staffed the Bureau of Reclamation, 

Powell lobbied diligently for the creation of a 
federal department of science with Cabinet rank 
and for concepts remarkably similar to those in- 
corporated eventually in the National Science Foun- 
dation. He believed that the federal government 
should be involved in the scientific and technologi- 
cal interests of the nation and patronize its institu- 
tions of research. He also believed in the social 
function of science and would not divorce it from 
the development of pure science. This was con- 
sistent with his belief that there is an inevitable 
and gradual process of concentration or centraliza- 
tion of authority and powers in all social insti- 
tutions. He thought that in modern society there 
should be a continuing combination of similar 
institutions (“corporations”), whether they be gov- 
ernmental, industrial, religious, or scientific. Para- 
doxically, he believed in extreme individualism. 

Late in life Powell assembled his diversified 
thoughts into a trilogy, only the first volume of 
which, Truth and Error (1898), was published. 
It contained a strange mixture of material and 
fared badly at the hands of reviewers, although it 
embodied many of Powell’s keen observations on 
man and evolution. 

It is difficult to assess Powell’s influence. He was 
a man of action, a skillful organizer and admin- 
istrator. Many regard him as one of the most suc- 
cessful of the government servants who have deter- 
mined the role of government in science. His 
personal influence upon his associates, for exam- 
ple on Lester Ward and especially on those in the 
Geological Survey, was remarkable. 

Many academic and professional honors were 
showered upon Powell, although he cared little for 
them. He died at his summer home at Haven, 
Maine, and was buried in Arlington National 


Cemetery. 
k WIiLLIaM C. DARRAH 


[Other relevant material may be found in aa 
TION; INDIANS, NORTH AMERICAN; Science, article 
ON SCIENCE-GOVERNMENT RELATIONS.] 
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POWER 


In approaching the study of politics through the 
analysis of power, one assumes, at a minimum, 
that relations of power are among the significant 
aspects of a political system, This assumption, and 
therefore the analysis of power, can be applied to 
any kind of political system, international, national, 
or local, to associations and groups of various kinds, 
such as the family, the hospital, and the business 
firm, and to historical developments. 

At one extreme, an analysis of power may simply 
postulate that power relations are one feature of 
politics among a number of others—but nonethe- 
less a sufficiently important feature to need em- 
phasis and description. At the other extreme, an 
analyst may hold that power distinguishes “politics” 
from other human activity; to analysts of this view 
“political science, as an empirical discipline, is the 
study of the shaping and sharing of power” (Lass- 
well & Kaplan 1950, p. xiv). 

In either case, the analyst takes it for granted 
that differences between political systems, or pro- 
found changes in the same society, can often be in- 
terpreted as differences in the way power is dis- 
tributed among individuals, groups, or other units. 
Power may be relatively concentrated or diffused; 
and the share of power held by different individ- 
uals, strata, classes, professional groups, ethnic, 
racial, or religious groups, etc., may be relatively 
great or small. The analysis of power is often con- 
cerned, therefore, with the identification of elites 
and leadership, the discovery of the ways in which 
power is allocated to different strata, relations 
among leaders and between leaders and nonleaders, 
and so forth. 

Although the approach to politics through the 
study of power relations is sometimes thought to 
postulate that everyone seeks power as the highest 
value, analysts of power generally reject this as- 
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sumption as psychologically untenable; the analysis 
of power does not logically imply any particular 
psychological assumptions. Sometimes critics also 
regard the analysis of power as implying that the 
pursuit of power is morally good or at any rate 
that it should not be condemned, But an analysis 
of power may be neutral as to values; or the analyst 
may be concerned with power, not to glorify it, but 
in order to modify the place it holds in human rela- 
tions and to increase the opportunities for dignity, 
respect, freedom, or other values (Jouvenel 1945; 
Lasswell & Kaplan 1950; Oppenheim 1961, chap- 
ters 8, 9). 

Indeed, it would be difficult to explain the extent 
to which political theorists for the past 25 centuries 
have been concerned with relations of power and 
authority were it not for the moral and practical 
significance of power to any person interested in 
political life, whether as observer or activist. Some 
understanding of power is usually thought to be 
indispensable for moral or ethical appraisals of 
political systems, From a very early time—certainly 
since Socrates, and probably before—men have 
been inclined to judge the relative desirability of 
different types of political systems by, among other 
characteristics, the relations of power and author- 
ity in these systems. In addition, intelligent action 
to bring about a result of some kind in a political 
system, such as a change in a law or a policy, a 
revolution, or a settlement of an international dis- 
pute, requires knowledge of how to produce or 
“cause” these results. In political action, as in other 
spheres of life, we try to produce the results we 
want by acting appropriately on the relevant causes. 
As we shall see, power relations can be viewed as 
causal relations of a particular kind. 

It therefore seems most unlikely that the analysis 
of power will disappear as an approach to the study 
of politics. However, the fact that this approach is 
important and relevant does not shield it from 
some serious difficulties. These have become par- 
ticularly manifest as the approach has been more 
earnestly and systematically employed. 

Origins 

The attempt to study and explain politics by 
analyzing relations of power is, in a loose sense, 
ancient. To Aristotle, differences in the location of 
power, authority, or rule among the citizens of a 
political society served as one criterion for differen- 
tiating among actual constitutions, and it entered 
into his distinction between good constitutions and 
bad ones [see ARISTOTLE]. With few exceptions 
(most notably Thomas Hobbes) political theorists 
did not press their investigations very far into cer- 


tain aspects of power that have seemed important 
to social scientists in the twentieth century [see 
HospEs]. For example, most political theorists took 
it for granted, as did Aristotle, that key terms like 
power, influence, authority, and rule (let us call 
them “power terms”) needed no great elaboration, 
presumably because the meaning of these words 
was clear to men of common sense. Even Machia- 
velli, who marks a decisive turning point from 
classical-normative to modern-empirical theory, 
did not consider political terms in general as par- 
ticularly technical. Moreover, he strongly preferred 
the concrete to the abstract. In his treatment of 
power relations Machiavelli frequently described a 
specific event as an example of a general principle; 
but often the general principle was only implied 
or barely alluded to; and he used a variety of un- 
defined terms such as imperio, forza, potente, and 
autorita [see MACHIAVELLI). 

From Aristotle to Hobbes political theorists were 
mainly concerned with power relations within a 
given community, But external relations even more 
than internal ones force attention to questions of 
relative power. The rise of the modern nation-state 
therefore compelled political theorists to recognize 
the saliency of power in politics, and particularly, 
of course, in international politics (Meinecke 1924). 

Thus political “realists” found it useful to define, 
distinguish, and interpret the state in terms of its 
power. Max Weber both reflected this tradition of 
“realism” and opened the way for new develop- 
ments in the analysis of power [see WEBER, Max]. 
“Power (Macht) is the probability that one actor 
within a social relationship will be in a position to 
carry out his own will despite resistance, regardless 
of the basis on which this probability rests” ( Weber 
[1922] 1957, p. 152). This definition permitted 
Weber to conclude that “the concept of power , 
highly comprehensive from the point of view 2 
sociology. All conceivable . . . combinations of cir- 
cumstances may put him [the actor] in a position 
to impose his will in a given situation” (p. 153). 
It follows that the state is not distinguishable from 
other associations merely because it rere = 
cial and uliarly important kind ©: be 
force. In Aao znd highly influential definition, 
Weber characterized the state as follows: A as 
pulsory political association with condgveni S z 
ization (politischer Anstaltsbetrieb) will ; aff 
a ‘state’ if and in so far as its administrative i = 
successfully upholds a claim to the yea? Ae 
the legitimate use of physical force in the ento: 
ment of its order” (p. 154). 

In his Erea typologies and his ana 
of political systems, however, Weber was less Ci 


cerned with power in general than with a special 
kind that he held to be unusually important— 
legitimate power, or authority. 

Later theorists, practically all of whom were di- 
rectly or indirectly influenced by Weber, expanded 
their objectives to include a fuller range of power 
relations. In the United States attempts to suggest 
or develop systematic and comprehensive theories 
of politics centering about power relations appeared 
in books by Catlin (1927; 1930), an important 
essay by Goldhamer and Shils (1939), and numer- 
ous works of the Chicago school—principally Mer- 
riam (1934), Lasswell (1936), and, in interna- 
tional politics, Morgenthau (1948). In the decade 
after World War 11 the ideas of the Chicago school 
were rapidly diffused throughout American politi- 
cal science, [See MERRIAM.] 


Elements in the analysis of power 


Power terms evidently cover a very broad cate- 
gory of human relations. Considerable effort and 
ingenuity have gone into schemes for classifying 
these relations into various types, labeled power, 
influence, authority, persuasion, dissuasion, in- 
ducement, coercion, compulsion, force, and so on, 
all of which we shall subsume under the collective 
label power terms. The great variety and hetero- 
geneity of these relations may, in fact, make it 
impossible—or at any rate not very fruitful—to 
develop general theories of power intended to cover 
them all. 

At the most general level, power terms in modern 
social science refer to subsets of relations among 
social units such that the behaviors of one or more 
units (the responsive units, R) depend in some 
circumstances on the behavior of other units (the 
controlling units, C). (In the following discussion, 
R will always symbolize the responsive or depend- 
ent unit, C the controlling unit. These symbols will 
be used throughout and will be substituted even 
in direct quotations where the authors themselves 
have used different letters.) By this broad defini- 
tion, then, power terms in the social sciences ex- 
clude relations with inanimate or even nonhuman 
objects; the control of a dog by his master or the 
power of a scientist over “nature” provided by a 
nuclear reactor would fall, by definition, in a dif- 
ferent realm of discourse. On the other hand, the 
definition could include the power of one nation to 
affect the actions of another by threatening to use 
a nuclear reactor as a bomb or by offering to trans- 
fer it by gift or sale. 

If power-terms include all relations of the kind 
just defined, then they spread very widely over 
the whole domain of human relations. In practice, 
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analysts of power usually confine their attention 
to smaller subsets. One such subset consists, for 
example, of relations in which “severe sanctions 

. . are expected to be used or are in fact applied 
to sustain a policy against opposition’—a subset 
that Lasswell and Kaplan call power (1950, pp. 74— 
75). However, there is no agreement on the com- 
mon characteristics of the various subsets covered 
by power terms, nor are different labels applied 
with the same meaning by different analysts. 

Despite disagreement on how the general con- 
cept is to be defined and limited, the variety of 
smaller subsets that different writers find interest- 
ing or important, and the total lack of a stand- 
ardized classification scheme and nomenclature, 
there is nonetheless some underlying unity in the 
various approaches to the analysis of power. In 
describing and explaining patterns of power, dif- 
ferent writers employ rather similar elements 
(compare Cartwright 1965), What follows is an 
attempt to clarify these common elements by ignor- 
ing many differences in terminology, treatment, 
and emphasis. 

Some descriptive characteristics. For purposes 
of exposition it is convenient to think of the analy- 
sis of power in terms of the familiar distinction 
between dependent and independent variables, The 
attempt to understand a political system may then 
be conceived of as an effort to describe certain 
characteristics of the system: the dependent vari- 
ables; and to explain why the system takes on these 
particular characteristics, by showing the effects 
on these characteristics of certain other factors: 
the independent variables. Some of the character- 
istics of a political system that analysts seek to 
explain are the magnitude of the power of the C’s 
with respect to the R’s, how this power is distributed 
in the system, and the scope, and domain, of control 
that different individuals or actors have, exercise, 
or are subject to. 

Magnitude. Political systems are often charac- 
terized explicitly or implicitly by the differences 
in the “amounts” of power (over the actions of the 
government or state) exercised by different indi- 
viduals, groups, or strata. The magnitude of C's 
power with respect to R is thought of as measur- 
able, in some sense, by at least an ordinal scale; 
frequently, indeed, a literal reading would imply 
that power is subject to measurement by an inter- 
val scale. How to compare and measure different 
magnitudes of power poses a major unsolved prob- 
lem; we shall return to it briefly later on. Mean- 
while, we shall accept the assumption of practically 
every political theorist for several thousand years, 
that it is possible to speak meaningfully of differ- 
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ent amounts of power. Thus a typical question in 
the analysis of a political system would be: Is con- 
trol over government highly concentrated or rel- 
atively diffused? 

Distribution. An ancient and conventional way 
of distinguishing among political systems is accord- 
ing to the way control over the government or the 
state is distributed to individuals or groups in the 
systems. Aristotle, for example, stated: “The proper 
application of the term ‘democracy’ is to a consti- 
tution in which the free-born and poor control the 
government—being at the same time a majority; 
and similarly the term ‘oligarchy’ is properly ap- 
plied to a constitution in which the rich and better- 
born control the government—being at the same 
time a minority” (Politics, Barker ed., p. 164). 
Control over government may be conceived as 
analogous to income, wealth, or property; and in 
the same way that income or wealth may be dis- 
tributed in different patterns, so too the distribution 
of power over government may vary from one soci- 
ety or historical period to another. One task of 
analysis, then, is to classify and describe the most 
common distributions and to account for the dif- 
ferent patterns. Typical questions would be: What 
are the characteristics of the C's and of the R's? 
How do the C's and R's compare in numbers? Do 
C’s and R's typically come from different classes, 
strata, regions, or other groups? What historical 
changes have occurred in the characteristics of 
C and R? 

Scope. What if C’s are sometimes not C’s, or 
C's sometimes R’s, or R’s sometimes C's? The pos- 
sibility cannot be ruled out that individuals or 
groups who are relatively powerful with respect to 
one kind of activity may be relatively weak with 
respect to other activities. Power need not be gen- 
eral; it may be specialized. In fact, in the absence 
of a single world ruler, some specialization is inev- 
itable; in any case, it is so commonplace that ana- 
lysts of power have frequently insisted that a state- 
ment about the power of an individual, group, state, 
or other actor is practically meaningless unless it 
specifies the power of actor C with respect to some 
class of R’s activities. Such a class of activities is 
sometimes called the range (Cartwright 1965) or 
the scope of C’s power (Lasswell & Kaplan 1950, 
p. 73). There is no generally accepted way of de- 
fining and classifying different scopes. However, 
a typical question about a political system would 
be: Is power generalized over many scopes, or is 
it specialized? If it is specialized, what are the 
characteristics of the C’s, the elites, in the different 
scopes? Is power specialized by individuals in the 
sense that C, and C, exercise power over different 


scopes, or is it also specialized by classes, social 
strata, skills, professions, or other categories? 

Domain. C’s power will be limited to certain 
individuals; the R’s over whom C has or exercises 
control constitute what is sometimes called the 
“domain,” or “extension,” of C's power (Lasswell & 
Kaplan 1950, p. 73; Harsanyi 1962a, p. 67). Typi- 
cal questions thus might be: Who are the R’s over 
whom C has control? What are their character- 
istics? How numerous are they? How do they differ 
in numbers or characteristics from the R’s not 
under C’s control? 

Given the absence of any standard unit of meas- 
ure for amounts, distributions, scopes, domains, 
and other aspects of power, and the variety of ways 
of describing these characteristics, it is not at all 
surprising that there is an abundance of schemes 
for classifying political systems according to some 
characteristic of power. Most such schemes use, 
implicitly or explicitly, the idea of a distribution of 
power over the behavior of government. The oldest, 
most famous, and most enduring of these is the 
distinction made by the Greeks between rule by 
one, the few, and the many (see Aristotle, Politics, 
Barker ed., pp. 110 ff.). Some variant of this 
scheme frequently reappears in modern analyses 
of power (e.g., Lasswell & Kaplan 1950, p. 218). 
Often, as with Aristotle himself, the distribution of 
power is combined with one or more other dimen- 
sions (e.g., Dahl 1963, p. 38). Rough dichotomous 
schemes are common. One based on “the degree 
of autonomy and interdependence of the several 
power holders” distinguishes two polar types, called 
autocracy and constitutionalism (Loewenstein 
1957, p. 29). American community studies have 
in recent years called attention to differences be- 
tween “pluralistic” systems and unified or highly 
stratified “power structures” [see CoMMUNITY, af- 
ticle on THE STUDY OF COMMUNITY POWER]. In one 
study that compares four communities the authors 
developed a more complex typology of power suu 
tures by combining a dimension of “distribution 
of political power among citizens” with the ye 
of convergence or divergence in the ideology 0 
leaders; the four types of power structures pro- 
duced by dichotomizing these two dimensions are 
in turn distinguished from regimes (Agger ee 
1964, pp. 73 ff.). 

Some explanatory characteristi de 
ferent types of political systems, how are iE 
differences among them to be explained? If, Fil 
example, control over government is sometimes i 
tributed to the many, often to the few, and g 
sionally to one dominant leader, how can as 
account for the differences? Obviously these 


cs. Given the dif- 


ancient, enduring, and highly complex problems; 
and there is slight agreement on the answers. 
However, some factors that are often emphasized 
in modern analysis can be distinguished. 
Resources. Differences in patterns or structures 
of power may be attributed primarily, mainly, or 
partly to the way in which “resources,” or “base 
values,” are distributed among the individuals, 
strata, classes, and groups in different communi- 
ties, countries, societies, and historical periods. 
This is an ancient, distinguished, widespread, and 
persuasive mode of explanation, used by Aristotle 
in Greece in the fourth century B.C., by James Har- 
rington in seventeenth-century England, by the 
fathers of the American constitution in the late 
eighteenth century, by Marx and Engels in the 
nineteenth century, and by a great many social 
scientists in the twentieth century. A central hy- 
pothesis in most of these theories is that the 
greater one’s resources, the greater one’s power. 
Although explanations of this kind do not always 
go beyond tautology (by defining power in terms 
of resources), logical circularity is certainly not 
inherent in this mode of explanation. However, 
there is no accepted way of classifying resources 
or bases. Harold Lasswell has constructed a com- 
prehensive scheme of eight base values which, al- 
though not necessarily exhaustive, are certainly 
inclusive; these are power (which can serve as a 
base for more power), respect, rectitude or moral 
standing, affection, well-being, wealth, skill, and 
enlightenment (Lasswell & Kaplan 1950, p. 87). 
Other writers choose more familiar categories to 
classify resources: for example, in trying to ac- 
count for the patterns of influence in one com- 
munity, the author described the patterns of social 
standing; the distribution of cash, credit, and 
wealth; access to legality, popularity, and control 
over jobs; and control over sources of information 
(Dahl 1961, pp. 229 ff.). ; 
Skill. Two individuals with access to approxi- 
mately the same resources may not exercise the 


same degree of power (over, let us say, govern- 
ment decisions). Indeed, it is a common observa- 
wealth 


tion that individuals of approximately equal 
or social status may differ greatly in power. To be 
sure, this might be accounted for by differences 
in access to other resources, such as the greater 
legality, bureaucratic knowledge, and public affec- 
tion that fall to any individual who is chosen, say, 
to be prime minister of Britain or president of the 
United States. Another factor, however, one given 
Particular prominence by Machiavelli, is political 
skill, Formally, skill could be treated as another 
resource, Nonetheless, it is generally thought to be 
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of critical importance in explaining differences in 
the power of different leaders—different presidents, 
for example, as in Neustadt's comparison of presi- 
dents Roosevelt, Truman, and Eisenhower (1960, 
pp. 152 ff.). However, despite many attempts at 
analysis, from Machiavelli to the present day, polit- 
ical skill has remained among the more elusive 
aspects in the analysis of power. 

Motivations. Two individuals with access to 
the same resources may exercise different degrees 
of power (with respect to some scope) because of 
different motivations: the one may use his re- 
sources to increase his power; the other may not. 
Moreover, since power is a relationship between 
C’s and R’s, the motivations not only of the C’s but 
also of the R’s are important. One person may 
worship authority, while another may defy it. A 
number of writers have explored various aspects 
of motivations involved in power relations (e.g., 
Lasswell 1930; Rogow & Lasswell 1963; Cartwright 
1959). 

Costs. Motivations can be related to resources 
by way of the economists’ language of cost—a 
factor introduced into the analysis of power by a 
mathematical economist (Harsanyi 1962a; 1962b), 
In order to control R, C may have to use some of 
his resources. Thus C’s supply of resources is likely 
to have a bearing on how far he is willing to go 
in trying to control R. And variations in C's re- 
sources are likely to produce variations in C's 
power. C’s opportunity costs in controlling R—that 
is, what C must forgo or give up in other oppor- 
tunities as a result of using some of his resources 
to control R—are less (other things being equal ) 
if he is rich in resources than if he is poor in 
resources. In concrete terms, to a rich man the 
sacrifice involved in a campaign contribution of 
$100 is negligible; to a poor man the sacrifice en- 
tailed in a contribution of $100 is heavy. C's will- 
ingness to use his resources to control R will also 
depend on the value to C of R’s response; the value 
of R’s response is, in turn, dependent in part on 
C’s motivations. The relationship may also be ex- 
amined from R’s point of view. R's opportunity 
costs consist of what he is then unable to do if he 
complies with C. In R’s case, as in C’s, his supply 
of resources and his motivations help determine 
his opportunity costs. Thus a power relation can 
be interpreted as a sort of transaction between C 


and R. 


Problems of research 

Like all other approaches to an understanding 
of complex social phenomena, the analysis of 
power is beset with problems. At a very general 
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level, attempts to analyze power share with many 
—perhaps most—other strategies of inquiry in the 
social sciences the familiar dilemma of rigor versus 
relevance, and the dilemma has led to familiar 
results. Attempts to meet high standards of logical 
rigor or empirical verification have produced some 
intriguing experiments and a good deal of effort 
to clarify concepts and logical relationships but not 
rounded and well-verified explanations of complex 
political systems in the real world. Conversely, at- 
tempts to arrive at a better understanding of the 
more concrete phenomena of political life and in- 
stitutions often sacrifice a good deal in rigor of 
logic and verification in order to provide more 
useful and reliable guides to the real world. 

There are, however, a number of more specific 
problems in the analysis of power, many of which 
have only been identified in the last few decades. 
Relevant work is quite recent and seeks (1) to 
clarify the central concepts, partly by expanding 
on the analogy between power relations and causal 
relations, (2) to specify particular subsets that are 
most interesting for social analysis, (3) to develop 
methods of measurement, and (4) to undertake 
empirical investigations of concrete political phe- 
nomena. 

Power and cause. The closest equivalent to the 
power relation is the causal relation. For the asser- 
tion “C has power over R,” one can substitute the 
assertion, “C’s behavior causes R’s behavior.” If one 
can define the causal relation, one can define in- 
fluence, power, or authority, and vice versa (Simon 
[1947-1956] 1957, p. 5). 

Since the language of cause is no longer common 
in the formal theoretical language of the natural 
Sciences, it might be argued that social scientists 
should also dispense with that language and that 
insofar as power is merely a term for a causal 
relation involving human beings, power-terms 
should simultaneously be dispensed with. But it 
seems rather unlikely that social scientists will, in 
fact, reject causal language. For the language of 
cause, like the language of power, is used to inter- 
pret situations in which there is the possibility that 
some event will intervene to change the order of 
other events. In medical research it is natural and 
meaningful to ask, Does cigarette smoking cause 
lung cancer and heart disease? In social situations 
the notion of cause is equally or even more appro- 
priate. What makes causal analysis important to 
us is our desire to act on causes in the real world 
in order to bring about effects—reducing death 
rates from lung cancer, passing a civil-rights bill 
through Congress, or preventing the outbreak of 
war. 

To interpret the terms power, influence, authority, 


etc., as instances of causal relations means, how- 
ever, that the attempt to detect true rather than 
spurious power relations must run into the same 
difficulties that have beset efforts to distinguish 
true from spurious causal relations. Some analysts 
have confronted the problem; others have noted it 
only to put it aside; most have ignored it entirely, 
perhaps on the assumption that if social scientists 
tried to solve the unsolved problems of philosophy 
they would never get around to the problems of the 
social sciences. Yet if power is analogous to cause 
—or if power relations are logically a subset of 
causal relations—then recent analyses of causality 
must have relevance to the analysis of power. 

In the first place, properties used to distinguish 
causation also serve to define power relations: 
covariation, temporal sequence, and asymmetry, 
for example. The appropriateness of these criteria 
has in fact been debated, not always conclusively, 
by various students of power (e.g., Simon [1947— 
1956] 1957, pp. 5, 11, 12, 66; Dahl 1957, p. 204; 
Cartwright 1959, p. 197; Oppenheim 1961, p. 104). 

Thus, the problem whether A can be said to 
cause B if A is a necessary condition for B, or a 
sufficient condition, or both necessary and suffi- 
cient, has also plagued the definition of power- 
terms. Some writers have explicitly stated or at 
least implied that relations of power mean that 
some action by C is a necessary condition for R’s 
response (Simon 1953, p. 504; March 1955, p. 435; 
Dahl 1957, p. 203). Oppenheim has argued, how- 
ever, that such definitions permit statements that 
run flatly counter to common sense; he holds 
that it would be more appropriate to require only 
that C’s action be sufficient to produce R’s re- 
sponse (1961, p. 41). Riker has suggested in turn 
that “the customary definition of power be revised 
. .. to reflect the necessary-and-sufficient condition 
theory of causality” (1964, p. 348). However, 
Blalock in his Causal Inferences in Non-experimen- 
tal Research has shown that defining cause in terms 
of necessary and sufficient conditions leads to great 
practical difficulties in research. “In real-life situ- 
ations we seldom encounter instances where B is 
present if and only if A is also present” (1964, 
p. 30); moreover, specifying necessary and suffi- 
cient conditions requires the researcher “to think 
always in terms of attributes and dichotomies,” 
whereas “there are most certainly a number of 
variables which are best conceived as continuously 
distributed, even though we may find it difficult to 
measure them operationally in terms of a specified 
unit of some kind” (p. 32). “The use of ‘necessary 
and sufficient’ terminology . . . may work well for 
the logician but not [for] the social scientist” (p. 34). 
Blalock’s criticism, and indeed his whole effort to 


explore problems of causal inference in nonexperi- 
mental research, are highly relevant to the analysis 
of power. 

Aside from these somewhat rarefied philosophi- 
cal and definitional questions, which many social 
scientists are prepared to abandon to metaphysi- 
cians or philosophers of science, the analogy be- 
tween power and cause argues that the problem of 
distinguishing cause from correlation, or true from 
spurious causation, is bound to carry over into the 
analysis of power. And indeed it does. The difficulty 
of distinguishing true from spurious power rela- 
tions has proved to be quite formidable. 

The most rigorous method of distinguishing true 
from spurious causation is, of course, experimenta- 
tion, and this would be the most rigorous method 
for distinguishing true from spurious power rela- 
tions, provided the proper experimental conditions 
were present. Unfortunately, however, as in many 
areas of the social sciences, so too in the analysis 
of power, experimental methods have so far been 
of limited value, and for similar reasons. In non- 
experimental situations the optimal requirements 
for identifying causal relations seem to be the 
existence of satisfactory interval measures, a large 
supply of good data employing these measures, 
and an exhaustive analysis of alternative ways of 
accounting for the observations (Blalock 1964). 
Unfortunately, in the analysis of power, existing 
methods of measurement are rather inadequate, 
the data are often inescapably crude and limited, 
a variety of simple alternative explanations seem 
to fit the data about equally well, and in any case 
the complexity of the relations requires extraordi- 
narily complex models, 

The shortage of relevant models of power may 
disappear in time. In fact, the causal analogue sug- 
gests that the development of a great array of 
carefully described alternative models to compare 
with observations is probably a prerequisite for 
further development in the analysis of power. 
Again, the analogy between power and cause read- 
ily reveals why this would seem to be the case. 
In trying to determine the cause of a phenomenon 


it is of course impossible to know whether all the 


relevant factors in the real world are actually 


controlled during an investigation. Consequently, 
it is never possible to demonstrate causality. 


usal inferences concerning the 
s, at least in the sense that 
ting inadequate models that 
ot consistent with the data. 


It is possible to make cai 
adequacy of causal model: 
we can proceed by elimina! 
make predictions that are ni 5 
e. [Such] causal models involve a) a finite set of 
explicitly defined variables, (2) certain assumptions 
about how these variables are interrelated causally, 


and (3) assumptions to the effect that outside varia- 
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bles, while operating, do not have confounding influ- 
ences that disturb the causal patterning among the 
variables explicitly being considered. (ibid., p. 62) 


If power relations are a subset of causal relations, 
these requirements would also be applicable in the 
analysis of power. 

In analyzing power, why have analysts so rarely 
attempted to describe, in rigorous language at any 
rate, the alternative causal models relevant to their 
inquiry? There seem to be several reasons. First, 
students of power have not always been wholly 
aware that distinguishing true from spurious power 
relations requires intellectual strategies at a rather 
high level of sophistication. Second, the crude qual- 
ity of the observations usually available in studying 
power may discourage efforts to construct elegant 
theoretical models. Third, until recent times the 
whole approach to power analysis was somewhat 
speculative: there were a good many impression- 
istic works but few systematic empirical studies of 
power relations. Of the empirical studies now avail- 
able most are investigations of power relations in 
American communities undertaken since 1950. 
These community studies have provoked a good 
deal of dispute over what are, in effect, alternative 
models of causation. So far, however, investigators 
have usually not described clearly the array of al- 
ternative models that might be proposed to explain 
their data, nor have they clearly specified the cri- 
teria they use for rejecting all the alternatives 
except the one they accept as their preferred 
explanation. 

Theories about power relations in various polit- 
ical systems are of course scattered through the 
writings of a number of analysts (e.g. Pareto 1916, 
volume 4; Mosca 1896, passim; Lasswell & Kaplan 
1950, chapters 9, 10; Mills 1956; Dahl 1961; Rossi 
1960; Polsby 1963; Parsons 19634; 1963b). But 
a straightforward presentation of an empirical the- 
ory of power relations in political systems is a 
rarity. A notable exception is offered by March's 
formulation of six models of social choice that in- 
volve, in some sense, relationships of power. 

The analogy between cause and power calls at- 
tention to one further point: any attempt to de- 
velop an empirical theory of power will run head- 
long into the fact that a causal chain has many 
links; that the links one specifies depend on what 
one wishes to explain; and that what one wishes to 
explain depends, in part, on the theory with which 
one begins. In causal analysis, it is usually 


. . . possible to insert a very large number of additional 
variables between any two supposedly directly related 
factors. We must stop somewhere and consider the 
theoretical system closed. Practically, we may choose 
to stop at the point where the additional variables are 
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either difficult or expensive to measure, or where they 
have not been associated with any operations at all... . 
A relationship that is direct in one theoretical system 
may be indirect in another, or it may even be taken as 
spurious. (Blalock 1964, p. 18) 


Some of the links that a power analyst may take 
as “effects” to be explained by searching for causes 
are the outcomes of specific decisions; the cur- 
rent values, attitudes, and expectations of decision 
makers; their earlier or more fundamental attitudes 
and values; the attitudes and values of other par- 
ticipants—or nonparticipants—whose participation 
is in some way significant; the processes of selec- 
tion, self-selection, recruitment, or entry by which 
decision makers arrive at their locations in the po- 
litical system; the rules of decision making, the 
structures, the constitutions. No doubt a “com- 
plete” explanation of power relations in a political 
system would try to account for all of these effects, 
and others. Yet this is an enormously ambitious 
task. Meanwhile, it is important to specify which 
effects are at the focus of an explanatory theory 
and which are not. A good deal of confusion, and 
no little controversy, are produced when different 
analysts focus on different links in the chain of 
power and causation without specifying clearly 
what effects they wish to explain; and a good deal 
of criticism of dubious relevance is produced by 
critics who hold that an investigator has focused 
on the “wrong” links or did not provide a “com- 
plete” explanation. 

Classifying types of power. Even though the 
analysis of power has not produced many rigorous 
causal models, it has spawned a profusion of 
schemes for classifying types of power relations 
(e.g., Parsons 1963a; 1963b; Oppenheim 1961; 
French & Raven 1959; Cartwright 1965). 

Among the characteristics most often singled out 
for attention are (1) legitimacy: the extent to 
which R feels normatively obliged to comply with 
C; (2) the nature of the sanctions; whether C uses 
rewards or deprivations, positive or negative sanc- 
tions; (3) the magnitude of the sanctions: extend- 
ing from severe coercion to no sanctions at all; 
(4) the means or channels employed: whether C 
controls R only by means of information that 
changes R’s intentions or by actually changing R’s 
situation or his environment of rewards and dep- 
rivations. These and other characteristics can be 
combined to yield many different types of power 
relations. 

As we have already indicated, no single classi- 
fication system prevails, and the names for the 
various categories are so completely unstandard- 
ized that what is labeled power in one scheme may 


be called coercion or influence in another. De- 
tached from empirical theories, these schemes are 
of doubtful value. In the abstract it is impossible 
to say why one classification system should be pre- 
ferred over another. 

Nonetheless, there are some subsets of power 
relations—types of power, as they are often called 
—that call attention to interesting problems of 
analysis and research. One of these is the distinc- 
tion between having and exercising power or influ- 
ence (Lasswell & Kaplan 1950, p. 71; Oppenheim 
1961, chapters 2, 3). This distinction is also in- 
volved in the way anticipated reactions function as 
a basis for influence and power (Friedrich 1963, 
chapter 11). 

To illustrate the problem by example, let us sup- 
pose that even in the absence of any previous com- 
munication from the president to Senator R, or 
indeed any previous action of any kind by the 
president, Senator R regularly votes now in a way 
he thinks will insure the president's favor later. 
The senator calculates that if he loses the next 
election, he may, as a result of the president's 
favorable attitude, be in line to receive a presi- 
dential appointment to a federal court. Thus, while 
Senator R’s voting behavior is oriented toward fu- 
ture rewards, expected or hoped for, his votes are 
not the result of any specific action by the 
president. 

If one holds that C cannot be a cause of R if C 
follows R in time, then no act of the incumbent 
president need be a cause of Senator R’s favorable 
vote, Obviously this does not mean that Senator R's 
actions are “uncaused.” The immediate determi- 
nant of his vote is his expectations. If we ask what 
“caused” his expectations, there are many possible 
answers. For example, he might have concluded 
that in American society if favors are extended to 
C, this makes it more likely that C will be indul- 
gent later on. Or he may have acquired from polit- 
ical lore the understanding that the general rule 
applies specifically to relations of senators and 
presidents. Thus, the causal chain recedes into the 
senator’s previous learning—but not necessarily to 
any specific past act of the incumbent president or 
any other president. 

This kind of phenomenon is commonplace, im- 
portant, and obviously relevant to the analysis of 
power. Yet some studies, critics have said, concen- 
trate on the exercise of power and fail to account 
for individuals or groups in the community who, 
though they do not exercise power, nonetheless 
have power, in the sense that many people try 
assiduously to anticipate their reactions (Bachrach 
& Baratz 1962). This failure may be a result of 


certain paradoxical aspects of having power that 
can make it an exceedingly difficult phenomenon 
to study. 

For in the limiting case of anticipated reactions, 
it appears, paradoxically, that it is not the presi- 
dent who controls the senator, but the senator who 
controls the president—i.e., it is the senator who, 
by his loyal behavior, induces the president to ap- 
point him to a federal court. Thus, it is not C who 
controls or even attempts to control R, but R who 
attempts to control C—and to the extent that R 
anticipates C’s reactions correctly, R does in fact 
control C. It is, then, not the king who controls 
the courtier but the courtier who controls the 
king. 

Now if we examine this paradox closely we 
quickly discover that it arises simply because we 
have tried to describe the relationship between king 
and courtier, president and senator, C and R by 
distinguishing only one aspect, namely, the exer- 
cise of power. The courtier does indeed exercise 
power over the king by successfully anticipating the 
reactions of the monarch and thereby gaining a 
duchy. But it was not this that we set out to ex- 
plain. For it is the king who has, holds, or possesses 
the capacity to confer that dukedom, and even 
though he does not exercise his power, he gains 
the willing compliance of the courtier. 

What is it, then, that distinguishes having power 
from exercising power? The distinction could hinge 
upon the presence or absence of a manifest inten- 
tion, We could define the exercise of power in such 
a way as to require C to manifest an intention to 
act in some way in the future, his action to be con- 
tingent on R’s behavior. By contrast, C might be 
said to have power when, though he does not mani- 
fest an intention, R imputes an intention to him 
and shapes his behavior to meet the imputed in- 
tention. If one were to accept this distinction, then 
in studying the exercise of power, one would have 
to examine not only R’s perceptions and responses 
but also C’s intentions and actions. In studying re- 
lationships in which C is thought to have power, 
even though he does not exercise it, one would in 
principle need only to study R’s perceptions, the 
intentions R imputes to C, and the bearing of these 
on R’s behavior. Carried to the extreme, then, this 
kind of analysis could lead to the discovery of as 
many different power structures in a political sys- 
tem as there are individuals who impute different 
intentions to other individuals, groups, Or strata in 
the system. 

The distinction betwee 
power could also turn on t 
in the relation between C an 


n having and exercising 
he directness involved 
d R and on the spe- 
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cificity of the actions. In the most direct relation- 
ship R’s response would be tripped off by a signal 
directly from C. In this case, C is exercising power. 
But some relationships are highly indirect; for ex- 
ample, C may modify R’s environment in a more 
or less lasting way, so that R continues to respond 
as C had intended, even though C makes no effort 
to control R. In these cases, one might say that 
although C does not exercise control over R, he 
does have control over R. There are a variety of 
these indirect, or “roundabout,” controls (Dahl & 
Lindblom 1953, pp. 110 ff.). 

Measuring power. Even more than with power 
terms themselves, notions of “more” or “less” pow- 
er were in classical theory left to the realm of 
common sense and intuition. Efforts to develop 
systematic measures of power date almost wholly 
from the 1950s. Of those, some are stated partly in 
mathematical formulas, some entirely in non- 
mathematical language. Since the essential fea- 
tures can be suggested without mathematics, we 
shall describe these measures in ordinary language. 
(The reader should consult the sources cited for 
the precise formulations. Most of the best-known 
measures are presented and discussed in Riker 
1964.) 

In a rough way, the various criteria for measur- 
ing power can be classified into three types: game- 
theoretical, Newtonian, and economic. 

Game-theoretical criteria. Shapley, a mathema- 
tician, and Shubik, an econometrician, have jointly 
formulated a “method for evaluating the distribu- 
tion of power in a committee system” (1954). This 
is intended to measure the power accruing to a 
voter where the outcome or decision is determined 
exclusively by voting. In these cases the rules pre- 
scribe what proportion of votes constitutes a win- 
ning proportion (eg a simple majority of all 
committee members). Thus each member has a 
certain abstract probability of casting the last vote 
that would be needed to complete a winning coali- 
tion, in other words to occupy 4 pivotal position 
with respect to the outcome. By adding his vote at 
this crucial juncture, a voter may be conceived of 


as having made a particularly decisive contribution 


to the outcome; thus, gaining his vote might have 
bers of a coali- 


considerable value to the other mem! 

tion that would lose without his vote. Shapley and 
Shubik proposed measuring the power of a voter 
by the probability that he would be the pivotal 
voter in a winning coalition, Because their measure 
is entirely limited to voting situations and excludes 
all outcomes other than the act of voting itself, the 
utility of the measure is limited to cases where 
most of the other familiar elements of political 
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life—various forms of persuasion, inducement, and 
coercion—are lacking. [See CoALITIONS.] 

Newtonian criteria. On the analogy of the meas- 
urement of force in classical mechanics, a number 
of analysts propose to measure power by the 
amount of change in R attributable to C. The 
greater the change in R, the greater the power of 
C; thus C, is said to exert more power than C, if 
C, induces more change in R, than C, induces in 
R, (or in some other R). Measures of this kind 
have been more frequently proposed than any other 
(Simon 1947-1956; March 1957; Dahl 1957; 1963, 
chapter 5; Cartwright 1959; Oppenheim 1961, 
chapter 8). 

“Change in R” is not, however, a single dimen- 
sion, since many different changes in R may be 
relevant. Some of the important dimensions of the 
“change in R” brought about by C that have been 
suggested for measuring the amount of C’s power 
are (1) the probability that R will comply; (2) 
the number of persons in R; (3) the number of 
distinct items, subjects, or values in R; (4) the 
amount of change in R’s position, attitudes, or 
psychological state; (5) the speed with which R 
changes; (6) the reduction in the size of the set 
of outcomes or behaviors available to R; and (7) 
the degree of R’s threatened or expected depri- 
vation. 

Economic criteria. Where the game-theoretical 
measure focuses on the pivotal position of C, and 
Newtonian measures on changes in R, a third pro- 
posal would include “costs” to both C and R in 
measuring C’s power. Harsanyi has argued that a 
complete measure of power should include (1) the 
opportunity costs to C of attempting to influence 
R, which Harsanyi calls the costs of C’s power, and 
(2) the opportunity costs to R of refusing to com- 
ply with C, which Harsanyi calls the strength of 
C’s power over R (1962a, pp. 68 ff.). The measure 
Harsanyi proposes is not inherently limited to the 
kinds of cost most familiar to economists but could 
be extended—at least in principle—to include psy- 
chological costs of all kinds. 

Designing operational definitions. Empirical 
studies discussed by Cartwright (1965), March 
(1965), and others, and particularly community 
studies, have called attention to the neglected prob- 
lem of designing acceptable operational definitions. 

The concepts and measures discussed in this 
article have not been clothed in operational lan- 
guage. It is not yet clear how many of them can be. 
Yet the researcher who seeks to observe, report, 
compare, and analyze power in the real world, in 
order to test a particular hypothesis or a broader 
theory, quickly discovers urgent need for opera- 


tionally defined terms. Research so far has called 
attention to three kinds of problems. First, the gap 
between concept and operational definition is gen- 
erally very great, so great, indeed, that it is not 
always possible to see what relation there is be- 
tween the operations and the abstract definition. 
Thus a critic is likely to conclude that the studies 
are, no doubt, reporting something in the real 
world, but he might question whether they are re- 
porting the phenomena we mean when we speak 
of power. Second, different operational measures 
do not seem to correlate with one another (March 
1956), which suggests that they may tap different 
aspects of power relations. Third, almost every 
measure proposed has engendered controversy 
over its validity. 

None of these results should be altogether sur- 
prising or even discouraging. For despite the fact 
that the attempt to understand political systems by 
analyzing power relations is ancient, the system- 
atic empirical study of power relations is remark- 
ably new. 

RoBERT A. DAHL 


[See also CommunrIry, article on THE STUDY OF COM- 
MUNITY POWER; POLITICAL SCIENCE; POLITICAL 
THEORY. Directly related are the entries AUTHORITY; 
BALANCE OF POWER; GOVERNMENT; MILITARY POW- 
ER POTENTIAL; POWER TRANSITION. Other relevant 
mdterial may be found in CAUSATION; COERCION; 
DECISION MAKING; INTERNATIONAL RELATIONS; Mo- 
NOPOLY; OLIGOPOLY; POLITICAL PROCESS; SOCIAL 
CONTROL.) 
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POWER TRANSITION 


The theory of power transition attempts to ac- 
count for the shifts of power and the causes of 
conflict among nations. According to this theory, 
the spread of industrialization to different nations 
at different times and at differing rates provides 
the key to understanding the fundamental patterns 
of contemporary international relations. 


Stages of power transition 

An industrializing nation undergoes a number 
of changes as it modernizes its economy. Typically, 
such a nation not only increases its wealth and 
its industrial strength but also grows in population 
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and improves the efficiency of its political insti- 
tutions. Since economic development, population 
size, social mobility, and political mobilization are 
among the major determinants of national power, 
an industrializing nation also increases its power, 
i.e., its ability to influence the behavior of other 
nations. It goes through a “power transition.” 

This power transition can for convenience be 
divided into three stages, although in reality the 
process is continuous. 

Potential power. First comes the “stage of po- 
tential power,” a preindustrial stage in which the 
population may be large or small and is often 
growing rapidly but in which the economy and the 
government are backward compared to more de- 
veloped nations. The economy is primarily sub- 
sistence agriculture. Productivity and living stand- 
ards are low, technical skills are few, and capital 
is extremely scarce. Governmental institutions are 
inefficient, and national unity is often, though not 
always, slight. Countries in this stage are often 
ruled by foreign conquerors or by small aristocra- 
cies; the common people participate little in na- 
tional government except to pay taxes. 

The human and material resources of such a 
nation are largely unorganized and only partly 
used; and the power of such a nation is slight 
compared to that of any industrial nation, although 
of course it may be greater than that of some other 
underdeveloped country. 

The power of a preindustrial nation is largely 
potential, to be realized when and if it modernizes 
its economy and its government. For a nation with 
a large population, however, the size of its poten- 
tial power may be great indeed. India, for example, 
by industrializing fully, would become one of the 
most powerful nations on earth; and other nations, 
recognizing this potential, grant India today some 
of the deference due to the power she may have 
tomorrow. 

Transitional growth. The second stage of power 
transition is the “stage of transitional growth in 
power.” During this stage the nation is in transi- 
tion from an agricultural to an industrial economy, 
and as it industrializes it grows in power. 

Fundamental changes take place during this 
stage. Economic modernization brings higher 
productivity, increased national income, and higher 
living standards. Political modernization brings a 
larger and more efficient government bureaucracy 
and increases the control of the central govern- 
ment over the nation. The general public is more 
affected by governmental action and participates 
more in governmental activities, and nationalistic 


sentiment often reaches a high pitch. Population 
size generally increases rapidly, for modern con- 
ditions reduce the death rate sharply. Industrial- 
ization, urbanization, secularization, and other re- 
lated changes alter the whole fabric of national 
life. 

Many of these changes have the effect of in- 
creasing the nation’s power, relative to both that 
of the other preindustrial nations it leaves behind 
and that of the already industrial nations it is 
beginning to catch up with. The speed of this gain 
in power and the degree to which it upsets the 
international community depend in large part 
upon the size of the nation and upon the speed 
with which it industrializes. 

The Soviet Union provides a good example of a 
nation in the stage of transitional growth in power, 
although it is now at the end of this stage. Its 
rapid industrialization and concomitant rise in 
power have changed the whole focus of interna- 
tional relations in the mid-twentieth century. 

Power maturity. The third stage of power tran- 
sition is the “stage of power maturity,” reached 
when a nation is highly industrial, as the United 
States and western Europe are today. Nations 
in this stage continue to change and to grow in 
wealth, efficiency, and size, but at a slower rate. 
At least, this has been the experience of the 
Western nations that have already reached power 
maturity. Presumably, the rate of economic ad- 
vance will also slacken in the Soviet Union and 
eventually in China and other nations as they 
reach this point, but only the future can supply 
proof of this. 

With power maturity the internal characteristics 
that give a nation power do not disappear, but 
in a race where everyone is running forward one 
may lose simply by slowing down. Power, after all, 
is relative, not absolute. Nations in the third stage 
lose relative power as other nations in the stage of 
transitional growth close the gap between them. 

The effects of automation may give a further 
burst of power to nations in the stage of power 
maturity and allow them to maintain their power 
superiority longer than would otherwise be the 
case, but in the end automation will destroy the 
nation-state and open the way to new and differ- 
ent forms of political organization. 


Effects on the distribution of power 

Had the entire world industrialized at the same 
time and at the same speed, there would have been 
great changes in international relations but no 
necessary major shifts in the distribution of power 


among nations. However, the industrial revolution, 
which began in England two hundred years ago 
and spread slowly through the West, has only re- 
cently swept into eastern Europe and Asia and has 
still to reach the majority of nations in the world. 

The result has been that first one nation and 
then another has experienced a sudden spurt in 
power, as in a race where one runner after another 
goes into a brief sprint. These sudden sprints keep 
upsetting the distribution of power in the world, 
threatening the established international order and 
disturbing world peace. Increased power is con- 
stantly passing into the hands of nations who use 
it to challenge the existing leaders of the inter- 
national community. 

At any given time the nations of the world tend 
to be organized into an “international order,” that 
is to say, a system of relationships that is fairly 
stabilized, with recognized leaders, a recognized 
distribution of power and wealth, and recognized 
rules of trade, diplomacy, and war. Sometimes, as 
during most of the nineteenth century, there is 
only one international order. At other times, as at 
present, there may be two or more competing in- 
ternational orders. 

The dominant international order is headed by 
the most powerful single nation on earth, formerly 
England, today the United States. In the years 
since the industrial revolution the rule of the domi- 
nant nation has been challenged by one newly 
industrialized nation after another. Sometimes the 
challenge has come from within the dominant in- 
ternational order, as when the United States took 
over world leadership from England. Sometimes it 
has come from the leader of a competing inter- 
national order, as in the cases of Nazi Germany 
and the Soviet Union. 

A recurring pattern can be seen in which new 
nations industrialize and experience an accom- 
panying rapid growth in power only to find them- 
selves dissatisfied with the place granted to them 
by the world leaders who industrialized ahead of 
them. When peaceful bids for a redistribution of 
wealth and power prove inadequate, past chal- 
lengers have turned to war. In the past one hun- 
dred years major wars have been started by 
challengers as they approached, but before they 
reached, equal power with those they challenged. 

Peace, then, is most assured when the dominant 
nation and its allies enjoy a huge preponder ane: 


of power over any possible challenger. War is most 
ger and its allies 


likely when the power of a challen| 
approaches equality with that of the world leaders 


who support the status quo. 
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Evaluation 


The major limitation of the concept of power 
transition is that it refers to a period extending 
roughly from 1750 to a time in the future (one 
may guess about 2050) when world-wide indus- 
trialization has been achieved, It does not apply 
to the years before 1750, when no nation was in- 
dustrial, nor does it apply to a future in which all 
nations will possess highly developed economies. 

The theory of the balance of power, on the other 
hand, may be more applicable to the preindustrial 
“dynastic” period, when there were many nations 
of roughly equivalent power, when nations were 
kings who could and did switch sides freely, and 
when nations increased their power primarily 
through clever diplomacy, alliances, and military 
conquests. Thus, each nation’s increase in power 
could be counterbalanced by similar international 
action on the part of its rivals. 

However, it is clear that differential industrial- 
ization has created vast differences in the power of 
nations since the industrial revolution. There no 
longer exist many nations of roughly equal power, 
and it is no longer possible to balance power by 
shifting alliances. In the last two hundred years 
the usual state of affairs has been a vast pre- 
ponderance of power in the hands of one leading 
nation. At most there have occasionally been two 
leading nations of almost equal power. Modern 
nations are not free to make and break alliances 
at will for power considerations (for example, to 
balance world power ), because economic and mili- 
tary interdependence have tied nations together 
into international orders whose membership they 
cannot leave without great domestic as well as 
international changes. 

Furthermore, balance of power situations, his- 
torically, have not aided the maintenance of peace. 
On the contrary, the greatest wars of modern his- 
tory have occurred precisely at times when a chal- 
lenging nation or coalition of nations has most 
nearly reached equal power with the leaders of the 
dominant international order. The great century 
of Pax Britannica from 1815 to 1914 amply illus- 
trates that peace comes with preponderant power, 
not with a balance of power. 

The theory of the power transition, unlike the 
theory of the balance of power, assumes that in- 
dustrial strength is one of the major determinants 
of a nation’s power and that a nation may there- 
fore increase its power greatly through internal 
changes in its economy, i.e., through industrializa- 
tion. By proceeding from this basic assumption 
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the theory of the power transition seems to explain 
some major developments in contemporary inter- 
national politics far better than the outdated 
formulation of balance of power. 

A. F. K. ORGANSKI 


[See also BALANCE OF POWER; INTERNATIONAL POLI- 
TICS; MILITARY POWER POTENTIAL; MODERNIZATION. 
Other relevant material may be found under INTER- 
NATIONAL RELATIONS; PEACE; WaR.] 
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PREDICTION 


In sociological writing the term “prediction” 
means a stated expectation about a given aspect 
of social behavior that may be verified by subse- 
quent observation. Within this general meaning 
the term is used in two principal senses: for de- 
ductions from known to unknown events within a 
conceptually static system and for statements about 
future outcomes based on recurring sequences of 
events, This article is largely restricted to the latter 
sense, although the former usage, which covers 
significant forms of logical reasoning, is widely 
prevalent. 

The estimate of a given variable from one or 
more concurrent variables as in regression analy- 
sis is conventionally referred to as a prediction. 
Similarly, the estimate of a population character- 
istic may be referred to as a prediction, although 
the sample from which the inference is drawn is 
not separated in time from the population that it 
represents. Still more common, especially in the 
writing on social systems, is the designation of the 
term y in the expression “If x, then y,” as a predic- 
tion, even though the x and y are often regarded 
as of simultaneous occurrence. Although such 
usage has been questioned on linguistic grounds, 
it is well established, and its currency is not likely 
to be affected by such arguments. 

In its second principal sense the term “predic- 
tion” refers to assertions about future outcomes 
based on the observed regularities among consecu- 
tive events of the past. This category contains its 
own distinction, depending on whether the state- 
ment holds for a single, concrete instance, with 
due regard for the accidents of time and place, or 
abstractly holds for any case in a class satisfying 


stated conditions. If the statement is concrete and 
necessarily bound to the calendar, it carries the 
label “forecast”; otherwise, when it is not so re- 
stricted, it carries the more general term “predic- 
tion.” According to this distinction, the expected 
volume of crime in the United States in the next 
calendar year would constitute a forecast, whereas 
the expected success of the individual on parole 
under specified conditions would be a prediction. 
Although this terminology is useful for distinguish- 
ing between special and general formulations, it 
has not been consistently applied, even in tech- 
nical writing, and many so-called predictions would 
have to be relabeled as forecasts if the distinction 
were to be strictly maintained. 

Prediction as a social process. At least some 
form of prediction, in the broadest sense of the 
term, is practiced on all levels of culture (Tylor 
1871, chapter 4). The contemporary emphasis in 
sociology, as described below, is thus consistent 
with traditional enterprise, answering to the same 
general purposes but differing in the process by 
which the foreknowledge is obtained. The social 
purpose of prediction, whether of physical or social 
events, is to secure a measure of control over what 
otherwise would be less manageable circumstances. 
The effects of such natural calamities as typhoons 
and floods may be mitigated, if not averted, by 
forehanded preparation; similarly, by stating the 
conditions under which a social upheaval can 
occur, steps may be taken to prevent the occurrence 
of one. Some of the Biblical prophecies were of 
this nature, urging the people to righteousness in 
order to avoid the wrath of God. At times such a 
statement may be more in the nature of a promise 
than a threat, setting forth the conditions to be 
met in order to achieve a desired objective, such 
as an annuity upon retirement. But however they 
differ in meaning, practically all predictions are 
potential instruments of social action, enabling the 
group either to facilitate a favorable outcome or 
to impede an unfavorable one. 

The process of sociological prediction. Although 
all predictions are alike in broad social purpose, 
they differ in the process of their formulation, 
which will be more or less scientific according to 
the nature of the underlying analysis. The process 
of sociological prediction has in varying degree 
those elements common to all scientific prediction: 
some theory of behavior from which deductions 
may be drawn and some factual evidence that is 
relevant to the propositions of the theory. Socio- 
logical prediction has arisen naturally from the 
concerns of sociology itself, both theoretical and 
empirical, and thus sociologists now take predic- 


tion of the forms and processes of social life as 
one of their principal tasks. This commitment has 
its roots in the writings of Auguste Comte and has 
been regularly affirmed by leading representatives 
of the discipline since that time. Max Weber held 
that the purpose of sociology is to predict the pat- 
terns of social interaction, and Albion Small, one 
of the founders of American sociology, took very 
much the same position (1916). 

Although sociological predictions ideally are to 
be drawn from theory, for the most part they have 
been little more than statistical projections based 
on compilations of empirical data within cate- 
gories of perhaps little theoretical significance. But 
such compilations in the form of time series and 
actuarial tables have had their bearing on theory. 
For example, the hypothesis of “cultural lag” (Og- 
burn 1922) was derived in part from the empirical 
growth curve of inventions, and Edwin H. Suther- 
land’s theory of “differential association” has been 
refined on the basis of parole prediction studies 
(see Glaser 1954). In this way the construction of 
statistical trends and experience tables and their 
corresponding projections have had some impact 
on theory, in both extending and recasting it. 
Nevertheless, it is not the statistical materials and 
their manipulation that give sociological prediction 
its special character, for comparable series and 
their analysis are part of the natural sciences; 
rather, it is the underlying categories from which 
sociological predictions are derived. 

Prediction research. Although many sociological 
investigations have a bearing on the predictability 
of social and cultural events, relatively few studies 
have had prediction as their primary goal. For the 
purposes of outlining these more specialized studies 
and citing examples, prediction research will be 
classified here according to whether its focus is the 
collective characteristics of the group or the char- 
acteristics of its constituent members. 

In those studies analyzing the collective aspects 
of the group, the prediction has in some cases 
extended over several classes of events, whereas 


in various others the prediction has been restricted 


to a single outcome. The prediction of a relatively 
best represented 


wide range of events is perhaps € 
by the work of William F. Ogburn, a consistent 
theme of which was the proposition that techno- 
logical trends of the past provide a useful key to 
cultural trends of the future. This idea was de- 
veloped in Ogburn’s work during the 1930s, which 
included reviews of selected social and economic 
trends in the United States from 1900 to 1930 
(President’s Research Committee . - - 1933) and 
a government report on social and economic con- 
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ditions affecting the rate of invention and the 
impact of invention on social life (U.S. National 
Resources Committee, Science Committee 1937), 
and it was later restated in The Social Effects of 
Aviation (Ogburn et al, 1946). 

Not all studies of trends and cycles have been 
at the societal level; some have been concerned. 
with the pattern of interaction and the sequence 
of its development within the small group. The 
work of Bales and his associates (summarized in 
Bales 1959) provides an example of this approach, 
and Bales’s writings contain an assessment of its 
potential for predictive knowledge. 

The prediction of a single social outcome may 
be illustrated by the election forecast, since its 
problems are well defined (Mosteller et al. 1949) 
and its operating procedures are well standardized. 
Such a forecast rests on a succession of carefully 
designed and drawn sample polls taken at regular 
intervals shortly before the election. Based on the 
trend of these results, with due allowance for 
sampling and measurement error, the percentage 
of the vote for each candidate is predicted, and 
thus the probable winning candidate can be named, 
The critical matters bearing on the accuracy of 
such prediction include the correspondence be- 
tween the sampled population and the population 
actually voting on election day as well as the sta- 
bility of the observed trend, at least through the 
day of the election. Notwithstanding these and re- 
lated difficulties, scientific polling agencies have 
been quite successful in predicting the results of 
political elections held in the United States since 
1952. Students of this process have noted that such 
predictions may affect the election itself. 

With minor exceptions, the prediction of indi- 
vidual behavior has been limited to those forms 
of personal adjustment whose variation is thought 
to be largely due to differences in social back- 
ground and circumstance: for example, adjust- 
ment in the armed forces (Star 1950), postwar 
adjustment (Cottrell 1949), adjustment on parole 
(Burgess 1928), and adjustment in marriage (Bur- 
gess & Cottrell 1939). 

The device by which the prediction of personal 
adjustment is usually effected is the experience 
table, which in principle is no different from the 
actuary’s life table that shows probabilities of 
death by age. Similarly, the table of social experi- 
ence gives the odds of success for the several sub- 
classes into which the population has been ar- 
ranged, and it thereby yields a prediction for the 
individual case; obviously, the more nearly the 
probabilities approach zero or one, the more accu- 
rate is the prediction for each person. This method 
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Table 1 — Frequency distribution of 1,000 parolees, by 
prediction score 


Number of men 


Prediction in each class Per cent violators 
score* interval of parole 
16-21 68 1.5 
14-15 140 2.2 

13 91 8.8 
12 106 15.1 
n 110 22.7 
10 88 34.1 
7-9 287 43.9 
5-6 85 67.1 
2-4 25 76.0 


* Score for eoch parolee is the number of factors for which he scored obove 
the group mean. 
Source: Adapted from Burgess 1928, p. 248. 


can be illustrated by a table from Burgess’ early 
study of factors determining success on parole (see 
Table 1); this table, an adaptation of the original, 
shows the percentage of failures (that is, parole 
violators) for each of the classes into which 1,000 
parolees had been grouped according to scores 
based on 21 factors correlated with outcome on 
parole. This table, which may be regarded as 
Stating a set of empirical probabilities, is char- 
acteristic of practically all studies seeking to pre- 
dict the behavior of the individual. 

Recent studies of social adjustment are some- 
what distinctive in their employment of refined 
Statistical methods to differentiate between suc- 
cesses and failures. An important innovation con- 
sists in their attention to the possible discrepancy 
between decisions maximizing predictive accuracy 
and those minimizing social cost. Ohlin (1951) 
discusses the relevance of prediction tables for 
parole selection and analyzes their differing pur- 
poses, and Cronbach and Gleser (1957) provide a 
comparable discussion of personnel selection based 
on psychological test scores. 


Techniques of social prediction 


The derivation of a prediction is usually accom- 
plished by general statistical methods, none of 
which is restricted in its application to social data. 
Despite their general familiarity, these methods 
will be briefly reviewed here, primarily to illustrate 
representative applications in social prediction. 
Although not all statistical predictions are derived 
in the same way, all share the requirement that 
both predictor and criterion variables be subject to 
reliable measurement, Except for this scant refer- 
ence, and despite its importance, the problem of 
measurement will be ignored in the following dis- 
cussion. 

Extrapolation. Roughly speaking, extrapolation 
is the process of predicting a variable from itself — 


for example, predicting the future growth of a 
population from its past growth. By this technique 
it is possible to obtain a succession of expected 
values that are arrayed in the future from least to 
most distant. Such expected values will materialize 
only if the underlying social process or causal sys- 
tem that the curve expresses is constant for the 
period over which the prediction extends. Predic- 
tion by extrapolation will be in error to the degree 
that a given process changes; accordingly, predic- 
tion by this method will generally be more accurate 
for shorter rather than longer durations. An ex- 
ample of extrapolation is provided by Hart’s pre- 
diction of life expectancy on the basis of the trend 
in life expectancy during the last 75 years (1954). 
[For other examples of extrapolation see PREDIC- 
TION AND FORECASTING, ECONOMIC; POPULATION, 
article on POPULATION GROWTH,] 

Correlation methods. Reduced to its lowest 
terms, prediction by correlation is based on meas- 
ured linkages between earlier and later events in 
a given sequence—for example, between scholastic 
performance in high school and that in college, 
between adjustment in childhood and that in mar- 
riage. It goes without saying that such obtained 
relationships will have no predictive utility beyond 
the specific population for which they hold. Thus, 
if the correlation between type of infant feeding 
and social maturity (to take a hypothetical ex- 
ample) obtains only in the middle class, the former 
category will be worthless as a predictor of social 
maturity in the lower class. 

In basing our prediction on correlated events, 
the guiding principle is that the errors of predic- 
tion must be minimized in some well-defined sense. 
To illustrate the application of this principle and 
also to mark points of entry into the pertinent 
literature, we will consider in barest detail the 
most common procedures for predicting a variable 
from a set of attributes or variables and those for 
predicting an attribute from a set of attributes or 
variables. When the criterion consists of more than 
a single element (Hotelling 1935) and/or when 
attributes and variates are employed together as 
predictors (Mannheim & Wilkins 1955), the pro- 
cedures will be more complicated but no different 
in principle. 5 

In predicting a variable from one or more varia- 
bles, the practice is to predict so that the sum of 
the squared errors around the fitted curve is a 
minimum. When the fitted curve is linear, as is 
usually the case, the product-moment coefficient 
of correlation r, or an adaptation of it (partial or 
multiple), serves to gauge the relative accuracy of 
comparable predictions. For example, a succession 


of similar studies of marriage obtained the follow- 
ing correlations between scores considered to be 
prognostic of marital adjustment and scores of 
actual adjustment in marriage: Burgess and Cot- 
trell, .51 (1939); Burgess and Wallin, .50 (1953); 
Terman, .54 (see Terman et al. 1938). 

In predicting a quantitative variable from one 
or more attributes, the rule is to determine the 
predicted values so as to maximize the sum of 
squares between groups and correspondingly to 
minimize the sum of squares within groups, as in 
the analysis of variance; hence, the coefficient of 
intraclass correlation, or an equivalent, may be 
taken as a measure of predictive accuracy. The fact 
that no instance of such a measure appears in the 
bibliography to this article is intended to suggest 
the possibly limited value of attributes for purposes 
of predicting quantitative variables. 

Where the prediction is from one or more at- 
tributes to a single attribute, the rule is to pre- 
dict repeatedly whichever attribute has the larg- 
est subclass frequency, or conditional probability 
(Guttman 1941). The accuracy of such prediction 
will vary according to the difference between the 
conditional and marginal probabilities. Since meas- 
ures of association such as C, T, or phi reflect that 
difference, they have been widely used to gauge 
the predictive accuracy of attributes. The examples 
given in Table 2, which are from one of the 
Gluecks’ early studies of recidivism, will serve to 
illustrate the form and usual magnitude of such 
coefficients. 

Finally, where the prediction is from one or 
more variables to a single attribute the preferred 
procedure is to derive from the variables a com- 
posite variable that will yield the least overlap 
among the several within-class distributions and 
hence the least error in prediction. When the com- 
posite variable is a linear function of its com- 
ponents, the procedure is termed a discriminant 
function (Fisher 1936). The experience table for 
predicting recidivism among Borstal (i.e., reform 
school) lads (Mannheim & Wilkins 1955) was 
based on this method and is indicative of both its 
value and limitations; Kirby (1954) has also used 
this method, in a parole prediction study. 

Table 2 — Correlation between recidivism and selected 
social traits 


Coefficient of 
Social troit contingency (C) 
Mental condition P 
Work habits 
'ork hal 3 


Economic responsibility 


Source: Adapted fror 
p. 135. 


m Glueck & Glueck 1937, 
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Markov chains. Although the Markov chain, as 
well as the general class of stochastic processes to 
which it belongs, is covered in most standard ref- 
erences on probability (Feller 1950-1966), its po- 
tentialities for sociological prediction have only 
recently been explored, and these efforts have been 
largely, if not exclusively, limited to the fitting of 
historical data to theoretical chains. Although such 
materials indicate whether conditions as observed 
at time t, might have been accurately predicted by 
Markov methods beginning at time t, they contain 
no demonstration that a given process will repeat 
itself indefinitely and therefore give no indication 
whether it may be confidently used for predictive 
purposes, This remark, however, should not be 
construed as a criticism of the studies cited illus- 
tratively below, which purported to be more sug- 
gestive than conclusive. 

The feasibility of predicting political attitudes 
has been considered by Anderson (1954) in a sec- 
ondary analysis of panel data. His particular con- 
cern was with the probability that a person would 
hold the same political opinion at the end of a se- 
quence that he held at the start. To determine 
whether such probabilities might be attained by 
Markovian methods, a comparison was drawn be- 
tween distributions based on opinions expressed 
by a panel of voters in each of the six months im- 
mediately preceding the U.S. presidential election 
of 1940 and those expected on the basis of proba- 
bility theory. Although the findings of this par- 
ticular analysis were inconclusive, they do hint at 
the potentialities of the Markov chain as a device 
for analyzing the process of attitude change and 
as a tool for social prediction. 

The correspondence between actual patterns of 
Jabor mobility and those obtained by treating the 
movement of workers as a Markov process has 
been examined by Blumen, Kogan, and McCarthy 
(1955). Although this investigation was concerned 
primarily with the dynamics of labor mobility, it 
has considerable relevance for prediction in its 
emphasis on statements expressing the probability 
that a worker in a given industrial group will be 
in that same group after k intervals of time. Apart 
from its substantive value for industrial sociology, 
this study is constructively important as a demon- 
stration of the potential utility of Markov theory 
in the prediction of mobility, both social and geo- 


graphical. 


Current problems 

During the past fifty years of American soci- 
ology, the problems, or problematics, of prediction 
have received at least as much attention as the 
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predictions themselves, which, as suggested in the 
foregoing discussion, have been relatively few in 
number and circumscribed in content. The range 
of these problems is reflected in the kinds of issues 
that have been regularly debated in the sociological 
journals. Some of these issues are briefly pre- 
sented here, 

Actuarial approach versus case method. The 
issue of the actuarial approach versus the case 
method has arisen in the prediction of personal 
adjustment. This issue has two parts: whether it 
is possible to frame a prediction solely from case 
materials in total disregard of all probabilities, and 
whether such prediction, when there is no explicit 
reference to actuarial materials, is more accurate 
than that based on statistical averages, or rates. 
Most serious students of prediction would answer 
“no” to the first question and “possibly under some 
circumstances” to the second. 

Prediction that claims to be wholly devoid of 
probabilities is regarded as logically impossible by 
some students (Social Science Research Council 
1941). Briefly put, their argument is: notwithstand- 
ing the claims of the predictor to the contrary, the 
predicted case will be treated as a class member 
or located in a risk table although that table may 
be wholly based on the experience of the fore- 
caster and may exist only in his mind. That the 
process is subjective and personal does not alter 
its essential nature as probabilistic; therefore—so 
runs the argument—all prediction is actuarial and, 
correspondingly, no prediction is wholly free of 
uncertainty. 

The second answer, “possibly under some cir- 
cumstances,” shows a recognition that predictions 
from extensive case materials may in some in- 
stances be more accurate than those based on 
group averages, by virtue of the analyst’s excep- 
tional ability to assign cases to risk categories 
having probabilities very close to zero or one. How- 
ever, such idiosyncratic accuracy embodies “pro- 
cedures” that cannot be readily codified and trans- 
mitted to the public, as would be required of 
scientific methods. Hence, prediction from case 
materials, no matter how accurate, must be re- 
garded as an expression of clinical insight and 
judgment rather than as the application of scien- 
tific law. 

Prediction as feedback. Despite the antiquity 
of the idea that a stated prediction may affect its 
own fulfillment (Popper 1957) and despite the 
relevance of this idea to social prediction, few em- 
pirical studies have sought to measure the influ- 
ence of predictions on subsequent events. However, 
an illustrative literature has emerged, including 


references to rumor, stereotyped expectation, the 
election forecast, and false prophecy (see espe- 
cially Merton 1948; Simon 1954; Festinger et al. 
1956). Students of voting behavior have noted the 
possible effects of a computer prediction of the 
election of specific candidates based on national 
election results in an earlier time zone on the pat- 
tern of voting in a later time zone. 

Such materials lend themselves to systematic 
classification according to whether the outcome is 
favorable or unfavorable and according to whether 
the effect of prediction is positive or negative. Thus, 
the prognosis that the patient will recover may 
generate either a confidence that will hasten re- 
covery or an overconfidence that will delay it; on 
the other hand, an unfavorable prognosis of chronic 
illness may create either an attitude of resignation 
or an attitude of defiance, with differing effects. 
Instances of this kind support the common opinion 
that by reason of the symbolic nature of human 
life, social prediction is reflexive and that in con- 
sequence the validity of social prediction is subject 
to greater uncertainty than physical prediction. 

Efficiency of social prediction. Broadly speak- 
ing, the concept of the efficiency of prediction 
refers to the accuracy of a given method of predic- 
tion relative to that of an alternative that is taken 
as a standard or norm (Reiss 1951). If, for exam- 
ple, the chosen standard produces 20 errors per 
100 trials and the alternative in question produces 
only 10, then the alternative may be said to be 
twice as efficient as the standard. 

Usually the predictions compared are that de- 
rived from the joint distribution of a criterion and 
one or more predictors and that derived from the 
distribution of the criterion alone. Thus, given a 
male delinquency rate of 20 per cent, the predic- 
tion of nondelinquency on repeated trials would 
carry an error rate of 20 per cent; if, after intro- 
ducing social class as a predictor, the error rate is 
reduced to 10 per cent, then the prediction from 
the two-way classification may be said to be twice 
as efficient as prediction from the single classifica- 
tion of children as delinquent or nondelinquent. 

By and large, the measured efficiency of social 
prediction has been relatively low. For example, 
in criminological studies, where the identification 
of the prospective delinquent or recidivist would be 
of considerable practical importance, the gains 
resulting from the introduction of information 
thought to bear on such behavior have been neg 
ligible (Schuessler 1954). To achieve substantial 
gains in efficiency it will be necessary to identify 
and measure those variables that are closely corre- 
lated with the criterion. 


Explanation versus prediction. Although ex- 
planation and prediction are not irreconcilable or 
even rival alternatives, there are differences of 
opinion about which of these should receive the 
greater emphasis. In general, this disagreement re- 
flects the difference between a preference for the- 
ory and an emphasis on factual research. 

In this disagreement, those placing greater em- 
phasis on explanation usually argue as follows: 
Predictions (hypotheses) may be deduced from a 
general explanation (theory), but a collection of 
statistical predictions, no matter how accurate, does 
not constitute a theory; hence, explanation should 
be the first order of business. Moreover, obtained 
statistical regularities may be an accident of time 
or place and hence an unstable basis for prediction, 
whereas a scientific law is universal and therefore 
an unerring source of prediction. According to this 
line of reasoning, the poor showing of prediction 
in sociology is a reflection of crudities in general 
theory. Thus, the development of a set of valid 
explanations is considered to be a prerequisite for 
the improvement of predictions. 

Those putting greater stress on prediction would 
probably not disagree with the premise of the fore- 
going argument, although they would emphasize 
the interplay between statistical association and 
theoretical explanation and the steady impact of 
the former on the latter. Furthermore, they would 
hold that if they are to frame hypotheses and to 
test them, they have no choice but to operate prag- 
matically within existing theory, reforming it as 
they proceed. Probably the rank and file of Amer- 
ican sociologists are “agnostics” with regard to this 
issue, excepting those whose major interest lies in 
the philosophy of social science. 

Limits of prediction. Another issue is the gen- 
eral question whether some classes of social events 
are inherently unpredictable. Although this ques- 
tion is valid, it necessarily admits of only a specu- 
lative answer. The judgment that the “future course 
of human affairs is unpredictable” (Toynbee 
1934-1961, vol. 12) has to do with the broad ques- 
tion of cultural history or evolution and has little 
to do with prediction as a specialty within sociol- 
ogy. In general, sociologists have been primarily 
concerned with problems of much smaller com- 
pass: land use, migration, suburban growth, fertil- 
ity rates of human populations, rates of assimila- 
tion, patterns of racial violence, and political 
movements, Nevertheless, the importance of pre- 
dicting social mutations as well as recurring 
phenomena has been increasingly empnesi s 
sociological writings (€-8- Moore 1964), and it is 


to be anticipated that this problem will be studied 
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empirically in the coming decades. Such factual 
studies will enable sociologists to set provisional 
limits to the range of social prediction (Bell 1965), 


In conclusion, it should be noted that prediction 
and sociology have always been closely linked: 
sociology grew out of a concern with prediction 
(Comte’s savoir pour prévoir) and has always had 
the securing of predictive knowledge as one of its 
express aims. However, relatively few empirical 
studies have been specifically concerned with pro- 
ducing it. Thus, the importance of recent predic- 
tion studies lies as much in the methodological 
understandings that have grown out of them as in 
the substantive findings themselves. As these un- 
derstandings become systematized and gain in cur- 
rency, a wider variety of factual investigations will 
probably appear as a sequel to the pioneering work 
of the last several decades. 

KARL F, SCHUESSLER 


[Directly related are the entries CAUSATION; LIFE TA- 
BLES; MARKOV CHAINS; PREDICTION AND FORECAST- 
ING, ECONOMIC. Other relevant material may be 
found in LINEAR HYPOTHESES; MULTIVARIATE ANAL- 
YSIS; PENOLOGY, article on PROBATION AND PAROLE; 
VorinG; and in the biographies of COMTE; OGBURN; 
SMALL; WEBER, MAX.] 


BIBLIOGRAPHY 

ANDERSON, T, W. 1954 Probability Models for Analyz- 
ing Time Changes in Attitudes, Pages 17-66 in Paul 
F, Lazarsfeld (editor), Mathematical Thinking in the 
Social Sciences, Glencoe, Il.: Free Press. 

Bares, Rosert F. 1959 Small-group Theory Research. 
Pages 293-305 in American Sociological Society, So- 
ciology Today, Edited by Robert K. Merton, Leonard 
Broom, and Leonard S. Cottrell. New York: Basic 
Books. 

Bert, DanwL 1965 Twelve Modes of Prediction. Pages 
96-127 in Julius Gould (editor), Penguin Survey of 
the Social Sciences 1965. Baltimore: Penguin. 

BLUMEN, Isavonx; Kocan, Marvin; and MCCARTHY, Pur 
1955 The Industrial Mobility of Labor as a Prob- 
ability Process. Cornell Studies in Industrial and 
Labor Relations, Vol. 6. Ithaca, N.Y.: Cornell Univ. 
Press. 

Borcarra, Encar F.; and Westory, CHARLES F. 1954 
The Prediction of Total Fertility. Milbank Memorial 
Fund Quarterly 32:383-419. 

BowERrMAN, CHARLES E. 1964 Prediction Studies. Pages 
215-246 in Harold T. Christensen (editor), Handbook 
of Marriage and the Family. Chicago: Rand MeNally. 

Burcess, Ernest W. 1928 Factors Determining Success 
or Failure on Parole. Pages 203-249 in Illinois, Com- 
mittee on Indeterminate-sentence Law and Parole, 
The Workings of the Indeterminate-sentence Law and 
the Parole System in Illinois. Springfield, T.: Divi- 
sion of Pardons and Paroles. 

Burcess, Ernest W.; and COTTRELL, Lronarp 8. 1939 
Predicting Success or Failure in Marriage. New York: 


Prentice-Hall. 


424 PREDICTION 


Burcess, ERNEST W.; and WALLIN, PauL 1953 Engage- 
ment and Marriage. Philadelphia: Lippincott. 

CANTRIL, HADLEY 1938 The Prediction of Social Events. 
Journal of Abnormal and Social Psychology 33:364— 
389. 

Cuapin, Francis S. (1947) 1955 Experimental Designs 
in Sociological Research. Rev. ed. New York: Harper. 

CLAUSEN, JoHN A. 1950 Studies of the Postwar Plans of 
Soldiers: A Problem in Prediction. Pages 568-708 in 
Samuel A. Stouffer et al., Measurement and Predic- 
tion. Studies in Social Psychology in World War II, 
Vol. 4. Princeton Univ. Press. 

Corrrett, LedNarp S. 1949 The Aftermath of Hostili- 
ties. Pages 549-595 in Samuel A, Stouffer et al., The 
American Soldier. Studies in Social Psychology in 
World War II, Vol. 2. Princeton Univ. Press. 

Cronsacu, LEE J.; and Greser, GoLDINE C. (1957) 1965 
Psychological Tests and Personnel Decisions, 2d ed. 
Urbana: Univ. of Illinois Press. 

DottarD, Joun 1948 Under What Conditions Do Opin- 
ions Predict Behavior? Public Opinion Quarterly 12: 
623-632. 

Duncan, Oris Duptey et al. 1953 Formal Devices for 
Making Selection Decisions. American Journal of So- 
ciology 58:573-584. 

FELLER, WILLIAM 1950-1966 An Introduction to Prob- 
ability Theory and Its Applications, 2 vols. New York: 
Wiley. + A second edition of the first volume was 
published in 1957. 

Festincer, LEON; RIECKEN, H. W.; and SCHACHTER, STAN- 
LEY 1956 When Prophecy Fails. Minneapolis: Univ. 
of Minnesota Press. 

Fisuer, R. A. 1936 The Use of Multiple Measurements 
in Taxonomic Problems. Annals of Eugenics 7:179- 
188. 

GLAsER, DanieL 1954 A Reconsideration of Some Parole 
Prediction Factors. American Sociological Review 
19:335-341. 

GLueEck, SHELDON; and GLUECK, ELEANOR 1937 Later 
Criminal Careers. New York: Commonwealth Fund. 

GLUECK, SHELDON; and GLUECK, ELEANOR 1959 Pre- 
dicting Delinquency and Crime. Cambridge, Mass.: 
Harvard Univ. Press. 

Goopman, Leo 1952 Generalizing the Problem of Pre- 
diction. American Sociological Review 17:609-612. 

Guttman, Louis 1941 Mathematical and Tabulation 
Techniques. Pages 251-364 in Social Science Research 
Council, Committee on Social Adjustment, The Pre- 
diction of Personal Adjustment, by Paul Horst et al. 
New York: The Council. 

Hart, Hornett 1954 Expectation of Life: Actual Ver- 
sus Predicted Trends. Social Forces 33:82-85. 
HoreLLING, Haroip 1935 The Most Predictable Crite- 

rion. Journal of Educational Psychology 26: 139-142. 

Kaptan, A.; Sxocsrap, A. L.; and GirsHICK, M. A. 1950 
The Prediction of Social and Technological Events. 
Public Opinion Quarterly 14:93-110. 

Krrpy, BERNARD C. 1954 Parole Prediction Using Mul- 
tiple Correlation. American Journal of Sociology 59: 
539-550, 

Lavin, Davin E. 1965 The Prediction of Academic Per- 
formance: A Theoretical Analysis and Review of Re- 
search. New York: Russell Sage Foundation. 

LAZARSFELD, Paur F.; and FRANZEN, RAyYMonD H. 1945 
Prediction of Political Behavior in America. American 
Sociological Review 10:261-273. 


McCormick, THomas C. 1952 Toward Causal Analysis 
in the Prediction of Attributes. American Sociological 
Review 17:35-44. 

MANNHEIM, HERMANN; and WiLkins, Leste T. 1955 
Prediction Methods in Relation to Borstal Training, 
London: H.M. Stationery Office. 

MEEHL, PauL E. 1954 Clinical Versus Statistical Pre- 
diction. Minneapolis: Univ. of Minnesota Press. 
Merton, Rosert K. (1948) 1957 The Self-fulfilling 
Prophecy. Pages 421-436 in Robert K. Merton, Social 
Theory and Social Structure. Rev. & enl. ed. Glencoe, 

Ill.: Free Fress. 

Moore, WILBERT E. 1964 Predicting Discontinuities in 
Social Change. American Sociological Review 29:331- 
338. 

MOSTELLER, FREDERICK et al. 1949 The Pre-election 
Polls of 1948. New York: Social Science Research 
Council. 

Ocpurn, WiLLIaM F. (1922) 1950 Social Change, With 
Respect to Culture and Original Nature. New edition 
with supplementary chapter. New York: Viking. 

Ocsurn, WiLLiaMm F. 1934 Studies in Prediction and 
the Distortion of Reality. Social Forces 13:224-229. 

Ocsurn, WiLLIaM F.; ApAMs, JEAN L,; GILFILLAN, S. C. 
1946 The Social Effects of Aviation. Boston: Hough- 
ton Mifflin. > Includes an extended discussion of the 
methodology of prediction and a selected bibliography. 

Onin, Lroyp E. 1951 Selection for Parole. New York: 
Russell Sage Foundation. 

Popper, Kart R. 1957 The Poverty of Historicism. 
Boston: Beacon, 

PRESIDENT'S RESEARCH COMMITTEE ON SociAL TRENDS 
1933 Recent Social Trends in the United States. 2 
vols. New York: McGraw-Hill. > Incorporates the re- 
sults of thirty studies of selected social and economic 
trends in the United States from 1900 to 1930. 

Reiss, ALBERT J. JR. 1951 The Accuracy, Efficiency, and 
Validity of a Prediction Instrument. American Journal 
of Sociology 56:552-561. 

Sawyer, Jack T. 1966 Measurement and Prediction, 
Clinical and Statistical. Psychological Bulletin 66: 
178-200. 

SCHUESSLER, Kart F. 1954 Parole Prediction: Its His- 
tory and Status. Journal of Criminal Law, Criminol- 
ogy, and Police Science 45:425-431. 

Simon, HERBERT A. (1954) 1957 Bandwagon and Un- 
derdog Effects of Election Predictions. Pages 79-87 in 
Herbert A. Simon, Models of Man: Social and Ra- 
tional; Mathematical Essays on Rational Human Be- 
havior in å Social Setting. New York: Wiley. 

SMALL, ALBION W. 1916 Fifty Years of Sociology in 
the United States (1865-1915). American Journal of 
Sociology 21:721-864, 

SoctaL SCIENCE RESEARCH COUNCIL, COMMITTEE ON So- 
CIAL ADJUSTMENT 1941 The Prediction of Personal 
Adjustment, by Paul Horst et al. New York: The 
Council. 

Sorokin, Prrmim A. (1941) 1962 Social and Cultural 
Dynamics. Volume 4: Basic Problems, Principles and 
Methods. Totowa, N.J.: Bedminster Press. 

Srar, SHIRLEY A. 1950 The Screening of Psychoneu- 
rotics in the Army: Technical Development of Tests. 
Pages 486-567 in Samuel A. Stouffer et al., Measure- 
ment and Prediction. Studies in Social Psychology in 
World War II, Vol. 4, Princeton Univ. Press. 

Terman, L. M. et al. 1938 Psychological Factors in 
Marital Happiness. New York: McGraw-Hill. 


PREDICTION AND FORECASTING, ECONOMIC 425 


THORNDIKE, EDWARD L. et al. 1934 Prediction of Voca- 
tional Success. New York: Commonwealth Fund. 
TOYNBEE, ARNOLD J. 1934-1961 A Study of History. 

12 vols. Oxford Univ. Press. 

TyLor, Epwarp B. (1871) 1958 Primitive Culture: Re- 
searches Into the Development of Mythology, Philos- 
ophy, Religion, Art and Custom. Volume 1: Origins 
of Culture. Gloucester, Mass.: Smith. 

U.S. NATIONAL RESOURCES COMMITTEE, SCIENCE COMMIT- 
TEE 1937 Technological Trends and National Pol- 
icy, Including the Social Implications of New Inven- 
tions. Washington: Government Printing Office. 

U.S. NATIONAL RESOURCES COMMITTEE, SCIENCE COMMIT- 
TEE 1938 The Problems of a Changing Population. 
Report of the Committee on Population Problems, 
Washington: Government Printing Office. 

WesTorr, CHARLES F.; Sacı, Pamir C.; and KELLY, E. 
LowELL 1958 Fertility Through Twenty Years of 
Marriage: A Study in Predictive Possibilities. Ameri- 
can Sociological Review 23:549-556. 


PREDICTION AND FORECASTING, 
ECONOMIC 


A forecast can be defined generally as a state- 
ment about an unknown and uncertain event— 
most often, but not necessarily, a future event. 
Such a statement may vary greatly in form and 
content: it can be qualitative or quantitative, con- 
ditional or unconditional, explicit or silent on the 
probabilities involved. A reasonable requirement, 
however, is that the forecast should be verifiable, at 
least in principle; trivial predictions that are so 
broad or vague that they could never be found in- 
correct merit no consideration, and the same ap- 
plies to predictions that are rendered meaningless 
by relying on entirely improbable assumptions or 
conditions. As for the “event” that is being predict- 
ed, it too is to be interpreted very broadly. Thus 
the forecast may refer to one particular or several 
interrelated situations (single versus multiple pre- 
dictions). It may identify a single value or a range 
of values likely to be assumed by a certain variable 
(point versus interval predictions). Various combi- 
nations of these categories are possible, and some 
are interesting; for example, an unconditional in- 
terval prediction can be viewed as a set of condi- 
tional point predictions (that is, the forecaster esti- 
mates the range of probable outcomes by setting 
limits to the variation in the underlying condi- 
tions). i 

In principle, the unknown event that is the a 
get of a prediction could pertain to the past or the 
present, but it is the future that is of primary con- 
cern to the forecaster. Information about the past 
and present is often incomplete and inadequate, 
but it is as a rule far richer and firmer than what- 


ever may be “knowable” about the future. In fact, 
inferences from the past are the sole basic source 
for the expectations of events that are beyond the 
forecaster’s control. And the quality of these ex- 
pectations (forecasts) is clearly a major determi- 
nant of the quality of plans and decisions that 
refer to factors over which the maker or user of 
the forecast does have substantial control, 
Economic forecasts refer to the economic aspects 
of unknown events. Looking into the future, these 
predictions may be classified into short-run (with 
spans or distances to the target period of up to one 
or two years), intermediate (two to five years), 
and long-term (relating to more persistent devel- 
opments or distant occurrences ). Forecasts in all 
these categories are made for purposes of business 
planning, of aiding economic policies of govern- 
ments, and of testing generalizations of economic 
theory. It is true that business forecasts deal largely 
with the near future, because this is both what is 
most needed and what stands a better chance of 
relative success in the reduction of avoidable busi- 
ness risks. Nonetheless, in certain areas, such as 
planning new industrial plant construction or 
acquisition of new businesses, prediction of rather 
long developments is needed, and lately business 
forecasters seem to have grown bolder in under- 
taking to project long trends in the economy. With 
the recent emphasis on growth objectives, long- 
range forecasts are also gaining ground as tools of 
governmental planning and decision making. 
Important generalizations of economic theory 
typically imply qualitative conditional predictions; 
for example, the “law of demand” predicts that a 
decline in the price of a good will lead consumers 
to purchase more of that good, In the history of 
economic thought one also finds another type of 
prediction, in which the author presents as a fore- 
cast of things to come what is essentially an em- 
pirical hypothesis based on assumptions that may 
only have been valid at the time or may be ques- 
tioned altogether. Predictions of secular develop- 
ments by the classical economists provide several 
major examples, such as the law of historically 
diminishing returns and the Malthusian population 
principle. (Marx's projections of a falling rate of 
profit and increasing pauperization and crises be- 
long in the same logical category.) History dealt 
harshly with some of these prognostications, while 
many others were left untested by events and 
must often be viewed as inconclusive or lacking in 
present interest. In any case, these so-called evo- 
lutionary laws and other predictions of such a 
general nature are not included in the subject 
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matter of this article, which will concentrate on 
more limited, specific, and, in particular, quanti- 
tative forecasts. 


Historical background 


It is clear from the preceding that prediction in 
the general sense has always been one of the prod- 
ucts of economic thought; indeed, the ideal aim of 
any scientific generalization is to establish regu- 
larities or relationships that would hold not only 
for the past but also for the future observations 
of the same phenomena. Specific quantitative pre- 
diction, however, is of a much more recent origin 
in economics, having had to await the development 
of empirically oriented research and its statistical 
and mathematical tools. Two contributions of en- 
during influence in this development were Ernst 
Engel’s analysis of cross-section data on workers’ 
household budgets, which appeared in 1883, and 
Clément Juglar’s study of time series data on prices 
and finance, which appeared in 1862 and intro- 
duced the idea of observable “cycles” in business 
activity. 

Growing interest in the persistent and disturb- 
ing phenomenon of business cycles gave consider- 
able impetus to the collection and analysis of a 
variety of economic time series. A succession of 
significant studies of business cycles, with fre- 
quent references to historical and statistical mate- 
rials, appeared between 1898 and 1925: works by 
Wicksell, Tugan-Baranovskii, Aftalion, Spiethoff, 
and Schumpeter in Europe, and by Mitchell in 
the United States. In the course of his later work 
on business cycles, Mitchell developed a strong 
(though healthily skeptical) concern about the 
possibilities of predicting the near-term fortunes 
of the economy. His 1938 paper on statistical in- 
dicators of cyclical revivals, written with Arthur F. 
Burns, initiated a series of several studies by the 
National Bureau of Economic Research (NBER), 
which produced tools that have recently been 
widely used in practical forecasting. [See the biog- 
raphy of MITCHELL.] 

Another flow of important contributions to the 
present-day techniques of economic forecasting 
had its source in the development of new methods 
of statistical inference, which were first applied in 
the physical and biological sciences and soon at- 
tracted the attention of those interested in social 
and economic data. Early illustrations of such work 
are found in Henry L. Moore's studies on economic 
cycles and forecasting, which appeared in 1914 
and 1917. Moore’s work stimulated the use of re- 
gression methods in forecasting prices and produc- 


tion of individual (particularly agricultural) com- 
modities. Irving Fisher’s major achievements in 
monetary economics, index numbers, the study of 
distributed lags, etc., have long been acknowledged 
as early models of what came to be known as 
econometric research. Other pioneers in this ap- 
proach include Paul H. Douglas (production func- 
tions, wages), Henry Schultz (demand functions), 
Ragnar Frisch (marginal utility measurement), 
Charles F. Roos (automobile and housing demand), 
and Jan Tinbergen (statistical tests of business 
cycle theories). Modern analysts and forecasters 
who use econometric models clearly owe a major 
debt to the work of these men. 

More directly concerned with forecasting of 
short-term changes in general business conditions 
were the efforts of Warren M. Persons (1931) and 
the Harvard University Committee on Economic 
Research to identify time series that tend to move 
cyclically and would therefore help to anticipate 
what was ahead for the economy. The result was 
the Harvard Index Chart, consisting of three 
curves: (A) speculation (stock prices); (B) busi- 
ness (wholesale prices, later bank debits); and 
(C) money market (short-term interest rates). The 
Harvard chart was first published in 1919 for the 
last decade before World War 1, then extended 
back to 1875; but the first extensive use of the 
underlying relationship was made by one of the 
early commercial business forecasting services, 
J. H. Brookmire, in 1911. The A-B-C sequence was 
found to have occurred with substantial regularity 
in the period before World War 1. In the 1920s, 
the three-curve barometer proved less consistently 
successful, and it was considerably modified and 
often used in combination with other statistical 
devices. The Harvard service did not survive the 
great depression, although the index was published 
periodically in the Review of Economic Statistics 
until 1941. However, a recent unpublished evalu- 
ation shows that the sequence underlying the 
Harvard ABC curves persisted over a long period, 
not only in the interwar years but also in the 
post-World War u years, The Brookmire—Harvard 
method, for all its shortcomings, deserves to be 
acknowledged as an early forerunner of the pres- 
ent indicator techniques and also as a highly in- 
fluential factor in the evolution of business cycle 
forecasting in Europe during the interwar period. 

For men of affairs, prediction in some (not nec- 
essarily explicit) form must have always been 
unavoidable, Most decisions made in business and 
many decisions made in government imply oF 
follow some forecasts of economic conditions. Eco- 
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nomic and business forecasting as a specialized 
activity, however, is a relatively recent phenome- 
non which made its appearance largely in the 
twentieth century and developed rapidly only after 
World War 1 and, particularly, after World War 1. 
In the latter postwar period forecasts have become 
both more abundant and more ambitious than ever 
before, Increasingly, forecasts of such compre- 
hensive aggregates as the gross national product 
(GNP) and its major components, industrial pro- 
duction, and total employment are being made in 
numerical form not only for the next year but often 
also over a sequence of short periods, say the next 
four or six quarters. The spread of such predictions 
was stimulated by two recent developments: the 
rise of active interest in the application of macro- 
economic theory, and the corresponding accumu- 
lation and improvement of aggregative data. The 
former can be attributed largely to the intensive 
work done by economists in the last thirty years— 
since Keynes’s General Theory—on problems in 
the determination of aggregate income, employ- 
ment, and the price level. The latter goes back to 
the development of concepts and data on national 
income accounts by Simon Kuznets and others at 
the NBER and in the government statistical agen- 
cies, Very recently, the rapid growth of electronic 
computer technology has greatly accelerated the 
rate at which economic data (the raw materials 
for the forecaster) are compiled and processed. 
The same factor also had some more direct effects 
—without the computer, for example, the large- 
scale econometric models could not have been pro- 
duced, and hence output of forecasts of the econo- 
metric variety would have been severely limited. 


Types and methods of forecasting 

It is instructive to classify forecasts by several 
different criteria. First, one may distinguish be- 
tween forecasts that are based solely on the past 
and current values of the variable to be predicted 
and forecasts that rely on postulated or observed 
relationships between the variable to be predicted 
and other variables. The former type of forecast 
will be referred to as an extrapolation. Second, the 
degree to which formalized methods are used es- 
tablishes at least in principle a whole gradation 
ranging from informal judgment forecasts to pre- 
dictions based on fully specified and strictly im- 
plemented econometric models. And third, one 
may distinguish between forecasts constructed by 
a single source, which could be an individual or 
a team (say, the staff of a business, Or government 
agency), and forecasts derived as weighted or un- 


weighted averages of different predictions made by 
a few or many individuals or organizations (the 
latter category includes opinion polls, surveys of 
businessmen’s anticipations, ete. ). 

These various classes of forecasts overlap and 
can be combined in various ways. For example, a 
forecast of next year’s GNP and its major com- 
ponents by a business economist may consist of 
any or all of the following ingredients: (1) extrap- 
olation, of some kind, of the past behavior of the 
given series; (2) relation of the series to be pre- 
dicted to known or estimated values of some other 
variables; (3) other external information consid- 
ered relevant, such as a survey of investment in- 
tentions or a government budget estimate; and 
(4) the judgment of the forecaster. Also, it should 
be noted that a group forecast, say an opinion poll, 
will incorporate as many different techniques as 
are used by the different respondents. Any single 
classification of forecasting methods that cuts 
across the different criteria is likely to have only 
limited application to actual forecasts, which 
are built much more often on a combination of 
methods than on one particular technique. 

Increasingly, starting with the early post-World 
War 1 forecasts, the analysis of factors affecting 
the course of the economy is being carried on in 
terms of the major components of GNP. [See Na- 
TIONAL INCOME AND PRODUCT accounts.] This 
framework is thought to have the advantages of 
(a) ensuring that none of the main components 
of “aggregate effective demand” will be overlooked 
in the forecast, since they are all represented in 
the expenditure categories of the GNP accounts; 
(b) steering the forecaster to think in terms of the 
basic determinants of spending decisions by con- 
sumers, business, government, and foreign buyers; 
and (c) providing some safeguards that the vari- 
ous parts of the forecast, being constructed accord- 
ing to the internally consistent GNP system, will 
not be inconsistent with each other. However, there 
are also some disadvantages in an exclusive reli- 
ance on data such as those for GNP, which are 
highly aggregative, subject to frequent and often 
substantial revisions, and available only annu- 
ally (with a relatively short dependable historical 
record) and quarterly (for the recent years). 

A few mechanical forecasting techniques that 
were once fairly popular can be quickly disposed 
of as having little scientific basis or empirical 
soundness. For example, the method that assumes 
periodic cycles of given duration founders on the 
fact that cycles in business activity are far from 
being strictly periodic. The device of equalizing 
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the areas below and above trend, or predicting 
that a business index will turn once it deviates 
from the “normal” by some critical amount, is un- 
likely to score well because trends are difficult 
to determine on a current basis and need not have 
the connotation of “normal” or “equilibrium” levels 
even when they are satisfactorily identified. 

Judgment, in the broad sense of the word, is, 
of course, a necessary ingredient of all types of 
prediction: the forecaster must “judge” what in- 
formation and methods of analysis to use and how 
to interpret and evaluate the results. Thus, judg- 
ment is bound to enter the forecasting process at 
various stages, but its proper role is to be a com- 
plement to, not a substitute for, a competent eco- 
nomic and statistical analysis, Informed judgment 
can go far in making forecasts more consistent 
and dependable, while pure guesswork can only 
rely on luck for success. It is well to recognize, 
however, that the ability to reach “good judgments” 
is not a well-defined, technical, transferable skill 
but rather a puzzling function of personal talent, 
experience, and training. 

Many economic forecasts, particularly from 
business sources, are not based on formal models 
and do not disclose the underlying assumptions 
and methods. Some are likely to be little more than 
products of intuition, yet most seem presently to 
take the specific form of numerical point forecasts. 
However, there is no general presumption that the 
informal judgmental forecasts are largely hunches; 
on the contrary, at least the better ones among 
them originate in the application of various ana- 
lytical techniques as well as judgment to diverse 
and substantial bodies of information. 

Judgmental inferences from data samples and 
from any other evidence that the forecaster may 
have and regard as pertinent involve probability 
distributions. It would be informative and helpful 
for the appraisal of predictions if forecasters stated 
the odds they attach to the expected outcomes 
(the practice is frequent in weather forecasts, for 
example). The step from interval to distribution 
forecasting is, in principle, short. However, prob- 
abilistic distribution predictions appear to be re- 
grettably rare in practical business and economic 
forecasting. 

After eliminating those forecasting procedures 
that have little relevance currently, the following 
seem to merit further discussion: (1) extrapola- 
tive techniques; (2) surveys of intentions or an- 
ticipations; (3) business cycle indicators; and 
(4) econometric models. Typically, forecasters are 
using one or another of these approaches, or more 
likely some combination of them, for the most part 


tempering their results with considerable doses of 
“judgment.” 

Extrapolations. The term “extrapolations” will 
be used here as a shorthand expression for “ex- 
trapolations of the past behavior of the series to 
be predicted.” (The term is often applied in a 
broader sense to include projections of relation- 
ships between different variables, but such fore- 
casts are more conveniently discussed under other 
headings—“econometric models” and to some ex- 
tent also “business cycle indicators.” To be sure, 
extrapolative elements can be combined with the 
others, as in an econometric model in which, say, 
x, is associated with both x; and y,.,.) Being re- 
stricted to the history of just one variable or proc- 
ess, extrapolations make only minimal or no use 
of economic theory, which deals largely with rela- 
tions between different factors. Technically, how- 
ever, extrapolations can vary a great deal, from 
very simple to very complicated forms. The sim- 
plest “naive models” project the last-known level 
or the last-known change in the series, that is, they 
assume that next period’s value of the series will 
equal this period’s value or that it will equal this 
period's value plus the change from the preceding 
to this period. Few, if any, forecasters would cast 
their predictions in the form of such crude ex- 
trapolations, but the naive models are useful as 
minimum standards against which to measure the 
performance of forecasts proper. 

Since trends are common in many economic time 
series because of the pervasive influence of growth 
in the economy, trend extrapolations usually pro- 
vide more effective predictions and are therefore 
more demanding as criteria for forecast evaluation. 
Particularly in application to long-term forecasts, 
trend fittings and projections are widely used. The 
trends are usually conceived as smooth (often but 
not always monotonic) functions of time; the 
methods of describing them vary greatly, from 
visual freehand projections and long-period mov- 
ing averages to diverse (for example, exponential, 
logistic) curves fitted by mathematical formulas. 
In the short-run context, the other typical com- 
ponents of economic time series become of primary 
importance, namely, the cyclical, seasonal, and 
purely “irregular” or random movements. (How- 
ever, for some series, trends are substantial even 
over short periods, and in such cases it is especially 
rewarding for the forecaster to approximate them 
well.) [See TIME SERIES.] 

Strictly periodic, repetitive fluctuations are, like 
persistent trends, relatively easy to extrapolate: 
stable seasonal movements would often be more 
or less of this type. Average ratios of raw (say, 
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monthly) data to smoothed values of the series 
representing mainly the longer-term movements (a 
centered 12-month moving average is the simplest 
example) are most commonly employed as a set 
of “seasonal indexes.” Extrapolations of these in- 
dexes then serve as forecasts of the seasonal move- 
ments. Since such movements are a major source 
of instability for business firms, their projection is 
particularly important in industrial and sales fore- 
casting. Complications arise when the seasonal 
patterns are not very pronounced and not very 
stable, hence difficult to isolate from other com- 
ponent movements, Forecasters whose interest is 
mainly in the other movements often try to work 
around the seasonal effects by predicting changes 
in the seasonally adjusted series. 

This leaves the cyclical and irregular compo- 
nents as the major objects of concern for the short- 
term forecaster. Looking forward, it is usually any- 
thing but easy to distinguish the cyclical from the 
random element in the movement of an economic 
time series, although retrospectively it is often 
possible to do so with fair results (by decompos- 
ing the time series and testing the residually ob- 
tained estimates of the irregular component for 
randomness ). 

The forecasting errors that are directly traceable 
to very short random movements must be accepted 
as unavoidable. The forecaster can hardly be ex- 
pected to predict an event generally regarded as 
unforeseeable, such as an outbreak of a war or a 
strike started without advance warning. However, 
although such “shocks” cannot themselves be pre- 
dicted individually with the tools of economics and 
statistics, their more significant effects on the 
economy are, of course, the proper concern of the 
forecaster. In probabilistic predictions, which aim 
at the distribution of unknown parameters and out- 
comes rather than at point forecasts of future 
events, the effects of shocks and other random 
errors would be taken into account as an important 
part of the system to be analyzed. The role Re 
random impulses in propagating business cycles 
has been given considerable attention in recent 
simulation studies of ageregative —economeli® 
models, [See SIMULATION, article on ECONOMIC 
PROCESSES. ] 
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changes in the given series, and extrapolations 
can help the forecaster in his task to the extent 
that they preserve these patterns and to the ex- 
tent that the patterns continue to apply. But, in 
regard to economic and related social events, the 
future seldom reproduces the past without signifi- 
cant modifications, and the historical “patterns” 
are often complex enough to elude efficient ex- 
trapolation. 

In particular, extrapolations are by and large 
incapable of signalizing the turning points in busi- 
ness. The turns in extrapolations will as a rule lag 
behind those in the actual values; the strength of 
a good projection lies almost entirely in that it 
may predict well the longer-term trends. This con- 
tributes to the fact that, along with the short ran- 
dom variations, it is the cyclical fluctuations, not 
the longer trends, that produce the greatest diffi- 
culties in short-term forecasting. These fluctuations 
are recurrent but nonperiodic; they vary greatly in 
duration and amplitude; and calling them cyclical 
should convey neither more nor less than that they 
reflect mainly the participation of the given eco- 
nomic factor in “the business cycle.” 

Important mathematical studies of smoothing 
and extrapolation of time series (by A. Kolmogoroff 
and by Norbert Wiener) appeared in the early 
1940s. The method of autocorrelation, or the ex- 
trapolation of a series by means of a correlation 
of the series with itself at different points of time, 
was added and related to the methods of trend pro- 
jection and harmonic analysis of the residuals from 
trend. It is an essential feature of dynamic process 
analysis in economics that variables at different 
points of time are functionally (usually stochasti- 
cally) related; this approach may result in differ- 
ence equations which generate their own solutions, 
as in the multiplier—accelerator models that yield 
a dependence of the current value of national in- 
come on the past values of national income. Thus, 
there is a strain of theoretical thought here that 
suggests the use of autoregressive extrapolation 
functions in aggregate income forecasting. 

More recent developments have led to applica- 
tion of such functions in the analysis of how ex- 
pectations are formed and how lagged adjustments 
are made. In particular, functions in which the 
weights decline geometrically as one goes back to 
progressively earlier past period values have been 
used in a variety of problems involving either ex- 
pectation or partial adaptation to change. [See 
DISTRIBUTED LAGS.] Forecasts derived from such 
exponentially declining weighted averages were 
found to have certain desirable properties for a 
class of autoregressive time series. But these are 
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“optimum” predictions only if the structure of the 
series is known to belong in the given class and 
only if the past values of the series are all that one 
has to go on. These conditions are seldom fulfilled 
or relevant. As one of the pioneers in econometric 
forecasting methods has observed: “Mathematical 
processing or analysis can never substitute for so- 
phistication. And until one understands the forces 
that built a particular structure in an economic 
time series and how these forces are currently 
changing, he cannot forecast with confidence even 
though by chance he scores a preponderance of 
successes” (Roos 1955, pp. 368-369). 

Surveys of anticipations or intentions. The col- 
lection and evaluation of expectational data for 
the U.S, economy did not develop on a large scale 
until after World War 11, but since then work in 
this area has proceeded at a rapid pace. The data 
relate to future consumer expenditures, planned 
or anticipated capital outlays of business firms, 
business expectations about “operating variables,” 
and government budget estimates. 

Reports dealing with purchases of household 
appliances and automobiles are published peri- 
odically by the Survey Research Center of the 
University of Michigan. They currently include 
quarterly measures of consumer attitudes and in- 
clinations to buy as well as annual financial data. 
Since 1959, a quarterly household survey has also 
been conducted by the Bureau of the Census. 

The surveys of business plans and anticipations 
relating to future expenditures on plant and equip- 
ment include one that is carried on annually and 
quarterly as a joint enterprise by the U.S. Depart- 
ment of Commerce and the Securities and Ex- 
change Commission (SEC) and another that is 
conducted annually by the McGraw-Hill Publishing 
Company. Since 1955, a quarterly survey of new 
and unspent capital appropriations has been con- 
ducted by the National Industrial Conference Board. 

The oldest of the current surveys of business- 
men’s expectations in the United States is the Rail- 
road Shippers’ Forecast, which has given quarterly 
anticipations for carloadings by commodity since 
1927. The Fortune magazine program, which 
started in the 1950s, includes surveys of “business 
expectations and mood,” retail sales, farm spend- 
ing, homebuilding, inventories, and capital goods 
production (all of these are now semiannual, ex- 
cept for the annual farm and the quarterly inven- 
tory surveys). Data on manufacturers’ sales ex- 
pectations have been gathered since 1948 in the 
course of the U.S. Department of Commerce— 
SEC survey of investment anticipations; quarterly 
figures on manufacturers’ sales and inventory ex- 


pectations are now published regularly by the De- 
partment of Commerce in a program initiated in 
1957. 

Two other sources of data on business expecta- 
tions about operating variables can be grouped 
together inasmuch as their output takes the special 
form of “diffusion indexes.” Such data indicate, for 
each successive forecast period, the percentage of 
respondents in the sample who expect either rises 
or declines or no change in the given variable. The 
Dun and Bradstreet surveys of manufacturers, 
wholesalers, and retailers cover employment, in- 
ventories, prices, new orders, sales, and profits; 
they started in the 1950s, are quarterly, and refer 
in each case to businessmen’s expectations for the 
impending six-month period. The monthly ques- 
tionnaire of the National Association of Purchasing 
Agents, first issued in 1947, is addressed to par- 
ticipating members, covers production, new orders, 
commodity prices, inventories, and employment, 
and asks how the month ahead is going to compare 
with the preceding month (whether it will be 
better, worse, or the same). 

Surveys of enterprise expectations have also 
spread in other countries, originating apparently 
with the IFO-Institute for Economic Research in 
Munich at the beginning of 1950. This institute 
sends monthly questionnaires to a large number 
of companies in West Germany and on the basis 
of replies from executives compiles diffusion data 
on the actual and expected directions of change in 
several important economic variables. By 1959, 
eight European countries, as well as Japan, South 
Africa, and Australia, adopted methods of entre- 
preneurial surveys similar to the IFO procedure. 

Business expectations have been classified into 
intentions (plans for action where the firm can 
make binding decisions), market anticipations (te- 
lating to the interplay between the firm’s actions 
and its environment), and outlook (expectations 
about conditions which the firm cannot signifi- 
cantly influence but which will affect the markets). 
To illustrate, new capital appropriations or plans 
regarding next year’s outlays for plant and equip- 
ment fall into the first category, as does the sched- 
uling over shorter periods of production and em- 
ployment. The firm’s sales forecast, which depends 
on customers’ reaction to the terms offered, adver- 
tising efforts, etc., belongs to the second category, 
as do the expectations concerning financing, in- 
ventories, and selling prices (for firms that are not 
able to set their own prices). Finally, the class 
labeled “outlook” includes forecasts of the general 
situation of the economy or industry. 5 

Consumer intentions to buy are in principle akin 
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to business plans to acquire productive resources. 
but in practice they are often more vague and atti- 
tudinal and usually less firmly budgeted. Govern- 
ment budget estimates also represent intentions: 
they document what the central administration 
would undertake to spend for diverse specified pur- 
poses, subject to approval by the legislature. In 
government, as well as in many large business 
companies, the process of forming “expectations” 
or forecasts is often highly decentralized and com- 
plex, as is indeed the related process of reaching 
decisions. 

The distinction between intentions, market an- 
ticipations, and outlook is of significance for the 
question of the predictive value of expectational 
data, It is plausible that accuracy will tend to be 
higher the greater the degree of control that those 
holding the expectation have over the variable con- 
cerned. This suggests that intentions should be on 
the whole more accurate than market anticipa- 
tions, and the outlook estimates should be the least 
accurate. There is some evidence consistent with 
this view, notably the fact that business anticipa- 
tions of plant and equipment expenditures have a 
much better forecasting record than business sales 
anticipations (as shown by the U.S. Department 
of Commerce-SEC sample surveys). However, 
there are other relevant factors which can modify 
such comparisons, such as the variabilities of the 
predicted series (forecasts of a very stable aggre- 
gate, classifiable as outlook, may be better than 
market anticipations for a variable which is highly 
volatile and therefore difficult to predict) and span 
of forecast (surveys looking far enough ahead, 
even for largely “controllable” variables, will be 
more in the nature of market anticipations and less 
of intentions, and they could well be less accurate 
than outlook surveys for the very near future). 

Clearly, expectations of all kinds always involve 
some degree of uncertainty. They are presumably 


based in part on historical evidence, such as ex- 


trapolation of past behavior of the given series 
d relations with other 


and inferences from observet 
series; but they are unlikely to incorporate only 
such evidence. As a result of expert insight or mere 
hunches, they may well include some additional 
information not contained in the patterns of the 
past. Hence, even where expectations are not very 
efficient when used alone as a direct forecast, they 
may still have a net predictive value as an ingredi- 
ent in a forecasting process that combines expecta- 
tional with other inputs. 

In long-run forecasting, informed judgment or 
expectations play a major role along with extrap- 
olative techniques. In large part, these forecasts 


are growth projections, which have been described 
as tools for exploring economic potentials. They 
aromor intended to provide predictions of actual 
conditions in a distant year but rather estimates of 
likely conditions under some specified assumptions 
regarded as more or less reasonable. Attempts are 
often made to allow for the uncertainties of the 
future by constructing alternative projections that 
assume several different paths of economic devel- 
opments within the range considered plausible. 
These forecasts are thus essentially concerned with 
trends of the economy at full employment. The 
variables projected are typically population, labor 
force, hours of work, and productivity on the sup- 
ply side; expenditures on the major GNP compo- 
nents on the demand side; income, saving, and in- 
vestment; and price level movements (which are 
often only implicitly considered, the projections 
being expressed in constant dollars). 

Business cycle indicators. Business cycle indi- 
cators, which are used in analyzing and forecasting 
short-term economic developments, are time series 
selected for the relative consistency of their timing 
at cyclical revivals and recessions (other criteria 
being the economic significance of the series in 
relation to business cycles, their statistical ade- 
quacy, historical conformity to general movements 
of the economy, smoothness, and currency). The 
series are selected from large collections of quar- 
terly and monthly economic series and then sub- 
jected to detailed analysis and repeated examina- 
tions of the quality of their performance as cyclical 
indicators. Such selections, based on studies of 
500-800 series and successive reviews of the re- 
sults, were made at the National Bureau of Eco- 
nomic Research in 1938 (by Mitchell and Burns), 
in 1950 and 1960 (by Geoffrey H. Moore), and in 
1965 (by Moore and Shiskin). The first list includ- 
ed only indicators of cyclical revivals; the later ones 
covered indicators of both revivals and recessions, 
classified into those that tend to lead the turns in 
general business activity, those that tend to coin- 
cide roughly with these turns, and those that tend 
to lag. Further revisions and extensions of the list 
have been prompted by the appearance of new and 
improvement of old data, the accumulation of 
knowledge about the behavior of the series and the 
processes they represent, and the great increase in 
efficiency with which time series can be processed 
and analyzed. These reviews resulted in many sig- 
nificant changes, but the core of the list has not 
been essentially altered as to its composition in 
terms of the represented processes. In the most 
recent list, this core consists of the following: 

Leading indicators (14 series )—average work 
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week in manufacturing, nonagricultural job place- 
ments, new building permits for private housing, 
net business formation, new orders for durable 
goods, contracts for plant and equipment; change 
in unfilled orders for durables, change in manufac- 
turers’ and trade inventories; industrial materials 
prices, stock prices, corporate profits after taxes, 
ratio of price to unit labor costs in manufacturing; 
change in consumer installment debt, liabilities of 
business failures. 

Roughly coincident indicators (8 series )—non- 
agricultural employment, unemployment rate, GNP 
in constant dollars, industrial production, personal 
income, retail sales, manufacturing and trade sales, 
wholesale price index. 

Lagging indicators (6 series)—long-duration 
unemployment, book value of manufacturers’ and 
trade inventories, labor cost per unit of output in 
manufacturing, business expenditures on new plant 
and equipment, bank rates on short-term business 
loans, commercial and industrial bank loans out- 
standing. 

While the selection is based mainly on historical 
evidence, it is also broadly supported by general 
economic considerations and logic. Thus the aggre- 
gative series on production, employment, and in- 
come measure approximately the general level of 
business or economic activity whose major fluctua- 
tions are defined as the business cycle; hence, these 
series could hardly fail to be “roughly coincident.” 
The leaders include series that anticipate produc- 
tion and employment, such as hours worked, job 
vacancies, and new orders and contracts. For ex- 
ample, an increase in demand calling for additional 
labor input is likely to be met first by lengthening 
the work week and only later, if still needed, by 
hiring new workers (the former adjustment is less 
binding and costly than the latter). New orders 
precede production and employment by sizable in- 
tervals for goods made largely in response to prior 
offers or commitments to buy. Most durable manu- 
factured goods belong in this category and, partic- 
ularly, the capital equipment items which are as 
a rule produced to fill advance orders; construction 
of industrial and commercial plant is similarly 
anticipated by building contracts. The execution of 
these new investment orders and contracts takes 
time, however, and so the expenditures on plant 
and equipment is a roughly coincident or slightly 
lagging series. 

Another type of sequence arises from the fact 
that a stock series often undergoes retardation be- 
fore reversal; hence the corresponding flow series 

(or rate of change in the stock) tends to turn ahead 
of the stock. Thus, inventory changes lead at busi- 


ness cycle turns, while total inventories lag. Still 
other sequences are recognized when downturns in 
some indicators are related to upturns in others, 
For example, the decline in inventories lags behind, 
and is a possible consequence of, the downturns in 
the comprehensive measures of economic activity 
(such as GNP, industrial production, and retail 
sales); but the downturn in inventories also leads, 
and may be contributing to, the later upturns in 
these and other series (as the need for the stocks 
to be ultimately replenished will stimulate orders 
and help to bring about the next business recov- 
ery). Such considerations suggest that the coin- 
ciders and laggers are not merely of value as con- 
firming indicators; some of them also play an 
active role as links in the continuous round of 
business cycle developments. 

In addition to the 28 series identified above, 
more than sixty other series are included in the 
full list of indicators of business expansions and 
contractions. Up-to-date charts and tabulations and 
various summary measures for the entire set of 
indicators have been published since 1961 by the 
Bureau of the Census in the monthly report Busi- 
ness Cycle Developments. The full list contains 
several related series for each type of economic 
process having significance for business cycle anal- 
ysis and forecasting. Most of the series are monthly 
(less than 20 per cent are quarterly), and more 
than half of those classified by timing are leaders. 
Change in money supply, a series with long leads 
and rather pronounced irregular component move- 
ments, deserves a special reference in view of the 
hypothesis that this variable is a fundamental fac- 
tor in initiating business downturns and upturns. 

The indicators are used mainly to reduce the lag 
in the recognition of cyclical turning points. The 
lead time provided by the indicators is, on the 
whole, short; often, especially when the economy 
reverses its course in a relatively abrupt manner, 
the best obtainable result seems to be to recognize 
the cyclical turning point at about the time it is 
reached. Even this is not a negligible achievement, 
however, since revivals and recessions are gener- 
ally not recognized as such until several months 
after they have occurred. 

The greatest difficulty in using indicators for 
forecasting purposes arises from the need to estab- 
lish on the current basis the direction in which 
these series are moving or the dates of their turn- 
ing points. This is because the trend and cyclical 
movements in many indicators are typically Overs 
laid and often obscured by other short-period varia- 
tions, partly of a seasonal but mainly of an erra- 
tic nature. Seasonal adjustment and subsequent 
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smoothing of the series can help to distinguish its 
cyclical from its shorter irregular movements, but 
these are essentially descriptive—historical proce- 
dures which are not very efficient and which cause 
losses in up-to-dateness when applied currently. 

The leading indicators, in particular, are highly 
sensitive to all kinds of short-term influences and 
not only to the forces making for the general cycli- 
cal movements. They have anticipated marked re- 
tardations in business activity as well as the major 
recessions and revivals. Skilled (or lucky) judg- 
ment of the user could sometimes succeed in dis- 
tinguishing between these different episodes, but 
no mechanical, replicable method of applying the 
indicators has been able to do so. 

Individual indicators occasionally fail to signalize 
the approach of a general business reversal, and 
their leads often vary a great deal in length from 
one recession or revival to another. The evidence of 
groups of indicators is considered to be on the 
whole more reliable than the evidence of any single 
indicator, Accordingly, the degree of consensus in 
the behavior of these series attracts considerable 
attention of business analysts and forecasters. Sev- 
eral measures of the consensus are in use, includ- 
ing diffusion and other composite indexes for 
groups of the indicator series. 

Econometric model forecasts. Aggregate econ- 
ometric models are systems of equations designed 
to represent the basic quantitative relationships 
among, and the behavior over time of, such major 
economic variables as national income and product, 
consumption, investment, employment, and price 
level, Such models are used for forecasting and 
also for other purposes (simulation of the likely 
effects of alternative fiscal and monetary policies, 
tests of hypotheses, etc. ). In recent years, intensive 
work in this area has been done in several uni- 
versities and government agencies in the United 
States and abroad, notably by Lawrence R. Klein 
and his associates. 

The equations in econometric models describe 
the behavior of consumers, producers, investors, 
and other groups of economic agents; they also 
describe the market characteristics, institutional 
conditions, and technological requirements that 
guide and constrain economic action, They include 
variables selected to represent important systematic 
factors entering each of these functions and relate 
these variables by means of statistical estimates of 
their net marginal effects. The estimation of these 
unknown values (called parameters and taken, as 
a rule, to be constant) is based on the assumption 
that all the major factors affecting the relationship 
have been properly identified, leaving only random 
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disturbances with expected values of zero. Ideally, 
the residual disturbance terms, which entail the 
net effect of all influences other than those of the 
specified explanatory variables, should be small, 
not serially correlated, and not associated with the 
systematic factors. In practice, the model builder 
hopes that the disturbances will be at least ap- 
proximately random and relatively small, that is, 
that economic theory or other insights will lead 
him to a sufficiently good specification of a few 
principal determinants in each of these relation- 
ships. 

In contrast to these “stochastic” equations, which 
are supposed to hold only approximately, the re- 
maining equations of the model are accounting 
“identities,” which are based on definitions and are 
therefore supposed to hold exactly., The stochastic 
equations and the identities together form a de- 
scription of the structure of the economy. 

The unknown variables that are to be determined 
by the model are called the jointly dependent or 
current endogenous variables; their number equals 
that of the equations in the complete system, To 
solve for these variables, each of them is expressed 
as a function of the estimated structural param- 
eters, the disturbances, and the predetermined var- 
iables. The predetermined variables are inputs re- 
quired by the model and consist of (a) the values 
of those variables (labeled exogenous) that are 
viewed as determined by factors outside the model; 
and (b) the lagged values of the endogenous var- 
iables, which are given by outside estimates or by 
the past operation of the system. It is the jointly 
determined estimates of the current endogenous 
variables, all of which are functions of the prede- 
termined variables with disturbances typically as- 
sumed to be zero, that represent the forecasting 
output or the “reduced form” of the model. 

If the predetermined variables are taken as 
given, say at their reported ex post values, fore- 
casts made from the reduced form will be condi- 
tional upon these data. “Unconditional” forecasts 
of the jointly dependent variables will be obtained 
when the unknown future values of the predeter- 
mined variables are themselves predicted ( which 
for the exogenous factors necessarily means predic- 
tion outside the model). A closely related distinc- 
tion is between alternative hypothetical forecasts 
(for example, of GNP next year, assuming 5 or 7 
or 10 per cent increases in government expend- 
itures) and the single preferred forecast (that gov- 
ernment expenditures will be up 7 per cent and 
GNP will be such and such). 

It follows that unconditional forecasts could have 
substantial errors because of wrong projections of 


434 PREDICTION AND FORECASTING, ECONOMIC 


exogenous variables, even if the specification and 
solution of the model were essentially correct. The 
accuracy of the conditional forecasts, on the other 
hand, depends (apart from any effects of errors in 
the data to which the equations are fitted) only on 
the errors that occur in the construction and solu- 
tion of the model. These may arise for several 
different reasons: (1) incorrect specification of the 
behavioral or other economic relationships, that is, 
failure to use the right explanatory variables in 
their proper form; (2) deficient methods of statisti- 
cal inference, for example, inability to measure 
well the separate effects of several closely interre- 
lated exogenous variables; (3) errors in the param- 
eter estimates resulting from sampling fluctuations, 
that is, from the presence of the disturbance terms 
which obscure the underlying relationships; and 
(4) discrepancies between predictions and realiza- 
tions that result from the assumption that the dis- 
turbance terms vanish—in any particular period 
these terms may differ from zero, even though their 
expected values are zero. 

Statistical inferences as to the probabilistic 
meaning of the parameter estimates, the goodness 
of fit to the data in the sample period, the presence 
or absence of autocorrelation in the disturbance 
terms, etc., can be used to evaluate the severity of 
the errors resulting from sampling variations and 
the inefficiency of the estimation methods em- 
ployed. As far as the appraisal of the model per se 
is concerned, however, misspecifications are clearly 
the decisive sources of error, and these are more 
difficult to detect and evaluate. Theoretical and 
other a priori considerations can tell us something 
about the correctness of some of a model's specifi- 
cations; but the correctness of the specifications 
for the forecast period and not just the sample 
period must be judged primarily from the quality 
of the conditional forecasts made with a model 
that proved satisfactory on the other statistical 
tests just mentioned, 

The unit time period varies from a quarter to a 
year in different models; for the purpose of analyz- 
ing and forecasting near-term developments, short 
unit periods are desirable, and aggregate econo- 
metric models have recently progressed from an- 
nual to quarterly units. Models also vary greatly 
in size and complexity. One view is that simple 
small-scale models can be sufficient for forecasting 
the broad course of the economy in the near future 
and indeed that they may be preferable in this role 
to large models which present greater opportunities 
for error by taxing heavily the present inadequate 
knowledge and data. But, while small models with 

as few as five equations have recently been pro- 


posed, the trend appears to be in the direction of 
ever larger and more complex systems. 

It should be noted that conditional forecasts of 
particular variables are sometimes obtained by 
econometricians from one-equation systems. If the 
equation is unlagged, it merely shifts the burden 
of prediction in that, to get an authentic ex ante 
forecast of the dependent variable, the independ- 
ent variable(s) must somehow be predicted outside 
the model. If the dependent variable is taken with 
a lag, it can be predicted from the equation inas- 
much as the earlier values of the explanatory fac- 
tors are known or treated as known. The single- 
equation approach is strictly applicable only to 
cause-effect relations where one variable, say x, 
depends on others, y,, y:,:::, but the y's do not 
depend on x. If the variables depend on each other, 
a multiequation model should be applied. 

An econometric forecasting model can in princi- 
ple be so constructed and annotated as to be avail- 
able for production of successive forecasts, replica- 
tion by users other than the authors of the model, 
and continuous inspection and testing. This possi- 
bility holds out the promise of scientific advance, 
but it is difficult to achieve in practice for complex 
models with large requirements in terms of data 
and methods, especially when the data and methods 
are themselves subject to frequent and substantial 
changes. Moreover, some econometric forecasters 
wish to use their models in a flexible manner, mod- 
ifying them repeatedly so as to take advantage of 
additional information. Thus, anticipatory data 
from surveys, indicators, etc., are introduced into 
the models as exogenous variables, or judgment 
about the probable effects of a recent event is used 
to alter the constant term in an individual equa- 
tion. Such modifications are often informal and 
sometimes unrecorded; they may (although they 
need not) improve any particular forecast, but they 
certainly increase the difficulty of replicating and 
evaluating the models. 

Input-output tables. Input-output analysis, 
which was developed by Wassily Leontief and first 
presented in 1941, involves a relatively detailed 
division of the commodity-producing sector of the 
economy into individual industries and estimation 
of the relations between these industries. In a statis- 
tical input-output table, each row shows the sales 
made by a given industry to every other industry, 
and each column shows what one industry pur- 
chased from every other industry. If the quantity 
of each input per unit of output is treated as 4 
structural constant (which implies constant returns 
to scale and, a particularly drastic assumption, 
absence of substitution among inputs when relative 
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prices change), then a system of linear equations 
can be set up describing the interdependence be- 
tween the outputs of the different industries by 
means of these “technical coefficients” of produc- 
tion. Such a model can be used to make conditional 
predictions of the values of industry outputs, given 
the estimates or projections of the “bill of goods,” 
that is, of purchases by the autonomous sectors— 
consumers, government, and foreign countries. 
[See INPUT-OUTPUT ANALYSIS.] 


Evaluaton of forecasts 


Despite the widespread and apparently increas- 
ing use of economic forecasts in business, govern- 
ment, and research, surprisingly little has been 
done to test these forecasts in a systematic and 
comprehensive way. Yet it is clear that forecasts 
must be properly evaluated if their makers are to 
learn from past errors and if their users are to be 
able to discriminate intelligently among the avail- 
able sources and methods. Without dependable as- 
sessments of forecasting accuracy, informed com- 
parisons of costs and returns associated with 
forecasting are clearly impossible. 

Conceivably, the effectiveness of forecasts could 
be such as to complicate seriously their evaluation. 
Forecasts may influence economic behavior and, 
in particular, the variables being predicted; to the 
extent that this happens, the forecasts may validate 
or invalidate themselves. For example, if almost 
everyone predicted better economic conditions in 
the period ahead, this very consensus of optimistic 
expectations could contribute to the stimulation of 
the economy. Or, conversely, if the government 
accepted the forecast that the economy is threat- 
ened by recession, it would probably adopt policies 
designed to avert or at least postpone the undesired 
outcome initially foreseen. 

It is easy to exaggerate or misjudge such feed- 
back effects. Of the limited theoretical work on the 
problem, some is highly abstract and speculative. 
The best result appears to negate the thesis that 
public announcements must necessarily invalidate 
an otherwise accurate prediction. Under some 
plausible assumptions ( notably that the predicted 
variable has a lower and an upper bound), if un- 
published forecasts can be accurate, so can pub- 
lished forecasts, because the reaction to them can 
conceptually be known and taken into account. 


Actually, the changes in GNP and other macro- 


variables that are predicted by different sources for 
any given period show sufficiently large dispersion 
to suggest that no single forecast is generally ac- 
cepted (the evidence comes from a recent NBER 
study of American forecasts referred to below). 


For groups of forecasters, average forecasts are in 
the long run typically more accurate than most of 
the forecasts of the individual members because of 
compensating errors among the latter. Consistently 
superior forecasts are evidently hard to find. Some 
predictions are much more influential than others 
(and the special significance of official government 
forecasts is widely recognized), but there is no 
single authoritative source that enjoys unques- 
tioned leadership. 

A careful appraisal of American business fore- 
casts presented by Cox (1929) was one of the few 
early studies in this area. The years following 
World War 1 produced a number of particularly 
unsatisfactory forecasts (reflecting expectations of 
serious unemployment), which became the subject 
of some instructive reviews. Christ (see Conference 
on Business Cycles... 1951) published a test of 
an early econometric model constructed by Klein. 
Essays concerned with the development and ap- 
praisal of different forecasting approaches were 
collected in several volumes by the National Bureau 
of Economic Research (Conference on Models of In- 
come Determination 1964; Conference on Research 
in Income and Wealth 1954; 1955; Universities- 
National Bureau Committee for Economic Re- 
search 1960; Moore 1961). In Europe, Henri Theil 
and his associates (1958) produced a major analy- 
sis of the methodology and quality of forecasts, 
directed to both business surveys and econometric 
models. A comprehensive study of accuracy and 
other properties of short-term forecasts of eco- 
nomic activity in the United States has been con- 
ducted since 1963 at the National Bureau of Eco- 
nomic Research, and several reports on methods 
and results of this evaluation have been prepared. 

Judging from the over-all results of most of these 
studies, the record of economic forecasters in gen- 
eral leaves a great deal to be desired, although it 
also includes some significant achievements and 
may be capable of further improvements. Accord- 
ing to the current NBER study, the annual GNP 
predictions for 1953-1963 made by some three 
hundred to four hundred forecasters (company 
staffs and groups of economists from various indus- 
tries, government, and academic institutions) had 
errors averaging $10 billion. Although this amounts 
to only about 2 per cent of the average level of 
GNP, the errors were big enough to make the differ- 
ence between a good and a bad business year. The 
average annual change in GNP in this period was 
approximately $22 billion. Hence, the errors were, 
according to absolute averages, not quite one-half 
the size of those errors that would be produced by 
assuming that next year's GNP will be the same 
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as last year’s (since the error in assuming no 
change is equal to the actual change). Had the 
forecasters assumed that GNP would advance next 
year by the average amount it had advanced in the 
preceding postwar years, the resulting average 
error would not have been greater than $12 billion. 
However, while it is true that in terms of the ab- 
solute average errors some forecasts of GNP have 
not been significantly better than simple trend 
extrapolations, the forecasts were typically superior 
in terms of correlations with actual changes. In 
fact, recent forecasts of aggregate economic ac- 
tivity in the coming year, whether measured by 
GNP or industrial production, have generally been 
more accurate than both the simple extrapolations 
of the last level or change and the considerably 
more effective models of trend projection and 
autoregression. 

Forecasters, then, were able to make a net pre- 
dictive contribution over and above what could be 
obtained by means of mechanical extrapolations 
alone. This is a favorable result that is by no means 
always attained—for example, simple trend projec- 
tions would have done better than the recorded fore- 
casts that envisaged a serious business depression 
in 1947-1948. It is not a major achievement, how- 
ever, since the course of the economy has been rela- 
tively smooth in recent years and therefore, on the 
whole, probably less difficult to predict than the 
developments in the earlier part of the postwar 
period. In the decade after 1953, there were fewer 
and smaller exogenous “shocks” of the type repre- 
sented, say, by the outbreak of the Korean War in 
1950. Also, the timing of recent business down- 
turns was early enough to make the presence of 
the recessions rather widely known by the ends of 
the peak years (1953, 1957, and 1960), and most 
forecasts are made at the end of the year. This, 
plus the presumption that the contractions would 
continue to be short, made predicting the direction 
of the year-to-year changes in aggregate economic 
activity relatively easy. 

By and large, the business economists’ forecasts 
covered by this analysis are informal, eclectic, and 
framed loosely in terms of the GNP accounts. The 
econometric model forecasts for GNP in the years 
since 1953 appear to have been about as accurate 
as the better business forecasts. It must be noted, 
however, that very few ex ante econometric fore- 
casts are available for such tests, since most of the 
models now in use have been constructed only in 
the last few years and their accuracy beyond the 
sample periods cannot as yet be evaluated with any 
confidence. The econometric forecasts that do lend 
themselves to this appraisal come mainly from a 
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series of closely related models used in a flexible 
manner, with many judgmental modifications, 

Many forecasts show a bias of underrating the 
growth of the economy; the declines are much less 
frequently underestimated than the increases, The 
charge that forecasters tend to be too cautious or 
conservative seems, in this sense, to be justified. 
Since such series as GNP or industrial production 
have strong upward trends, their future levels as 
well as their changes have been understated most 
of the time (underestimation of increases typically 
results in underestimation of the ensuing levels). 

Forecast errors are also affected by the cyclical 
characteristics of the forecast period. The levels of 
GNP and industrial production are underestimated 
most in the first year of expansion, when the 
increases in these series tend to be very large. Later 
in the expansions, when the increases are usually 
smaller, the levels are underestimated much less. 
In contractions, the predicted levels are as a rule 
too high, often because the downturn was missed 
and sometimes because the decline turned out to 
be larger than foreseen. 

The forecasts of total GNP are often substan- 
tially better, in the sense of having smaller per- 
centage change errors, than the forecasts of most 
major GNP expenditure components from the same 
source. Apparently, the over-all forecasts ben- 
efit from a partial cancellation of errors in fore- 
casts of the components. This is definitely pref- 
erable to the opposite case of positively correlated 
and mutually reinforcing errors, but gross inaccu- 
racies in the component forecasts are, of course, 
disturbing, even if these errors happen to be largely 
compensating. 

Errors in predicting percentage changes in per- 
sonal consumption expenditures are considerably 
smaller than those in corresponding forecasts of 
gross private domestic investment, while errors in 
predicting government spending are of intermedi- 
ate size. Consumption forecasts have suffered from 
a pronounced tendency toward underestimation. 
In contrast, changes in series that fluctuate more 
and grow less strongly, notably the components of 
investment, have been as often overestimated as 
underestimated. 

Although the errors of consumption forecasts 
are smaller than those for the other major GNP 
components when measured in deviations of per- 
centage changes, they are large relative to the 
errors of some extrapolations. The consumption 
series, including those for nondurable goods and 
services, are smoothly growing series that could 
have been predicted very well in recent years by 
simple trend projections; and, indeed, the average 
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errors of the latter have often been smaller than 
those of recorded consumption forecasts. 

Aggregation of short-term expectations or plans 
of business concerns about their outlays on plant 
and equipment, as developed in periodic intentions 
surveys, results in better predictions of total busi- 
ness capital expenditures than those made inde- 
pendently for the entire economy. This can be 
inferred from comparisons between investment 
forecasts made before and after the McGraw-Hill 
investment intentions survey and also from com- 
parisons involving the U.S, Department of Com- 
merce-SEC anticipations data. 

The average accuracy of short-term forecasts 
diminishes as they reach further into the future: 
the various series covered are all predicted better 
over the next three months than over the next six 
and better over six months than over nine or twelve. 
However, the errors increase less than proportion- 
ately to the extension of the span. Most extrapola- 
tions also tend to worsen with the lengthening of 
their span, and the simplest among them (the 
naive models) usually show the fastest deteriora- 
tion, While forecasts of three to nine months 
(which includes the annual forecasts whose aver- 
age spans are little more than six months) are 
generally superior to all kinds of extrapolations, 
the longer forecasts, which aim at targets 12 to 18 
months ahead, are often worse than some of the 
relatively efficient types of trend projections and 
autoregressive extrapolations. 

Conditional forecasts from formal models ap- 
pear to produce errors that are similarly related to 
the time span of prediction. Thus, a recent analysis 
of input-output tables for the Netherlands during 
the period 1949-1958 concluded that predictions 
based on this method were on the whole better than 
extrapolations when the tables used were less than 
two or three years older than the extrapolation 
data. The input-output forecasts of industry out- 
put values for three or more years ahead (given 
the actual final demand schedules) proved inferior 
to the results of some fairly simple mechanical 
extrapolations. 

Forecasts made frequently for sequences of two 
or more short intervals (for example, quarterly, for 
four successive quarters each) are more relevant 
for an appraisal of turning-point errors than are 
the annual forecasts, and they present a rather 


unfavorable picture. There are very few indications 
in the record of an ability to forecast turning dates 
ly were actual turns 


several months ahead. Not only ™ ; 
missed, but some turns were predicted that did not 


occur, Reports by observers of the economic scene 
in business and financial periodicals are consistent 


with this finding in that they commonly show sub- 
stantial lags in recognizing revivals and recessions. 
Business cycle indicators have at times demon- 
strably reduced these lags for competent users, but 
they also occasionally gave signals of retardations 
or minor declines that were misread as forewarn- 
ing recession. 

Multiperiod forecasts also provide data that lend 
themselves to an analysis of forecast revisions, that 
is, changes in predictions for a given target period 
made at dates successively closer to that period, 
These revisions do tend to improve the forecasts in 
most cases, and they appear to be related to errors 
in previous forecasts. Positive correlations between 
forecast revisions and errors of forecast have been 
interpreted as evidence of “adaptive behavior,” or 
“learning from past errors.” Such correlations, how- 
ever, could also be due to autoregressive elements 
in some forecasts, since they are implied in fixed- 
weight autoregressive extrapolations, 

Better statistical data and better utilization of 
the historical content of the predicted series could 
lead to significant improvements of the forecasts 
(as indicated, in particular, by the relative success 
of some types of extrapolations in predicting con- 
sumption and also price levels). Improvements in 
the record-keeping practices of forecasters are defi- 
nitely needed. The records should include the esti- 
mates of the level of the series at the time the fore- 
cast was made, as errors in these base values are 
as a rule substantial and their measurability is im- 
portant for the appraisal of forecasts. 

VICTOR ZARNOWITZ 


[Directly related are the entries BUSINESS CYCLES; 
ECONOMETRIC MODELS, AGGREGATE; SURVEY ANALY- 
sıs, article on APPLICATIONS IN ECONOMICS.| 
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PREJUDICE 


Otto Klineberg 


1. THE CONCEPT 
J. Milton Yinger 


tm. SOCIAL DISCRIMINATION 


I 
THE CONCEPT 


The English term “prejudice” and its equivalents 
in many other European languages (French 
préjugé; German Vorurteil; Portuguese preconceito) 
refer primarily to a prejudgment or @ preconcept 
reached before the relevant information has been 
collected or examined and therefore based on in- 
adequate or even imaginary evidence. In contem- 
porary social science this notion has been retained 
but is usually regarded as constituting only one 
aspect of the complex phenomenon of prejudice, 
namely, the conceptual, or cognitive, aspect—the 
ideas or opinions we have about those individuals 
or groups who are the objects of such prejudgment. 


(The term “stereotype” is usually applied to this 


aspect.) Prejudice also involves an attitude for or 


PREJUDICE: The Concept 439 


against, the ascription of a positive or negative 
value, an affective, or feeling, component. Usually 
there is in addition a readiness to express in action 
the judgments and feelings which we experience, 
to behave in a manner which reflects our accept- 
ance or rejection of others; this is the conative, or 
behavioral, aspect of prejudice. (The resulting ac- 
tions are also described as representing varying 
degrees of discrimination.) Prejudice may there- 
fore be defined as an unsubstantiated prejudgment 
of an individual or group, favorable or unfavorable 
in character, tending to action in a consonant 
direction. 

Social science research has joined with popular 
usage in introducing two limitations to this con- 
cept. In the first place, favorable prejudices, al- 
though they undoubtedly exist, have attracted rela- 
tively little attention, perhaps on the principle that 
they do good rather than harm, It might, however, 
legitimately be argued that even favorable preju- 
dices should be discouraged, since they too repre- 
sent unwarranted generalizations, often of an irra- 
tional nature. Second, although prejudice may èx- 
tend far and wide to apply to objects as disparate 
as trade-union leaders, women, or exotic foods, in 
practice it has been considered as dealing primarily 
—if not exclusively—with populations or ethnic 
groups distinguished by the possession of specific 
inherited physical characteristics (“race”), or by 
differences in language, religion, culture, national 
origin, or any combination of these. This article, 
therefore, will be primarily concerned with ethnic 
prejudice. The Italian anthropologist Tentori takes 
approximately this position when he defines preju- 
dice as the “negative perception of human groups 
culturally different from ourselves” (1962, p. 14); 
but the limitation to perception on the one hand 
and to cultural differences on the other makes the 
concept considerably narrower than that which is 
here proposed. 

The term “ethnic group” is used in preference to 
the more popular “race” as the object of prejudice, 
first because of the difficulty of adequately defining 
the latter term so that it may safely be applied to 
human populations, and second because of the 
even more important fact that the populations 
against whom prejudice may be directed do not 
usually satisfy the criteria of “race” proposed by 
physical anthropologists and geneticists. The phe- 
nomenon here being considered can be interpreted 
in terms of “race” in the case of the apartheid 
policy of the South African government, although 
even here the extension of the same attitudes and 
the same degree of discrimination to Africans, to 
the mixed “Coloureds,” and to Indians (usually 
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classed by anthropologists with Caucasians) indi- 
cates the lack of consistency in this respect. Amer- 
ican attitudes toward the Negro diverge from “race” 
relations in the true sense to the extent that persons 
who are genetically almost completely “white” may 
not be so considered if they have any degree of 
Negro admixture. Elsewhere other criteria of dif- 
ferentiation may play the dominant role. In Can- 
ada, Belgium, and India hostilities appear to follow 
mainly linguistic lines; in Malaysia the demarca- 
tion is due primarily to national origin, and com- 
parable situations are found in many other regions 
where immigration has been extensive. In Lebanon, 
Nigeria, and the Sudan, religious differences are 
also important, as they are in part at least in the 
case of anti-Semitism; in Brazil and Mexico cul- 
tural (and class) factors predominate, so that the 
word indio is restricted to those who have re- 
mained culturally Indian, and there appears to be 
complete acceptance of those Indians who have 
become “acculturated.” To speak of “race relations” 
in such contexts is clearly incorrect, and use of the 
expression “sociological races” to cover these cases 
—because the groups concerned are treated as if 
they were “races”—simply perpetuates an unfor- 
tunate and misleading terminology. 

In his valuable treatment of the nature of preju- 
dice, G. W. Allport states that “Prejudgments be- 
come prejudices only if they are not reversible when 
exposed to new knowledge” (1954, p. 9). This cri- 
terion of prejudice appears to go too far. Granted 
that prejudice is difficult to remove, it surely can- 
not be assumed that if and when it has been 
removed by new knowledge it could not really have 
been prejudice in the first place. The question of 
the use of information to reduce prejudice will be 
discussed below as part of the wider problem of 
determining what methods and techniques are most 
effective in this connection. 

It would be difficult to overestimate the impor- 
tance of prejudice as a problem for the social sci- 
ences. The hostility which prejudice engenders and 
the discrimination to which it may lead on the part 
of the dominant population toward other ethnic 
groups or minorities have caused so much human 
damage (the Nazi period represents perhaps the 
most drastic example) that it is hardly surprising 
that so many specialists in these disciplines have 
directed their energies toward understanding and 
control of this form of social pathology. Particu- 
larly but by no means exclusively in the United 
States, the problem has not only aroused general 
concern but has also been seen as a research chal- 
lenge by psychologists, psychiatrists, sociologists, 

anthropologists, political scientists, lawyers, edu- 


cators, and others whose professional activities are 
concerned with human behavior. This has been not 
only out of humanitarian motives but also out of 
conviction that the very life of a community may 
be at stake. The concern has been deepened by the 
realization that what happens inside a country may 
also have significant international repercussions. 
The apartheid policy has implications far outside 
the confines of South Africa; the American treat- 
ment of Negroes affects the image and the leader- 
ship of the United States throughout the world. 

Nor is prejudice a monopoly of the whites. The 
emergence of new nations has to a certain degree 
brought with it what has sometimes been called 
racism in reverse, an antiwhite attitude on the part 
of darker-skinned peoples. The philosophy of négri- 
tude, developed by French-speaking Africans and 
West Indians, has stressed the values inherent in 
African culture, but it has by no means always 
been free of aggressive overtones (Thomas 1963). 
Prejudice and ethnic hostilities constitute a major 
danger to peace both within a nation and among 
nations. 


The causes of prejudice 


The fact that prejudice is so widespread has led 
to a popular belief that it is inevitable and univer- 
sal. Even among social scientists the view has oc- 
casionally been expressed that human nature 
involves a “dislike of the unlike,” although what is 
“unlike” has never been adequately defined; or it 
has been suggested that “in-group” feelings are in- 
variably accompanied by dislike of the “out-group,” 
but this, too, has never been demonstrated. A 
German social psychologist, Hofstiitter, has suggest- 
ed (1954) that one must accept the fact that preju- 
dice against members of other groups represents 4 
“normal” phenomenon of human social life and 
that no one is free from this attitude. This appears 
to be an extreme and unjustified conclusion. Indi- 
viduals and groups vary so tremendously in the 
extent to which they show prejudice that any at- 
tempt to explain it in terms of a universal human 
nature fails to carry conviction. The full acceptance 
of Orientals by most whites in Hawaii and their 
relative rejection in California or British Columbia 
is difficult to reconcile with such a theory. The 
further fact that the most intimate degrees of asso- 
ciation, including intermarriage or miscegenation, 
have occurred to such an extent that most anthro- 
pologists deny the existence of “pure races” argues 
that prejudice cannot be universal. Finally, its ab- 
sence in young children, even though they may 
acquire it relatively early in life from their social 
environment, argues that learning rather than na- 


ture plays the dominant role in its development 
(Harding et al. 1954). 

Derivations from psychoanalysis. One view of 
the universality of prejudice seems to derive from 
an erroneous interpretation of psychoanalytic the- 
ory. This theory, particularly in its orthodox form, 
regards hostility or aggression (Freud’s Thanatos) 
as instinctive and universal; prejudice would then 
be simply one manifestation of this instinct. Not all 
psychoanalysts would accept this formulation, but 
even those who do would add that although ag- 
gression must manifest itself in some form, there 
is no one form (for example, prejudice) which 
must be regarded as inevitable. There is still con- 
siderable argument as to whether hostile aggression 
is universal, but in any case it can be expressed in 
so many different ways that inference to the uni- 
versality of prejudice remains exceedingly doubtful. 

Frustration-aggression theory. A variant of 
this view is found in the well-known frustration— 
aggression theory (Dollard et al. 1939), which in 
its original formulation argued that frustration 
always leads to aggression and that aggression is 
always due to frustration. An impressive array of 
clinical and experimental evidence was marshaled 
in favor of this position, and there can be no doubt 
that it has contributed to an understanding of the 
problem, The fact remains, however, that even if 
aggression is reactive rather than primary, the 
same objection holds—namely, that no one mani- 
festation of aggression is necessarily implied and 
that the specific phenomenon of prejudice requires 
some further explanation. There is, therefore, circu- 
larity in the argument which holds that hostility 
must be expressed and that when a group like the 
Negroes is indicated by society as a legitimate ob- 
ject for such hostility, it can then be safely directed 
by the whites in this accepted fashion. The circular- 
ity occurs by taking for granted the very phenom- 
enon that the theory sets out to explain: if society 
indicates that hostility may with impunity be di- 
rected against the Negro, that hostility must have 
been there to start with, It is not surprising that in 
later formulations of the theory the role of learn- 
ing is given a prominent place (Miller & PE 
1941) in determining the specific nature of the 
behavior that follows frustration. vet 

Prejudice as a learned behavior. Prejudice may 
certainly be learned; it is through (mainly unChr 
scious) learning that a child acquires and eg 
rates the prejudices prevalent in his society. hae 
search has shown that in the early years there 1s 
close relation between the ethnic attitudes of par- 
ents and children; somewhat later the correspond- 
ence is rather between teachers and their pupils. 
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The representation of minority groups in the mass 
media may also play a part in reinforcing, if not 
in creating, the current attitudes. Most significant, 
in all probability, is the role of social factors and 
institutions which emphasize lines of demarcation 
between ethnic groups—segregation, whether de 
jure as in South Africa or de facto as in many 
North American cities, in connection with housing 
and education; “exclusive” clubs or resorts; 
churches limited to one ethnic group; restrictions 
on enrollment in certain colleges or universities; 
limitations on advancement to executive positions 
in industry; and discrimination in employment gen- 
erally. These and similar phenomena serve as con- 
stant reminders that they are not like us. It is in 
this sense that discrimination causes prejudice as 
well as the other way around. As Myrdal pointed 
out in connection with the American Negro, the 
relation is a circular one: “White prejudice and 
discrimination keep the Negro low in standards of 
living, health, education, manners and morals. 
This, in turn, gives support to White prejudice. 
White prejudice and Negro standards thus mutual- 
ly ‘cause’ each other . . .” (1944, p. 75). 
Unimportance of contact. One form of learning 
turns out to be less important in causing prejudice 
or hostile attitudes than is usually believed, namely, 
that resulting from actual contact with other ethnic 
groups. It may of course happen that an extremely 
dramatic or traumatic experience—such as being 
attacked on a lonely street or being saved from 
drowning by a member of a particular ethnic group 
—may result in a generalization regarding the 
characteristics of the group concerned; it may also 
happen that repeated experiences of a similar na- 
ture may create, and even appear to justify, such 
a generalized attitude. Research has shown, how- 
ever, that such experiences are not necessary for 
prejudices to develop. In one study, students 
showed a high degree of “social distance” from 
Turks, even though most of them had never seen 
a Turk. Other students rejected imaginary groups, 
such as the Wallonians, Danireans, and Pirenians, 
even though no one could have had any unpleasant 
experiences with any representatives of these non- 
existent populations (Hartley 1946). Contact and 
experience may cause prejudice to develop, but 
they probably play no important role in many cases. 
Role of language. It has been suggested that 
prejudices may also be learned through the lin- 
guistic habits of a community. The common asso- 
ciation in many parts of the world between Swhite 
and purity or honor (“that is white of you") and 
between “black” and dirt or evil (“he has a black 
heart”) may create attitudes that are difficult to 


442 PREJUDICE: The Concept 


overcome. Less clear in their impact are expressions 
such as “to jew him down”; “nigger in the wood- 
pile”; the Chinese reference to Europeans as 
“foreign devils”; the French ivre comme un po- 
lonais; or the Italian fare il portoghese, said when 
one enters a streetcar or a theater without paying. 

Roots of the tradition. All of these aspects of 
learning are important, but they are limited in their 
implications to the acquisition of attitudes already 
current in the community. They help us to under- 
stand the development of prejudice in children as 
they become “socialized” and absorb the prevailing 
cultural traditions. They leave open the question 
(as in connection. with the frustration—-aggression 
hypothesis discussed above) as to how the tradition 
of prejudice arose in the first place. 

Fear of the stranger. Ethnological accounts of 
preliterate peoples have indicated with some fre- 
quency a fear of the stranger and the development 
of hostility as a consequence. This is by no means 
universal, however; Hooton even considered it rela- 
tively rare. He wrote: “Primitive people are prob- 
ably not race conscious to the deplorable or laud- 
able extent which is characteristic of civilized pop- 
ulations. I mean that they are rather naively free 
from race prejudice until they have learned it from 
bitter experience. The American Indian was ready 
to take the European literally to his arms until he 
found out that a civilized embrace was inevitably 
throttling” (1937, p. 143). 

Historical factors, In the form in which preju- 
dice is found in contemporary cultures, many dif- 
ferent contributing influences may be recognized, 
all of which have a relatively long history. From 
the viewpoint of the whites, the facts of slavery 
and colonization must at least have reinforced— 
if they did not create—the notion of a racial hier- 
archy, with the darker peoples occupying an in- 
ferior position. Sometimes this went so far, as in 
the case of certain Spanish writers during the con- 
quest of America, as to deny to American Indians 
the same humanity as that of their conquerors; 
more mildly, but with somewhat the same conse- 
quences, the British spoke of the “white man’s 
burden” and the French of their mission civilisa- 
trice, often expressed with the utmost sincerity but 
clearly implying the superiority of white culture 
and white people. 

Historical factors are of great importance in this 
connection, and the contribution of the historian 
is needed at various points in this analysis. In some 
parts of the world relations between ethnic groups 
have taken a much more favorable turn, and preju- 
dice has played only a minor role. In Hawaii, for 
example, Adams (1937) indicates that when white 


men served as advisers to the king, they were 
occasionally honored by receiving permission to 
marry ladies of the court. This set a pattern of 
ethnic friendliness which was later extended to 
other groups as well. It became impossible to set 
up any strict “racial” line because so many mem- 
bers of prominent families had intermarried. In 
the case of Brazil, where there is some degree of 
prejudice that tends to follow class lines but only 
a fraction of that found elsewhere, Freyre (1933) 
suggests a number of reasons for the relatively 
more friendly attitude. In the first place, the Portu- 
guese who settled in Brazil had had centuries of 
contact with darker-skinned conquerors, namely 
the Moors, which predisposed them to a friendly 
and even respectful attitude. They developed an 
attraction for the “enchanting Mooress” which was 
extended to other women of darker complexion, 
thus encouraging intermarriage with Indian and 
later with Negro women. This was facilitated by 
the fact that Portuguese men migrated to Brazil for 
the most part without their families, in contrast to 
the settlers in most of North America. In addition, 
the fact that Brazil liberated its slaves peaceably 
and not as a consequence of civil war undoubtedly 
contributed to an earlier and smoother transition 
to a new relationship. Clearly, the history of ethnic 
contacts within a particular country helps to ac- 
count for the pattern of acceptance or rejection 
prevalent today. 

Religious factors. In the case of one variety of 
prejudice which in recent times has been exhibited 
in the most virulent and extreme form, namely 
anti-Semitism, it seems clear that in its origins 
religious considerations have played a dominant 
role. (Negro slavery, too, has been justified by an 
appeal to the Bible.) The story of the New Testa- 
ment as told to succeeding generations of children 
has left an imprint of ancient Jewish wickedness 
which has frequently been extended to the Jewish 
group as a whole (Glock & Stark 1966). This is 
true to some extent even in religious teaching to- 
day, although many attempts are being made to 
present a less biased and more objective picture 
(Olson 1963). [See ANTI-SEMITISM.] 

Religious identification may be relatively less 
significant to most people now than it was in the 
Middle Ages, but the climate of opinion created by 
past teaching is removed with difficulty even when 
circumstances have changed. Here, as elsewhere, 
the strength of “social lag’—the continuance of 
institutions and traditions even when they are no 
longer appropriate—is not to be discounted. 

Nationalism. Another significant factor asso- 
ciated with prejudice is the growth of nationalism 


and of feelings of national identity. Huxley and 
Haddon wrote: “A ‘nation’ has been cynically but 
not inaptly defined as a ‘society united by a com- 
mon error as to its origin and a common aversion 
to its neighbours’ ” (1936, p. 5). This aversion may 
be directed to different objects, and as has already 
been indicated social identity may be attached to 
a variety of characteristics, including language, 
religion, or any other symbol of demarcation. 
National identification is glorified as a means to 
social solidarity, to participation in a common 
enterprise, and to “belonging.” Unfortunately, how- 
ever, a “healthy” nationalism easily moves into an 
exaggerated chauvinism which is not only for “us” 
but against “them.” Considerable research has in- 
dicated that this kind of hypernationalism is usu- 
ally accompanied by ethnocentric prejudice 
(Adorno et al. 1950). Why it has developed in 
certain regions and at certain times, and why it 
has taken one form rather than another, are ques- 
tions difficult to answer. 

Economic factors. The analysis of the causes 
of prejudice has so far stressed the historical proc- 
ess related to ideas of “race,” religion, and national- 
ism and to the manner in which the resulting pat- 
terns of prejudice and hostility are “taught” to 
individuals. Research clearly indicates, however, 
that learning occurs most readily and most effi- 
ciently when it is motivated and when it is accom- 
panied by certain satisfactions which reinforce the 
learning process. In the case of prejudice, perhaps 
the most important reinforcement comes from the 
gains that appear to result, the practical ends to 
which prejudice and discrimination may lead. 
Among these ends, it is highly probable that eco- 
nomic gain plays a dominant role; for some theo- 
rists (Marxists or economic determinists) it con- 
stitutes the single significant factor. Prejudice and 
discrimination enable the dominant group to main- 
tain others in a state of subservience, to exploit 
them, to treat them as slaves or serfs, to reduce their 
power to compete on equal terms for jobs, and to 
keep them “in their place.” Gains and advantages 
other than economic may also enter, however. There 
may be a gain in status or prestige, permitting the 
humblest member of the dominant group to feel 
superior to the most successful among those who 
belong to the minority or to rejoice, through iden- 
tification with his nation, in the subjugation of 
others through colonization, even when the conse- 
quence is economic loss rather than profit. There 
may be a gain from the point of view of self-image 
when, in times of misfortune or adversity, the 
blame can be placed on outsiders who sea ns 
scapegoats; it is they, not we, who are responsible. 
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Finally, less important today than in the past, there 
may be a sexual gain, when men of the dominant 
group have access to women of the minority, often 
with severely enforced taboos against the reverse 
relationship. 

Rationalization. One of the striking aspects of 
the use of prejudice and discrimination as means 
toward these practical goals is that the underlying 
motive is so rarely recognized. Psychoanalysis has 
made us all familiar with the mechanism of ration- 
alization, which in this context is characterized by 
the tendency to persuade ourselves that our actions 
stem from the loftiest ethical and even religious 
considerations rather than from anything as base 
as the desire for gain. This was briefly mentioned 
above in connection with the “white man’s burden” 
and similar formulations. They, whom we keep in 
an inferior position, are happier than they would be 
otherwise; they, whom we persecute because of 
their beliefs, can be saved only if they accept the 
true (that is to say, our) religion; they, whom we 
destroy, are planning to destroy us, and we are 
simply exercising the right to protect ourselves. 
It is arguments like these, presented in all sincer- 
ity, which so often in the past, and not so rarely 
in the present, have given to men the conviction 
that what they are doing is somehow noble and 
beautiful and that, in Hooton’s telling phrase, they 
“can rape in righteousness and murder in magna- 
nimity” (1937, p. 151). 

Personality structure. The fact remains that 
under the same cultural conditions, surrounded by 
the same institutions and tempted by the same 
desire for gain, some people show prejudice and 
others do not. This has suggested that the key to 
the development of prejudice may be found in per- 
sonality, and an extensive amount of research has 
been directed toward this aspect of the problem. 
The most important single study is undoubtedly 
The Authoritarian Personality (Adorno et al, 1950). 
One major finding was a verification of the results 
of earlier research indicating that prejudices tend 
to go together: those individuals who disliked Jews 
were also likely to dislike Negroes, Mexicans, and 
“foreigners” in general. The use of a series of 
scales or questionnaires demonstrated a positive 
correlation between ethnocentrism and (4) politico~ 
economic conservatism, (b) chauvinism (labeled 
“pseudopatriotism”), and (c) “fascism” as meas- 
ured by the F-scale. The F-scale has been widely 
used in subsequent research, and some serious 
methodological questions have been raised about 
its application—for example, the failure to take 
into account the tendency of certain subjects to 
answer the questions in an “acquiescent” direction, 
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the fact that higher education and socioeconomic 
status are negatively correlated with the scale, and 
so on. On the whole however, the F-scale is consid- 
ered to have made a valuable contribution toward 
the understanding of the prejudiced personality. 
It has also been applied with interesting results in 
countries other than the United States (Christie & 
Cook 1958). 

The ethnocentric personality. More intensive 
analysis of prejudiced individuals, conducted as 
part of the same study, has indicated the following 
pattern of characteristics: the ethnocentric person 
is a conformist; he sees the world as menacing and 
unfriendly; he exalts his own group; he is funda- 
mentally anxious and insecure; he blames others 
for his own faults and misfortunes; he appears to 
worship his parents but has strong repressed hostil- 
ity toward them; he divides the world into the weak 
and the strong, has a well-developed concern for 
status, and is willing to obey those above him in 
the status hierarchy but demands obedience from 
those below. (Some aspects of this description sug- 
gest that the prejudiced person is pathological. This 
interpretation is justified in some cases but cer- 
tainly not in all; it is only when authoritarianism 
is extreme that the inference of psychological mal- 
adjustment becomes reasonable. ) 

A major criticism of this important study is its 
relative neglect of social and cultural factors. Re- 
search has shown that groups may be high in 
ethnocentrism if their culture includes lines of 
demarcation strictly enforced by custom or by law, 
as in South Africa or the southern United States, 
without being particularly “authoritarian” (Petti- 
grew 1958). The fact remains that personality does 
exert an important influence, even though it must 
be seen as acting in conjunction with the historical 
and sociological factors mentioned above. Prejudice 
is multidimensional (Klineberg 1964; Williams 
1964). Different individuals may develop prejudice 
for different reasons and frequently for more than 
one reason. Nor should these various possible 
causes be considered as independent: they interact, 
are interrelated, and influence one another, The 
search for a single, all-embracing origin for preju- 
dice is chimerical. [See PERSONALITY, POLITICAL, 
article on CONSERVATISM AND RADICALISM.] 


Varieties of prejudice 

It follows that prejudice is not a unitary phenom- 
enon and that it will take varying forms in differ- 
ent individuals. Socially and psychologically atti- 
tudes differ depending on whether they are the 
result of deep-seated personality characteristics, 
sometimes of a pathological nature, or of a trau- 


matic experience, or whether they simply represent 
conformity to an established social norm. No ade- 
quate typology of forms of prejudice is yet avail- 
able, and since there will always be intervening 
and transitional varieties, perhaps no such typology 
will ever be fully acceptable. 

Earlier it was indicated that three aspects of 
prejudice must be distinguished in the definition: 
the cognitive or ideational, the affective, and the 
conative or behavioral. The frequent lack of con- 
sistency among these three aspects or components 
has suggested one typology of attitudes (including 
prejudice). Katz and Stotland (1959), for exam- 
ple, distinguish between (1) affective associations, 
(2) intellectualized attitudes, and (3) action-orient- 
ed attitudes. In the case of the first two, no accurate 
prediction regarding behavior is possible; in the 
third, both the cognitive and affective elements 
may be absent. In addition there are (4) balanced 
attitudes, which show consistency among the three 
components. This is helpful as far as it goes, but 
it leaves out a number of the dimensions identified 
above as contributing to the development of preju- 
dice and consequently to the forms which it takes 
in different individuals. 


The effects of prejudice 


In his discussion of the place of the American 
Negro—or, as he prefers to say, the Negro Amer- 
ican—in contemporary society, one social psychol- 
ogist (Pettigrew 1964) speaks of the role and its 
burdens. There is much evidence that these burdens 
are varied and heavy. They are revealed in a pat- 
tern of objective life conditions that include con- 
siderable poverty and overcrowding, a shorter life 
expectancy, poorer education, inferior facilities for 
recreation, more family disorganization, and other 
related characteristics described in the Dark Ghetto 
(Clark 1965). On the subjective side, it is difficult 
to overestimate the effects on personality of belong- 
ing to a group which is generally regarded as in- 
ferior and so treated. A series of investigations has 
revealed the frequency with which Negro children 
show their preference for white over black (as, for 
example, in the dolls they choose to play with) and 
the emotional shock which may accompany an 
experience that requires them to become openly 
aware of their own skin color. Although this shock 
may become somewhat attenuated with the years, 
the damage done to the self-image and self-esteem 
of such children must be viewed as exceedingly 
serious (Clark 1955; Pettigrew 1964). It is pre- 
cisely this damage that was referred to in the 
unanimous Supreme Court decision of May 17, 
1954 as one justification for finding that enforced 


school segregation violated the rights of Negro 
children and was therefore contrary to the princi- 
ples of the American constitution. (A study in 
Sao Paulo, Brazil, indicated a moderate preference 
by Negro children for white dolls, but with none 
of the emotional reactions so frequent among chil- 
dren in the United States; see Ginsberg 1955.) 

The expression “self-hatred” has occasionally 
been applied to the reaction of Negroes, Jews, and 
other minorities who attempt in one form or an- 
other to reject their ethnic identity. In its most 
extreme form, accompanied by a dislike of every 
reminder of such identity and by hostility which 
echoes that shown by the dominant group, the 
term “self-hatred” may possibly be applicable, ex- 
cept that it is the group rather than the self that 
is hated, There is a whole range of reactions, how- 
ever, which may result from being identified with 
a group that is stigmatized as not only different but 
inferior; it is inappropriate to speak of “hatred” in 
the case of those who may more legitimately be 
described as reaching for identification with the 
larger community. The phenomenon is too complex 
to be adequately described as “self-hatred,” but the 
underlying psychological processes relating to re- 
jection undoubtedly constitute great hazards to 
normal personality development. The relative fre- 
quency of Negro aggressive crime and the occa- 
sional outbursts of group violence are not too diffi- 
cult to understand against this background of 
rejection. 

The effects of prejudice on the dominant group 
are also clear, although more indirect. If one cri- 
terion of mental health is an adequate perception 
of reality (Jahoda 1958), free from distortions due 
to needs and wishes and including sensitivity in the 
understanding of other people, then prejudice ob- 
viously interferes with mental health. In any case, 
the economic and social waste consequent upon 
prejudice has a harmful effect on the whole com- 
munity, majority and minority alike. The apparent 
gains resulting from prejudice and discussed above 
among its causes are more than offset by its real 


costs (Rose 1951). 


The reduction of prejudice ne 
The problem of the reduction of prejudice is 
part of the whole issue of attitude change and 
therefore involves the techniques of persuasion a 
propaganda, the effects of the mass media and 0! 
education, and other related phenomena. The pres- 
ent discussion will be limited to certain aspects 
which touch directly on prejudice, even though t 
must be recognized that in many cases these an 
to be seen against the background of attitu 
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change in general. [See ATTITUDES, article on ATTI- 
TUDE CHANGE; PERSUASION. | 

Effects of information. A great deal of thought 
and a substantial amount of research have been 
directed toward the question of whether informa- 
tion about minority groups contributes toward the 
reduction of prejudice. Since, as was indicated 
earlier, prejudice contains a cognitive component, 
it is reasonable to expect that improving the accu- 
racy of that component should have a salutary 
effect. On the other hand, it is argued that purely 
informational campaigns will necessarily fail, since 
the target audience will select, accept, ignore, or 
even distort the meaning of the available content 
in order to keep it consonant with preconceptions. 
This is more likely to occur when the content is 
presented through the mass media and is somewhat 
less likely in the case of a “captive audience,” al- 
though even in the latter case there will by no means 
be a passive acceptance of what is offered. Re- 
search in this area is equivocal in its findings but 
appears to justify the conclusion that information 
is indeed useful to a limited degree (Allport 1954; 
Klineberg 1950). It has been suggested that al- 
though cultural differences among ethnic groups 
should not be neglected in the information present- 
ed, particular stress should be laid on cultural simi- 
larities and on the common aspects of human 
experience. 

Scientific evidence. It has also been urged that 
special attention be given to the position, held by 
the overwhelming majority of biological and social 
scientists, that there is no acceptable proof of the 
inherent genetic inferiority of any ethnic group. 
Myrdal (1944) has called for an “educational 
offensive” to reduce the gap between the scientific 
and the popular position in this regard. There has 
in fact been a marked change in this direction in 
recent years. A trend study of public opinion in the 
United States analyzed answers to the question “In 
general, do you think that Negroes are as intelli- 
gent as white people—that is, can they learn things 
just as well if they are given the same education 
and training?” In 1942, 50 per cent of Northern 
whites answered “yes”; in 1963, the figure had 
risen to 80 per cent. In the South, 21 per cent said 
yes in 1942 and 60 per cent in 1963. This degree 
of change may well be described as revolutionary 
(Hyman & Sheatsley 1964). 

Effects of religious training. A special problem 
is posed by the relation of religious education to 
prejudice. The paradoxical situation arises that 
although in many areas the leaders in the attack 
on prejudice have a religious motivation or are 
identified with religion as an institution, in general 
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there appears to be slightly more prejudice among 
those who are “religious” than among those who 
are not (Allport 1954; Adorno et al. 1950). This 
probably indicates that religion means different 
things to different people—in certain cases a true 
involvement, in others an outward, superficial ex- 
pression. The problem for religious educators is to 
instill loyalty to one’s faith while also emphasizing 
the brotherhood of man. It is not surprising that in 
the light of this difficult task the results of such 
education have so far been equivocal as far as 
prejudice is concerned. [See RELIGION, article on 
PSYCHOLOGICAL ASPECTS.] 

Effects of contact and cooperation. Contact be- 
tween members of different ethnic communities 
has also had a positive effect under certain condi- 
tions. The earlier, rather naive expectation that if 
only people came to know members of other ethnic 
groups friendlier relations would automatically re- 
sult has given way to a more sophisticated under- 
standing of the factors involved. It seems clear, 
first, that best results are obtained when contact 
occurs between persons of equal status; no amount 
of contact between a white boss and Negro workers 
will alter the prevailing stereotype or prepare the 
way for better understanding. When status is equal, 
on the other hand, a salutary effect may usually be 
expected. This was demonstrated in a series of 
investigations of attitude change consequent upon 
integrated housing in a number of Northern com- 
munities, A second important condition is that the 
two groups work together toward the realization of 
a common goal and that they depend upon each 
other for its realization. This was the case in the 
U.S. Army during the Korean War. When the two 
ethnic groups were mixed in the same military out- 
fits, the attitudes of white soldiers and noncommis- 
sioned officers toward Negroes became much more 
favorable. 

It is important to note that in these cases the 
contact was not deliberately chosen or even neces- 
sarily welcomed; it was imposed in the one case by 
housing authorities and in the other by the military 
command. The participants were presented with 
a fait accompli, It appears that this procedure 
works successfully in other situations as well, in- 
cluding school desegration, the hiring of Negro 
sales personnel in department stores, and so forth 
(Saenger 1953). It may at first sight seem para- 
doxical that in a democratic society such tech- 
niques should work in Spite of the expression of 
contrary opinion. One could hazard a guess that 
the fait accompli is effective in those cases in which 
the attitudes are held with some degree of ambiv- 


alence, so that those involved are almost as willing 
to be pushed in the liberal as in the reactionary 
direction. The opposition to acceptance of Negroes 
in these situations is not so strong as to inhibit 
adaptation to a new status quo when that is intro- 
duced by respected authorities. 

The value of creating opportunities for two hos- 
tile groups to work together in a cooperative enter- 
prise has also been demonstrated in a series of 
important experiments ( Oklahoma, University of 
-.. 1961). An investigation in a boys’ camp showed 
that the introduction of common or “superordinate” 
goals which could be attained only through joint 
activity, in which each group depended on the 
other for success, had a significant effect in reduc- 
ing hostility. It seems highly probable that similar 
techniques would work on a wider scale and not 
only within the microcosm of the experimental sit- 
uation. What is not entirely clear is what would 
happen if the superordinate goal could not be 
reached. Would each group blame the other, with 
a consequent increase in tension? Or would the 
sympathy produced as a result of the common en- 
terprise improve relations even in the case of 
failure? Further research is required in order to 
answer these questions. 

Sociometry and psychotherapy. In the case of 
conflicts between small groups or individuals, cer- 
tain techniques used in group dynamics and soci- 
ometry have frequently been found to be effective. 
Among these, taking the role of the other—putting 
oneself in the “enemy's” place and attempting to 
see the problem from his viewpoint—represents a 
mechanism which may contribute to better under- 
standing. A number of investigations, some of them 
specifically related to prejudice, indicate the value 
of this approach (Krech et al. [1948] 1962, pp. 
259-261). 

Much has also been written about prejudice as 
a form of pathology and therefore not susceptible 
to cure except by the methods of psychotherapy 
and psychoanalysis. Such methods are probably the 
only ones that could reach the extreme bigot; but 
they can hardly be regarded as very practical, since 
bigots are unlikely to consider themselves ill and 
in need of treatment by a psychiatrist. It seems 
highly probable, however, that a system of child 
care which produces secure and mentally healthy 
individuals would reduce the frequency of bigots 
in the population, 

Institutional change. Earlier it was indicated 
that prejudice and discrimination are causally re- 
lated in circular fashion, each contributing to the 
origin and growth of the other. It would follow that 


prejudice can be reduced through an attack on dis- 
crimination and that a change in institutions would 
inevitably mean, in time, a change in attitudes. In 
the United States, at least, from this point of view 
the legal approach has constituted the major source 
of improvement in the position of the Negro, affect- 
ing his access to more adequate education, hous- 
ing, transportation, employment, recreation, and 
political rights. The frequently heard objection “you 
can’t legislate against prejudice” becomes irrelevant, 
since one can legislate against discrimination, one 
of the causes of prejudice; in the case of decisions 
by the courts, the same effect can be produced not 
by new legislation but by so interpreting existing 
laws as to safeguard human rights, Not only in the 
United States but in many other countries as well, 
the rights of minorities have been expressly pro- 
tected in national constitutions, and the full weight 
of the United Nations and of its specialized agen- 
cies, such as UNESCO, has been thrown into the 
campaign for the reduction of prejudice and dis- 
crimination, through both education and the law. 

Institutions change as the result of many factors 
not necessarily legal in character, The recent his- 
tory of the United States has been marked by a 
rapid transformation through the pressure of “sit- 
ins,” nonviolent resistance, protest meetings, a 
march on Washington, the impact of the interna- 
tional scene, and even by outbreaks of violence 
which, although they must be deplored, may still 
function as reminders of “the unfinished business 
of democracy.” Not least in its influence has been 
the development of a new climate of opinion and 
a reaffirmation of the democratic belief in the 
rights of all human beings. 


These occurrences do not mean that the problem 
of prejudice has been solved. The social lag re- 
ferred to above will mean a continuation of the 
prejudiced attitudes of many long after the law 
and the collective conscience have declared them 
obsolete. There is, however, some ground for op- 
timism as far as the United States is concerned. To 
the question “Do you think white students and 
Negro students should go to the same schools or 
to separate schools?”, a representative sample of 
American whites in 1942 included fewer than a 
third who favored school integration; in 1956 ap- 
proximately one half favored it; in 1963, three- 
fourths. In the South, the proportion was 2 per 
cent in 1942, 14 per cent in 1956, and 30 per cent 
in 1963 (Hyman & Sheatsley 1964). The trend is 
clear. There is reason to hope that it will continue 
and that it is symbolic of a slow but certain change 
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in attitude. There is hope, too, in the fact that 
“authoritarians” are conformist, If prejudice be- 
comes unfashionable, even the hard core of resist- 
ance to change may give way to progress, 


OTTO KLINEBERG 


[Directly related is the entry STEREOTYPES. Other rele- 
vant material may be found in AGGRESSION; ATTI- 
TUDES; CONCEPT FORMATION; ETHNIC GROUPS.) 
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I 
SOCIAL DISCRIMINATION 


In a neutral sense “discrimination” means simply 
“the drawing of a distinction.” The criteria on 
which a distinction is based, however, may range 
from those widely accepted in a society as valid 
and legitimate to those generally regarded as in- 
valid and inappropriate. Thus, in purely dictionary 
meanings, discrimination is “a faculty of nicely 
distinguishing; acute discernment,” and it is also 
“an unfair or injurious distinction.” A given act can 


be labeled as one or the other, not by its intrinsic 
nature, but only by referring to given standards. 

Types of discrimination. With reference to hu- 
man interaction we can think of the dictionary 
meanings of discrimination given above as falling 
along a continuum. Although no sharp lines can 
be drawn, it is useful to divide this continuum into 
three sectors on the basis of the extent to which 
norms are shared. Thus, at one end of the con- 
tinuum is the type of differential treatment that 
is based on generally accepted standards of excel- 
lence or appropriateness (e.g., a legal distinction 
between child and adult, variation in privilege 
among castes in a stable caste system, or a rule of 
seniority that gives precedence to those with longer 
records of service), The second type is discrimina- 
tion that is deemed invalid—that is, based on un- 
acceptable criteria—by many members of a society 
because it violates primary customs and laws yet 
is regarded as acceptable by significant subgroups 
and supported by secondary norms (e.g., racial 
segregation in the United States or anti-Semitism 
in the Soviet Union). It is in this sense that 
Williams used the term: “Discrimination may be 
said to exist to the degree that individuals of a 
given group who are otherwise formally qualified 
are not treated in conformity with these nominally 
universal institutionalized codes” (1947, p. 39). 
Third, there are the distinctions drawn by individ- 
uals in violation of the established standards, cus- 
toms, or laws, without subgroup support and lack- 
ing even secondary normative sanction (e.g., 
favored treatment of one student by a teacher). 

In this article we shall deal with discrimination 
only in the second sense. One might call it “social 
discrimination,” in contrast with “normative dis- 
crimination” (legal or customary application of 
standards) on the one hand and “individual dis- 
crimination” (unsupported acts of differentiation ) 
on the other. 

Why should the same word be used to cover what 
appear to be such diverse facts? Part of the expla- 
nation can be found in the study of social change. 
Because needs and values change, distinctions ac- 
cepted in law and custom at one time and place 
may become unacceptable in a different context. 
Since individuals and areas experience the change 
at different rates, what has become unacceptable 
to some remains acceptable to others. This is well 
illustrated by developments in race relations in the 
United States. In 1875 few persons believed that 
differential treatment of Negroes was discrimina- 
tory. The legal and constitutional norms were am- 
biguous, and custom largely supported segregation 
and other acts of differentiation. In 1896 some of 


the ambiguity was removed from the legal norms 
when the Supreme Court declared that segregation 
of public accommodations was legal only if facili- 
ties were equal. For nearly half a century, however, 
custom prevailed over law, and in most of the 
country segregation and inequality of treatment 
were the rule. During this period the situation was 
gradually redefined by the passage of civil rights 
laws in northern states, further court decisions de- 
manding equality, and a growing belief that dis- 
tinctions drawn on the basis of race were undesir- 
able and immoral. In terms of our classification, 
racial discrimination began to shift from the nor- 
mative to the social variety. By the 1960s further 
Supreme Court decisions, declaring even “separate 
but equal” facilities unacceptable, federal civil 
rights laws, and the growth of a substantial na- 
tional consensus opposed to the use of race as a 
criterion of choice had established a nominally uni- 
versal institutionalized norm relative to race rela- 
tions 

Although the process of change continues, racial 
discrimination in the United States has not yet 
shifted to the third type: it still has subgroup sup- 
port and justification from secondary norms; it is 
not an individual act of differentiation. Critical to 
the whole idea of social discrimination, it should 
be emphasized, is the fact that it is embedded in 
social structures and sustained by group practices 
even though it violates the dominant standards of 
the society, When the drawing of “an unfair or 
injurious distinction” has become a matter of scat- 
tered, individual acts unsupported by group mem- 
berships and standards, it loses a number of the 
important qualities of social discrimination: ran- 
dom individual acts of discrimination tend to scat- 
ter throughout the population; they do not pile up 
on particular groups, creating self-perpetuating 
justifications from the responses of those continu- 
ally discriminated against. Individual discrimina- 
tion does not split a society into two or more con- 
tending groups that defend different standards of 
morality and justice, and it is not passed along 
through normal processes of socialization. In the 
life of a society, therefore, a move along the range 
from social toward individual patterns of discrim- 
ination is of great significance. 

Social peer (hereafter simply called 
discrimination) is, then, the persistent application 
of criteria that are arbitrary, irrelevant, OF wes 
by dominant standards, with the result that some 
persons receive an undue advantage ano a 
although equally qualified, suffer an ae 
penalty, When comparing those grnonare apia 
taged with those who are disadvantaged, one can 
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speak of discrimination as “the unequal treatment 
of equals,” for example, the application of quotas 
for university admission on the basis of group 
membership rather than individual qualification or 
the determination of eligibility for a job on grounds 
of race or religion instead of competence. When 
only those who are being discriminated against are 
considered, it is also useful to reverse this phrase 
and to speak of the “equal treatment of unequals”: 
the members of disprivileged groups are treated 
alike despite variation in their competence, train- 
ing, or other personal characteristics. 

Discrimination is an analytic concept, not a 
moral term. Some manifestations of violence or 
persecution may be so established in custom that 
they are best understood as a different kind of 
phenomenon, despite the moral repugnance with 
which those who do not accept the custom may 
view the facts. Or to put the matter differently, 
discrimination acquires moral relevance when its 
consequences are measured against stated values. 
This statement does not mean, however, that there 
is no basis of choice between those who say that 
a given discriminatory practice is good and those 
who say that it is bad, The use of some criteria 
of discrimination may contradict fundamental 
values of a society and weaken its unity. For ex- 
ample, a democracy based on equality before the 
law and other universalistic norms is weakened 
by discrimination on racial, religious, or ethnic 
grounds. Thus, one can say that certain practices 
are morally incongruent with societal well-being in 
a democracy. 

The relation of discrimination to prejudice. 
Among the many questions one might raise in the 
study of discrimination, four seem of special im- 
portance: What is its relationship to prejudice? 
What criteria are used to determine those to be 
discriminated against? What forms does the dis- 
crimination take? What are its effects on the ob- 
jects of discrimination? 

Prejudice can be defined as an inner tendency 
to respond to persons on the basis of their group 
membership; it is a rigid, emotional prejudgment 
that gives an individual confidence that he knows 
all about a person when he knows his membership 
in a symbolically important group. This tendency 
may or may not express itself in discriminatory 
behavior on the part of a person, depending upon 
the role he is playing, what others around him are 
saying and doing, what he believes he will gain or 
lose by discrimination, and other factors. Each in- 
dividual has many attitudes, only some of which 
will be expressed in a given situation. Thus, his 
prejudice may be inhibited, deflected, or manifest- 
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ed, depending upon the context. In contrast, a per- 
son lacking in prejudice may nevertheless discrim- 
inate if he believes that thereby he may win an 
advantage or protect himself against loss; he may 
discriminate as an expression of his solidarity with 
the group with which he is identified at the 
moment. 

Patterns of discrimination. The criteria used 
for setting apart those against whom discrimination 
is directed vary widely, The growth of nations to 
include persons of diverse origins and the mobility 
of men in the modern world have brought individ- 
uals of many religions, races, national origins, and 
native languages into close contact. Which of the 
differences is symbolically important depends to 
a large degree on the history of the contact and 
the basic values of dominant groups. Discrimina- 
tion against Jews in many countries, most disas- 
trously in Germany (see Hilberg 1961), has been 
based in part on their status as a religious minority 
but has gained in strength by the convergence of 
several symbols. Thus, many peasants saw the 
Jews as city dwellers, ruling groups saw them as 
threatening radicals, and the middle classes saw 
them as powerful competitors. In the United States 
the economic importance of slavery and the vio- 
lence associated with its elimination were impor- 
tant factors in making race a basis for discrimina- 
tion. In India the disprivileges resting upon caste 
status are now legally and constitutionally pro- 
scribed, but some religious norms and traditions 
support continuing patterns of discrimination. In 
France racial and religious differences are rela- 
tively unimportant, but the person unskilled in the 
French language and poorly assimilated into 
French culture is seriously handicapped. In some 
situations cultural differences between groups are 
sufficiently sharp so that it is not clear whether 
one is dealing with a case of discrimination with- 
in a society or a case of cultural conflict between 
subsocieties, as in the case of the restrictions im- 
posed on the Tamils by the Singhalese of Ceylon. 

In addition to discrimination based upon reli- 
gion, race, national origin, and culture, lines of dis- 
tinction may be drawn by age (a person is “too 
young” or “too old,” despite qualifications for a 
position), by sex, by occupation, or by class (see 
International Labor Office 1962). One may even 
be discriminated against on grounds of his ances- 
tor’s occupation or class, as shown by the segrega- 
tion of the eta in Japan or by the preference given 
in many situations to a person from an “old” upper- 
class family over the nouveau riche. 

The forms of discrimination are as various as 
the criteria on which they are based. Perhaps the 


three most important types are restriction on social 
mobility (denial of opportunity), restriction on 
physical mobility (segregation), and barriers to the 
acquisition of full self-respect and intrapsychic har- 
mony (such as demands for arbitrary status def- 
erence). These often occur together, of course, and 
are mutually reinforcing, but they may be found 
separately. One sees them in the extreme form in 
South Africa, where nonwhites are barred from po- 
litical participation, educational opportunities, and 
many jobs; are residentially segregated; and are 
compelled to carry status-defining passes. Much of 
this discrimination is “legal” in South Africa, in 
the strict sense of the term; but since more than 
three-quarters of the population cannot participate 
in the definition of legality and protest its validity, 
the activities there are appropriately described as 
social discrimination rather than normative dis- 
crimination. 

Discrimination as a system of social relation- 
ships is affected by the responses of the individuals 
and groups experiencing it. In the first instance 
it “vitiates the power and knowledge of its victim” 
(Antonovsky 1960), thus creating tendencies by 
which still further discrimination is justified. Those 
who are given poor schools, poor jobs, and poor op- 
portunities for advancement may demonstrate low 
morale, low skill, and high resentment, which then 
become the “causes” of further discrimination. 
Such self-fulfilling prophecies or vicious circles 
(Maclver 1948; Myrdal 1944) help to explain the 
tenacity of many patterns of social discrimination. 

Since even the definition of discrimination is still 
problematic, efforts to scale and measure it and to 
record precisely the conditions under which various 
forms occur have been scarce. Fortunately, this 
situation is beginning to change. Careful studies of 
the economic consequences of discrimination (e.g 
Becker 1957; Krueger 1963), of the exact patterns 
of segregation (Taeuber & Taeuber 1965; Williams 
1964), and of the salience of various forms of dis- 
crimination for different groups (Killian & Grigg 
1961), as well as more formal conceptualizations 
of discrimination (Blalock 1960), are indicators of 
a growing precision in discussions of this topic. 

The international perspective. In the modern 
era discrimination has become an international 
problem. At the very time when many patterns of 
discrimination are brought under severe challenge 
within societies, these patterns are affecting Te 
lationships between societies. Achievement of a 
merely national consensus on what is fair and just 
proves to be inadequate at a time when the fates 
of nations are bound so closely together. Should 
women have equal rights in Iraq? Should all reli- 


gions be allowed free expression in Spain? Should 
the advantages of hereditary social classes in South 
America be reduced? Should Australia modify its 
all-white immigration laws? Should persons of 
Mexican descent receive equitable treatment in the 
United States? Should persons of European descent 
hold full citizenship in the newly independent Afri- 
can states? Societies are being judged on such 
questions more and more on the basis of “outside” 
criteria; there are the beginnings of an interna- 
tional set of standards in the light of which many 
practices that carry powerful (if not universal) 
support within nations have become controversial. 
Viewed from this international perspective, dis- 
crimination is on the increase in the sense that 
formerly customary practices are being defined as 
arbitrary and unfair, Stated in terms of the con- 
tinuum suggested above, many acts are being 
shifted from normative to social discrimination as 
international contacts and standards grow in im- 
portance. Perhaps the separate societies are going 
through a necessary and inevitable process out of 
which will emerge more nearly universal standards 
in a world abruptly made small. 

J. MILTON YINGER 


[See also CASTE; DISCRIMINATION, ECONOMIC; MINORI- 
TIES; RACE RELATIONS; SEGREGATION. ] 
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PRESIDENTIAL GOVERNMENT 


The term “presidential government” came into 
use in English journalism before the American 
Civil War as a shorthand expression for the gov- 
ernmental system of the former colonies, the 
United States. Walter Bagehot so used it in The 
English Constitution (1865-1867), begun while 
Lincoln was still president, to draw the contrast 
with his own term for the British system, “cabi- 
net government.” A generation later, in America, 
Woodrow Wilson assayed Bagehot’s “realism” and 
deliberately rejected Bagehot’s term for the United 
States; Wilson called his own book Congressional 
Government (1885). Time and American develop- 
ments, however, have vindicated Bagehot’s usage 
rather than Wilson’s. On both sides of the Atlantic 
the term “presidential government” is commonly 
employed today to characterize the American sys- 
tem. It is sometimes used loosely, as a generic 
term applying to all governments with elective 
chief executives styled “president.” This looser 
usage is of doubtful value analytically, for reasons 
that will be discussed below. 


Centrality of the president 

The government of the United States is “presi- 
dential” in the sense that its presidency occupies 
the vital, central place among public institutions 
at the national level. This office serves at once as 
the central source of judgment and initiative and 
as the only object of national elections in a gov- 
ernment legitimated by popular sovereignty. An 
‘American president also is the embodiment of sov- 
ereignty in external relations. He is, besides, the 
government's chief spokesman in internal relations 
with interest groups and citizens at large. A presi- 
dent of the United States both reigns and rules. 

This key position is the product partly of con- 
stitutional provisions, partly of accreted precedents 
and modern practice. The constitution of 1787 
conferred upon the president a number of positions 
of advantage in the governmental system. First 
among these is security of tenure for a four-year 
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period with removal only by congressional im- 
peachment, a cumbersome procedure which has 
only been attempted once, in 1867. Second is in- 
direct popular election through the medium of the 
electoral college. Third is command of the armed 
forces and the conduct of diplomacy, traditional 
royal prerogatives vested in the presidency by 
former British subjects. Fourth is an assortment 
of specific rights or duties bearing on the conduct 
of public administration, most important of which 
is the right to name department heads and the 
duty to “take care that the laws be faithfully exe- 
cuted.” Fifth is a limited prerogative in legislation: 
the qualified veto power and the right to recom- 
mend. 

Building on these constitutional foundations, 
personalities in office and surrounding circum- 
stances have united to enlarge the presidency, re- 
shaping and increasing its positions of advantage 
relative to other institutions in the system, The 
contemporary presidency is the product of a long 
accretion. 

In the first century under the constitution the 
practice of three presidents particularly shaped the 
character of the presidential office. Washington 
lent it something of his personal prestige and care- 
fully set precedents to mark it as the highest place 
in government. His precedents were frequently 
ignored, but the impact of his tenure was never 
wholly lost. Jackson dramatized the office’s inde- 
pendent popular connection, and his renomination 
by a national convention of state parties (with 
electors pledged to the convention’s nominee) 
proved a lasting and important innovation, There- 
after the nominating process made a president's 
election virtually direct, gave him an independent 
claim upon the nation, and provided him with an 
independent power base in politics—the national 
party. Lincoln, faced by civil war, invented the 
“war power,” juxtaposing his position as com- 
mander in chief with his duty under the “take care” 
clause and his constitutionally prescribed oath of 
office. This assertion of inherent prerogative was 
largely sustained by the courts and ever since has 
made the president something of a “constitutional 
dictator” in times of declared war. 

Since the turn of the twentieth century enlarge- 
ment of the presidency has continued and acceler- 
ated. In an age of mass communication Wilson 
and the two Roosevelts effectively asserted the 
office’s primacy as news source and as national 
spokesman. In the period when the United States 
reached a position of world power and then super- 
power these presidents effectively imparted mod- 
ern meaning to the office’s prerogatives in defense 


and diplomacy. During the great depression of the 
1930s, Franklin Roosevelt played a role in guiding 
the economy (including labor-management rela- 
tions), a function that has been associated with 
the office ever since. He also took the first decisive 
steps to institutionalize the presidency, equipping 
it with staff resources of its own, both in the White 
House proper and in the executive office of the 
president—a move associated with the report of 
his Committee on Administrative Management, the 
so-called Brownlow report (U.S. Presidents Com- 
mittee... 1937). He and his successor innovated 
further by developing initiative in legislation to 
the point where, under Truman, a comprehensive, 
detailed “program of the president” became an 
annual feature of the legislative process, setting 
the congressional agenda. Under Eisenhower a 
routine role in implementing that agenda was 
recognized and regularized, with a staff for “legis- 
lative liaison” established at the White House to 
coordinate executive-branch pressure on Congress. 

Also, in the two decades after World War 1— 
the first decades of the cold war and of nuclear 
weapons—Truman, Eisenhower, and Kennedy, in 
turn, developed modern doctrine and procedure for 
the presidential use of force abroad without resort 
to declared war or the “war power.” Eisenhower 
and Kennedy, confronting racial conflicts and 
breakdowns in law enforcement, began the devel- 
opment of modern doctrine and procedure for the 
presidential use of force at home consistent with 
a mandate long since given by the courts to pre- 
serve internal peace in the United States. 

Taken together, these positions of advantage 
make the presidency indispensable to the effective 
performance of all other governmental institutions. 
The national parties find their raison d'être in 
presidential nominations and campaigns; were 
there no presidency to unite them, federalism 
would fragment them, if indeed they existed at all. 
The national legislature depends upon the presi- 
dency for agendas and also for external pressure 
to enhance the weight of its internal party leader- 
ship. The national bureaucracy depends upon the 
White House for authoritative judgments and po- 
litical support in struggles between departments 
or with Congress. The national courts depend upon 
the White House to sustain and spur executive en- 
forcement. The Washington press corps looks to 
it for news. The national organizations of assorted 
private interests use the presidency’s aid (or Op- 
position) to magnify their voices and to spur 
their memberships. State and local governments 
depend upon the presidency for preservation, ul- 
timately, of local law and order. Apart from insti- 


tutions Americans in general look toward their 
president (“government” personified) whenever 
private troubles seem to have a public source. 


Separateness of Congress 


But the constitution, which established the foun- 
dations for this office, also established as a funda- 
mental principle that its positions of advantage 
should be matched by those of other national insti- 
tutions, notably Congress. This is the principle of 
“separated powers’—more accurately, separated 
institutions sharing powers—a principle intended 
to insure Americans against the fates that had be- 
fallen Englishmen in the colonial period: parlia- 
mentary usurpation of the crown, quasi-military 
dictatorship, and royal manipulation of Parlia- 
ment. Consciously or not, American constitution 
makers sought the balance that the English Whigs 
had precariously achieved in the reign of William 
and Mary. And even while the English were de- 
parting from that model, these constitution makers 
so successfully pursued it that enlargement of the 
presidency has produced no loss of “balance” from 
a still distinctly separated Congress. On the con- 
trary, although history has altered form and terms, 
congressional positions of advantage continue to 
confront and check a president's advantages. 

The constitution granted one or both houses of 
Congress a share in every aspect of the presidency’s 
powers. The conduct of defense was qualified by 
making force levels and funds dependent on con- 
gressional enactment and by reserving to Con- 
gress the right to declare war. The conduct of 
diplomacy was qualified by reserving to the Senate 
“advice and consent” on treaties. The conduct of 
administration was severely limited by making 
money, authority, and departmental structure sub- 
ject to specific legislation—with presidential ap- 
pointees subject to Senate confirmation. In the 
sphere of legislation Congress was paramount, 
checked only by the presidential veto, which itself 
was subject to an overriding two-thirds vote of the 
two houses. 

These substantial shares of powers were vested 
in legislators who were meant to be effectually 
separated from the president. Senators and mem- 
bers of the House of Representatives were given 
the advantage of fixed tenure, but with terms dif- 
ferent in length from the chief executive's: six 
years for the upper house, two years for the lower. 
They were advantaged further by popular election 
no less direct than his: congressmen were elected 
by citizens in state-created districts; senators, by 
state legislatures (a right since transferred to citi- 
zens in states). In practical application these slepi 


PRESIDENTIAL GOVERNMENT 453 


toral arrangements powerfully reinforce the formal 
independence of Congress and were responsible, 
above all, for the achievement of the separation of 
powers. 

It is the politics of nomination that has kept 
Congress effectually separated from the White 
House. The constitution prescribed differing elec- 
torates for senators, representatives, and presi- 
dents. But it was silent on the means by which 
the candidates were to be chosen and presented 
to electors. And it left the regulation of elections 
to the states. In consequence, this gap was filled 
by private organizations, political parties, operat- 
ing in the legal framework of each state to make 
the nominations for those offices. Senate and 
House seats as well as local posts were subject to 
election inside states, under rules set by state legis- 
lators, who themselves had to be nominated. Ac- 
cordingly, the party organizations and their nomi- 
nating processes grew up inside the states. For 
most intents and purposes these have remained 
distinctly separated by state boundaries. 

Since Jackson’s presidency nominations for the 
White House have been arranged by confederal 
conventions of state party organizations, looking 
toward simultaneous elections in all states of party- 
pledged electors, followed by party votes in the 
electoral colleges. But nominations for the Senate 
and the House were differently arranged, at dif- 
ferent times, by different men, for differing elec- 
toral purposes. Once elected by their different sets 
of voters, the candidates who had been chosen in 
these different ways were separated by much more 
than constitutional prescription. Even when they 
shared a party label, they were separated by the 
terms of their employment and survival. So it has 
remained. 

In the early 1960s the U.S. Supreme Court 
ruled unconstitutional the districting of legislative 
bodies, state and national alike, that failed to meet 


the test of equal representation for each voter in 


a state. Revision of state legislative districts and 
also of the districts for the House of Representa- 
tives may have a marked effect on nominating 
politics inside the states and may make presiden- 
tial and congressional nominations less disparate. 
If so, the separation between the presidency and 
Congress should begin to shrink. But this is spec- 


ulative. 


Modern presidential-congressional relations 

As a separated institution Congress has con- 
tinued to share powers with the presidency even 
while the latter’s own advantages were growing. 
In some respects, indeed. congressional advantages 
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have also grown. The roles originally envisaged 
for the two have changed considerably, especially 
since Franklin Roosevelt's time, but Congress still 
is able to contend in many spheres with the ag- 
grandized presidency. 

Since World War 11, Congress has been least 
advantaged in defense and in those aspects of di- 
plomacy related to the use of force. Buttressed by 
technology, a president's advantages have over- 
shadowed congressional prerogatives: the right to 
declare war has lost most of its meaning; the right 
to finance has not produced departures of much 
moment from White House proposals; the consent 
to treaties has not curbed White House agreements 
or initiatives from day to day, In other aspects of 
diplomacy, however, especially where economic aid 
comes into play, the need for funds in foreign re- 
lations has enlarged the reach of both houses of 
Congress. By a variety of legislative devices both 
houses maintain considerable control over details 
of foreign policy, and by conventions of “biparti- 
sanship” leading legislators have considerable voice 
in White House deliberations. 

In spheres of domestic policy Congress has in- 
creasingly relied on the White House to frame 
issues of the day and suggest solutions. The ini- 
tiative in legislation has passed to the presidency. 
However, legislative action remains very much a 
matter of congressional discretion, subject to presi- 
dential influence perhaps, but not under White 
House control. Except in moments of grave na- 
tional emergency akin to general war Congress has 
continuously asserted its capacity to alter, block, 
or disregard proposed bills from the president. 
White House influence is exercised, for the most 
part, through legislative leaders whose political de- 
pendence on a president is never absolute and 
rarely controlling. Even his own partisans will owe 
him almost nothing for their nominations and will 
often doubt his usefulness in their elections. His 
influence with them is made of less substantial 
stuff; popular prestige, access to publicity, party 
sentiment, personal favors, patronage, and “pork.” 
These frequently cannot suffice in controversial 
cases. 

Congress, moreover, is increasingly adept at in- 
tervening in details of public administration. While 
legislative initiative shifts toward the White House, 
administrative “oversight” shifts toward the Capi- 
tol—a curious reversal of the literary theory of 
the constitution. Congressional prerogatives to au- 


and Senate rights of supervision over departmental 


work. Since the 1930s the enormous growth of a 
national bureaucracy has spurred both houses to 
press these rights. They now are exercised unevenly 
but sometimes in great depth, especially at the 
margins of a president's concerns, where compe- 
tition from the White House offers fewest obstacles 
to congressional intervention. 

The bureaucracy, in consequence, serves more 
than one master, A product of the separation be- 
tween Congress and the presidency has been a 
separation of officialdom from both. The govern- 
ment departments are dependent upon both, sub- 
ject in part to both, hence wholly subordinate to 
neither, Least of all are they subordinate to one 
another. “Collective responsibility” is not a mean» 
ingful concept in the United States, as applied 
either to department heads or to their civil servants, 

The presidency’s ultimate advantage over Con- 
gress is the latter's lack of unity. Congressional 
advantages in legislation and administration might 
suffice to tilt the balance toward the Capitol if 
Congress, as a unit, coordinated and applied them. 
But it does not. Its powers and advantages vis-à-vis 
the White House are dispersed among standing 
committees of the two congressional bodies, the 
Senate and the House. Powers and advantages 
repose for the most part in committee work, tra- 
ditionally dominated by seniority leaders, These 
seniors become dominant by virtue of repeated 
nomination and election in home districts, not, €x- 
cept pro forma, by virtue of congressional party col- 
leagues. Seniority is the governing principle pre- 
cisely because ties of party, being home-centered, 
cannot sustain alternative arrangements. Com- 
mittee leaders are but lightly linked by the floor 
leaders of each house, who owe their places to a 
party vote. And the two houses have no common 
leadership whatever (except as party leaders in 
each house are brought together by their partys 
president). 

Congress remains “a broken mirror,” as Wilson 
once described it, and this has proved a fatal flaw 
in the assertion of “congressional government. 
Twice in American history Congress has pre 
empted the center of the governmental stage, 
dominating policy and operations to the virtual 
exclusion of the White House. This happened in 
the decades between Jefferson and Jackson 
again in the decades after the Civil War. On both 
occasions institutional disunity kept Congress from 
consolidating its pre-eminence and paved the way 
for a resurgent presidency, responsive to new cir- 
cumstances. 

The circumstances of this century have not been 
kind to legislatures anywhere. The United States 


is no exception. Preeminence has passed beyond 
the reach of Congress, Disunity now tempers its 
resistance, but unity could scarcely give it assured 
domination, The has now evolved too 
far, in circumstances that permit of no retreat. 
Even so, and even with continuing fragmentation, 
Congress still possesses enough independence and 
advantages to block executive preeminence. In this 
it stands alone among the parliaments of the in- 
dustrialized world, Congress remains what the 
constitution made it; a separated Institution shar- 
ing powers with the presidency. 

Presidential government in the United States Is 
thus to be distinguished by four features; the cen- 


ing and administrative management; the auton: 
omy of a bureaucracy that is at once responsible 
to both; the absence of a party te sufficient to 
unite both. Federalism has contributed 

to these results 09 as Sena oer 
terpreted by independent courts, which 

come another separated institution sharing legis- 
lative powers, 


Other presidential systems 


“Presidential government” as & 
invites confusion and is analytically 
for with the possible exception 
the Philippines the above features 
ingly combined in the United 
true that many Latin American 
comparable constitutional forms, 
these nations, notably Mexico, 
dered operable and viable by virtue of a 
ent feature: the preponderant political 
where, the operative feature tends to be 
bargaining, or military “guidance, 
dictatorship, or some combination 
crudely has the same effect of 
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at the end of such a road for Britain. Provided that 
the government kept a working majority in Parlia- 
ment, the substitution of prime minister for cabinet 
would center on him a role beyond the reach of an 
American president. 

Much the same can be said of other cabinet 
systems that may seem to verge on being “presi- 
dentialized"; none resembles “presidential govern- 
ment” as it is known in the United States up to the 
present time. 

RICHARD E. NEUSTADT 


[See also CONSTITUTIONS AND CONSTITUTIONALISM; 
FEDERALISM; GOVERNMENT; LEADERSHIP; PARLIA- 
MENTARY GOVERNMENT; POLITICAL EXECUTIVE; PUB- 
LIC POLICY. Other relevant material may be found in 
ADMINISTRATION; CIVIL SERVICE; COMMISSIONS, 
GOVERNMENT; CONSTITUTIONAL LAW; JUDICIAL 
PROCESS; LEGISLATION; PARTIES, POLITICAL; REPRE- 
SENTATION; WELFARE STATE; and in the biographies 
of BAGEHOT; Bryce; HAMILTON, ALEXANDER; JEF- 


FERSON; MADISON; TOCQUEVILLE; WILLOUGHBY; 
WIson.]} 
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PRICES 


For the general considerations underlying the 
determination of individual prices, see DEMAND 
AND SUPPLY and UTILITY. The first article pre- 
sented below discusses problems of price setting 
in noncompetitive markets. See also MoNoPpoLy 
and OLIGOPOLY. For price determination in com- 
petitive markets, see COMPETITION. The second 
article discusses the role of prices in a free market 
economy and government regulation of prices. 
For closely related elements of government regula- 
tion, see RESALE PRICE MAINTENANCE and REGU- 
LATION OF INDUSTRY. The third article discusses 
historical aspects of the movement of the general 
level of prices. For analytical treatment of general 
price level movement, see INFLATION AND DEFLA- 
TION; for problems of measurement, see INDEX 
NUMBERS. 
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1. PRICING POLICIES 
u. PRICE CONTROL AND RATIONING 
im. PRICE HISTORY 


I 
PRICING POLICIES 


In the strict theory of perfect competition, price 
policy has no role. Prices are assumed to be de- 
termined by an impersonal and automatic market 
mechanism that operates to adjust prices so that 
quantities demanded and supplied at the given 
price are equal. Price policy is therefore inherently 
associated with imperfect competition, even though 
in some cases the competition, though imperfect, 
is sufficiently active to impose very narrow con- 
straints on the price policies that individual firms 
can follow without disaster. 


Where only a single product is involved, with 


a single price, distinguishable pricing policies in- 
clude marginal-cost pricing, average-cost or full- 
cost pricing, uniform monopoly pricing, and stay- 
out pricing. Of these, only marginal-cost pricing 
corresponds to the full rigor of competition; it is 
usually considered to be the price that, theoretically 
at least, is most conducive to efficient allocation of 
resources. It is generally the most volatile of the 
pricing methods, resulting in extremely low prices 
at times of excess capacity, particularly in highly 
capital-intensive lines, and correspondingly high 
prices in periods of shortage. This volatility has 
made marginal-cost pricing quite unpopular, as 
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compared with average-cost or full-cost pricing. 
Average-cost, or full-cost, pricing is, in a sense, 
the modern equivalent of the medieval just price, 
in that it gives priority to the goals of stability and 
equity between buyer and seller rather than to 
economic efficiency. Monopoly pricing, at the other 
extreme, represents giving full rein to the cupidity 
of the seller. While stay-out pricing, i.e., pricing 
calculated to discourage the entry of potential 
competitors, is in essence merely a prudential 
variant of monopoly pricing and thus is in theory 
subject to similar opprobrium, it usually involves 
less flagrant exploitation of purchasers and is, in 
practice, not often referred to by this name. In- 
deed, much of what is termed “full-cost pricing,” 
but with capital charges and rents often included 
in costs at fairly generous levels, is in reality a 
form of stay-out pricing in that the firm's own 
costs are often a fairly good indicator of the costs 
that a potential competitor might face. 


Marginal-cost pricing 

Where heavy fixed costs are involved in the 
production of nonstorable goods and services, 
fluctuation of demand or supply or both gives rise 
to drastic fluctuations in marginal cost, and one 
would theoretically expect that competitive pres- 
sures or a desire to promote efficiency would pro- 
duce corresponding variations in price. Actually, 
inertia, tradition, and administrative constraints 
have combined to keep such variation to a mini- 
mum. In resort areas, seasonal variation in hotel 
rates is common, particularly where competition 
is keen, but comparable variation in rates in non- 
resort areas is rare, Although downward deviation 
from a “standard” rate in slack seasons seems gen- 
erally acceptable, upward variation in periods of 
specially heavy demand (for example, during fairs 
and festivals) is generally considered to be un- 
ethical and in some jurisdictions is unlawful. 
Nevertheless, even where the nominal rates remain 
the same, some variation occurs in the effective 
rates. Price differentials for different accommoda- 
tions may not vary commensurately with their at- 
tractiveness, so that as demand increases, late- 
comers must pay substantially higher rates for only 
slightly better accommodations, or, as in the case of 
Pullman upper berths, must be content with inferior 
accommodations at a rate differential at which 
they would ordinarily go begging. Variation of box 
office prices on short notice has been inhibited by 
excise tax regulations, and where speculators have 
stepped in to adjust demand and supply, they have 
often been subjected to legal penalties. 

Sometimes price actually varies perversely with 
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the intensity of demand, a practice rationalized in 
some instances by an appeal to the notion of a 
discount for quantity purchases and in others by 
average-cost calculations involving inappropriate 
cost allocations. Thus commutation fares used pri- 
marily for the regular rush-hour journey to work 
are usually priced far below the one-way fares paid 
by casual riders, who are much less heavily con- 
centrated in the rush-hour periods, and are often 
priced below even the special reduced excursion 
fares offered for nonrush-hour trips. Similar con- 
cessions are often offered for bridge tolls; in 
neither case can the size of the concessions be 
justified by savings in collection costs. 

Injection of “value of service” concepts, in con- 
junction with nominal uniformity and historical 
continuity, sometimes leads to bizarre and costly 
rate patterns. For example, long-distance telephone 
rates in the United States are generally based on 
airline distances, requiring a costly look-up pro- 
cedure to assess the charge for each call; a toll 
schedule based largely on direct-dialing code zones 
would be both more in line with costs and less 
expensive to apply. Again, evening rates apply ac- 
cording to uniform nominal local time at the point 
of origin. This sounds like an even-handed pro- 
cedure, but it results in a Boston caller being able 
to call a Seattle friend at the reduced rates and 
reach him at a mutually convenient evening hour, 
whereas Seattle may not call Boston at the night 
rates without having to rouse the Boston party out 
of bed. The staggering mass and complexity of 
American freight rates is notorious. 

Even where substantial differentiation between 
peak and off-peak rates occurs, as in long-distance 
telephone service, it usually falls far short of the 
degree of differentiation that would lead to ef- 
ficient utilization of the facilities. Where the 
capacity limit is sharp, with no deterioration of 
quality of service short of capacity and no pos- 
sibility of supply exceeding capacity in the short 
run, the efficiency, or marginal-cost, rule is clear: 
as long as capacity is not fully utilized, price should 
reflect only operating costs with no contribution 
to capacity costs; when capacity would be exceeded 
at this price, the price should be adjusted upward 
until demand is restrained to the capacity level. 
Under constant returns to scale, when the capacity 
charges thus collected just defray capacity costs, 
capacity is appropriate. When capacity charges de- 
termined in this way exceed capacity costs, the 
installation of additional capacity is indicated, 
while in the reverse case existing capacity is ex- 
cessive, 


Carried to its logical limit, such a price policy 


would cause sharp variations in price not only 
between the peak and off-peak periods of a re- 
curring pattern, but also between one year and 
the next. This would be particularly true where 
mistakes are made in the prediction of demand, 
or where it is necessary or economical to add to 
capacity in fairly large units. In some cases the 
results of following such a prescription literally 
might appear quite inequitable. Thus when a new 
transit facility is developed in one sector of a city, 
users of other more crowded facilities in other 
quarters may feel sufficiently discriminated against 
because the users of the new facility are (for the 
time being at least) relatively less crowded with- 
out, in addition, their being favored with lower 
fares. Again, when a large new unit such as a 
bridge is financed by tolls, these tolls are often 
imposed immediately upon completion, when traffic 
is insufficient to cause congestion, and are removed 
when the cost of the unit has been amortized and 
when congestion is sometimes beginning to build 
up. 
Indeed, it is often felt that variations in short- 
run marginal cost are so sharp and erratic as to 
have no relevance for price policy. In many cases 
the marginal cost that would be relevant to setting 
prices so as to guide decisions toward efficient 
utilization of facilities would reflect not costs im- 
posed on the supplier but costs imposed on fellow 
users, In the short run a surge of traffic on a transit 
line, a road, or a telephone system may have no 
discernible effect on the costs of the operating 
agency, but will have as its cost a deterioration of 
the quality of the services rendered through in- 
creased crowding of riders, increased delay to 
motorists, or increased probability that a call can- 
not be completed promptly because the circuits 
are all busy. If marginal cost is measured in these 
terms, the fluctuations may still be great, but they 
will not be quite so erratic as a strict allocation 
of capacity costs might indicate. Here again, an 
appropriate level of capacity will have been reached 
when the cost of additional capacity would just 
balance the value of the improvement of service 
that the added capacity would provide for the 
actual level of traffic. This cost, in turn, given 
reasonable continuity of the production function, 
will equal the charges assessable on a volume of 
traffic equal to the increment that the added capac: 
ity would make it possible to carry at the original 
level of service quality. 

However, it is seldom possible to apply this cri- 
terion directly, since in most cases prices are not 
in fact made to vary as widely as even this in- 
terpretation of marginal cost would suggest. Aside 


from popular prejudice against extreme variations 
in rates as being unfair or discriminatory, a ra- 
tional motivation can be attributed to regulated 
public utilities that would make them reluctant to 
curb peak use quite this strongly. A utility com- 
pany is often assured of a “fair” rate of return 
on its investment, provided only that a case can 
be made that this investment is needed in provid- 
ing the service, and some margin is usually allowed 
between the actual cost of capital to the utility and 
this fair rate of return. In such cases the interests 
of the management, both as representatives of the 
stockholders and in terms of their own prestige and 
position of control, will be to maximize the amount 
of investment that is justifiable. This can be done 
by setting rates so as to maximize the net revenue 
derived from off-peak service. If peak rates must 
then be set below cost in order not to exceed the 
over-all rate of return allowable, a higher peak 
demand will result and will justify a larger capital 
investment than would be justified if rates reflected 
relative marginal costs. One can expect, accord- 
ingly, that only rarely will rates vary as widely as 
costs, 


Administered and reactive prices 


Another differentiation among price policies is 
that between predetermined, or “administered,” 
prices and “reactive,” or market-determined, prices. 
Administered prices tend to be more stable and 
predictable, thus providing a basis for planning by 
both buyer and seller that may be felt to lead to 
more efficient results. If prices are set some time 
in advance, potential buyers have an opportunity 
to make plans on the basis of a known price, and 
at the same time may be protected from certain 
types of losses that might occur if they made com- 
mitments without being assured of the prices that 
would prevail when they came to carry them out. 
On the other hand, setting prices in advance may 
lead to premature commitment by parties who are 
thus protected against the effects of future de- 
velopments, and may jnhibit taking advantage of 
opportunities that develop unexpectedly. 

In principle, a distinction can be made between 
the amount by which prices fluctuate and the lead 
time between the establishment of the price and 
the date of the eventual transaction. It is possible 
to have an elaborate and sharply changing pattern 
of price variation settled upon long in advance 
or to have prices that are set at the last minute 
fluctuate relatively little. Nevertheless, the tendency 
is for the preset prices to fluctuate less than the 


reactive prices. A : 
From an efficiency standpoint, whether the price 
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should be predetermined or reactive depends to a 
considerable extent on the point in time at which 
the affected decision has to be made and on the 
information upon which it is based. A price may 
be thought of as a signal from the price setter to 
an affected decision maker that is intended to con- 
vey information and provide incentive in the direc- 
tion of efficient choice. If, when a commitment is 
to be made by the decision maker (for example, 
whether to install gas- or oil-fired heating equip- 
ment), the price setter has more relevant informa- 
tion (about the relative future scarcities of oil and 
gas), it may be conducive to better choices to have 
a firm price for the future transaction (the pur- 
chase of fuel) set at or prior to this time. In other 
cases the purchaser as decision maker may have 
more up-to-date and immediate knowledge of the 
prospects than the price-setting seller has, or 
there may be substantial costs involved in the price- 
setting seller's acquiring a corresponding degree 
of knowledge, converting this into price informa- 
tion, and transmitting this information to actual 
and potential buyers (as in the case of variations 
in demand due to weather conditions or special 
events). In such cases the determination of prices 
according to criteria agreed upon in advance but 
dependent on demand and supply conditions as 
experienced may be more conducive to efficient 
allocation of resources, Similar considerations ap- 
ply, mutatis mutandis, to the relatively rare cases 
in which the buyer is the effective price setter, 
The principal objection to ex post pricing is 
that it subjects the parties to uncertainty as to the 
price that will eventually be paid. In some cases 
this is serious enough to be a major consideration, 
For example, there is a reluctance to offer gas rates 
reflecting low current costs of incremental gas use 
for house heating where it is possible that the 
costs may rise in the future as the capacity of 
existing mains becomes fully utilized or supplies 
of natural gas become more difficult to obtain, It 
would, of course, be possible to offer the current 
low rate and to avert overinvestment in gas ap- 
pliances by adequately notifying purchasers of the 
likelihood of future price increases. However, given 
the strong promotional orientation of most cus- 
tomer relations, it might be difficult to insure that 
an unbiased appraisal of future prospects would 
be conveyed to customers. This difficulty could be 
met by means of long-term contracts covering the 
purchase of a given amount of service at a suitably 
varied sequence of rates. If the customer is even- 
tually allowed to use more or less than the stipu- 
lated amount of service, with the difference ad- 
justed on the basis of ex post rates, undesirable 
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constraints on actual use would be avoided. In ef- 
fect, the customer protects himself by a kind of 
futures contract against the income consequences 
of adverse price changes without precluding ap- 
propriate substitutions; however, the administrative 
problem of adapting such contracts to the needs 
of individual consumers in a context of utility 
regulation appears formidable. 

Actually, fuel clauses in electric power contracts 
are a mild form of reactive pricing; these serve, 
however, more to protect the utility against fuel 
cost increases than to influence consumption pat- 
terns appreciably. 

In some cases the strength of the commitment 
in advance to the use of the service at a given time 
is slight for a substantial part of the demand while 
the possibilities for effective short-notice reactive 
pricing are good. For example, it is quite easy to in- 
form telephone users when they place a call what 
the current rate is. An actual practice approaching 
this is that of having different rates for different de- 
grees of priority, the caller being able to ascertain 
from the operator the expected delay for the various 
categories. Taking an example from electric power, 
the same load signal used by Electricité de France 
to switch rates according to time of day could be 
used to vary rates on a reactive basis, simultane- 
ously encouraging the switching on and off of de- 
ferrable demands such as water heaters and re- 
frigerators, One or more “emergency” steps in a 
reactive price structure can serve as a substitute 
for arbitrary physical load shedding and can aid 
in preventing or mitigating the effects of major 
power failures. 

But whatever the analytical attractiveness of 
fluctuating and reactive prices, consumers are often 
found to have a marked preference for prede- 
termined and even flat rates. Thus, for residential 
telephone service it is found that customers often 
seem to prefer a flat monthly rate to a charge 
varying with use, even in instances where the bill 
for the amount of service actually used would be 
lower. That flat rate service saves the expense of 
recording and billing for local calls is in some 
cases an adequate justification on the cost side for 
the flat rate. However, in addition to and inde- 
pendently of this, on the demand side customers 
appear to be willing to pay extra, on the average, 
for assurance that they will not be faced with an 
unexpectedly high bill as a result of a spate of 
high calling activity. In some cases this attitude 
may arise from a budgeting situation in which a 
fixed income is heavily mortgaged for relatively 
fixed outlays, but it also appears to indicate a more 
general desire to be free from worry about the 


economic consequences of actions. If so, many 
would regard this as a legitimate preference of 
consumers to be given appropriate weight in de- 
cisions about price structure. 

In some markets the price to consumers is made 
reactive through the intervention of speculators. 
Where the supplier of a service feels constrained 
to adopt a fairly rigid price policy, whether through 
custom, excise tax regulations, or a fear that price 
concessions might be thought to indicate an in- 
ferior quality of product, as with theater seats, 
speculators often find it profitable to pre-empt the 
units most in demand and resell them at higher 
prices. To the extent that demand is thus diverted 
from those less eager to those more eager for the 
service, as measured by the price they are willing 
to pay, speculators can be said to enhance the 
value of the service rendered and, hence, to be 
productive. However, because of the disorganized 
and often clandestine nature of their operations, 
their costs may often exceed the value of the en- 
hancement, so that their net productivity becomes 
negative. The possibility that a well-coordinated 
and efficient market in reserved seats for sporting 
events, theatrical performances, long-distance air 
travel, and the like might prove highly productive 
is not to be ignored. Or such reservations might be 
sold originally on a basis that would simulate the 
action of such a market: at any given date in ad- 
vance of the performance, reservations would be 
sold at prices varying according to the relation 
between the proportion of the seats of the given 
category still unsold and the time remaining to 
the date of the performance. This is, in effect, a 
form of marginal-cost pricing, marginal cost being 
in this case the expected value as of the current 
instant of the seat to the potential purchaser even- 
tually displaced as a result of the current sale 
(adjusted, possibly, for changes in the identity of 
other “purchasers). While marginal cost in this 
sense cannot be defined independently of the price 
policy in the succeeding interval, there will in gen- 
eral exist a price policy such that the price at any 
given instant corresponds to expected margin: 
cost computed in terms of the specified price policy. 
Even though this definition is implicit rather than 
straightforward, it is still capable of being evaluated 
to an acceptable level of precision. It is, in effect, 
merely a formal specification of what would be 
expected to take place in an efficient speculators 
market. x 

Another form of speculative pricing occurs in 
the successive auctioning of highly similar items 
and in the letting of contracts by sealed tenders, 
where decisions on how to bid often depend, 


under the usual procedures, on speculative evalu- 
ations of the bidding strategy of others. The effi- 
ciency of such procedures can often be enhanced 
and the results made more nearly Pareto-optimal 
by departing from the traditional procedures. For 
example, similar items can be auctioned simulta- 
neously rather than sequentially, or it can be stipu- 
lated that the contract being bid for will be awarded 
to the lowest bidder but at a price equal to the bid 
of the second lowest bidder. This latter procedure 
sounds as though it would result in a needlessly 
high price, However, when allowance is made for 
the effect of the change in procedure on the level 
of the bids that rational bidders would make, it is 
by no means clear in which direction the price will 
be affected on the average. What is clear is that 
the improved procedures enhance the likelihood 
that the items being sold will be bought by those 
valuing them most highly and that the contract 
will be let to the bidder expecting to be able to per- 
form it at the lowest cost. 


Discriminatory pricing 

In the absence of economies of scale, marginal- 
cost pricing consistently carried out would yield 
revenues sufficient to cover total costs, at least 
where the level of investment has been appropriate. 
Most of the more difficult problems of price struc- 
ture occur, however, in a context of substantial 
economies of scale, in which case, under a strict 
application of the marginal-cost rule, revenues 
would fall short of total costs by an “intramarginal 
residue.” Earlier advocates of marginal-cost pricing 
tended to assume that the appropriate procedure 
here is to cover this intramarginal residue by sub- 
sidies financed out of general tax revenues. But 
with growing governmental budgets has come in- 
creased recognition that substantial increments of 
tax revenues cannot ordinarily be obtained without 
incurring additional administrative and compliance 
costs and interfering with the efficient allocation 
of resources in other ways. This has led to ac- 
ceptance of the “principle of the second best,” which 
states that it is desirable, as Jong as the marginal- 
cost price cannot be maintained in every market, 
to have prices that generally deviate from marginal 
cost in complex ways that depend on the com- 
plementarity and substitution interrelationships 
among the various commodities and services. 

This in turn implies acceptance of discriminatory 
pricing. In the simplest case, where the market 
demands for the several output categories are sub- 
stantially independent, a rough rule is that the 
ratio of the cost (in terms of misallocation of 
resources) of curtailing consumption by one unit 
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to the net revenue so obtained shall be the same 
in the various markets. The misallocation can be 
measured by the difference between the marginal 
cost of production of the unit and its value to the 
consumer as measured by the market price he pays. 
The net revenue is the excess of the marginal cost 
saved by producing one less unit over the marginal 
revenue given up, marginal revenue being in turn 
the price of the last unit being curtailed minus 
the gain from increasing the price of all preceding 
units. The implication of this rule is that the pro- 
portion by which marginal cost falls short of price 
is to be made inversely proportional to the elasticity 
of demand. It is to be noted, however, that while 
this sounds fine in principle, elasticity of demand 
is notoriously difficult to determine in practice. 
This principle reintroduces, in a more restricted 
form, the discrimination often stigmatized as 
“charging what the traffic will bear.” 

Where products or services are close substitutes, 
a different consideration comes into play: the de- 
sirability of keeping the price relationship between 
the two goods or services in line with the relation- 
ship of marginal costs, so that a customer making 
a choice between the two will be faced with a price 
differential that properly reflects the difference in 
marginal cost. This poses a special problem where 
the competing products or services are produced 
under sharply differing degrees of economy of 
scale, as when an all-rail route competes with a 
route making use of coastwise shipping. The all- 
rail route may well be unable to attract the share 
of the total traffic that can most economically be 
moved along it unless this route can quote espe- 
cially favorable rates that will make little or no 
contribution to covering’ the intramarginal cost 
residue, or unless rates for the water movement 
are kept artificially high by taxes or by high rates 
for the complementary rail segments of the route. 
If the water route rates are kept high, this may, 
on the other hand, unduly inhibit the movement 
of traffic by water even where an all-rail movement 
is entirely out of the running. Competition among 
the varying modes is indeed one factor that has 
contributed greatly to the complexity of freight 
tariffs. 

Where the degree of economy of scale is more 
nearly the same, as when two rail routes compete, 
the problem tends to be less one of introducing 
needed discrimination than one of avoiding dis- 
crimination arising from cutthroat competition. In 
the absence of regulation, in extreme cases rates 
between competing termini have been brought sub- 
stantially below rates between intermediate points. 
Such rate structures obviously interfere with the 
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efficient location of industry; and there is, accord- 
ingly, a tendency to try to curb this kind of price 
competition within a single mode of transportation 
while opening the door a little wider for com- 
petition between modes. One result has been the 
adoption of flat blanket rates for entire areas or 
zones, in part to simplify rates generally but also 
as a way of making it easier to recognize and in- 
hibit attempts to undercut competitors by rate con- 
cessions. The possibility of competitive improve- 
ment in quality remains, and is at times carried 
to wasteful excess; but improvement in quality 
is a slower and more difficult process, less likely 
to result in a major impairment of over-all net 
revenues. Blanket rates, however, often mask sig- 
nificant differences in marginal cost for trans- 
portation to or from different points within the 
blanketed area, or by different routes to the same 
point, and thus in their turn lead to inefficient 
routing of traffic and location of activity. 

Highway competition has been a strong factor 
tending to break down discrimination based on 
commodity classification where rate differentials 
were based on the supposed elasticity of demand 
rather than on characteristics of the commodity 
related to cost of transport, such as density or sus- 
ceptibility to damage. To avoid losing highly profit- 
able traffic to highway trucks, railroads have been 
forced to reduce premium rates formerly charged 
on high-value goods and to place greater emphasis 
on “all commodity” rates, particularly in the case 
of shipment in containers and in highway trailers 
carried on flatcars. The highway competition prob- 
lem is further complicated by the fact that trucks 
share the burden of the intramarginal residue of 
highway costs with other highway users. 

Aside from the need to stabilize competition, 
geographical uniformity of rates, even in the face 
of significant differences in cost, has great appeal 
to management on grounds of administrative sim- 
plicity and to the general public on grounds of 
equity. Indeed, such uniformity often masks more 
or less deliberate discrimination. Thus rural elec- 
tric service or postal service at uniform rates may 
reflect a more or less conscious desire to redis- 
tribute income in favor of agriculture. 

Another type of discrimination in the guise of 
uniformity arises when delivered prices are quoted 
on a uniform basis, with the seller absorbing dif- 
ferences in transportation cost, rather than on a 
“mill net” basis. In some cases this is merely a 
matter of ignoring, for the sake of convenience, 
costs that would make only a minor difference in 
the total bill. But the practice often goes further 
than this and constitutes a recognition that sales 


to more distant customers are subject to stronger 
competition from other suppliers than sales to 
nearby customers. This discrimination has been 
carried further still in “basing point” schemes, in 
which the delivered price includes freight, not 
from the shipping mill but from an established 
basing point that may approximate the location of 
a competitors mill. In international trade, the 
practice of selling to a more remote market at a 
lower net realization than is obtained from do- 
mestic customers is referred to as “dumping” and 
is often strongly resisted by the customer country 
through tariff and other measures. 

Where the use of a system supplying services 
involves some form of physical connection or semi- 
permanent relationship between customer and 
seller, additional opportunity arises for discrimina- 
tory pricing through various forms of “multipart 
tariff.” To be sure, where there are costs associated 
with the connection as such, it is often difficult 
to distinguish the discriminatory aspects of such 
schedules from the cost-related aspects. In elec- 
tricity supply, for example, there is good justifica- 
tion, on strictly marginal-cost principles, for a 
“customer charge” related to the installed capacity 
of the connection to the system and covering the 
cost of this connection plus meter-reading and 
billing costs. In addition, the user would pay en- 
ergy charges, which may vary according to time 
of day and season of the year, and charges for 
“wattless KVA” drawn by motors and other in- 
ductive devices, which may also vary by time of 
day, but not in the same way as the energy 
charge. Where time-of-day rates are deemed im- 
practical, because of high metering costs or for 
other reasons, recourse may be had to step rates, 
which charge for successive blocks of monthly en- 
ergy consumption at successively lower rates. 
Attempts have been made to justify this practice 
on the ground that customers using only energy 
in the first blocks draw a relatively large propor- 
tion of their energy in the peak periods. A more 
realistic appraisal seems to be that this represents 
mainly a discriminatory exploitation of differences 
in the elasticities of demand for the various blocks 
of power. Among domestic users, this discrimina- 
tion operates regressively in favor of the larger 
customers, who tend to have the higher incomes. 

It has been suggested, indeed, that the size of 
the blocks should be varied according to the size 
of the premises served, but this practice has not 
proved popular. Not only would it tend to correct 
the regressive impact of the more usual type of 
schedule, but it might, properly adjusted, put 
nearly all customers at a point on the schedule 


where they could obtain additional energy at mar- 
ginal cost, so that the efficiency conditions could 
be met much more closely. However, if the total 
charge varies with the size of the premises, this 
converts the rate schedule into a charge on space 
at the margin and leads to an uneconomical stint- 
ing in the use of space instead of uneconomical 
stinting in the use of power. 

On a strictly cost basis, however, there is con- 
siderable justification for a charge based on lot 
area or street frontage, representing the cost of 
carrying the conduits past the property. Efficiency 
considerations suggest that such a charge could 
properly be assessed against any occupant of land 
within the service area, regardless of whether the 
specific service is taken, on the ground that occu- 
pancy of space in an area in which the service 
is provided increases the cost of providing the 
service to those who need it. While charges of this 
type are sometimes used for water supply, they 
have not been much used for other utility services, 
although in some instances developers of new resi- 
dential areas have been required to provide or pay 
for a portion of the utility distribution plant. 

The direct effect of such charges on the alloca- 
tion of resources is likely to be somewhat inde- 
pendent of the exact method of computing the 
charge, since any variations from one method to 
another would probably be capitalized in the value 
of the affected land, Indeed, the effect of such 
charges would in the long run not be very differ- 
ent from giving the agency providing the service 
a general subsidy financed by land taxes (as dis- 
tinct from taxes on improvements ) imposed on the 
property in the service area. The crucial element 
is that the amount of the tax or charge be inde- 
pendent of improvements or changes in the nature 
of the use of the property. In some jurisdictions, 
indeed, liability for property tax is made con- 
ditional on the availability of specified municipal 
services, such as garbage collection, although in 
most cases the tax is assessed on improvements 
as well as on land, There would, in principle, be 


no need to restrict such charges to cases where 
the distribution system comprises fixed capital 
‘ould be 


plant: mail and parcel delivery service W 
cases in point. 


Even in the absence of an individual physical 


connection, or where rate blocks of uniform size 
would be too arbitrary, a form of multipart pric- 
ing is sometimes effected by a variant of the 
“requirements” contract, in which the shipper, in 
return for a more favorable set of rates, agrees to 
ship or receive not less than a given percentage 
of his total shipments via the specified carrier or 
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mode, usually rail. Such bahntreu rates differ in 
motive from the block rates for electric power, 
however, in that they usually are aimed at divert- 
ing traffic from competing modes rather than at 
increasing the total volume of traffic moved. 

Complex as pricing policies already are, modern 
technology is opening new areas for pricing as a 
means of enhancing efficiency. Until recently it 
has been considered impractical to make specific 
charges, for example, for the use of city streets, 
and from time immemorial the attractiveness of 
the city core has resulted in the growth of traffic 
until the costs of congestion have inhibited further 
increase. The obvious wastefulness of this con- 
gestion has led to recent attempts in England and 
elsewhere to devise methods of assessing charges 
on the users of the congested streets that would 
bring home to them the costs that they impose 
on others by their presence in the congested area, 
and thus would restrain use to that which is suffi- 
ciently urgent to warrant occupancy of the scarce 
space. In this way it is hoped that the economic 
productivity of the core street area will be en- 
hanced much as the enclosure of common agri- 
cultural land led to an increase in agricultural 
productivity. 

Even when prices are not actually charged, 
“shadow prices” are being increasingly used as a 
management tool in large complex organizations 
as a means of decentralizing the decision-making 
process, Such uses have been given considerable 
impetus by the fact that similar “shadow prices” 
emerge as an integral part of the process of solv- 
ing linear programming problems, These shadow 
prices have the property that when the various 
activities being analyzed are evaluated in terms of 
them, all activities carried on at positive levels in 
an optimum solution will show a zero profit, no 
activity will show a net gain, and in general, activi- 
ties that cannot be a part of any optimal solution 
will show a loss. In this and other ways, even 
socialist and centrally planned economies are find- 
ing increasingly that proper pricing policies can 
play a crucial role in the efficient organization of 


an economy. 
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LLI 
PRICE CONTROL AND RATIONING 


Freedom from personal control by government 
is deeply imbedded in the ideology of enterprise- 
market economies. This freedom as it applies to 
the individual market decisions which determine 
prices and purchases in such economies finds 
strong functional support in the economizing role 
played by free prices. In the Western democracies, 
and notably in the United States, government con- 
trol of industrial and commercial prices and gov- 
ernment rationing of goods have traditionally been 
resorted to only in periods of war or near-war. 


The economy of the United States, although char- 
acterized by considerable government activity and 
influence, is essentially an enterprise-market econ- 
omy, the logic and operation of which call upon 
prices to play a central economizing role. It is of 
the very essence of this role that prices be free to 
respond to, and continually to reflect, the myriad 
choices and decisions of individual consumers and 
producers as these are meshed together in the 
market, and, in turn, continually to guide these 
choices and decisions. 

Role of price in enterprise-market economies. 
Buyers, as they bid in free markets for the con- 
sumer-good offerings of sellers, sit in review on 
the decisions which producer-sellers have made 
as to the uses of the economy's scarce resources. 
The prices placed on consumer goods represent 
market evaluations of these goods and, hence, of 
resources put to these uses. Producer-sellers in 
their turn have bid in free markets for these re- 
sources (factors of production). The prices (costs 
of production) which they have paid in hopeful 
anticipation of later favorable judgments by con- 
sumers represent market evaluations of the re- 
sources. 

When the price which emerges in the market 
for a consumer good is below its cost of produc- 
tion as determined in intermediate and factor 
markets, it means a pecuniary loss for the pro- 
ducer-seller. More fundamentally it bespeaks the 
market judgment of consumers that the economy's 
resources have been misused—resources of given 
value have been misdirected into uses of lower 
value, On the other hand, a price above cost of 
production means a gain for the producer-seller 
and a market verdict by consumers vindicating the 
producer's resource decision—resources of given 
value have been transformed into goods of higher 
value. The gains, prospectively and as realized, ena 
courage resource decisions in line with society's 
desires as reflected in free markets; the losses deter 
and prompt corrections of decisions which mis- 
allocate resources. 

The prices worked out in the markets for the 
basic factors of production—land, labor, capital, 
and enterprise—constitute not only the basic costs 
of production but also the incomes of all who par- 
ticipate in production, the incomes which enable 
them in their capacities as consumers to bid effec- 
tively for goods to satisfy their desires. 

The constant interplay of prices, as they respond 
to and influence demands and anticipations within 
and between the markets for consumer goods, in- 
termediate goods, and basic factors of production, 
is the enterprise-market economy's mechanism for 


directing the employment of its scarce, valuable 
resources and for distributing the fruits of produc- 
tion among those who participate in production. 
It is the process by which such an economy 
economizes. It is not surprising, then, that in econ- 
omies or sectors of economies where prices tradi- 
tionally are cast in the leading economizing role, 
doubts are raised by suggestions for government 
controls designed to limit the free market interplay 
of prices or to substitute government for prices and 
the market in the performance of the economizing 
function. 

Price control in enterprise-market economies. 
Principal reliance by enterprise-market economies 
upon free markets and free prices to perform the 
economizing function has not meant exclusive re- 
liance upon these forces, even in the absence of 
war or preparation for war. The classic instance 
of generally accepted peacetime price control oc- 
curs in the case of public utilities, where, because 
of peculiar economic and physical features of 
the industry, competition cannot be relied upon 
to produce socially acceptable results. Almost uni- 
versally, where public utility services are provided 
by private companies, monopoly is established by 
public authority, and service and price are sub- 
jected to government regulation [see REGULATION 
OF INDUSTRY]. 

Elsewhere in the economy, too, in particular 
areas at particular times, dissatisfied groups have 
succeeded in having the economizing results of 
free market processes altered by the introduction 
of government controls of output or prices or both: 
agricultural price supports, petroleum prorating, 
and resale price maintenance are examples. In 
transportation a most confused mélange of regu- 
lated prices and market prices has developed. At 
the moment the future of price determination in 
the transportation industries is most uncertain [see 
AGRICULTURE, article on PRICE AND INCOME POLI- 
CIES; RESALE PRICE MAINTENANCE; TRANSPORTA- 
TION, article on ECONOMIC ASPECTS]. 

All enterprise-market economies have substan- 
tially altered the patterns of income distribution 
produced by the free pricing processes of their 
markets—by tax measures designed to take more 
from the rich than from the poor and by the free 
provision of public goods which if available only 
on the market could not be bought widely by per- 
sons of low income. Increasingly, governments 
have taken responsibility for spurring their lagging 
economies into fuller use of resources and the 
Production of increased national income—by re- 
Sort to tax, debt management, and public expendi- 
ture programs, all of which are intended to effect 
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their results through market prices. Concern over 
peacetime inflation has stimulated government 
activity designed to keep the general level of prices 
in check. This typically takes the form of broad 
fiscal and monetary (including credit) measures. 
But more recently informal government pressures, 
such as admonitions to be “responsible” addressed 
to industry and labor leaders, have been applied 
to prices and wages in particular industries. In 
these instances, control of prices and wages is 
explicitly disavowed, but, of course, government 
action here is meaningless unless specific prices 
and wages are in fact affected. 

Despite these instances of direct or indirect 
price control by government, it remains true that 
in the American peacetime economy prices are 
largely left free to work themselves out in markets 
at all levéls and so perform, with greater or lesser 
success and acceptance, their appointed economiz- 
ing role. There are, indeed, substantial misgivings 
in the minds of many about the performance of 
market prices as guides to resource use and as 
determinants of factor income—growing out of 
anterior misgivings as to the actual effectiveness of 
market competition in eliciting responsive prices. 
Action prompted by these misgivings, however, 
takes the form of government programs to restore, 
rehabilitate, and maintain competition rather than 
to operate directly on the prices themselves. 

Against this background, we may turn to a 
consideration of the very special circumstances 
under which even enterprise-market economies 
undertake widespread programs of price control 
and rationing, the nature of the programs on which 
they embark, and some of the problems which they 
encounter. 


Price control in wartime 

Price control and control of the distribution of 
consumer goods (rationing) are parts of over-all 
government programs to bring about widespread 
mobilization of economic resources in time of war 
or preparation for war. They are designed both to 
help in mobilization and to protect the economy 
from certain adverse effects of mobilization. Price 
control and rationing rarely make an appearance 
except in the company of other war-mobilization 
controls, and they can be understood and ap- 
praised realistically only in this context. 

Total, all-out war demands all-out controls, in- 
cluding the allocation of resources and materials, 
the control of specific prices, and the rationing of 
specific goods. Preparation for war (“defense”) 
calls for a program of controls appropriate to the 
scale of mobilization being undertaken, a program 
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which may or may not include full-blown alloca- 
tion, price control, and rationing. Both the dangers 
of inflation and the need for direct controls as 
instruments for mobilizing resources are less in 
defense periods than in time of war, and the 
dangers of (and potentially to) these controls are 
greater. This is particularly true if it may be antici- 
pated that the defense period will lengthen into 
an endless era of cold war. Formal direct con- 
trols, which are virtually inevitable in wartime, are 
decidedly optional as measures of long-drawn- 
out preparation for wars which may or may not 
materialize, 

Total war calls for total economic mobilization 
—a rapid, massive shift of manpower, equipment, 
and materials from the uses of peacetime to the 
purposes of war. It demands the utmost in total 
production despite the transfer of millions of men 
from fields and factories to military service, and, 
notably, it demands a cutback in the production 
of consumer goods in favor of vast new programs 
for the production of military goods and services. 
These shifts in resources cannot be made on the 
scale and at the speed demanded if reliance is had 
solely upon the ordinary processes of the market. 
Men will not leave civilian employment by the 
millions to join the armed services in response to 
considerations of economic gain. Plants will not 
be converted en masse and overnight from the pro- 
duction of consumer goods to the production of 
goods for war (with all that is entailed in new 
construction, retooling, and the disruption of pro- 
duction and sales contacts and channels) by the 
precarious directions and questionable lures of 
early wartime markets. Economic responses would 
at best be no more certain than the diffused eco- 
nomic calculations on which they were based, and 
the prosecution of wars cannot wait upon the out- 
come of speculation, titillation, and gentle persua- 
sion. Sterner and more certain measures are called 
for. Economic incentives may still be employed, 
but they must be supplemented by specially de- 
signed interventions and by commands. Govern- 
ment limitations on consumer buying and on non- 
war uses of resources, and government direction 
of the use of men, materials, and equipment must 
take the place of or supplement government bid- 
ding in the market in competition with civilian 
producers and consumers. 

A declaration of war is a declaration that the 
individual economic preferences of the citizens of 
a country are to be subordinated to what is essen- 
tially a single national economic purpose—mobi- 
lization of the economy to win the war. Tremen- 
dous reallocations are necessary, and time is of 


the essence. It is of the nature of war that com- 
mands which must be obeyed take the place of 
inducements which individuals are free to accept 
or ignore and, further, that the operation of com- 
mands must not be weakened and diffused by the 
activities of individual buyers and sellers exercis- 
ing personal preferences in the market. Markets 
simply cannot provide the unified direction, speed, 
and sweep which war demands of an economy. 
Commands will frequently blunder, and imbal- 
ances and unnecessary inconveniences will con- 
stantly occur, but the main drive of mobilization 
cannot safely be entrusted to any mechanism other 
than centralized command. 

Feasibility. A major contention of those who 
look upon wartime price control and rationing 
with disfavor born of something more than sheer 
ideology or purely personal irritation is that these 
direct controls cannot be made to work—that only 
confusion and breakdowns can result. Experience 
indicates, however, that for periods running for 
at least three or four years, under crisis conditions 
and admittedly in greater degree for some goods 
in some markets than for other goods, direct con- 
trols can, indeed, be made to work. It is clear, of 
course, that “frozen” prices cannot perform the 
allocative role prescribed for prices by enterprise- 
market economies. But allocation of materials and 
rationing are present to perform this task in time 
of war—devices specifically designed to achieve 
the special allocative ends which wars require both 
in the military and civilian sectors of the economy. 
Those responsible for wartime controls are fully 
conscious of the allocative role of prices in time 
of peace and the need, if wartime prices are to 
be fixed, to establish specific measures to provide 
for allocation. But it is fair to suggest that they 
are also concerned to establish allocative measures 
in their own right and on their own merits as nec- 
essary under wartime conditions—even if prices 
were not to be controlled. Materials allocation and 
rationing are, to be sure, supplements to price 
control; they also have a wartime raison d'être of 
their own. 

The imposition of price control upon the price 
structure of a modern industrial economy is in 
itself not as great a shock as some might antici- 
pate. Peacetime markets as we know them are 
shot through with conditions, conventions, and 
customs which make it entirely possible to impose 
government controls under the circumstances of 
war without great fear that a highly sensitive, 
marvelously articulated mechanism will, by the 
very fact of intervention, be destroyed and without 
great apprehension that the mechanism simply 
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cannot be made to function under other than its 
own internal forces and controls. To be sure, some 
markets are more amenable than others to unusual 
external controls whether of prices or distribution, 
and equal success cannot be expected of controls 
imposed in all markets under all circumstances. 
Controls need to be tailored to conditions—and in 
some instances should not be imposed. 

However, in many industries and markets, sellers 
consist of (or are led by) a few large firms. What 
we know of these markets suggests that prices are 
made more by the personal decisions of the few 
than by impersonal forces of the market. Such 
prices can be controlled by government with much 
greater ease and with much less impairment of 
any impersonal allocative function they may be 
thought to perform than would be the case if in 
peacetime they were the product of more com- 
pletely competitive processes. We have been re- 
minded by J. K. Galbraith that it “is relatively easy 
to fix prices that are already fixed” (1952, p. 17). 
In World War 1, the American economy, operat- 
ing under a network of direct controls calculated 
to mobilize, redirect, allocate, and distribute re- 
sources and goods, changed its course drastically 
from peace production to war production and con- 
sumption, and picked up speed and productive 
power in the process. 

Necessity. Inflation is a normal consequence of 
large-scale mobilization. Total national product 
and, hence, total national money income may be 
expected to reach new highs, but much of the 
product (anywhere from 40 to 60 per cent) must 
be for military rather than civilian use. Military 
demands added to the demands of newly enriched 
consumers for goods and, hence, for the means 
and materials of production can result only in 
greatly elevated prices, unless countermeasures 
are taken. Simply stated: higher than usual per- 
Sonal incomes in pursuit of a lower than usual 
Output of personal goods can be balanced in the 
market only at new, higher price levels. 

However, consumer demands can be partially 
curbed. Much personal income can and will be 
extracted from civilian pockets by increases in 
wartime taxes and by vigorously conducted sales 
of wartime government bonds to individuals and 
businesses, As an academic matter, enough pur- 
chasing power in the aggregate could be trans- 
ferred by these measures from the civilian popu- 
lation to match the transfer of resources from 
Civilian to government military uses and, hence, to 
permit the balancing of civilian demand with the 
lowered output of civilian goods with no increase 
in the general level of prices. 

In the actual event, however, this will not be 
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done and for good reason. Government borrowing 
from individuals and businesses is a fruitful 
source of wartime funds, but at best it will produce 
far less than the amounts required, and its pro- 
ceeds are uncertain. If the government is to ob- 
tain by taxation the rest of the vast sums it needs, 
it must levy its taxes where, at the moment, the 
money is located. Any such tax program, geared 
to the tremendous demands of an all-out military 
effort, would necessarily violate all canons of 
equity in taxation. It would, as well, ruin any plans 
the government might have to employ economic 
incentives as part of its mobilization program. 

Granted that governments could and should 
undertake to support wars by greater dependence 
upon taxes than they typically find to be politi- 
cally feasible, it is generally conceded that they 
are wise not to go all the way in adopting a pay- 
as-you-go program of war finance. It is even more 
certain that they will not in fact go all the way. 
Deficit financing, in greater or lesser measure, 
with its attendant threat of inflation is one of the 
realities of war [see FiscaL POLICY, article on 
WARTIME FISCAL POLICY]. 

The fact is that if inflation is to be prevented— 
even if only suppressed or contained—broad in- 
direct measures, such as taxation, public borrow- 
ing from individuals and firms, and monetary and 
credit controls, will have to be supplemented by 
direct government control of specific prices. This 
would still be true even if the so-called inflationary 
gap could be largely wiped out by taxes and bond 
sales. Panic buying and speculative buying of stra- 
tegic materials and key commodities could produce 
skyrocketing prices and profits, which could create 
bottlenecks and threaten the success of economic 
mobilization. These increases could lead in turn to 
spiraling wage demands and further price in- 
creases both in the immediately affected and, in 
other commodities. This would be certain to occur 
in industries where the ineffectiveness of compe- 
tition leaves much room for discretion in the set- 
ting of prices and wages—where increases in both 
could easily be passed on. Latent savings and 
credit could feed the fire, and no government con- 
cerned with winning a war could avoid further 
deficit financing. As Donald Wallace put the 
matter, “In full war, the question is not whether 
inflation can be stopped without use of direct 
controls but whether inflation can be checked even 
with the greatest practicable use of all controls, 
direct and indirect” (1953, p. 16). 

Supporters of wartime price control are quite 
willing to concede that its effect is to suppress 
inflation and not fully to destroy its underlying 
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causes, but they argue that suppressed inflation 
is to be preferred to open inflation, that the excess 
demand contained by price control itself serves 
a useful wartime function, and finally that sup- 
pressed inflation can be rendered “noninflationary” 
by the use of proper fiscal, credit, and control 
measures during a reasonable liquidation period 
following the war. It is pointed out that the price 
controls which suppress inflation operate on par- 
ticular prices so that price-wage spirals will not 
be touched off and the war economy will thus be 
immunized from a particularly virulent and de- 
structive form of inflation. Excess demand, stored 
up behind the controls guarding the prices of con- 
sumer goods, far from indicating weakness or lack 
of sophistication in the application of fiscal and 
price controls, should be regarded, if properly ad- 
ministered, as a most useful tool for the develop- 
ment of maximum economic—military potential. 
It is “a practical way of adapting modern capi- 
talism [by facilitating the absorption and employ- 
ment of great increases in the labor force and in 
the use of labor]—a capitalism characterized by 
oligopoly in product markets and strong unions in 
factor markets—to the wartime imperative that all 
possible resources be employed and if possible 
under approximately stable conditions of prices 
and costs” (Galbraith 1952, p, 34). Restraint in 
the removal of price controls following the cessa- 
tion of hostilities, and the employment of budget 
surpluses coupled with restrictions on consumer 
borrowing and new investment, could thin out and 
gradually reduce inflationary pressures to the point 
of elimination. 

Rationing. Rationing of consumer goods is the 
natural ally of price control. It will certainly be 
called up for wartime service. All-out war is bound 
to bring at least limited rationing, and if price 
controls are widely extended, rationing, too, will 
spread. In peacetime enterprise-market economies, 
rationing (the distribution of goods) is a func- 
tion of price; in war economies price control and 
rationing provide each other with much-needed 
support. The case for both rests on the limits that 
war places on the aggregate supply “of consumer 
goods, while aggregate consumer purchasing power 
continues to rise, as well as on the fact of particu- 
lar shortages in particular markets. Rationing can 
be made to check consumer demand and thus to 
eliminate pressure on prices. It can contribute to 
the equitable distribution of essential goods in 
short supply. (Note that under war conditions our 
notions of equity undergo something of a change.) 
Rationing can direct particular scarce goods where 
they are most needed for war purposes, and it is 
a natural complement to and support for govern- 


ment orders that limit consumer use of scarce ma- 
terials. Experience in World War 1 demonstrated 
the need for rationing for all of these purposes in 
the case of such commodities as rubber, petroleum, 
metals, sugar, and meat and other foods (see 
Wallace 1953, pp. 215 ff.). Rationing was particu- 
larly necessary as a support to price control in 
those highly competitive markets where price con- 
trol was subjected to great pressures. 


Policy and administration 


The introduction and operation of price control 
and rationing involve important policy decisions 
as to timing, coverage, and type of control and as 
to administrative machinery and personnel. 

Policy. A price or prices may be “frozen” at the 
level of a given base date or during a given base 
period. This is the most dramatic type of price 
action, and it may be employed for this very reason 
in the presence of an immediately threatened in- 
crease or spiral of prices. It presents many diffi- 
culties, both of compliance and enforcement, most 
of which grow out of lack of knowledge by sellers 
and buyers of actual base prices. It always em- 
bodies inequities that must be corrected over time, 
and it leaves open the problem of new commod- 
ities and new types or styles. Legal maximum 
prices may be calculated by formulas, a useful 
procedure in the case of special commodities and 
a necessary procedure where new goods or styles 
are involved. Again, however, buyers are not in 
possession of the information necessary for really 
effective enforcement. Margins or markups fall 
into this category, and these may or may not be 
effective in holding prices, depending upon the 
character of the calculation of the cost base to 
which they are applied. For compliance and en- 
forcement, no other type of ceiling can equal pub- 
lished or publicly displayed dollar-and-cents prices, 
and if price control programs are long continued, 
such lists are bound to become more prevalent. 

Selection of the goods to which price control is 
to be applied presents a difficult problem. Certain 
goods are natural candidates for control because 
of spot-price situations (speculative or panic buy- 
ing), because price control is needed as an aid 
to production control, or because the goods (cost- 
of-living or “wage” items) are likely to contribute 
to price-wage spirals, As suggested above, it is 
easier to control the prices of standard goods pro- 
duced and sold under conditions of oligopoly, where 
sellers are few and peacetime prices are “admin- 
istered,” than the prices of goods which are sub- 
ject to frequent changes in type and style or which 
are produced and sold by a multitude of small 
firms, which typically “follow the market.” The 


case for control of the prices of luxury (“non- 
wage”) goods rests not so much on concern for 
their users or because of their direct effect on in- 
flation as on the likelihood that high prices, prof- 
its, and wages in these industries will attract re- 
sources from, and instigate inflationary demands 
in, other industries. 

Considerations relevant to control in particular 
instances are frequently in direct conflict. The need 
for control of food prices is great, but so are the 
difficulties of control. One consideration that is 
likely to impress price controllers as the control 
program continues is that all prices are interre- 
lated. While this fact does not dictate an all-or- 
nothing coverage, it points up the need to be on 
the lookout for unanticipated effects and is a 
powerful force moving for an extension of controls, 
once instituted. Woe to the controllers, for example, 
who fail to keep the price of hogs in mind when 
setting the price of corn. 

It is necessary to establish price standards if 
control is to involve more than a freezing of past 
prices, and even frozen prices have to be adjusted 
to reflect individual circumstances and unique con- 
ditions. Prices must be set de novo for new goods 
and goods offered by new sellers. And prices once 
fixed may need to be changed because of considera- 
tion of equity, expediency, or changed conditions. 
It is fashionable to contend that it is prices not 
profits that are being controlled; but effect on 
profits cannot be avoided, and there is much to be 
said for tying price control directly and openly to, 
say, percentage return on net worth. It is inevitable 
as the program grows that particular prices be 
tailored to particular, identified purposes—the re- 
moval of disparities and inequities, the bolstering 
of physical controls, the meeting of shortages. In 
some instances involving strategic materials, care- 
fully tailored and differential prices may have to 
be supplemented by subsidies to insure needed pro- 
duction, The function and effects of prices are 
nowhere more clearly apparent than in a program 
of price control once gotten under way. 

Rationing. Rationing, too, calls for decisions 
on types and coverage. The intricacies and the 
sheer physical task of rationing are, if anything, 
greater than in the case of price control. A strong 
case can be made on paper for the wide extension 
of rationing. The realities of the task, however, 
counsel the restriction of rationing to those in- 
stances where great shortages of very important 
commodities are involved or where the greatest 
importance attaches to the placing of particular 
goods in particular hands. Certificate rationing 
Provides a ticket for each allotment, awarded (as 
in the case of tires in World War 11) to persons 
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on the basis of their specifically determined need 
for the commodity in relation to the war effort. 
Coupon rationing entitles the coupon holder to buy 
a stated quantity of a given good (for example, 
sugar) at any time during a given period. Point 
rationing covers items which are more or less sub- 
stitutes for each other (for example, different 
foods), and it entitles the holder of point coupons 
to distribute his limited purchases as he pleases 
over a range of goods. This device makes it pos- 
sible to change point values as supplies and de- 
mands change and hence to adjust distribution to 
changing conditions. 

Consumers are bound to be disappointed at the 
limited amounts they are permitted to buy legally, 
and the temptation is great for rationing officials 
to overextend their issuance of certificates, cou- 
pons, or points. Nothing, of course, could be more 
damaging to a rationing program, and few things 
could be more directly damaging to civilian morale 
and to the war effort. Coupons as a condition of 
buying are irritating enough; coupons “without 
honor” compound the irritation. 

Direct control of specific prices is, quite under- 
standably, not popular with sellers, both because 
of its effect on revenue and the irritations which 
are produced inevitably by its imposition and ad- 
ministration. Rationing of consumer goods is, if 
anything, even more unpopular with sellers and 
consumers. The latter, almost universally, see its 
restrictive rather than its “fair share” aspect; it 
appears to them to be rationing rather than the 
war that is responsible for their inability to buy 
all of any good they are willing to pay for. 

In time, as these controls become more wide- 
spread and come to be associated in the public 
mind with all of the inconveniences of war, the 
unpopularity fans out generally over the popula- 
tion and deepens in intensity. There can be no 
doubt that it is easier and more comfortable not 
to be controlled than to the controlled, just as it 
is easier and simpler not to control than to control. 
Paradoxically, if the controls prove to be reasonably 
successful in containing inflation, with the result 
that the dangers of inflation fade from public con- 
sciousness, the need for controls seems less urgent. 
Their very efficacy undermines public support for 
their continuance. 

Administration. The task of setting up and 
operating the machinery required to control most 
prices at all levels and to ration a considerable 
array of products is staggering. Since these are not 
regular, continuing functions, the staff must be 
specially recruited, and the members are bound to 
be amateurs—learning, it is hoped, as they per- 
form. Some members can be drawn from industry 
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and business, and there is a widespread misconcep- 
tion that because industrial and business prices are 
involved, industrial and business personnel are 
peculiarly suited to the task. Quite the contrary 
is true. Industrial and business knowledge and 
advice are essential, but price fixing in the context 
of government control is quite different in con- 
ception, objective, and practice from the making 
of prices within a business firm. Businessmen, 
however conscientious, have much to unlearn be- 
fore they can qualify as government price con- 
trollers. University economists come to the task 
better prepared than most, but they suffer from 
the allegation, sometimes justified, that they are 
unrealistic crusaders. The fact is that there is no 
professional career in an enterprise-market econ- 
omy for the price controller and rationer, and, 
hence, there is no standby staff. 

Other administrative problems, too, are almost 
overwhelming: control by Washington versus con- 
trol by the field offices; simple regulations in the 
interest of understanding and compliance versus 
detailed regulations in the interest of precise tailor- 
ing to fit individual situations; toughness and 
rigidity versus sympathetic understanding and flex- 
ibility, These and a myriad other issues are always 
present—and, in actual cases that make a dif- 
ference, not just in a broad philosophical setting. 
And congressmen, whose constituents “don’t like 
it,” are omnipresent and insistent. In due course, 
controllers as well as those controlled will have 
had their fill. 

Increasingly it is borne in on everyone that 
there is in fact a basic difference between being 
controlled, commanded, and even abused by the 
cold, heartless, impersonal forces of the market 
and by the personal orders (however understand- 
ing and “correct”) of government officials and that 
at this stage in history democratic peoples seem 
to prefer unidentified systems to identified officials 
as economic arbiters. As price control and rationing 
proceed, the scales on which the public in democ- 
racies weighs the disadvantages of inflation against 
the irritations of controls are almost certain even- 
tually to swing in favor of “kick out the controls.” 
But it is equally certain that at another time and 
with the advent of another national emergency 
requiring sweeping mobilization of the economy 
and threatening serious inflation, direct controls 
will be eagerly and confidently adopted again. 


BEN W. Lewis 
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m 
PRICE HISTORY 


Although movements of commodity prices can 
be measured in earlier periods than can any other 
phenomenon of general interest to social scientists, 
we have no satisfactory records of prices in antiq- 
uity. Sporadic quotations for a few commodities, 
such as grain, metals, and livestock, in different 
Places give rough ideas, which are valuable for 
some purposes, of the relative prices of a few arti- 
cles and of some violent changes in the price level. 
(Examples of such violent changes are those which 
the treasure obtained in Alexander’s conquests in- 
duced in Greece in the fourth century B.C. and 
which the debasement of the coinage caused in 
Rome in the third and fourth centuries.) But there 
are no series of commodity prices that will permit 
us to measure changes in their average for any 
important locality or period in ancient times. 

Medieval price and wage movements. Although 
More numerous and reliable than those for antiq- 
uity, the medieval price data now available lack 
the regularity, continuity, homogeneity, and diver- 
sity required for the satisfactory measurement of 
the price level before about 1350. The next oldest 
Continuous quantitative economic data are money 
wages, or the price of labor. Despite their obvious 
Importance, wage data have been neglected by some 
Price historians and used with less skill than com- 
Modity price series by many others. Nevertheless, 
we can trace the movements of money wages in 
Many leading economic areas since about 1500, 
and we can gain a fairly accurate impression of 
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major changes in money wages from about the time 
of the Black Death (1348-1349), The wage data 
for earlier periods are generally fragmentary and 
sometimes questionable. But when proper use is 
made of the incomparably rich and virtually un- 
touched account books in Italian archives, particu- 
larly the Archivio di Stato in Florence and other 
depositories in Tuscany, our knowledge of prices 
and wages in the Middle Ages will be vastly en- 
riched and extended. 

The preservation of manuscripts that provide 
the raw material for historical price and wage sta- 
tistics in one country after another—differing 
widely in institutions, economic development, and 
culture—can hardly have been accidental. What 
was preserved was, and has continued to be, impor- 
tant. Good as they are, however, in comparison with 
other historical statistics, even the best historical 
price and wage series are never good enough to 
satisfy a scholar using them in studying any major 
problem. Furthermore, the usefulness of the most 
adequate and reliable price and wage data is usu- 
ally limited by the lack of other statistical and non- 
statistical material that alone can reveal their true 
meaning. 

Index numbers for 50-year periods compiled by 
Sir William Beveridge (1939) from fragmentary 
quotations for six commodities show that English 
prices were about a third higher in 1250-1299 
than in 1200-1249 and about 15 per cent higher 
in 1300-1349 than in 1250-1299. Although he 
believed that the upward movement of prices was 
violent, Beveridge was commendably skeptical con- 
cerning his results. If the scattered prices thrown 
together by Vicomte d’Avenel (1894-1926) from 
different areas of France are reliable, the price level 
was about 14 per cent higher in 1291-1300 and 
about a third higher in 1301-1350 than in 1201- 
1225. Discontinuous quotations (not worth pub- 
lishing) that I have compiled for a few staple com- 
modities for Aragon and Navarre in 1201-1275 
show an upward trend in line with the trend in 
France. Prices for 12 articles I gathered for Na- 
varre suggest that the price level rose about 25 per 
cent from 1284-1289 to 1309-1310 but moved 
horizontally in the next quarter century. Prices for 
14 articles I compiled for Aragon indicate that the 
price level was about 38 per cent lower in 1309- 
1310 than in 1276-1277, when money was bad and 
harvests worse. The index for 1350 was about 6 per 
cent lower than in 1300. But the underlying data 
are so scant and questionable that all these esti- 
mates of movements of the price level should be 
regarded as “prehistorical.” 

Index numbers of prices in Navarre for 1351— 
1380, based on extraordinarily extensive and com- 
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plete series for such an early period, doubled in 
terms of money of account and rose 30 per cent in 
terms of fine gold. The quinquennial average of 
gold prices collapsed in 1381-1385 and had dropped 
7 per cent more by 1441-1445, when the series 
end. Prices in money of account rose 50 per cent 
from 1381-1385 to 1431-1435 and fell 7 per cent 
in the next ten years. Inadequate data for Aragon 
suggest that prices in money of account doubled 
in 1351-1380, as they did in Navarre, and declined 
17 per cent from 1381-1385 to 1396-1400, The 
price indexes in money of account averaged about 
10 per cent lower in 1426-1450 than in 1401- 
1425. The average rose about 3 per cent in 1451— 
1475; but although the quinquennial indexes 
moved horizontally in 1476-1500, the average for 
the entire period was about 13 per cent lower than 
in 1451-1475, The trend of prices in the fifteenth 
century was downward also in Valencia. Appar- 
ently prices in silver averaged about 14 per cent 
higher in France in the third quarter of the four- 
teenth century than in the second. Prices fell about 
one-fourth from 1376-1400 to 1401—1425, and the 
average for the next quarter century declined about 
9 per cent (see Hamilton 1936), In the second half 
of the fifteenth century, French prices averaged 
about 24 per cent less than in the first half. The 
trend of prices in France in the fifteenth century 
did not differ greatly from the trends in Aragon 
and Valencia, and the decline seems to have been 
somewhat greater in Alsace. In the third quarter 
of the fourteenth century, wheat prices in England 
—a poor gauge of movements of the price level, 
but the best we have—rose much more than prices 
in France but a great deal less than prices in Ara- 
gon or Navarre. During the next hundred years the 
trend of English wheat prices seems not to have 
deviated greatly from the price trends in Aragon, 
France, and Valencia. The long decline was appar- 
ently followed by a horizontal movement in the last 
quarter of the fifteenth century. But, as in Aragon, 
there is no clear evidence of a rising trend before 
1500, 

Wage rates have been published for Navarre for 
10 grades of labor in the building trades in 1346- 
1400 and 11 grades in 1401-1450. Wage indexes 
in Navarre, where the Black Death was extremely 
severe and was followed by secondary epidemics, 
soared nearly 60 per cent in 1349 and continued to 
rise irregularly, but sharply, for three decades. 
From 1346-1350 to 1376-1380 the quinquennial 
average of wage indexes almost trebled. By the end 
of the century they had almost quintupled, and by 
the middle of the fifteenth century, when the series 
end, they had sextupled. If we divide money wage 


rates by food prices to obtain a crude index of real 
wages, we find that the quinquennial average dou- 
bled in the half century from 1351-1355 to 1401- 
1405. In the next two decades real wages fell about 
one-fifth, but the net increase in 1351-1445 was 
more than two-thirds. Crude index numbers of real 
wages for Aragon suggest that real wages were 
about one-sixth higher in the last quarter of the 
fifteenth century than in the first quarter. 

For Valencia we have a remarkable series of 
salaries of 22 classes of workers in 1351-1450 and 
25 classes in 1451-1500, together with the wages 
of 11 grades of labor in the building trades in 
1392-1450 and 12 grades in 1451-1500. Salaries 
rose about 25 per cent from 1355 to 1370 and 
remained stable for 20 years. They dropped 7 per 
cent in 1399, but rose 20 per cent by 1418, and 
remained practically unchanged for 70 years. Sal- 
aries then dropped 8 per cent in 1489 and held this 
level through 1500, Wage rates fluctuated more 
than salaries, but their trends were rather similar. 

Apparently the wages of most types of labor in 
England rose sharply in the first two decades or 
more after the Black Death, in spite of vigorous 
government opposition to wage increases. The rates 
for the lowest grades of workers may have risen as 
much as in Aragon, and possibly even as much as 
in Navarre. Since food prices did not rise substan- 
tially, real wages did. Thorold Rogers tells us that 
although some of the gain was lost, wages never fell 
to the old rate and that they were stable at a high 
level throughout the fifteenth century, when prices 
were falling. He regarded “the fifteenth century 
and the first quarter of the sixteenth” as the “golden 
age of the English labourer, if we interpret the 
wages [which] he earned by the cost of the neces- 
saries of life” ([1884] 1890, pp. 326, 233 ff.). The 
fragmentary data available suggest that wages also 
rose substantially in France after the Black Death. 

It is often taken for granted that medieval kings 
robbed their subjects, disturbed economic life, and 
impeded progress through frequent and unprinci- 
pled debasement of the coinage and alteration of 
its value in money of account. Yet the long-term 
movement of commodity prices was much more 
stable in the areas for which we have reliable data 
in the last two centuries of the Middle Ages than 
in any country of the Western world during the 
nineteenth and twentieth centuries. If medieval 
rulers had not sought to relieve distress and replen- 
ish royal treasuries during fifteenth-century depres- 
sions by debasing the coinage or marking it up in 
money of account, prices would have fallen still 
more. This might have lowered profit margins, de- 
terred savings, and limited progress. For example, in 


the second half of the fifteenth century Valencian 
prices fell only about 6 per cent in money of ac- 
count and a little more than 20 per cent in gold. 
In the same period prices in Aragon fell less than 
10 per cent in money of account and more than 
35 per cent in gold (see Hamilton 1936). 

It would be extremely interesting to know 
whether the alternation of debasement and markup 
of the coinage in periods of falling prices and of 
restoration and markdown in periods of rising 
prices, which tended to stabilize price trends, did 
not also intensify short-term instability by causing 
changes in velocity opposite to the announced or 
expected changes in the money-of-account rating or 
the specie content of the coinage. For example, in 
Spain not only did merchants overstock goods, but 
women as unbusinesslike as nuns overbought com- 
modities as perishable as eggs in an effort to avoid 
losses resulting from royal markups of coins (see 
Hamilton 1934; 1947). 

The Black Death and succeeding epidemics in 
the next three decades had substantially reduced 
the population of western Europe and, conse- 
quently, the total production of goods when the 
revival of gold coinage in England, Flanders, Ger- 
many, Castile, and Aragon significantly increased 
the money supply. Hence the price level moved 
briskly upward. In the last two decades of the four- 
teenth century and throughout the fifteenth a con- 
siderable growth in population, total production, 
and the proportion of goods exchanged for money 
lowered commodity prices. The steady loss of specie 
to the Far East in exchange for spices and luxury 
products also exerted downward pressure on the 
price level. 

Price and wage history after 1500. Although de- 
fective in many respects, price and wage series 
after 1500 are far more satisfactory than previous 
series. The data supplied by the archives of Spain 
are outstanding in quantity and quality. Early in 
the sixteenth century the trend of prices turned 
upward in Spain, first and most rapidly in Anda- 
lusia, and rose for a hundred years, with practically 
all troughs and peaks above the preceding ones. 
Andalusian prices more than doubled in the first 
half of the sixteenth century and more than quin- 
tupled by its close. Prices increased fourfold in 
New Castile, the region closest to Andalusia, and 
34 fold in Old Castile-Leon and in Valencia. On 
the average, in the four regions prices in silver, 
then the same as money of account, quadrupled in 
the sixteenth century. Despite inflation of the vel- 
lon coinage (at first copper with a little silver, and 
then pure copper), which became the money of 
account after about 1603, the trend of prices in 
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Andalusia, New Castile, and Old Castile-—Leon was 
rather horizontal in 1601-1625; Valencian prices 
in silver declined slightly. Although there were vio- 
lent increases and declines in vellon prices for the 
remainder of the century, the trend of silver prices 
was moderately downward. 

From statistics collected by Thorold Rogers for 
England and d’Avenel for France, Wiebe (1895) 
constructed decennial index numbers of prices and 
wages for England and 25-year indexes for France 
during the sixteenth and seventeenth centuries. 
These indexes indicate the direction and approxi- 
mate magnitude of long-term movements, but are 
particularly weak in timing changes. Neither Henri 
Hauser, the latest historian of French prices 
(1936), nor William Beveridge, the latest and most 
distinguished historian of English prices (1939), 
has supplied us with index numbers. Hauser’s data 
are not extensive or complete enough to measure 
the price level, and trial index numbers based on 
Beveridge’s data for 1500-1700 have not shown 
trends differing significantly from those derived 
from Rogers. 

The decennial price indexes for England were 
about 6 per cent higher in 1511-1520 than in 
1501-1510, more than a third higher by 1551— 
1560, and about 150 per cent higher in 1593-1602. 
According to d’Avenel, French prices began rising 
during the reign of Louis xm, 1498-1512. The 
French indexes in silver for each quarter of the 
sixteenth century were higher than for the pre- 
ceding one, and the index for the last quarter was 
150 per cent higher than for the first. Hence, it 
seems that silver prices rose more in France than 
in England during the sixteenth century. The 
decennial average of English prices continued up- 
ward in the first half of the seventeenth century, 
reaching a level in 1643-1652 about 3} times as 
high as in 1501-1510, The trend of English prices 
was fairly flat in the second half of the seventeenth 
century, with the lowest average in 1653-1662 and 
the highest in 1673-1682. 

If we combine the four Spanish regions for 
which we have data and make 25-year indexes for 
comparison with France, we find that in each 
quarter of the sixteenth century the advance over 
the preceding one was greater in Spain. Comparing 
decennial indexes for Spain with those for England 
shows that while English prices were rising 150 
per cent from 1501-1510 to 1593-1602, Spanish 
prices were rising more than 200 per cent. By 
1601-1610, when decennial silver prices for Spain 
reached their apogee during the Price Revolution, 
they were 3.4 times higher than a hundred years 
before. English prices reached their zenith during 
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the Price Revolution in 1643-1652, when they 
were 3.5 times the 1501-1510 level. Spanish prices 
rose a great deal more than English prices in the 
sixteenth century, and the Spanish peak came 
forty years earlier (see Hamilton 1934). 

While prices in England were rising 150 per 
cent in the sixteenth century, wages were appar- 
ently rising only 30 per cent. While prices in 
France were rising 150 per cent from the first to 
the last quarter of the sixteenth century, wages 
were rising only 25 per cent. In 1676-1700 French 
prices were twice as high as in 1521-1525, and 
wages were only a third higher. During the seven- 
teenth century the price-wage ratio had become 
slightly less beneficial to employers and less harm- 
ful to workers, but the change was probably too 
small for any worker or employer to notice during 
his lifetime. By the end of the seventeenth century 
English wages were 150 per cent higher than in 
1501-1510, while prices were 250 per cent higher. 
The gap between prices and wages remained wide, 
although it was far smaller than in 1600. Appar- 
ently wages in Alsace lagged farther behind prices 
during the Price Revolution than in England but 
less than in France. Wage indexes for Lvov, 
Poland, on a 1521-1525 base stood lower than 
price indexes in every decade from 1525 to 1700, 
but the gap seems to have been great only in 1551- 
1560 and in 1651-1700. Charts of raw data pre- 
sented by Elsas (1936-1949) indicate that wages 
of two grades of labor lagged behind prices of rye 
at Augsburg, Munich, and Würzburg from about 
1540 to 1630. 

The chief cause of the Price Revolution of the 
sixteenth and seventeenth centuries was the great 
increase in the money supply. Higher output of 
silver in Europe, particularly in Germany, begin- 
ning in the last quarter of the fifteenth century, 
arrested the downward trend of prices and, along 
with imports of gold from the Antilles, caused 
prices to turn upward early in the sixteenth cen- 
tury. But it was the explosive rise in silver produc- 
tion after the conquests of Mexico and Peru; the 
discovery of the fabulous mines of Zacatecas. 
Guanajuato, and Potosí; and the introduction of the 
mercury amalgamation process of mining in the 
middle of the sixteenth century that generated 
the Price Revolution. The sharp rise in prices began 
earlier and was more violent in Andalusia, where 
all the bullion legally imported into Europe from 
the Spanish colonies landed, than in any other area 
in Europe for which we have satisfactory data. It 
was in New Castile, the region closest to Andalusia, 
that the Price Revolution began next and was sec- 


ond in magnitude. In fact, it was higher prices in 
Spain than elsewhere and, to a much lesser extent, 
royal expenditures for war, diplomacy, and the ad- 
ministration of the European empire, that distrib- 
uted the precious metals among other countries, 
Of course, some bullion leaked into other kingdoms 
directly from the New World. The lag of French 
and English prices behind Spanish prices and the 
similar ultimate rise in each country indicates that 
specie from the New World was the leading factor 
in the price upheaval in all three countries (see 
Hamilton 1934). 

We know approximately how much specie came 
into Europe legally from the Hispanic colonies in 
1503-1660, while the Price Revolution was in prog- 
ress; but we do not know how much was smuggled 
into Spain and other kingdoms or what the total 
money supply was in about 1500, But there are 
strong reasons to believe that the increase in the 
money supply during the Price Revolution was 
much greater percentagewise than the rise in 
prices. Rather than seeking ancillary causes of the 
Price Revolution, as most writers (beginning with 
Jean Bodin and including the present one) have 
done, one needs to explain the failure of prices to 
keep pace with the increasing stock of precious 
metals. Some of the new bullion was neutralized 
by the increased use of gold and silver for non- 
monetary purposes as they became relatively 
cheaper through rising commodity prices. After the 
discovery of the Cape route to the East, the age-old 
flow of bullion from the West to the East was 
intensified. Once in the East, much of the bullion 
was hoarded or used ornamentally, so that the con- 
tinuous inflow of specie did not raise Eastern prices 
enough to reverse the flow. But these outlets did 
not prevent unprecedented additions to the Euro- 
pean monetary stock. A major counterpoise to the 
rise in the money supply was the increased flow of 
goods onto markets, resulting from great techno- 
logical advances and a large increase in popula- 
tion. Conversion of produce rents into money pay- 
ments, a shift from wages partially in kind to 
money wages, and a decline in barter also tended 
to counteract the effect on prices of the increase 
in gold and silver money. And with real national 
incomes and wealth increasing, the velocity of cir- 
culation of money presumably declined. 

During the first third of the eighteenth century, 
the trend of prices in most countries was moder- 
ately downward, and wages moved horizontally. A 
marked increase in the output of silver in Mexico 
and (though much less significant) of gold in 
Brazil arrested the secular decline of prices about 


1735. These factors, with assistance from wars and 
from deposit and note creations by banks, forced 
prices upward about 35 per cent in London—ac- 
cording to indexes I have computed from data in 
Beveridge (1939)—and 65 per cent in New Castile 
and Holland in 1750-1790. In all three areas 
prices approximately doubled from 1735 to 1800 
under the same forces, and prices in Philadelphia 
rose about 125 per cent. While prices were rising 
35 per cent in London, the wages of artisans and 
unskilled laborers in the building trades, which 
seem to have been representative for most grades 
of labor, advanced only 15 per cent, according to 
wage indexes I have computed from series in Gil- 
boy (1934), since Beveridge supplies no wage data. 
While prices were doubling, wages rose only 20 
per cent. The price series compiled by Labrousse 
for France (1933) shows a rise of 64 per cent from 
1726-1741 to 1784-1789, and his wage series 
shows a rise of only 22 per cent. Although prices 
in New Castile rose somewhat more in 1750-1790 
than in France from 1726-1741 to 1784-1789, 
wages rose a little less. 

The strategic role of changes in the supply of 
silver money in Spanish price movements is dem- 
onstrated by the falling prices during the war with 
France and England in 1719-1720 as well as dur- 
ing the War of the Austrian Succession and by 
stable prices during the 13 months Spain was in 
the Seven Years’ War. During these wars specie 
flowed out and British sea power obstructed im- 
ports from the New World. When the suspension 
of hostilities permitted bullion impounded during 
these conflicts to flow into Spain, prices rose. Since 
paper money filled the void in 1779-1783, prices 
increased; but they rose more in the first two years 
of peace, when specie pent up in America arrived, 
than in four years of war and paper-money infla- 
tion (see Hamilton 1947). 

The end of the 18th century brought two hyper- 
inflations caused by the overissue of paper cur- 
rency. Under the assignats and mandats in the 
1790s France had the only hyperinflation she has 
ever experienced. In Philadelphia prices rose two 
hundredfold from the outbreak of the Revolution 
in 1776 to the battle of Yorktown in 1781, and 
“not worth a Continental” still reminds Americans 
of their Revolutionary currency. 

Nineteenth- and twentieth-century price move- 
ments. That prices were remarkably stable in 
England and the United States in the nineteenth 
century is a myth accepted by many great econo- 
Mists. They were, as compared with the twentieth 
century, but not as compared with the five cen- 
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turies that had gone before. Prices in Philadelphia 
rose nearly 50 per cent from 1801 to 1814, and 
from then until 1847—1850 they fell nearly 60 per 
cent. From 1816, when data begin, to 1847-1850 
prices in Cincinnati dropped 58 per cent. From 
1818 to 1847-1850 prices in England fell 50 per 
cent in money of account and nearly as much in 
gold. During this period money wages declined 
very little in England and rose slightly in the 
United States. The chief factors in the severe down- 
ward trend of prices were (1) postwar readjust- 
ment on an unprecedented scale, including the re- 
turn to a metallic standard at prewar parity, (2) a 
decline by half in the output of precious metals in 
Latin America because of the wars for independ- 
ence and political instability under the new govern- 
ments, and (3) a great increase in the production 
of goods resulting from improved technology, revo- 
lutionary changes in transportation, soaring popu- 
lation, and the exploitation of vast new areas with 
rich natural resources. 

Serious short-term price declines also occurred. 
From the fourth quarter of 1836 to the fourth 
quarter of 1842 British prices fell about 22 per 
cent. From January 1817 to January 1822 prices 
fell 24 per cent in Philadelphia and nearly 60 per 
cent in Cincinnati, where raw materials were a 
much larger component of the index numbers. 
From the first quarter of 1837 to the first quarter 
of 1843 prices dropped 25 per cent in Philadelphia 
and 58 per cent in Cincinnati. 

From 1847-1850 to 1873 prices rose nearly 40 
per cent in England, France, and Germany, and 
nearly 60 per cent in the United States. The Ameri- 
can Civil War, with its paper money aftermath, 
and the Franco-Prussian War both exerted upward 
pressure on price levels. But the chief cause of the 
upheaval was the great increase in gold production 
following the discovery of rich gold fields in Cali- 
fornia in 1848 and in Australia in 1851. Since a 
proposed shift from the gold standard was hotly 
debated in England in the 1850s, and actually 
occurred in Holland in 1851, to combat expected 
gold inflation, presumably an increased velocity of 
circulation intensified the effect of a rising money 
supply. 

History came as near to repeating itself in 1873— 
1896, by duplicating the price movements of 
1815-1850, as it ever does. Prices in gold fell 41 
per cent in France, 42 per cent in England, and 
43 per cent (49 per cent in money of account) in 
the United States. The annual rate of decline was 
even greater than in the earlier period, and the 
causes were strikingly similar. The decline was 


476 PRICES: Price History 


due largely to (1) widespread demonetization of 
silver in the 1870s, (2) a falling cumulative rate 
of growth in the gold stock, (3) exploding popula- 
tion, and (4) increased production of goods re- 
sulting from remarkable advances in technology 
and transportation and from the utilization of 
important new natural resources. 

Alaskan and South African gold discoveries and 
chemical improvements in the process of extract- 
ing gold from ore reversed the steep downward 
trend of prices. From 1897 to 1914 they rose about 
one-third in England; four-tenths in France and 
Germany; and one-half in Canada, Russia, and the 
United States. World War 1 sharply accelerated the 
upswing. Never before in modern times had there 
been such war expenditures, fiscal deficits, and 
monetary inflation or such disruption of trade, 
mobilization, and reduction in the supply of civilian 
goods. And never before had there been such a 
violent and universal rise of prices in both belliger- 
ent and neutral countries. An acute shortage of 
goods, the urgent need to reconstruct devastated 
areas, and low market rates of interest extended 
the advance of prices in most countries for about 
two years after the armistice. From the outbreak 
of war to their peak, prices advanced about 120 
per cent in Spain, 130 per cent in the United 
States (as in the Civil War of 1861-1865), 145 
per cent in Canada, 160 per cent in Japan, 205 per 
cent in Great Britain, 275 per cent in the Nether- 
lands, and 410 per cent in France. 

Almost everywhere prices collapsed in late 1920 
and early 1921, but they were unusually stable in 
most places in 1922-1929. Hyperinflation in Ger- 
many reduced the value of the mark to zero from 
1919 to 1923, but prices moved horizontally from 
the currency reform in 1924 to 1929. 

From the last quarter of 1929 to 1932 there oc- 
curred one of the most severe and widespread falls 
in commodity prices in history. Apparently no 
country was spared. India and the Netherlands, 
with decreases of 36 per cent and 46 per cent re- 
Spectively, were among those suffering the greatest 
declines, In France, Italy, Great Britain, the United 
States, and Canada prices fell from 28 to 32 per 
cent. As in all cyclical downswings, prices of agri- 
cultural products and other raw materials fell 
much farther than prices of finished consumer 
goods. Countries and sections of countries that pre- 
dominantly produced raw materials were critically 
injured. Laborers suffered massive unemployment. 
Although the trend of prices was upward after 
1933, unemployment and agricultural distress 
lasted until the outbreak of World War 11. 


Since much of the literature in money and bank- 
ing and labor statistics is concerned with the price 
level and the cost of living in the last 25 years, 
price history may logically end with the eve of 
World War u. 

EARL J. HAMILTON 


[See also AGRICULTURE, article on HISTORY; INDEX 
NUMBERS, article on PRACTICAL APPLICATIONS.] 
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PRIMITIVE ART 


The concept of “primitive art” is partly a figment 
of the romantic imagination of the Western art 
world, A major problem of this field lies in the 
attempt to move from an ethnocentric preoccupa- 
tion with primitive art to basic, anthropologically 
based studies of variations of the art of the world’s 
peoples. Leonhard Adam (1940, p. 30 in the 1949 
edition ) said that the mere foreignness of form and 
content of the various primitive arts serve to link 
them together in our mind for purposes of art 
criticism but that this linking is extraneous to the 
works themselves, being more a part of our atti- 
tudes toward them. The serious study of primitive 
art may be said to consist of attempts to delimit, 
classify, and understand the art of various peoples 
of the world in such a way that links other than 
those formed from our Western attitudes may be 
found between them. 

The dictionary definition of the word “primitive” 
stresses the original, first, or root stages of develop- 
ment, and thus use of the term “primitive art” fre- 
quently connotes “early” art. However, most of 
what is generally considered as primitive art origi- 
nated in the recent past in ethnographically known 
societies. Most of the primitive art objects in mu- 
seums and private collections are contemporary 
with such modern European art movements as im- 
pressionism and subsequent developments, but pre- 
historic art, archeologically recovered and inter- 
preted, is also primitive art. Although still small, 
the corpus of prehistoric primitive art will probably 
increase. 
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The peculiarity involved in calling recent and 
still-living peoples “primitive” and in knowing that 
the word can also have a historical meaning is a 
major source of discontent with the use of the term 
“primitive art” (see Haselberger 1961 for argu- 
ments on this topic and for consideration of alter- 
nate terms). But however controversial the term 
may be, it does refer to art objects and data cen- 
tered on the art of various indigenous peoples of 
Oceania, the Americas, and Africa and on the pre- 
historic art of precivilized peoples. It has been 
suggested that “primitive art” be retained as a viable 
term but that the entity to which it refers be care- 
fully delimited and defined. Primitive art is the art 
of societies that are typologically, rather than 
chronologically, primitive. 

There seems to be no common denominator of 
form in primitive art. Adam (1940) and Firth 
(1951, pp. 155-182) make the points that prim- 
itive art is not uniform and that it has a great 
diversity of themes and styles, but that these styles 
and the differences between them are imposed 
upon the materials by Western observers. Firth 
finds that there has been little success in relating 
style of art to type of society and in relating dif- 
ferences of form and style to geographical or 
social differences (ibid., p. 170). 

Nonetheless, there are elements of social struc- 
ture that are common to all the societies from which 
the art comes, Primitive society as a type of society 
from which civilization developed was described 
by Redfield (1953, pp. 6-15, 22). In Redfield’s 
terms primitive societies are those whose communi- 
ties are small, isolated, homogeneous, nonliterate, 
and lacking in full-time spe ialists. Art in primitive 
societies is produced by artists who work for an 
audience or public whose members know each 
other and share values. The relative isolation of 
such societies insulates the artists from the influ- 
ence of foreign styles. The absence of literacy mag- 
nifies the importance of art, which in nonliterate 
societies often carries burdens of communication 
otherwise the concern of written documents, monu- 
ment inscriptions, and the like. But most impor- 
tant, the art of primitive societies is the product of 
artists who are not specialists. 

They seem to be semispecialists, in that they have 
learned to carve and paint and to do so better than 
other members of their society, but they still must 
participate in everyday economic activities in order 
to subsist. This contrasts with the situation of 
artists working in more complex societies, where 
full specialization occurs. Which are the societies 
from which the art objects now in museums and 
private collections come? 


In Oceania, the primitive societies are those in- 
digenous to Australia, Melanesia, Micronesia, Poly- 
nesia, and Malaysia whose cultures have not been 
significantly influenced by the extensions of Asian 
civilization into the Pacific area. In the Americas, 
they are those Indian societies of North and South 
America that were not influenced by the Andean 
and Mesoamerican civilizations. In sub-Saharan 
Africa, they are various indigenous peoples who 
were not influenced by extensions of Euro—Asian 
civilizations or by the several indigenous Negro 
states and kingdoms of the western Sudan, the 
Guinea coast, and the Congo region. In Asia (exclu- 
sive of the great Chinese and Indian civilizations), 
the Siberian tribes, the Ainu, central Asian nomads, 
and some of the marginal aboriginal peoples of 
south China and mainland southeast Asia may be 
considered as primitive societies. In Europe, paleo- 
lithic cave and portable art may generally be con- 
sidered primitive art and so may that of neolithic 
peoples, except that certain neolithic cultures may 
be considered the beginning stages of European 
civilization. 

The points in time at which specific primitive 
peoples came into contact with civilized European, 
Asian, and African societies varied, and so the 
change from the isolation of primitive society to 
inclusion in the political and economic affairs of 
large, stratified states occurred at different times in 
different parts of the world. Utilizing social criteria 
for the classification of primitive art eliminates 
much ambiguity. 

Anthropologists have often been interested, al- 
though marginally, in the art of the peoples they 
study. In late-nineteenth- and early-twentieth-cen- 
tury ethnographic studies, treatment of art went 
along with general concern with material culture. 
As more and more energy was diverted from studies 
of material culture to studies of kinship, social 
structure, political and economic systems, and 
value systems, this emphasis declined. Recognition 
was given to the conservative qualities of primitive 
art: it looks ever backward to what has come be- 
fore, to antiquity, and in so doing, it incorporates 
glimpses of the past. Moreover, art is artifact, con- 
scious and deliberate, so that locked into every art 
object are consciously determined images and 
forms, symbolic of ideas, events, and personages. 

The basic ethnographic collections of material 
culture and art in the anthropological museums 
were made during the early and mid-nineteenth 
century through the first two or three decades of 
the twentieth. Private collections were also formed 
primarily during this period, some earlier, but they 
have been steadily absorbed by the museums. The 


primitive art objects now in the museums origi- 
nated partly as curiosities brought back by trav- 
elers, colonial officials, and missionaries and partly 
through systematic collections by ethnologists and 
archeologists. They were displayed and sometimes 
studied as part of the cultures of the exotic peoples 
of the world. 

It was in this period that the European modern 
art movement, although it had been evolving since 
the Renaissance, began to change very rapidly. 
Canons of romanticism and naturalism toppled, and 
Western art moved through a dizzying succession 
of styles toward increasing abstraction of form with 
a corresponding de-emphasis of explicit content. 
Schools, factions, and cliques arose and contended 
over the differing styles, each being put forward 
as the inheritor of the mainstream of European 
artistic tradition. 

During the period from 1900 to 1910, as Western 
artists, art dealers, critics, collectors, and historians 
sought justification and rationalization of the 
radical new art forms, they began to make voyages 
of discovery, not to Oceania and Africa but to the 
exhibition halls of the anthropological museums 
where collections from those areas and from the 
Americas were readily available. The traditional 
arts of the primitive peoples were hailed as new 
discoveries, and their stylizations of form were 
pointed out as support for the radically stylized 
modern art. Primitive art was hailed as the brother 
of modern art. 

Some of the forms of primitive art, notably Afri- 
can sculpture, found their way by this route into 
the works of Western artists such as Picasso and 
Braque, Art dealers traded in African, American, 
and Oceanic primitive art, and collectors purchased 
(at low prices) these “masterpieces” of primitive 
art, Interpreters of primitive art arose, not from the 
ranks of anthropologists but from the art world, 
and they emphasized form. There was little con- 
cern with contextual meaning or the relationship 
between the art objects and the culture from which 
they came. Meanings were read into the objects, 
regardless of historical justice. 

Unquestionably the Western art world has fo- 
Cused great interest upon primitive art and attract- 
ed the attention of scholars. However, these non- 
anthropologically oriented art commentators have 
done little to increase understanding of primitive 
art. At the same time, unfortunately, the anthro- 
Pologists have moved away from consideration of 
art, so that although there is some mention of art 
in ethnological monographs, it is rarely detailed 
enough to provide for significant analysis. 

There have been pioneering studies, notably by 
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anthropologists with orientations toward art his- 
tory, but they have not been very penetrating. The 
usual combination of library and museum research 
is simply not sufficient; what is needed is art-orient- 
ed ethnographic field research, This field research 
must be carried out in reference to, and with 
inquiry about, specific objects and classes of objects 
presently in museums. This, of course, is the often- 
sounded call to study a living people before their 
culture changes, but for many parts of the world it 
is still a valid exhortation and will continue to be 
for some time to come. 

However, unlike the kind of study that has been 
called for in the past, these ethnological studies 
could build upon the foundation laid by the vast 
museum collections of the world, most of which 
are catalogued but not adequately surveyed and 
cross-referenced. When the collected art of a given 
people can be dealt with exhaustively, it will then 
be possible to conduct field research that will help 
to classify such materials into style areas. Most 
museum collections contain a large number of 
specimens that are incompletely catalogued and 
either are without specific provenience or have 
erroneously attributed places of origin. A crucial 
task to be accomplished for previously collected 
specimens of primitive art is to extend attribution 
of provenience as accurately as possible. Ethno- 
historical research could supplement study of mu- 
seum collections by concentrating on the historical 
conditions under which the collections were made 
and on the sociocultural contexts in which the art 
itself was made and used. This kind of study would 
take into account the complex sampling of objects 
from different time periods and different subareas 
of the studied regions represented by museum col- 
lections. 

Studies of the sociocultural contexts of specific 
art objects, forms, and styles should focus carefully 
upon the art as part of a larger system. In other 
words, specialized, anthropologically based study 
of art is needed, which would seek data about art 
and its contexts from a number of different socie- 
ties and strive to place such studies in a compar- 
able conceptual framework. A facet of the anthro- 
pological study of art in its social context would 
be that the phenomenor. could be dealt with in its 
own right and as actually found, rather than ethno- 
centrically, or as primarily in relationship to the 
art of Western society. Previous ethnocentric 
studies have warped our view of primitive art, so 
that its concepts and forms always seem to be 
exotic variations on our own art. Western con- 
cepts of art for art’s sake, the marginal position of 
artists in society, and the emphasis on a conscious, 
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verbalized aesthetic criticism are often brought up 
in considering the art of a primitive society, as 
though these were universal features of art. Actu- 
ally these may well turn out to be very special 
aspects of Western art and perhaps of the art of 
other civilized societies. 

General archeological research will obviously in- 
crease in areas whose prehistory is now virtually 
unknown, such.as Oceania and sub-Saharan Africa. 
As the boundaries of ethnographically known prim- 
itive art are extended into prehistory, it is possible 
that presently known styles and areas will take on 
different significance. Indeed the very notion of 
primitive art as a field of study may give way to 
an amalgamation and synthesis of traditional art 
history and the study of primitive art, so that com- 
parable regional histories of art will be established. 
Perhaps from such a base, increased knowledge of 
the common denominators of all art will become 
possible. 

PuILuip H. LEWIS 
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PRIMITIVE LAW 

See JUDICIAL PROCESS, article on COMPARATIVE 
ASPECTS; LAw, article on LAW AND LEGAL INSTITU- 
TIONS; POLITICAL ANTHROPOLOGY; SANCTIONS; and 
the biography of MAINE. 


PRIMITIVE SOCIETY 

See ANTHROPOLOGY; CULTURE; EVOLUTION; 
HUNTING AND GATHERING; INDIANS, NORTH AMER- 
ICAN; TRIBAL SOCIETY. 


PRIMITIVE WARFARE 
See under War. 


PRINCIPAL COMPONENTS 
See FACTOR ANALYSIS, article on STATISTICAL 
ASPECTS. 


PRISONERS OF WAR 
See INTERNMENT AND CUSTODY. 


PRISONS 
See INTERNMENT AND CUSTODY; PENOLOGY; and 
the biographies of BEcCARIA and BENTHAM. 


PRIVACY 


Privacy is a concept related to solitude, secrecy, 
and autonomy, but it is not synonymous with these 
terms; for beyond the purely descriptive aspects 
of privacy as isolation from the company, the curi- 
osity, and the influence of others, privacy implies 
a normative element: the right to exclusive control 
of access to private realms. The philosophical prob- 
lem of private knowledge, whether and in what 
sense knowledge and experience may be idiosyn- 
cratic and fundamentally incommunicable to 
others, will not be discussed here (see Ayer 1959). 
Rather, after some comments on the right to pri- 


vacy and the basis for contemporary interest in the 
subject, this article will consider some of the de- 
terminants and indicators of privacy, the elements 
of a functional analysis of privacy, and finally, the 
law of privacy. 

A right to privacy is recognized both in law and 
in common parlance; but different legal systems 
emphasize different aspects, and customs related 
to privacy differ greatly from culture to culture, 
from social system to social system, and from sit- 
uation to situation. Many of the claims to the right 
to privacy are difficult to distinguish from other 
claims to rights of the personality, from claims 
to respect for personal integrity, and from claims 
against interference by government and other ex- 
ternal agents. It has been said that a free society 
is governed by the principle that “there are frontiers 
not artificially drawn, within which men should be 
inviolable, these frontiers being defined in terms 
of rules so long and widely accepted that their ob- 
servance has entered into the very conception of 
what it is to be a normal human being” (Berlin 
1958, p. 51). The frontiers mentioned are “not arti- 
ficially drawn” because they are recognized in a 
given culture as legitimate boundaries of the per- 
sonality. More generally, they are recognized as 
identity boundaries that define the culturally recog- 
nized entity—be it personality, status, collectivity, 
or institution—whose “space of free movement” 
(Lewin [1936] 1948, p. 5), or domain of autono- 
mous activity, or privacy is at issue. 


Determinants and indicators of privacy 


Among the material facts and relatively fixed 
social conditions that strongly affect patterns of 
privacy must be reckoned population density, rates 
of interaction, patterns of residence, the division 
of labor, and the nature of family and other social 
relationships. Comparative anthropological data do 
not seem to exist in a form that crystallizes out 
the aspects relevant to questions of privacy, nor 
have correlations of these aspects with other social 
and cultural elements been given any extensive 
study. The topics mentioned here, accordingly, sug- 
gest only the range of matter that appears relevant 
and deserves further study. 

Urban society of necessity has privacy standards 
and privacy problems different from those of rural 
society. Through the variety of experiences open 
to the urban dweller and the multiplicity of sub- 
groups to which he may be tied, he becomes more 
distinctive, more isolated, and more aware of his 
existence and his rights as a distinct person. In the 
urban setting, individual honor supersedes family 
or community honor, and individual privacy gains 
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legitimacy. As Maine (1871) pointed out, the legit- 
imacy of group interests historically precedes the 
claims of individual interests. Thus the nature of 
privacy has been changing for groups as well as 
for individuals. 

Private property may be looked upon as a mani- 
festation of the concept of individuality in a given 
culture. In Western industrial societies, the concept 
of property is undergoing fundamental changes as 
an increasing fraction of the wealth of each nation 
is distributed by its government. The transition 
from “privilege” to “right” (e.g., to receive licenses, 
franchises, grants, or welfare benefits) constitutes 
a change in the region over which individuals or 
organizations may expect to have autonomous con- 
trol, and thus surely changes the region over which 
privacy may be claimed. For example, if support 
from a welfare department is seen as a right, much 
of what is now done in welfare client investigations 
would surely be considered an invasion of the 
client’s privacy (Reich 1964; 1966). In general, 
major social changes imply changes in identity 
boundaries and domains of privacy. 

Residence patterns and family structure put 
bounds on patterns of interaction and thus on pat- 
terns of privacy (Chombart de Lauwe 1961; 1965). 
Conversely, privacy norms will dictate some aspects 
of residence patterns and will exert a strong influ- 
ence on the process of individual development 
within the family. But no set of external conditions 
completely determines patterns of privacy. Even in 
the most crowded situations one occasionally finds 
mechanisms that give the individual privacy by 
institutionalizing places and occasions that grant 
the individual sanctuary from approaches by others. 
On the other hand, the relatively isolated city 
apartment dweller may find that he has very little 
privacy because of the thinness of walls, the loca- 
tion of doors and windows, the accessibility provid- 
ed by mail and telephone, and the hordes of service 
people who collect refuse, read meters, and inspect 
or “service” the machinery with which the modern 
apartment is filled, not to mention salesmen, inter- 
viewers, and others who have developed the skill 
of approaching people for one purpose or another. 

Distance is perhaps the most obvious and exter- 
nal characteristic of social interaction that may be 
taken as an indicator of exclusiveness or intimacy, 
of respect or contempt (Simmel [1908] 1958, pp. 
320-324). An “undue closeness,” that is, one which 
violates exclusiveness or intimacy norms, clearly 
represents an invasion of privacy. Hall (1963) has 
coined the term “proxemics” to denote the study of 
the distance kept between people of different back- 
grounds in different situations. The strong feelings 
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aroused by cultural differences in regard to habits 
of intimate contact indicate the social and psycho- 
logical importance of distance. For example, that 
men should hold hands when walking together 
seems very strange to the European but is quite 
acceptable to the Arab. Similarly strange to the 
northern European and North American city 
dweller are customs such as conversing at a dis- 
tance so short that one feels the warmth of the 
other's body and the spray from his mouth; the 
sharing of half-chewed material (e.g., the betel 
wads of the Tikopia); the sharing of eating uten- 
sils; the chewing of the child’s food by the mother 
(Hall 1959), Indeed, some scholars have suggested 
(e.g., Lee 1959) that the large differences in habits 
in different cultures regarding the constancy and 
frequency of body contact between mother and 
child have a bearing on differences in character 
development. 

Concealment, or avoidance of observation, is 
another action from which privacy norms may be 
inferred. In anthropological compendia concealed 
behavior is listed primarily in terms of religion and 
magic, on the one hand, and modesty patterns, on 
the other. Recalling Durkheim's definition of the 
sacred (1912, pp. 56, 236-237 in the 1961 edi- 
tion), it becomes quite clear how much privacy 
shares with sacredness, The sacred is set apart, 
isolated, untouchable except by special people with 
Special dispensations, and it inspires a respect 
which demands that a distance be kept from it. 
Without stretching common meanings, it can be 
said that the right to privacy asserts the sacredness 
of the person, Modesty is usually discussed in rela- 
tion to biological events and functions, and anthro- 
pologists have found some regularities in the social 
patterning of these. For bathing and elimination, 
segregation of the sexes is common; for eating, 
such segregation is much less common. Sexual 
intercourse is most commonly hidden in some way, 
although customs demanding rather little conceal- 
ment or none at all, especially on ceremonial oc- 
casions, have been reported (see, for example, 
Ford & Beach 1951, as well as Westermarck 1889). 
Childbirth, illness, death, and religious rites are 
events at which only individuals in particular 
statuses or in particular relationships are allowed 
or expected to be present, The analysis of such 
patterns in terms of their consequences for social 
systems—for example, that occasional exclusive- 
ness affirms the solidarity of the exclusive group 
—appears in many cases to be more successful 
than the analysis of the motivation of the partici- 
pants. Yet, what is perhaps the dominant reason 
for seeking privacy, namely, the desire to be insu- 


lated from observation, is intimately related to 
motives of avoiding criticism, punishment, or the 
discomfort of feeling inhibited. 

Social and technological developments have 
greatly increased the possibility of observing the 
activities of individuals and groups, of disturbing 
their equanimity or internal balance, and of influ- 
encing or controlling their behavior. Such possibil- 
ities of encroachment upon privacy inevitably 
become realities unless effective social control 
mechanisms are developed, Law provides one such 
mechanism; but in a period of rapid social change, 
the heterogeneity of social norms and beliefs leaves 
much uncertainty and instability of behavior in 
those areas of action which law does not seek to 
control or fails to control. 

Technology provides the material potential for 
electronic spying and high-speed record keeping 
and information retrieval; for psychological testing 
and psychiatric interviews; for the use of drugs 
and other physiological agents, as well as methods 
of physical deprivation, irritation, and coercion 
which, contrary to the victim’s wishes and interests, 
may unveil the secrets of his mind; and for the 
manipulation of groups to make people confess or 
turn informers. The chances that these methods 
will be used are vastly increased if ideologies are 
at hand to legitimate such use. 

When ideologies, political or religious, demand 
orthodoxy of all citizens, or when confessions of 
guilt or the punishment of persons, irrespective of 
proof of guilt, are considered justifiable on adminis- 
trative or educational grounds, spying and inform- 
ing, forced confession, and coercive persuasion will 
flourish. Ideologies of religious or political “purity,” 
of “moral community” or of “mental health,” can be 
and have been employed to legitimate the use of 
psychological techniques for the manipulation of 
behavior in flagrant disregard of time-honored con- 
cepts of personal integrity (Lifton 1961; Schein 
et al. 1961; Szasz 1963). Thus, in the current crisis 
in the relations between psychiatry and law one 
may observe a monumental obfuscation in the no- 
tions of personal responsibility and mental health, 
of therapy and punishment, of personal integrity 
and the individual’s welfare, as these terms are de- 
fined by authorities in these fields (Szasz 1963). 


Functional analysis of privacy 

The basic social-psychological mechanism that 
motivates seclusion is expressed by the following 
proposition: The greater the (perceived) probability 
that an action will be observed, the greater the 
probability that the action will be in compliance 
with the perceived social norms of the observer. 


Observation is a crucial part of social control (Mer- 
ton [1949] 1957, pp. 336-357; Zetterberg 1957; 
Hopkins 1964); hence, privacy norms put limits 
upon the degree of social control. 

Opportunity for privacy facilitates spontaneity. 
For example, in the therapeutic professional—client 
relationship confidentiality implies not only that the 
client will be protected from external sanctions but 
also that the professional will not exert the usual 
sanctions for deviance. -Thus the client may feel 
free to tell what the professional may need to know, 
or what the client may need to tell, if he is to be 
helped, The experiments in stimulating group crea- 
tivity that have been called “brainstorming” have, 
with limited success, attempted to institutionalize 
the withholding of sanctions (Haefele 1962, pp. 
155-156), But the most usual method of attempt- 
ing to minimize the inhibitions of workers whose 
creativity is being tapped is to grant them a maxi- 
mum of privacy. 

Imperfect consensus and integration of norms 
and values are found in all but the smallest, sim- 
plest, and most homogeneous societies; conse- 
quently, in most societies a degree of deviance from 
legitimate expectations of some role partners is 
unavoidable. Privacy may therefore be seen as 
having the function of protecting people against 
the psychological conflict that would be generated 
by unremitting observation, In a large-scale, plu- 
ralistic society such protection is especially neces- 
sary. 

Privacy insulates the individual from social pres- 
sures to compliance and generally reduces the need 
to act in ways that are likely to gain the approval 
of others, Gouldner and Peterson (1962, pp. 44— 
45) have made the distinction between “self- 
esteem”—the positive self-evaluation gained from 
others’ approval, by “consensual validation”—and 
“self-regard’”—the positive self-evaluation attained 
through recognition of one’s distinctness from 
others, through “conflictual validation.” Privacy not 
only reduces the need for consensual validation 
but also provides an opportunity for the process 
of individuation, which depends on conflictual val- 
idation, Thus the development of autonomy and 
personal responsibility in children and adolescents 
Tequires opportunities for them to act autono- 
mously, in spite of the possibility that their actions 
may be contrary to the dominant norms, Privacy 
and tolerance would seem to be alternative condi- 
tions for such opportunities. In general, the less 
individuals can be relied upon to have internalized 
norms, the greater will be the external social con- 
trol. Where both distrust and external controls 
become particularly severe, various forms of social 
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and psychological pathology may be expected. Se- 
vere efforts to interfere with a child’s attempts to 
develop autonomy have been reported to be one 
component of the etiology of schizophrenia (Len- 
nard et al. 1965). 

The internalization of norms may be considered 
a functional alternative to external social control. 
Hence, the greater the internalization of norms, 
the greater the privacy granted and, therefore, the 
greater the development of privacy norms. 

Other things being equal, the longer the sociali- 
zation process, the greater the likelihood that rele- 
vant norms are internalized and that subsequent 
supervision is minimal. Physicians and other pro- 
fessionals who have a relatively long educational 
experience are subsequently given much responsi- 
bility with only sporadic supervision. Privacy may 
also be seen as a reward for responsibility. Studies 
of industrial workers in the United States have 
shown rather consistently that close supervision, 
that is, supervision with short intervals between 
supervisory contact, and hence with little privacy, 
results in poor morale and low productivity (Whyte 
1961). In a study of industrial work in England, 
it was found that the normal interval between 
supervisory contacts was an excellent measure of 
the responsibility exercised by a worker and could 
be used as a basis for wage scales (Jaques 1956). 

This correlation between responsibility and pri- 
vacy illustrates a more general correlation between 
social rank and privacy. Certainly, the higher the 
individual's socioeconomic rank, the more likely 
it is that his physical amenities and interaction 
patterns can be arranged so as to maximize pri- 
vacy; and it is equally true that the higher the 
socioeconomic rank, the greater the individual's 
power to defend his rights, including privacy, 
against both neighbors and agents of the commu- 
nity or state. Furthermore, social distance, exclu- 
siveness, and thus privacy, as expressions of self- 
regard, are correlated with social rank. This 
correlation between social rank and privacy is 
emphasized by the problem of securing privacy for 
celebrities and public servants whose visibility is 
especially high. 

The privacy of collectivities. Many instructive 
examples of the social structure of privacy situa- 
tions can be found in the privacy patterns of collec- 
tivities. The family is a collectivity that demands 
respect and privacy; in some societies—for exam- 
ple, contemporary Japan and Greece, as well as in 
most of the developing societies—the claims of the 
family or larger group far outbalance the rights of 
the individual. One may expect to find this to be 
the case wherever family or group honor is the 
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ultimate criterion by which the individual is 
judged. Aristocracies provide examples, but so do 
such groups as the Mafia. 

Family types may be classified by the degree of 
freedom with which they grant access to nonmem- 
bers. “Open” families readily admit members from 
neighboring families; their doors are not locked, 
and their members freely mind each other’s busi- 
ness, including, for example, the disciplining of 
children, Little secrecy is possible, and mutual so- 
cial control is at a maximum. This pattern is likely 
to occur only if the community has a relatively 
impermeable privacy boundary: the families are 
open, but outsiders from beyond the community 
are kept out. The “closed” family, at the other end 
of the continuum, protects its privacy by dwellings 
and other technical arrangements that preserve 
distance and exclusiveness, but above all by norms 
that prescribe social distance and formality in so- 
cial relations. Since there is much less mutual 
social control, greater variety may be expected to 
develop among closed families, as well as greater 
individuality within them. Thus the privacy of the 
closed family in the open community appears to 
have the function of generating and preserving the 
diversity of personalities and values within a given 
society. 

The following generalization may tentatively be 
deduced from this case and other examples to 
follow: In collectivities with strong privacy bound- 
aries, subcollectivities will tend to be relatively 
open; in open collectivities, subcollectivities will 
tend to be closed. The “open society” contains 
closed social, political, religious, economic, and 
other organizations. They are closed, at least, to 
political influence and investigation by the govern- 
ment; and those organizations which are in a com- 
petitive relationship (e.g., political parties) are also 
closed to each other in certain ways. In contrast, 
closed totalitarian societies either do not permit 
organizations to develop within them or demand 
great permeability of all organizational boundaries 
to governmental observation and infiuence—to the 
extent that family members are encouraged to in- 
form on each other. Many situations—familial, 
therapeutic, conspiratorial, educational—combine 
strong external privacy boundaries with demands 
for openness on the part of the individuals who are 
the subunits. Such openness provides an opportu- 
nity for intensive socialization, for the internaliza- 
tion of the norms and values of the collectivity, 
and, as a special case, for the development of that 
trustworthy identification with the collectivity 
which is called loyalty. 

Loyalty is a prerequisite for privacy in a collec- 


tivity that feels itself threatened. Above all, the 
loyalty of those in leadership and other “sensitive” 
positions must be assumed, unless it is possible to 
subject them to constant surveillance by other 
members of the collectivity, whose loyalty would, 
in turn, have to be assumed. On a national scale, 
differences in the social structuring of sentiments 
can have very important implications for general 
climates of trust and the concomitant state of pri- 
vacy. Great Britain and the United States provide 
an interesting contrast (Shils 1956; Lewin 1936; 
Hyman 1963). Traditionally, privacy standards for 
both the individual and the government are much 
higher in Britain than in the United States. While 
it would be difficult to say whether, in fact, loyalty 
is greater in one country than in the other, the 
common identification with dominant national in- 
stitutions is made more visible in Britain, for ex- 
ample, by recurring expressions of the generally 
shared attitude of deference toward the crown, the 
government, and those higher in the stratification 
system in general. Thus there is a recurrent ex- 
pression of consensus invoking the common history 
of a relatively homogeneous population and focused 
upon the leadership of the country; and since the 
leadership group reflects the same consensus in 
its public utterances, there is a greater continuity 
of sentiment and trust. Since the American public 
lacks such recurrent expressions of consensus, it 
has more easily fallen prey to jingoistic dema- 
gogues, and this tendency has resulted, at various 
times, in widespread suspicion of “treason in high 
places” and rampant loyalty investigations that 
have trampled privacy underfoot. [See LOYALTY.] 


The law of privacy 


Laws regarding the right of privacy were rela- 
tively late in arriving on the scene. While a general 
law of private personality, rooted in Roman law, 
found its way into numerous Continental codes, 
judges have preferred to link the right to privacy 
with property rights, and its violation with specific 
torts, such as libel or slander, copyright infringe- 
ment, breach of contract, trespass, and assault and 
battery. This has been particularly true of Anglo— 
American courts; indeed, common-law jurispru- 
dence regarding the right of privacy dates back no 
farther than 1890, when Warren and Brandeis 
(1890) published a famous article on this topic. 
In England there is still no actionable invasion 
of privacy unless property rights have been violat- 
ed or reputation has been injured. 

In the United States a general right to privacy 
was first explicitly affirmed by the Supreme Court 
as recently as 1965, in Griswold et al. v. Connecti- 


cut (381 U.S. 530). This decision invalidated a 
Connecticut law that prohibited the use of contra- 
ceptives even by married couples. Yet the dissent- 
ing opinion of Justice Potter Stewart questioned the 
legal basis of the decision: “What provision of the 
Constitution, then, does make this state law inval- 
id? The Court says it is the right of privacy ‘created 
by several fundamental constitutional guarantees.’ 
With all deference, I can find no such general right 
of privacy in the Bill of Rights, in any other part 
of the Constitution, or in any case ever before de- 
cided by this Court.” In fact, the majority opinion, 
written by Justice Douglas, relied upon “a right of 
privacy older than the Bill of Rights—older than 
our political parties, older than our school system.” 
In both the concurring and the dissenting opinions, 
implied references to natural law were called into 
question. But it cannot be questioned that funda- 
mental values were invoked when the opinion of 
the Court mentioned that the marriage relationship 
is “intimate to the degree of being sacred.” It is 
quite literally Durkheim's idea of the sacred that 
is involved here. An appeal to such basic and gen- 
eral values would not have been necessary if this 
decision could have been based on more immedi- 
ate precedents. 

Invasions of privacy. Invasions of privacy rec- 
ognized up to 1965 in the U.S. law courts have 
been categorized by Prosser (1960, pp. 389 ff.) 
into “four distinct kinds of invasion of four differ- 
ent interests of the plaintiff, which are tied together 
by the common name, but otherwise have almost 
nothing in common except that each represents an 
interference with the right of the plaintiff... ‘to 
be let alone.’” A somewhat different view has been 
expressed by Bloustein (1964), who has argued 
brilliantly that this common element is the central 
issue in the law of privacy. It has been said that 
the law of privacy developed as a parasite upon 
laws about specific classes of torts (Harper & 
James 1956, pp. 677-679). However, Bloustein 
pointed out that court decisions, judicial opinions, 
and the most important scholarly sources make 
clear the principle that, beyond the particular torts 
to which privacy cases have been tied, any invasion 
of privacy constitutes an offense against the rights 
of the personality—against individuality, dignity, 
and freedom. 

The following discussion of Prosser’s four classes 
of torts will serve to illustrate the specific wrongs 
that Anglo-American courts have connected with 
the concept of privacy. The first category is con- 
cerned with intrusion upon the plaintiff's seclu- 
sion or solitude or into his private affairs. Exposure 
of one’s private doings to public view, to Peeping 
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Toms, to eavesdropping and wire tapping, to open 
pursuit, shadowing, trailing, or public surveillance 
is seen as disturbing one’s equanimity. And this 
fact, rather than any possible harm or loss subse- 
quent to the intrusion, is the basis of the claim that 
the right to privacy has been invaded. “It appears 
obvious,” wrote Prosser (1960, p. 392), “that the 
interest protected by this branch of the tort is pri- 
marily a mental one. It has been useful chiefly to 
fill in the gaps left by trespass, nuisance, the inten- 
tional infliction of mental distress, and whatever 
remedies there may be for the invasion of constitu- 
tional rights.” However, Bloustein pointed out a 
critical distinction made by Warren and Brandeis 
when they asserted that mere injury to feelings is 
without legal remedy. Invasion of privacy is thus 
“a legal injuria,” or an “act wrongful in itself”; it is 
not “mental suffering” that is taken into account in 
reckoning damages for a legal injury. In common 
parlance the term “privacy” is very often employed 
in connection with what legally may come exactly 
under the categories of trespass, nuisance, or the 
infliction of mental distress, although under U.S. 
law the offense in question may not involve any 
legal question in regard to privacy. 

The second category of invasions of privacy is 
concerned with public disclosure of embarrassing 
private facts about the plaintiff. This was the prin- 
cipal topic of the article by Warren and Brandeis, 
upon which most of the Anglo-American legal dis- 
cussions of privacy have been based. Warren and 
Brandeis pointed out that, in contrast with the law 
of defamation, which deals only with damage to 
reputation by lowering a person in the estimation 
of his fellows, the right to privacy grants protection 
against actions that would outrage a person’s own 
feeling, irrespective of his reputation. Thus, to 
publish another's name or picture is an invasion of 
his right to privacy if it is done without his consent 
and if it cannot be justified as a matter of public 
interest, as in the case of the actions of a public 
figure. The truth of the disclosed matter is no 
defense, as it may be under the law of defamation. 

The third category deals with publicity which 
places a person in a false light in the public eye. 
The principal cases in this category deal with the 
publication by the police of information about an 
individual which may lead the public to infer that 
he has committed delinquencies of which he has 
not been convicted. Impersonating someone and 
forgery of another's signature also belong to this 
class of privacy invasions, which, in contrast with 
the other three classes, depends on the falsity of 
the fiction that is publicized. 

Finally, Prosser’s fourth category concerns ap- 
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propriation, for the defendant's advantage, of the 
plaintiff's name or likeness. In contrast with the 
three other classes of privacy invasions recognized 
by the U.S. law, this one involves the use of the 
advantage gained through the offensive actions. It 
seems to focus more on the proprietary aspect than 
the others. What it shares most crucially with the 
other recognized privacy invasions is that it consti- 
tutes an attack upon the individual's identity. 

Privacy law under civil codes. In countries 
under civil codes privacy law is often better devel- 
oped than in common-law countries. In civil codes, 
laws against many specific privacy invasions can 
be found under the most diverse chapters of law. 
Correspondence by letter, telegram, and telephone, 
as well as personal information given to public 
officials or professionals, tend to be explicitly pro- 
tected. And each code has its peculiar prohibitions, 
for example, against “intrusion by illicit means 
into another's secrets,” or against “injury to per- 
sonal relationships owing to the willful or negligent 
act of another.” General categories of privacy viola- 
tion, such as the “injury to personal relationships” 
just mentioned (which occurs in the Swiss code), 
are fairly common, although varied in content. On 
the Continent the recognition and protection of the 
right of privacy is based, in large part, on the con- 
cept of the injuria delict derived from the Roman 
law. The concept of Persénlichkeitsrecht and the 
idea of a tort contra bonos mores have also done 
much to strengthen judicial protection of personal 
privacy (Weeks 1963, p. 501). 

Finally, it should be noted that the Draft Cove- 
nant of Civil and Political Rights as adopted at the 
fifteenth session of the General Assembly of the 
United Nations, in 1960, recognizes a right to pri- 
vacy in article 17: “1. No one shall be subjected to 
arbitrary or unlawful . . . attacks on his honour and 
reputation, 2. Everyone has the right to the pro- 
tection of the law against such interference or at- 
tacks” (United Nations . . . 1961). 

The limits of privacy. The legal protection of 
privacy has inherent limitations, since legal reme- 
dies of privacy invasions may bring with them 
further exposure of the matter that was to be kept 
private. It remains to be seen whether the recog- 
nition of a general right to privacy by the U.S. 
Supreme Court in Griswold v. Connecticut will in- 
deed open the gates to a vast amount of litigation 
about privacy, a fear often expressed in past dis- 
cussions of possible developments in the law of 
privacy. 

A matter closely related to privacy (although not 
subsumed under the law of privacy) that is likely 
to be the source of some litigation is the subject 


of confessions made by citizens accused of a crime. 
Supreme Court decisions in regard to confessions 
(Escobedo v. Illinois [378 U.S. 478]; Miranda v. 
Arizona [384 U.S, 436]) are clearly based on the 
fact that certain policemen and other custodians of 
prisoners have not sufficiently respected the indi- 
vidual’s right not to reveal what he does not want 
to reveal. This issue points to another essential 
difficulty of enforcing the legal protection of pri- 
vacy: violations of privacy often are injuries in- 
flicted by relatively large and powerful forces upon 
the smallest and weakest element in society, the 
individual, who may be poor, uneducated, and a 
member of a minority group. Thus the protection 
of privacy requires not only a degree of consensus 
in the total population about the rights of the indi- 
vidual, and adequate laws to recognize these rights, 
but also considerable effort by those who exercise 
influence and wield power, governmental or other- 
wise, to enforce the laws and to encourage com- 
pliance with the more general social norms of re- 
spect for the individual. 


The emphasis that has been placed here on the 
right of privacy should not obscure the fact that 
governments and other collectivities have legiti- 
mate concerns with private aspects of their mem- 
bers’ lives. Laws establish only guidelines for the 
competition between the autonomy of the individ- 
ual and his government; the understanding of that 
competition requires further study of the functions 
of privacy for individuals, for collectivities, and for 
the relationships between the two. 

ARNOLD SIMMEL 


[See also CENsorsHIP; ETHICS, article on ETHICAL 
ISSUES IN THE SOCIAL SCIENCES; LOYALTY; ORGANI- 
ZATIONS, article on ORGANIZATIONAL INTELLIGENCE. 
Other relevant material may be found in BRAIN- 
WASHING; CONSTITUTIONAL LAW; FREEDOM; Hu- 
MAN RIGHTS; INTERVIEWING, article on SOCIAL RE- 
SEARCH; and in the biographies of LEWIN; SIMMEL.] 
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I 
FORMAL PROBABILITY 


This article deals with the mathematical side of 
probability, the calculus of probabilities, as it has 
sometimes been called. While there is sometimes 
disagreement on the philosophy and interpretation 


488 PROBABILITY: Formal Probability 


of probability, there is rather general agreement 
on its formal structure. Of course, when using the 
calculus of probabilities in connection with proba- 
bilistic models or statistical investigations, the 
social scientist must decide what interpretation to 
associate with his probabilistic statements. 


The axiomatic approach 


The sample space. Mathematically the most 
satisfactory approach to probability is an axiomatic 
one. Although a rigorous description of such an 
approach is beyond the scope of this article, the 
basic ideas are simple. The framework for any 
probabilistic investigation is a sample space, a set 
whose elements, or points, represent the possible 
outcomes of the “experiment” under consideration. 
An event, sometimes called a chance event, is rep- 
resented by a subset of the sample space. For ex- 
ample, in a guessing-sequence experiment, test 
subjects are instructed to write down sequences of 
numbers, with each number chosen from among 
the digits 1, 2, 3. For sequences of length 2, an 
appropriate sample space is given by the nine num- 
ber pairs (1,1), (1,2), (2,1), (1,3), (2,2), (3,1), 
(2,3), (3,2), (3,3). In this sample space the sub- 
set consisting of the three pairs (1,1), (2,2), (3,3) 
stands for the event a run, and the subset consist- 
ing of the pairs (1,3), (2,2), (3,1) stands for the 
event sum equals 4, For convenience of reference, 
these two events will be denoted more briefly by 
Rand F, 

The algebra of events. It is helpful to intro- 
duce some concepts and notation frorn the algebra 
of events. Events (or sets) are denoted by letters 
A,B, ---, with or without subscripts. The event 
consisting of every experimental outcome—that is, 
the sample space itself—is denoted by S. The 
event not A is denoted by A° (alternate notations 
sometimes used include A, CA, and A’), the event 
A and B by AB (or A N B or A A B), the event A or 
B by A+B (or A UB or A V B), with correspond- 
ing definitions for more than two events. The or 
in A or B is to be interpreted in the inclusive sense, 
implying either A or B alone or both A and B. Two 
events A and B are said to be disjoint (or mutually 
exclusive) if AB = , where $= S° is the impossi- 
ble event, an event consisting of none of the ex- 
perimental outcomes. The events A + B and AB are 
often called, respectively, the union and the inter- 
section of A and B, and the event A‘ is often called 
the negation of A. The events A and A“ are said to 
be complementary events. Figure 1 is a pictorial 
representation of two events ( sets), A and B, by 
means of a Venn diagram. The sample space, S 


Figure 1 — Venn diagram 


is represented by a rectangle; events A and B par- 
tition S into four disjoint parts labeled according 
to their description in terms of A, A“, B, and Br’. 
Thus, for example, A°B“ is the part of the sample 
space that belongs both to A° and B“. By definition, 
A + B= AB" + A°B + AB. 

The axioms. In an axiomatic treatment, the 
probability, P(A), of the event A is a number that 
satisfies certain axioms. The following three axioms 
are basic. They may be heuristically explained in 
terms of little weights attached to each point, with 
the total of all weights one pound and with the 
probability of an event taken as the sum of the 
weights attached to the points of that event. 


(i) P(A) 20, 
(ii) P(S) =1, 
(iii) P(A +B) = P(A) + P(B), if AB = ¢. 


Thus, no probability should be negative, and the 
maximum value of any probability should be 1. 
This second requirement represents an arbitrary 
though useful normalization. Finally, the proba- 
bility of the union of two disjoint events should be 
the sum of the individual probabilities. It turns out 
that for sample spaces containing only a finite 
number of sample points no additional assumptions 
need be made to permit a rigorous development of 
a probability calculus. In sample spaces with in- 
finitely many sample points the above axiom sys- 
tem is incomplete; in particular, (iii) requires 
extension to denumerably many events. Mathe- 
matically more serious is the possibility that there 
may be events to which no probability can be as- 
signed without violating the axioms; practically, 
this possibility is unimportant. 

It should be noted that the axiomatic approach 
assumes only that some probability P(A) is associ- 
ated with an event A. The axioms do not say how 
the probability is to be determined in a given case. 
Any probability assignment that does not contra- 
dict the axioms is acceptable. 


The problem of assigning probabilities is par- 
ticularly simple in sample spaces containing a finite 
number of points, and the weight analogy applies 
particularly neatly in this case. It is then only 
necessary to assign probabilities to the individual 
sample points (the elementary events), making 
sure that the sum of all such probabilities is one. 
According to axiom (iii) it follows that the proba- 
bility of an arbitrary event, A, is equal to the sum 
of the probabilities of the sample points in A. In 
particular, if there are n points in the sample space 
and if it is decided to assign the same probability, 
1/n, to each one of them, then the probability of 
an event A is equal to m,(1/n) = m,/n, where m, 
is the number of sample points in A. For example, 
in the guessing-sequence experiment the same 
probability, 4 = .111, may be assigned to each one 
of the nine sample points if it is assumed that each 
number written down by a test subject is the result 
of a mental selection of one of three well-shuffled 
cards marked 1, 2, and 3, each successive selection 
being uninfluenced by earlier selections. This model 
will be called the dice model, since it is also ap- 
propriate for the rolling of two unconnected and 
unbiased “three-sided” dice. An actual guessing- 
sequence experiment in which 200 test subjects 
participated produced the results given in Table 1. 
For example, 13 of the 200 subjects produced se- 
quences in which both digits were 1. Table 2 gives 
the proportion of cases in each category (for ex- 
ample, 13/200 = .065). Inspection reveals impor- 
tant deviations from the theoretical dice model, 
deviations that are statistically significant [see 
COUNTED DATA; HYPOTHESIS TESTING]. For illus- 
trative purposes an empirical model can be set up 
in which the entries in Table 2 serve as basic proba- 
bilities. In the dice model P(R) = P(F) = }=3, 
while in the empirical model P,(R) = .065 + .065 
+ .045=.175 and P,(F) =.130+ .065 + .160 = 


Table 1 — Guessed digit pairs 


SECOND DIGIT 
1 2 3 Total 
1 13 27 32 72 
FIRST 
Dier Ê 25 13 29 67 
3 26 26 9 61 
Total 64 66 70 200 


Source: Private communication from 
Frederick Mosteller. 
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Table 2 — Proportions of subjects guessing each digit 


pair 
SECOND DIGIT 
1 2 3 Total 
1 065 135.160 -360 
FIRST 
DIGIT 2 125 065 145 1335 
3 130° 130 045 -305 


Total  .320 -330 .350 1.000 


Source: Private communication from 
Frederick Mosteller. 


.355. The subscript “e” distinguishes probabilities 
in the empirical model from those in the dice 
model. 


Some basic theorems and definitions 


Probabilities of complementary events. The 
probability of the event not A is 1 minus the proba- 
bility of the event A, P(A*)=1— P(A); in par- 
ticular, the probability of the impossible event is 
zero, P() = 0. This result may seem self-evident, 
but logically it is a consequence of the axioms. 

Addition theorem. If A and B are any two 
events, the probability of the event A + B is equal 
to the sum of the separate probabilities of events 
A and B minus the-probability of the event AB, 
P(A+B)=P(A) + P(B) — P(AB). For example, 
P(R+F)=4+3-—}=4= .556, and P.(R +F) = 
.175 + .355 — .065 = .465. If A and B are disjoint, 
P(AB) = 0, so the addition theorem is then just a 
restatement of axiom (iii). 

Conditional probability. If P(A) is not zero, 
the conditional probability of the event B given the 
event A is defined as 


P(AB) 

P(A) © 

The meaning of this quantity is most easily in- 
terpreted in sample spaces with equally likely 
outcomes. Then P(BJA) = (149/2)/(mu/n) = 
m,,/m,, Where Msn is the number of sample points 
in AB. If A is considered a new sample space and 
new probabilities, 1/m,, are assigned to each of 
the m, sample points in A, the probability of the 
event B in this new sample space is ma;/ms= 
P(BJA), since-mas of the m, points in A are in B. 
The interpretation of P(B|A) as the probability of 
B in the reduced sample space A is valid generally. 
Figure 2 shows the two ways of looking at con- 
ditional probability. The events AB and A can be 


a) P(BIA) = 
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S 


Figure 2 — Conditional probability P(B! A) 


considered in the sample space S, as in the first part 
of Figure 2. It is also possible to consider the event 
B in the sample space A, as in the second part of 
Figure 2. For example, P,(R|F) = P.(RF)/P.(F) = 
.065/.355 = .183, 

The multiplication theorem. Equation (1) can 
be rewritten as P(AB) = P(A)P(B|A). This is the 
multiplication theorem for probabilities: The prob- 
ability of the event AB is equal to the product of 
the probability of A and the conditional probability 
of B given A. The multiplication theorem is easily 
extended to more than two events. Thus, for three 
events A, B, C, P(ABC) = P(A)P(B|A)P(C|AB). 

Independent events. In general, the conditional 
probability P(B|A) differs from the probability 
P(B). Thus, in general, knowledge of whether A 
has occurred will change the evaluation of the prob- 
ability of B. If the occurrence of A leaves the 
probability of B unchanged, that is, if P(B|A) = 
P(B), B is said to be independent of A. It then 
follows that A is also independent of B, or simply 
that A and B are independent. A useful way of de- 
fining the independence of A and B is 
(2) P(AB) = P(A)P(B); 
that is, A and B are independent just when the 
probability of the joint occurrence of A and B is 
equal to the product of the individual probabilities. 
For example, in the dice model P(RF) =}= 
P(R)P(F), while in the empirical model P.(RF) 
= .065 # P.(R)P.(F). Thus the events R and F are 


independent in the dice model but dependent in 
the empirical model. 


The extension of the concept of independence 
from two to more than two events produces certain 
complications. For two independent events, equa- 
tion (2) remains true if either A or B or both are 
replaced by the corresponding negations A“ and B°, 
Put differently, if A and B are independent, so are 
A and B°, A‘ and B, A® and B°. If the concept of in- 
dependence of three or more events is to imply 
a corresponding result—and such a requirement 
seems desirable—the formal definition of independ- 
ence becomes cumbersome: The events A,, °, 
A, are said to be independent if the probability of 
the joint occurrence of any number of them equals 
the product of the corresponding individual prob- 
abilities. Thus, for the independence of three 
events A, B, C, all of the following four condi- 
tions must be satisfied: P(ABC) = P(A)P(B)P(C), 
P(AB) = P(A)P(B), P(AC) = P(A)P(C), P(BC) 
= P(B)P(C). 

Bayes’ formula. Let A be an event that can 
occur only in conjunction with one of k mutually 
exclusive and exhaustive events H,, +++, Hx. If A 
is observed, the probability that it occurs in con- 


junction with H; (i= 1, +++, k) is equal to 
P(H;|A) 
pe P(HiA) 
(3) ~ P(H,A) + +++ + P(H;A) 
PCH, )P(A\H;) 


= PCH, OPAH) + «+: + PHAPA 


where P(H;) is the prior probability of the event 
H; , the probability of H; before it is known whether 
A occurs, and P(A|H;) is the (conditional) proba- 
bility of the event A given H;, so that the denomi- 
nator of (3) is P(A). The probability (3) is called 
the posterior probability of H;. It is important to 
note that application of (3) requires knowledge of 
prior probabilities. For example, suppose that each 
of the 200 number pairs of the guessing-sequence 
experiment has been written on a separate card, In 
addition, two three-sided dice have been rolled 
200 times and the results noted on additional 
cards. One of the 400 cards is selected at random 
and found to contain a run. The probability that 
this card comes from the guessing sequence is 
(4x .175)/[(4x .175) + (4x 4)]=.344. Thus the 
appearance of a run changes the prior probability 
.500 to the posterior probability .344, reflecting the 
much smaller probability of a run in the empirical 
model than in the dice model. 

Combinatorial formulas. The task of counting 
the number of points in the sample space S and in 
subsets of S can often be carried out more eco- 
nomically with the help of a few formulas from 
combinatorial analysis. A permutation is an ar- 


rangement of some of the objects in a set in 
which order is relevant. A combination is a selec- 
tion in which order is irrelevant. Given n different 
objects, m <n of them can be selected in 


n\ _ n! 
(n) m! (n-m)! 
_ n(n—1)-:: (n—m+1) 
m(m—1)-:-1 


ways, when no attention is paid to the order. The 
symbol (2) (called a binomial coefficient and some- 
times written C») is read as “the number of com- 
binations of n things taken m at a time.” For a 
positive integer r, r! (read “r-factorial”) stands for 
the product of the first r positive integers, r! = 
r(r—1) «++ 1. By definition, 0! = 1. It is possible 
to arrange m out of n different objects in (n)» = 
mi(")=n(n—1)-++: (n—m+1)=n!I/(n—-m)! 
ways. The symbol (n)m is read as “the number of 
permutations of n things taken m at a time.” If 
the n objects are not all different—in particular, 
if there are n, of a first kind, n, of a second kind, 
and so on up to m of a kth kind such that n = n, + 


‘++ + m—then there are n!/(n,! --- nų!) ways in 
which all n objects can be permuted, For k=2 
this expression reduces to (7,) = (7,). See Niven 


(1965) and Riordan (1958) for combinatorial 
problems. 


Random variables 


Definition. In many chance experiments the 
chief interest relates to numerical information fur- 
nished by the experiment. A numerical quantity 
whose value is determined by the outcome of a 
chance experiment—mathematically, a single- 
valued function defined for every point of the 
sample space—is called a random variable (abbre- 
viated r.v.), The remainder of this article deals 
with such variables. For reasons of mathematical 
simplicity, the discussion will be in terms of r.v.’s 
that take only a finite number of values. Actually, 
with suitable modifications the results are valid 
generally. (Some details are given below.) It is 
customary to denote r.v.’s by capitals such as X, Y, 
Z. For example, consider the sample space con- 
sisting of the n students in a college, each student 
having the same probability, 1/n, of being selected 
for an “experiment.” A possible r.v., X, is the IQ 
Score of a student. A second r.v., Y, is a student's 
weight to the nearest five pounds. A third r.v., Z, 
may take only the value 1 or 2, depending on 
whether a student is male or female. The example 
shows that many r.v.’s can be defined on the same 
sample space. 

As a further example, consider the r.v. W, equal 
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Table 3 — Relation between sample points and values 
of a random variable 


Values of random 


Sample space voriable, W 
ay a2 03 Y 3) 4 
ea, ea, es “3a 4 5 
Gliese sea" 4 8 6 


to the sum of the two numbers in a guessing 
sequence of length 2. Table 3 shows the relation- 
ship between sample points and values of W. To 
each point in S there corresponds exactly one value 
of W. Generally, the reverse is not true. 

While it is possible to study the calculus of prob- 
abilities without specifically defining r.v.’s, formal 
introduction of such a concept greatly clarifies 
basic ideas and simplifies notation. If the sample 
space S in Table 3 actually refers to successive rolls 
of a three-sided die, possibly with unequal prob- 
abilities for the three sides, it is natural to consider 
only the sum of the two rolls as the event of interest 
[see SuFFiIcrENCy]. This can be done by defining 
separately the five events sum equals 2, sum equals 
8, ---, sum equals 6 or by considering vaguely a 
“wandering” variable that is able to take the values 
2 or 3 or --- or 6. The r.v. W defined above and il- 
lustrated in Table 3 combines both approaches in a 
simple and unambiguous way. Note that the event 
F can be expressed as the event W = 4. 

The advantages of a formal concept are even 
more pronounced when it is desirable to consider 
two or more measurements jointly. Thus, in the 
earlier example an investigator might not be inter- 
ested in considering weight and sex of students 
each by itself but as they relate to one another. This 
is accomplished by considering for every point in 
the sample space the number pair (Y,Z) where 
Y and Z are the r.v.’s defined earlier. Perhaps most 
important, the concept of r.v. permits more con- 
cise probability statements than the ones associ- 
ated with the basic sample space. 

Frequency function of a random variable. Let 
X be a random variable with possible values 
Xı, >*t, Xz. Define a function f(x) such that for 
x=x; (i=1,---,k), f(x;) is the probability of 
the event that the basic experiment results in an 
outcome for which the r.v. X takes the value 
x;, f(x;) = P(X = x;). Clearly, f(x,) + --+ + fa) = 1. 
The function f(x) defined in this way is called the 
frequency function of X. For example, in the dice 
model the frequency function of W is 


w|2 3 74 56 
fw) + tah 4 4 
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Mean and yariance of a random yariable. The 
expected value of the r.v. X, which is denoted by 
E(X) (or by EX or px or simply p if there is no 
ambiguity ), is defined as the weighted average E(X) 
=x,f(x,) + +++ + axf(x,). E(X) is also called the 
mean of X. More generally, if H(x) is a function of 
x, the expected value of the r.v. H(X) is given by 


E[H(X)] = H(x1)f(x:) + -++ + H( xx) f(%x) 
= Prix 


The function H(x) = (x —)? is of particular in- 
terest and usefulness. Its expected value is called 
the variance of X and is denoted by varX (or o% 
or o*), varX = E(X — x)? = (x, — p) f(x) + + 
(x, — u)? f(x). The positive square root of the vari- 
ance is called the standard deviation and is denoted 
by s.d.X (or oy or o). 

If the r.v. Y is a linear function of the r.v. X, Y = 
a+ bX, where a and b are constants, then py = a + 
buy, o} = b?o$, and o, = |b\o,, where |b| denotes 
the numerical value of b without regard to sign. 

The mean and variance of a r.v. are two impor- 
tant characteristics. Their true theoretical signifi- 
cance emerges in connection with such advanced 
theorems as the law of large numbers and the 
central limit theorem (see below). The useful addi- 
tive property of means and variances is also stated 
below. On an elementary level, the mean and the 
variance are useful descriptive or summary meas- 
ures [see STATISTICS, DESCRIPTIVE, article on LOCA- 
TION AND DISPERSION]. 

The mean is but one of the “averages” that can 
be computed from the values x,, ---, x; of the r.v. 
Another average is the median, Med, defined by 
the two inequalities P(X < Med) >4 and P(X > 
Med) > }. Thus the median—which may not be 
uniquely defined—is a number that cuts in half, as 
nearly as possible, the frequency function of X. 

The standard deviation is a measure of vari- 
ability or spread around the mean, a small standard 
deviation indicating little variability among the pos- 
sible values of the r.v., a large standard deviation 
indicating considerable variability. This rather vague 
statement is made more precise by the Bienaymé- 
Chebyshev inequality. This inequality establishes a 
connection between the size of the standard devia- 
tion and the concentration of probability in inter- 
vals centered at the mean. 

The Bienaymé-Chebyshev inequality. Let ô be 
an arbitrary positive constant. The probability that 

a r.v. X takes a value that deviates from its mean 
by less than ô standard deviations is at least 
1— 1/8, P(|X — p| < ôr) > 1 — 1/82. This is one 
of the forms of the Bienaymé—Chebyshev inequality 
(often called the Chebyshev inequality). Often the 


(4) 


complementary result is more interesting. The prob- 
ability that a r.v. X takes a value that deviates from 
its mean by at least 6 standard deviations is at 
most 1/6*, P(|X — | > r) < 1/8*. Although there 
are r.v.’s for which the inequality becomes an 
equality, for most r.v.’s occurring in practice, the 
probability of large deviations from the mean is 
considerably smaller than the upper limit indicated 
by the inequality. An example is given below. The 
Chebyshev inequality illustrates the fact that for 
probabilistic purposes the standard deviation is the 
natural unit of measurement. 

The binomial distribution. Consider an experi- 
ment that has only two possible outcomes, called 
success and failure. The word trial will be used to 
denote a single performance of such an experiment. 
A possible sample space for describing the results 
of n trials consists of all possible sequences Of 
length n of the type FFSSS -:: F, where S stands 
for success and F for failure. A natural r.v. defined 
on this sample space is the total number X of 
successes [see SUFFICIENCY]. If successive trials are 
independent and the probability of success in any 
given trial is a constant p, then 


(5) f(x) =P(K=x)=(%) pa —p)™, 


w= '0)1, ++", n 


The r.v. X is called the binomial r.v. and the fre- 
quency function (5) the binomial distribution. 
[See DISTRIBUTIONS, STATISTICAL for a discussion 
of the binomial distributions and other distributions 
mentioned below.] The expected or mean value of 
X is np and the standard deviation is Vnp(1 ~ P). 
According to the Chebyshev inequality the upper . 
limit for the probability that a r.v. deviates from 
its mean by two standard deviations or more is .25. 
For the binomial variable with n = 50 and p= 4, 
for example, the exact probability is only .033, con- 
siderably smaller than the Chebyshev limit. 

Joint frequency functions. Let X and Y be two 
r.v.’s defined on the same sample space. Let X take 
the values x,,-+-,x,, and let Y take the values 
Yı, +, Ya. What then is the probability of the 
joint event X = x; and Y = y; ? The function f(x,y) 
such that .f(x;, yj) = P(X = x; and Y=y;) (i= 1, 
-++,k; j=1,-*+,h) is called the bivariate fre- 
quency function of X and Y. In terms of f(x,y) the 
marginal frequency function of X is given by 
f(x) = P(X =x) = DA f(x;,y;) and, similarly, 
the marginal frequency function of Y by g(y;) = 
P(Y =y,) = Eta f(x,,y;)- Marginal distributions 
are used to make probability statements that in- 
volve only one of the variables without regard to the 
value of the other variable. A different situa- 


tion arises if the value of one of the variables 
becomes known. In that case, probability state- 
ments involving the other variable should be con- 
ditional on what is known. If in (1), A and B 
are defined as the events X = x; and Y = y; , respec- 
tively, the conditional frequency function of the 
r.v. Y, given that X has the value x;, is found to be 
g(yi|%1) = PCY = yi|X = x) = f(x,y) /F (xi), with 
a corresponding expression for the conditional fre- 
quency function of X given Y. For example, in the 
empirical model for the guessing-sequence experi- 
ment, associate a r.v. X with the first guess and a 
r.v. Y with the second guess. Then f(x,y) = P. (the 
first guess is x and the second guess is y), x,y= 
1,2,3. In Table 2 the last column on the right repre- 
sents the marginal distribution of X and the last row 
on the bottom the marginal distribution of Y. It is 
noteworthy that these marginal distributions do 
not differ very much from the marginal distribu- 
tions for the dice model in which all probabilities 
equal 4, As an example of a conditional probability, 
note that g(1|1) =P,(Y=1|X=1) = .065/.360 = 
181, a result that differs considerably from the 
corresponding probability 4 for the dice model. 
The two r.v.’s X and Y are said to be independent 
(or independently distributed) if the joint fre- 
quency function of X and Y can be written as the 
product of the two marginal frequency functions, 


(6) F(x.) = f: )g y). 


If X and Y are independent r.v.’s, knowledge of the 
value of one of the variables, provided that f(x,y) 
is known, furnishes no information about the other 
variable, since if (6) is true, the conditional fre- 
' quency functions equal the marginal frequency 
functions, f(x|y) = f(x) and g(y|x) = g(y), what- 
ever the value of the conditioning variable. The 
tvs X and Y in the guessing-sequence example 
are independent in the dice model and dependent 
in the empirigal model. 
In addition to the means px and py and the vari- 
ances of and o% of the marginal frequency func- 
tions one defines the covariance, 


E(X — py) (Y — py) 
È E- a)l — orf) 


izi j= 


E(XY) — px, 


Oxy 


Il 


Ii 


I 


and the correlation coefficient, p = oxy/ox0%. If X 
and Y are independent, their covariance and, con- 
Sequently, their correlation are zero. The reverse, 
lOwever, is not necessarily true; two r.v.’s, X and 
Y, can be uncorrelated (that is, can have correla- 
tion coefficient zero) without being independent. 
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The covariance of two r.v.’s is a measure of the 
“co-variability” of the two variables about their re- 
spective means; a positive covariance indicates a 
tendency of the two variables to deviate in the same 
direction, while a negative covariance indicates a 
tendency to deviate in opposite directions. Although 
the covariance does not depend on the zero points 
of the scales in which X and Y are measured, it 
does depend on the units of measurement. By di- 
viding the covariance by the product of the stand- 
ard deviations a normalization is introduced making 
the resulting correlation coefficient independent of 
the units of measurement as well. (If, for example, 
X and Y represent temperature measurements, the 
correlation between X and Y is the same whether 
temperatures are measured as degrees Fahrenheit 
or as degrees centigrade.) The concepts of covari- 
ance and correlation are closely tied to linear as- 
sociation. One may have very strong (even com- 
plete) nonlinear association and very small (even 
zero) correlation. 

The concepts discussed in this section generalize 
from two to more than two variables. 


Sums of random variables 


Mean and variance of a sum of random variables. 
Let X,,--:,X, be a set of n r.v.’s with means 
E(X;) =p, variances E(X;—p,)?=07, and co- 
variances E(X; — pi)(X;- mj) =o% (i7=1,°°°, 
n; i#j). Some of the most fruitful studies in the 
calculus of probabilities are concerned with the 
properties of sums of the type Z = ¢,X, + +++ + Cary 
where the c; are given constants. In this article 
only some of the simpler, although nevertheless 
highly important, results will be stated. 

The mean and variance of Z are py, = Cif, + 
+++ Capin and of = Cio} + +++ + Choy + 2c,c.012 
+ 2e,Co1y + ++ + WCyACyTn-1,n. For the remainder 
assume that the r.v.’s X,,---,X, are independ- 
ently and identically distributed. This is the mathe- 
matical model assumed for many statistical investi- 
gations. The common mean of the r.v.’s X,, +++, Xn 
is denoted by u and the common variance by o°. 
Of particular interest are the two sums S, = X, + 
+++ 4X, and X= (X,++::+X,)/n=S,/n, (The 
binomial r.v. X is of the form S, if a r.v. X; is as- 
sociated with the ith trial, X; taking the value 1 or 
0, depending on whether the ith trial results in 
success or failure.) For the sum S,, w= np, o}, = 
no®, and for X, pz = p, o$ =0°/n. 

The law of large numbers. If Chebyshev’s in- 
equality is applied to X, then for arbitrarily small 
positive 5 


5 o 
P(IR—p| <5) > 1— apr 
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By choosing n sufficiently large, the right side can 
be made to differ from 1 by as little as desired. It 
follows that the probability that X deviates from y 
by more than some arbitrarily small positive quan- 
tity ô can be made as small as desired. In particular, 
in the case of the binomial variable X, the proba- 
bility that the observed success ratio, X = X/n, 
deviates from the probability, p, of success in a 
single trial by more than 6 can be made arbitrarily 
small by performing a sufficiently large number of 
trials. This is the simplest version of the celebrated 
law of large numbers, more commonly known—and 
misinterpreted—as the “law of averages.” The law 
of large numbers does not imply that the observed 
number of successes X necessarily deviates little 
from the expected number of successes np, only that 
the relative frequency of success X/n is close to p. 
Nor does the law of large numbers imply that, given 
RX >p after n trials, the probability of success on 
subsequent trials is small in order to compensate for 
an excess of successes among the first n trials. 
Nature “averages out” by swamping, not by fluctu- 
ating. 

In more advanced treatments this law is called the 
weak law of large numbers, to distinguish it from a 
stronger form. 

The central limit theorem. The law of large 
numbers has more theoretical than practical sig- 
nificance, since it does not furnish precise or even 
approximate probabilities in any given situation. 
Such information is, however, provided by the cen- 
tral limit theorem: As n increases indefinitely, the 
distribution function of the standardized variable 
(Sn — Hsn) /Ts, = (Sn — nu )/ Vno? converges to the 
so-called standard normal distribution. (A general 
discussion of the normal distribution is given be- 
low. ) For practical purposes the stated result means 
that for large n the probability that S, takes a value 
between two numbers a <b can be obtained ap- 
proximately as the area under the standard normal 
curve between the two points (a — ng )/ Vno and 
(b — nu)/ Vno”. In particular, the probability that 
in a binomial experiment the number of successes is 
at least k, and at most k, (where k, and k, are in- 
tegers) is approximately equal to the area under 
the normal curve between the limits (k, — 4 — np)/ 
vnp(I =p) and (k, +3—np)/Vnp(1 =p), pro- 
vided n is sufficiently large. Here a continuity cor- 
rection of 4 has been used in order to improve the 

approximation. For most practical purposes n may 
be assumed to be “sufficiently large” if np(1 — p)is 
at least 3. 

The central limit theorem occupies a basic posi- 
tion not only in theory but also in application. The 
sample observations x,, :-- , x, drawn by the statis- 
tician may be looked upon as realizations of n jointly 


distributed random variables X,, ---, X,. It is cus- 
tomary to refer to a function of sample observations 
as a statistic. From this point of view, a statistic is a 
r.v. and its distribution function is called the sam- 
pling distribution of the statistic, The problem of 
determining the sampling distributions of statistics 
of interest to the statistician is one of the important 
problems of the calculus of probabilities. The central 
limit theorem states that under very general con- 
ditions the sampling distribution of the statistic S, 
can, for sufficiently large samples, be approximated 
in a suitable manner by the normal distribution. 

More complicated versions of the law of large 
numbers and the central limit theorem exist for the 
case of r.v.’s that are not identically distributed and 
are even dependent to some extent. 


A more general view 


A more general view of random variables and 
their distributions will now be presented. 

Discrete and continuous random variables. For 
reasons of mathematical simplicity the discussion 
so far has been in terms of r.v.’s that take only a 
finite number of values. Actually this limitation was 
used explicitly only when giving such definitions as 
that of an expected value, Theorems like Cheby- 
shev’s inequality, the law of large numbers, and the 
central limit theorem were formulated without men- 
tion of a finite number of values, Indeed, they are 
true for very general r.v.’s. The remainder of this 
article will be concerned with such r.v.’s. Of neces- 
sity the mathematical tools have to be of a more 
advanced nature. (For infinite sample spaces, there 
arises the need for a concept called measurability in 
the discussion of events and of random variables. 
For simplicity such discussion is omitted here. ) 

By definition, a r.v. X is'a single-valued function 
defined on a sample space. For every number x 
(-» < x < %), the probability that X takes a value 
that is smaller than or equal to x can be determined. 
Let F(x) denote this probability considered as a 
function of x, F(x) = P(X <x). F(x) is called the 
(cumulative) distribution function of the r.v. X. The 
following properties of a distribution function are 
consequences of the definition: F(—%) = 0; F( oy = 
1; F(x) is monotonically nondecreasing, that is, 
F(x,) < F(x.) if x,<x.. Furthermore, F(x) ~ 
F(x,) = P(x, <X <x). Such a function may be 
continuous or discontinuous. If discontinuous, it has 
at most a denumerable number of discontinuities, at 
each of which F(x) has a simple jump, or saltus. 
The height of this jump is equal to the probability 
with which the r.v. X takes the value x where the 
discontinuity occurs. At the same time, if F(x) 
is continuous at x, the probability of the event X = ¥ 
is zero. 


Let F(x) be discontinuous with discontinuities 
occurring at the points x,, X2, *-*, Xn, °*:. (If there 
are only a finite number of discontinuities, denote 
their number by n.) Let the size of the jump occur- 
ring atx = x; (i= 1,2, --+,m, --+) beequal to f(x;). 
A particularly simple case occurs if f(x,) + f(x.) + 
+++ + f(X) + +++ = 1. In this case F(x) is a “step 
function” and x,, X2, +++, Xa, **- are the only values 
taken by the r.v. X. Such a r.v. is said to be discrete. 
Clearly the r.v.’s considered earlier are discrete r.v.’s 
with a finite number of values. As before, call f(x) 
the frequency function of the r.v. X. In terms of 
f(x), F(x) is given by F(x) =Df(x;:), where 
—« <x < and the summation extends over all 
xı that are smaller than or equal to x. 

If F(x) is continuous for all x, X is said to be a 
continuous r.v. Consider the case where there exists 
a function f(x) such that F(x) = FECE) dt. (In sta- 
tistical applications this restriction is of little im- 
portance.) The function f(x) = dF(x)/dx is called 
the density function of the continuous r.v. X. Clear- 
ly f(x) > 0 and ff(t) dt = 1. Furthermore, F(x) — 


F(x,) = f(t) dt = P(x, <X <x.) = P(x, SX <x.) 


= P(x, < X < x,) = P(x, < X < x), since for a con- 
tinuous r.v. X, P(X = x) = 0 for every x. It follows 
that for a continuous r.v. X with density function 
f(x) the probability that X takes a value between 
two numbers x, and x, is given by the area between 
the curve representing f(x) and the x-axis and 
bounded by the ordinates at x, and x, (Figure 3). 
Although r.v.’s that are part continuous and part 
discrete occur, they will not be considered here. 


F(x) 


Ea Xs 


Figure 3 — P(x, < X < x,) as area 


For two continuous r.v.’s defined on the same 
sample space, stipulate the existence of a bivariate 
density function f(x,y) such that 

41 V3 
P(x, <X <x, and y, SY <y:) = J [fan dxdy. 
7 Wy 
In terms of f(x,y), marginal and conditional den- 
Sity functions can be defined as in the case of two 
discrete variables. 
_ Let H(x) be a function of x. An interesting and 
Important problem is concerned with the distribu- 
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tion of the derived random variable H(X). Here 
only the expected value of H(X) will be discussed. 
E[H(X)] can be expressed in terms of the distribu- 
tion function F(x) by means of the Stieltjes integral 


(7) E[H(X)] =[H(x) dF(x). 


For discrete r.v’s, (7) reduces to E[H(X)]= 
SH (x: )f(xi). If X has a density function f(x), 
(7) becomes E[H(X)] = fH(x)f(x)dx. One im- 


-> 


portant new factor arises that was not present in 
(4), the question of existence. In (7) the expected 
value exists if and only if the corresponding sum or 
integral converges absolutely. This condition means 
that in the discrete case, for example, E[H(X)] 
exists just when Ð |H(x;)|f(x:) converges. 

If H(x) =x*, k=1,2,---, the corresponding 
expected value is called the kth moment about the 
origin and is denoted by pi, mi, = E(X*). Of par- 
ticular interest is the first moment, u’, which was 
written earlier as u. The kth central moment, px, 
is defined as p, = E(X — p)! = pi, — kph, £ + 
(—1)*u*. The first central moment is zero. The sec- 
ond central moment is the variance, denoted by o°, 
a? = E(X — p)? = p’ — p*. The moments are of in- 
terest because of the information that they provide 
about the distribution function. Thus the Chebyshev 
inequality shows the kind of information provided 
by the first two moments. Additional moments 
provide more and more precise information. Fi- 
nally, in many circumstances, knowledge of the 
moments of all orders uniquely determines the 
distribution function. 

Generating functions. Rather than compute 
moments from their definitions, it is often simpler 
to make use of a generating function. The moment- 
generating function, M(u), is defined as the ex- 
pected value of the random variable e“*, where u 
is a real variable. The characteristic function is 
defined as the expected value of e*“*, where u again 
is real and i= V~]. The characteristic function 
has the advantage that it always exists. The mo- 
ment-generating function exists only for r.v.’s that 
have moments of all orders. For k = 1, 2,---, the 
kth moment can be found as the kth derivative of 
M(u) evaluated at u=O0, yi,=d'M(u)/du',.,. 
A corresponding result holds for characteristic 
functions. 

While the moment-generating property of the 
moment-generating function is useful, its main 
significance arises from the uniqueness theorem. 
A moment-generating function uniquely deter- 
mines its distribution function, and it is often 
easier to find the moment-generating function of 
a r.v. than its distribution function. As an example, 
consider the r.v. Z=X+/Y, where X and Y are 
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two independently distributed r.v.’s. It follows from 
the definition of a moment-generating function 
that the moment-generating function of Z is the 
product of the moment-generating functions of X 
and Y. No such simple relationship exists between 
the distribution function of Z and those of X and Y. 
However, once the moment-generating function of 
Z is known, it is theoretically possible to determine 
its distribution function, These results also hold for 
characteristic functions. 

The Poisson and normal distributions. In con- 
clusion two examples will be given—one involving 
a discrete r.v., the other a continuous r.v. When 
considering random events occurring in time, one 
is often interested in the total number of occur- 
rences in an interval of given length. An example 
is the number of suicides occurring in a community 
in a year's time. Then a discrete r.v. X with possible 
values 0, 1, 2, --- can be defined. Often an appro- 
priate mathematical model is given by the Poisson 
distribution, according to which 


Fx)=P(X=x)=e9%, x =0,1,2--, 
where À is a characteristic of the type of random 
event considered. The moment-generating func- 
tion of X is M(w) = exp[A(e“ — 1)], where exp w 
stands for e”. Then p = à = a°, Thus À is the mean 
number of occurrences in the given time interval 
and, at the same time, is also the variance of the 
number of occurrences. Furthermore, the sum of 
two independent Poisson variables is again a Pois- 
son variable whose parameter À is the sum of the 
parameters of the two independent variables. 

The Poisson distribution serves as an excellent 
approximation for binomial probabilities, if the 
probability of success, p, is small. More exactly, 
if np is set equal to A, the binomial probability in 
(5) can be approximated by e-\*/x!, provided n is 
sufficiently large. This approximation is particu- 
larly useful when only the product np is known 
but not the values of n and p separately. 

A continuous r.v. with density function f(x) = 
exp [—(x — a)*/2b*]/ \/27b* is said to have the 
normal distribution with parameters a and b > 0. 
The moment-generating function of such a variable 
is exp (au + 4b*u*), from which p =a, o? = b, It 
is therefore customary to write the density as 


1 
F) = Faas OP [Le z w]. 
A particular normal density function is obtained 
by setting » = 0 and o*=1. This simple function 
is called the unit (or standard) normal density 
function; it appeared above in the discussion of 
the central limit theorem. The sum of two or more 
independent normal variables is again normally 


distributed with mean and variance equal to the 
sums of means and variances, respectively. 

The normal distribution is often used as a mathe- 
matical model to describe populations such as that 
of scores on a test. Arguments in support of the 
normality assumption are customarily based on 
the central limit theorem. Thus it is argued that 
the value of a given measurement is determined 
by a large number of factors. It is less frequently 
realized that reference to the central limit theorem 
implies also that factors act in an additive fashion. 
Nevertheless, experience shows that the degree of 
nonnormality occurring in practice is often so small 
that the assumption of actual normality does not 
lead to erroneous conclusions. 


GOTTFRIED E, NOETHER 


[For a discussion of the various distributions mentioned 
in the text, see also DISTRIBUTIONS, STATISTICAL.) 
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LLI 
INTERPRETATIONS 


Many disputes—what are they about? There are 
myriad different views on probability, and disputes 
about them have been going on and increasing for 
a long time. Before outlining the principal ques- 
tions, and the main attitudes toward them, we note 
a seeming contradiction; it may be said with equal 
truth that the different interpretations alter in no 
substantial way the contents and applications of 


the theory of probability and, yet, that they utterly 
alter everything. It is important to have in mind 
precisely what changes and what does not. 

Nothing changes for the mathematical theory 
[see PROBABILITY, article on FORMAL PROBABILITY]. 
Thus a mathematician not conceptually interested 
in probability can do unanimously acceptable work 
on its theory, starting from a merely axiomatic 
basis. And often nothing changes even in practical 
applications, where the same arguments are likely 
to be accepted by everyone, if expressed in a suf- 
ficiently acritical way (and if the validity of the 
particular application is not disputed because of 
preconceptions inherent in one view or another). 

For example, suppose someone says that he 
attaches the probability one sixth to an ace at his 
next throw of a die. If asked what he means, he 
may well agree with statements expressed roughly 
thus: he considers $1 the fair insurance premium 
against a risk of $6 to which he might be exposed 
by occurrence of the ace; the six faces are equally 
likely and only one is favorable; it may be expected 
that every face will appear in about 4 of the trials 
in the long run; he has observed a frequency of } 
in the past and adopts this value as the probability 
for the next trial; and so on. Little background is 
needed to see that each of these rough statements 
admits several interpretations (or none at all, if 
one balks at insufficient specification). Moreover, 
only one of the statements can express the very 
idea, or definition, of probability according to this 
person’s language, while the others would be ac- 
cepted by him, if at all, as consequences of the 
definition and of some theorems or special addi- 
tional assumptions for particular cases. It would be 
a most harmful misappraisal to conclude that the 
differences in interpretation are meaningless ex- 
cept for pedantic hairsplitters or, even worse, that 
they do not matter at all (as when the same 
geometry is constructed from equivalent sets of 
axioms beginning with different choices of the 
Primitive notions). The various views not only 
endow the same formal statement with completely 
different meanings, but a particular view also 
usually rejects some statements as meaningless, 
thereby restricting the validity of the theory to a 
narrowed domain, where the holders of that view 
feel more secure, Then, to replace the rejected 
Parts, expedients aiming at suitable reinterpreta- 
tion are often invented, which, naturally, are only 
misinterpretations for the adherents of other views. 


A bit of history 


The beginnings. It is an ambitious task even 
to make clear the distinctions and connections be- 
tween the various schools of thought, so we re- 
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nounce any attempt to enter far into their historical 
vicissitudes. A sketch of the main lines of evolution 
should be enough to give perspective. [See STATIS- 
Tics, article on THE HISTORY OF STATISTICAL 
METHOD, ] 

In an early period (roughly 1650-1800), the 
mathematical theory of probability had its begin- 
nings and an extraordinarily rapid and fruitful 
growth. Not only were the fundamental tools and 
problems acquired, but also the cornerstones of 
some very modern edifices were laid: among others. 
the principle of utility maximization, by Daniel 
Bernoulli in 1738; the probabilistic approach to in- 
ductive reasoning and behavior, by Thomas Bayes 
in 1763; and even the minimax principle of game 
theory, by de Waldegrave in 1712 (Guilbaud 
1961). But interest, in those days, focused on 
seemingly more concrete problems, such as card 
games; conceptual questions were merely fore- 
shadowed, not investigated critically; utility theory 
remained unfruitful; Bayes’ principle was mislead- 
ingly linked with Bayes’ postulate of uniform initial 
distribution; de Waldegrave’s idea went unnoticed 
[see BAYESIAN INFERENCE; GAME THEORY]. 

It happened thus that some applications of prob- 
ability to new fields, such as judicial decisions, 
were bold and careless, that the Bayesian approach 
was often misused, and that ambiguity in interpre- 
tation became acute in some contexts. Particularly 
troublesome was the meaning of “equally likely,” 
when—with Laplace in 1814—this notion came 
forward as the basis of an ostensible definition of 
probability. 

A confining criticism. A bitter criticism arose 
that was prompt to cut away all possible causes of 
trouble rather than to analyze and recover sound 
underlying ideas. This attitude was dominant in 
the nineteenth century and is still strong. Concern- 
ing “equal probabilities,” it has long been debated 
whether they ought to be based on “perfect knowl- 
edge that all the relevant circumstances are the 
same” or simply on “ignorance of any relevant cir- 
cumstance that is different,” whatever these ex- 
pressions themselves may mean. To illustrate, is 
the probability of heads 3 for a single toss only if 
we know that the coin is perfect, or even if it may 
not be but we are not informed which face happens 
to be favored? The terms objective and subjective 
—now used to distinguish two fundamentally dif- 
ferent natures that probability might be understood 
to have—first appeared in connection with these 
particular, not very well specified, meanings of 
“equally probable.” 

It is apparent how narrow the field of probability 
becomes if restricted to cases arising from sym- 
metric partitions. Sometimes, it was said, applica- 
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tions outside this field could be allowed by “analogy,” 
but without any real effort to explain how far (and 
why, and in what sense) such an extension would 
be valid. 

Authors chiefly interested in statistical problems 
were led to another confining approach that hinges 
on that property of probability most pertinent to 
their field, namely, the link with frequency, for 
example, Venn (1866), or the limit of frequency, 
for example, Von Mises (1928). In any such the- 
ory, the limitation of the field of probability is less 
severe, in a sense, but is vaguely determined—or 
altogether undetermined if, as I believe, such theo- 
ries are unavoidably circular. 

A liberating criticism. There is today a vigor- 
ous revival of the current of thought (mentioned 
above in the subsection “The beginnings”) that 
could not find its true development in the eight- 
eenth century for want of full consciousness of its 
own implications. According to this outlook, an 
attempt to replace the familiar intuitive notion of 
probability by any, necessarily unsuccessful, imita- 
tion is by no means required, or even admissible. 
On the contrary, it suffices to make this intuitive 
notion neat and clear, ready to be used openly for 
what it is. 

The most deliberate contributions to this pro- 
gram were along two convergent lines, one based 
on what may be called admissibility, by Frank P. 
Ramsey in 1926 (see 1923-1928) and Leonard J. 
Savage (1954), and one on coherence, or con- 
sistency, by Bruno de Finetti (1930) and B. O. 
Koopman (1940a; 1940b). In addition, the impact 
of authors supporting views concordant only in 
part with this one (like John Maynard Keynes 
1921; Emile Borel 1924; and Harold Jeffreys 1939), 
as well as the impact of many concomitant circum- 
stances, was no less effective. We can but list the 
following principal ones: the development of some- 
what related theories (games, as by John von Neu- 
mann and Oskar Morgenstern 1944 [see GAME 
THEORY]; decisions, Abraham Wald 1950 [see DE- 
CISION THEORY]; logical investigations, Rudolf Car- 
nap 1950; 1952); the detection of shortcomings in 
objectivistic, or frequentistic, statistics; the applica- 
tions of probability to problems in economics and 
operations research; the survival of the “old-new” 
ideas in some spheres where common sense and 
practical needs were not satisfied by other theories, 
as in engineering (Fry 1934; Molina 1931), in actu- 
arial science, and in credibility theory (Bailey 
1950); and so on. Above all, the less rigid concep- 
tion of scientific thought following the decline of 
rationalistic and deterministic dogmatism facili- 
tated acceptance of the idea that a theory of uncer- 
tainty should find its own way. 


The objective and the subjective 


In logic. To make the step to probability easier, 
let us start with logic. The pertinent logic is that of 
sentences—more precisely, of sentences about the 
outside world, that is, sentences supposed to have 
some verifiable meaning in the outside world. We 
call such a sentence or the fact that it asserts—a 
harmless formal ambiguity—an event. 

(There is a usage current in which event denotes 
what would here be called a class of events that 
are somehow homogeneous or similar, and indi- 
vidual event, or trial, denotes what is here called an 
event. The alternative usage is favored by some 
who emphasize frequencies in such classes of 
events. The nomenclature adopted here is the more 
flexible; while not precluding discussion of classes 
of events, it raises no questions about just what 
classes of events, if any, play a special role.) 

Some examples of events—that is, of sentences, 
assertions, or facts—are these: A = Australia wins 
the Davis Cup in 1960. B =the same for 1961. 
C = the same for 1960 and 1961. 

What can be said of an event, objectively? Ob- 
jectively it is either true or false (tertium non 
datur), irrespective of whether its truth or falsity 
is known to us or of the reason for our pos- 
sible ignorance (such as that the facts are in the 
future, that we have not been informed, that we 
forget, and so on). However, a third term, inde- 
terminate, is sometimes used to denote events that 
depend on future facts, or, in a narrower sense, on 
future facts other than those considered to be fully 
controlled by deterministic physical laws. In this 
sense, event A was indeterminate until the deciding 
ball of the 1960 Cup finals, and B and C were in- 
determinate until 1961, when they became true. 
Yet, in the strictest logical terminology, they are 
true in the atemporal sense; or, with reference to 
time, they were and will be true forever, irrespec- 
tive of the time-space point of a possible observer. 

Subjectively—for some person, at a given mo- 
ment—an event may be certain, impossible, or 
dubious. But mistakes are not excluded; certain 
(or impossible) does not necessarily imply true (or 
false). 

In logic restricted to the subjective aspect, that 
is, ignoring or disregarding reality, one can say 
only whether a person’s assertions are consistent 
or not; including the objective aspect, one may be 
able to add that they are correct, unmistaken, or mis- 
taken. In the Davis Cup example (if A means not-A, 
and a that A is dubious), then ABC, Abc, aBe, abc, 
abC, ABC, ABC, ABC, AbC, aBC are the only consist- 
ent assertions. For example, AbC is not, for there is 
no possible doubt about B if C is certain. Taking 


reality into account (namely, that A, B, C are actu- 
ally all true), only the first, ABC, is correct; the 
next three are unmistaken, for no false event is 
considered as certain nor any true one as impos- 
sible; the remaining assertions are mistaken. 

From logic to probability. Probability is the ex- 
pedient devised to overcome the insufficiency of the 
coarse logical classification. To fill the questionable 
gap between true and false, instead of the generic 
term indeterminate, a continuous range of objec- 
tive probabilities may be contemplated. And the 
unquestionable gap between certain and impos- 
sible, the dubious, is split into a continuous range 
of degrees of doubt, or degrees of belief, which are 
subjective probabilities. 

The existence of subjective probabilities and of 
some kind of reasoning thereby is a fact of daily 
psychological experience; it cannot be denied, al- 
though it may be considered by some to be proba- 
bilistically uninteresting. (In the most radical 
negation its relation to the true probability theory 
is compared to that between the energy of a per- 
son’s will and energy as a physical notion.) To 
exploit this common experience, we need only a 
device to measure, and hence to define effectively, 
subjective probabilities in numerical terms and 
criteria to build up a theory. We consider three 
kinds of theories about subjective probability, SP, 
SC, and SR, which aim, respectively, to character- 
ize psychological, consistent, and rational behavior 
under uncertainty. 

Since the meaningfulness of objective proba- 
bilities is at least as questionable as the notion of 
indeterminateness, it is essential to ponder the vari- 
ous bases proposed for objective probability and 
the forces impelling many people to feel it im- 
portant. Here, too, we shall consider three kinds 
of theories of objective probability, OS, OF, and 
OL, based, respectively, on symmetry (or equi- 
probable cases), on frequency, and on the limit 
of frequency. 

Some remarks. The above classification into six 
kinds of theories is a matter of convenience. Some 
theories that do not fit the scheme must be men- 
tioned separately. In the axiomatic approach, for 
example, probability has, by convention, the nature 
of a measure as defined by Kolmogorov ([1933] 
1956, p. 2) and discussed by Savage (1954, 
P. 33); applications are made without specifying 
the meaning of probability or alluding to any of the 
interpretations discussed here. In other theories, 
Probabilities may be noncomparable, and hence 
nonnumerical; there are such variants not only of 
SP but also of SC (Smith 1961) or even SR, as by 
Keynes, 

Different theories are not necessarily incom- 
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patible. For instance, SC should be, and sometimes 
is, accepted by a supporter of SR as a preliminary 
weaker construction—as projective is preliminary 
to Euclidean geometry. Even less natural associa- 
tions are current: Carnap admits two distinct 
notions, prob, and prob,, that correspond to SR (or 
SC?) and OF (or OL?); if, like many psycholo- 
gists, one stops at SP or at its variant with non- 
comparability (Ellsberg 1961), judging SC to be 
unreasonably stringent, one is often inclined to 
accept much more stringent assumptions, such as 
OS, in particular fields; and so on. 

The order we shall follow is from the least to the 
most restrictive theory. In a sense, each requires 
the preceding ones together with further restric- 
tions, Beginning with the subjectivistic interpreta- 
tion, we have a good tool for investigating the 
special objectivistic constructions; they can in fact 
be imbedded into it, not only in form but also in 
substance. For the opinions a person holds as ob- 
jectively right will of course also be adopted by 
him among those he considers subjectively right 
for himself. 

Toward a subjectivistic definition. There is no 
difficulty in finding methods that attach to a sub- 
jective probability its numerical value in the usual 
scale from 0 to 1; what is difficult is weighing the 
pros and cons of various essentially equivalent 
measuring devices. Verbalistic comparisons or as- 
sertions are simple, but not suitable, at least at 
first, when we must grasp the real meaning from 
the measuring device. The device must therefore 
reveal preferences, by replies or by actions. The 
first way, by replies, is still somewhat verbalistic; 
the second is truly behavioristic but may be vitiated 
insofar as actions are often unpondered or dictated 
by caprice or by elusive side effects. 

Obviously, the latter inconveniences are particu- 
Jarly troublesome in measuring subjective entities. 
It must be considered, for example, whether some 
opponent exists who may perhaps have, or be able 
to obtain, more information or be inclined to cheat. 
Such trouble is not, however, peculiar to proba- 
bility. Measuring any physical quantity gives rise 
to the same sort of difficulties when its definition is 
based on a sufficiently realistic idealization of a 
measurement device. 

Roughly speaking, the value p, given by a person 
for the probability P(E) of an event E, means the 
price he is just willing to pay for a unit amount of 
money conditional on E’s being true. That is, the 
preferences on the basis of which he is willing to 
behave (side effects being eliminated) are deter- 
mined, with respect to gains or losses depending 
on E, by assuming that an amount S conditional on 
E is evaluated at pS. That must be asserted only 
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for sufficiently small amounts; the general ap- 
proach should deal in the same way with utility, a 
concept that, for want of space, is not discussed 
here [see UTILITY]. 

To construct a device that obliges a person to 
reveal his true opinions, it suffices to offer him a 
choice among a set of decisions, each entailing 
specified gains or losses according to the outcome 
of the event (or events) considered; such a set can 
be so arranged that any choice corresponds to a 
definite probability evaluation. The device can also 
be so simply constructed as to permit an easy dem- 
onstration that coherence requires the evaluation 
to obey the usual laws of the calculus of probability. 


The subjectivistic approach 


If we accept such a definition, the fundamental 
question of what should be meant by a theory of 
probability now arises. There seems to be agree- 
ment that it must lead to the usual relations be- 
tween probabilities, to the rules of the probability 
calculus, and also to rules of behavior based on 
any evaluation of probabilities (still excluding de- 
cisions about substantial amounts of money, which 
require utilities), The three kinds of theory an- 
nounced earlier differ in that they are looking for: 
a theory of actual behavior (SP); a theory of co- 
herent behavior (SC); a theory of rational behavior 
(SR)—in other words, they are looking either for 
a theory of behavior under uncertainty as it is or 
for a theory of behavior as it ought to be, either in 
the weaker sense of avoiding contradictions or in 
the stronger sense of choosing the “correct” proba- 
bility evaluations. 

Subjective probability, psychological (SP). No 
one can deny the existence of actual behavior or 
the interest in investigating it in men, children, 
rats, and so on. Such experimental studies yield 
only descriptive theories, which cannot be expected 
to conform to the ordinary mathematics of proba- 
bility. A descriptive theory may exist whether a 
corresponding normative one does or not. For ex- 
ample, when studying tastes, there are no ques- 
tions of which tastes are intrinsically true or false. 
When studying responses to problems of arith- 
metic or logic, it is meaningful to distinguish, and 
important to investigate, whether the answers are 
correct and what mistakes are made. 

One can be content to stop with the study of SP, 
accepting no normative theory—which SC, SR, OS, 
OF, and OL are, in a more or less complete sense. 
But one cannot object to a normative theory on the 
grounds that it does not conform to the actual be- 
havior of men or rats or children. A normative 
theory states what behavior is good or bad. We 


may question whether any normative theory exists 
or whether a given one has any claims to be ac- 
cepted. These questions have nothing to do with 
whether any beings do in fact behave according to 
the conclusions of the theory. It cannot therefore 
be confirmed or rejected on the basis of observa- 
tional data, which can on the contrary say only 
whether there is more or less urgent need to teach 
people how to behave consistently or rationally. 

Subjective probability, consistent (SC). The 
probability evaluations over any set of events what- 
ever can be mathematically separated into the 
classes of those that are coherent and those that 
are not. Coherence means that no bet resulting in 
certain loss is considered acceptable; coherence is 
equivalent to admissibility, according to which no 
decision is preferable to another that, in every case, 
gives as good an outcome [see DECISION THEORY]. 
This is meant here with reference to maximizing 
expectations; it should ultimately be transformed 
into maximizing expected utility, which is the most 
general case of coherent behavior (Savage 1954). 

As may be seen rather easily from the behavior- 
istic devices on which personal probability is 
founded, coherence is equivalent to the condition 
that the whole usual calculus of probability be 
satisfied. For instance, if C = AB (as in the exam- 
ple about the Davis Cup), the necessary and suf- 
ficient condition for coherence is 


P(A) + P(B) —1<P(C) <min[P(A), P(B)]. 


Properly subjectivistic authors (like the present 
one) think coherence is all that theory can pre- 
scribe; the choice of any one among the infinitely 
many coherent probability distributions is then 
free, in the sense that it is the responsibility of, 
and depends on the feelings of, whatever person 
is concerned. 

Subjective probability, rational (SR). We de- 
note as rational, or rationalistic, a theory that aims 
at selecting (at least in a partial field of events) 
just one of the coherent probability distributions, 
supposed to be prescribed by some principles of 
thought. (This is called a “necessary” view. Some 
writers use “rational” for what we call “coherent.” ) 

In most cases, such a view amounts to present- 
ing as cogent the feeling of symmetry that is likely 
to arise in many circumstances and with it the 
conclusion that some probabilities are equal, or 
uniformly distributed. But what really are the con- 
ditions where such an argument applies? A sym- 
metry in somebody's opinion is the conclusion ít- 
self, not a premise, and the ostensible notion of 
absolute ignorance seems inappropriate to any real 
situation. Less strict assumptions, such as sym- 


metry of syntactical structure of the sentences 
asserting a set of events, are likely to permit arbi- 
trariness and to lead to misuses—in most radical 
form, to the d'Alembert paradox, according to 
which any dubious event has p = 3, on the pretext 
that we are in a symmetric situation, being unable 
to deny either E or non-E, which means simply 
stopping at “dubious.” 


The objectivistic approach 


The picture changes utterly in passing from the 
subjectivistic to the objectivistic approach, if due 
attention is paid to the underlying ideas. There are 
no longer people intent on weighing doubts. It is 
Nature herself who is facing the doubts, irresolute 
toward decisions, committing them to Chance or 
Fortune. The mythological expressions are but 
images; yet the expressions commonly used in ob- 
jectivistic probability are, at bottom, equivalent to 
them, and the objectivistic language is so wide- 
spread that even attentive subjectivists sometimes 
lapse into it. It is considered meaningful to ask, 
for example, whether some effect is due to a cause 
or to chance (that is, is random); whether a fact 
modifies the probability of an event; whether a 
random variable is normal, or two are independent, 
or a process is Poisson; whether chance intervenes 
in a process (once for all, at one step, or at every 
instant); whether this or that phenomenon obeys 
the laws of probability or what their underlying 
chance mechanism is; and so on. 

In fact, an objective probability is regarded as 
something belonging to Nature herself (like mass, 
distance, or other physical quantities) and is sup- 
posed to “exist” and have a determined value even 
though it may be unknown to anyone. Quite nat- 
urally, therefore, objectivistic theories do actually 
deal with unknown probabilities, which are of course 
meaningless in a subjectivistic theory. Further- 
more, it can fairly be said that an objective prob- 
ability is always unknown, although hypothetical 
estimates of its values are made in a not really 
specifiable sense. How can one hope to communi- 
cate with such a mysterious pseudoworld of objec- 
tive probabilities and to acquire some insight on it? 

Objective probability, symmetrical (OS). The 
first partial answer comes from the objectivistic 
Interpretation of the “symmetry principle,” or “the 
Principle of cogent reason,” according to which 
identical experiments repeated under identical con- 
ditions have the same probability of success. This 
applies also to the case of several symmetric pos- 
sible outcomes of one experiment (such as the six 
faces of a die) and is often asserted also for com- 
binations (such as the 2" sequences of possible 
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outcomes of tossing a coin n times). Accepting— 
perhaps on the basis of SC, admitting that objec- 
tive probabilities must be consistent with subjective 
ones, or perhaps by convention—the rule of favor- 
able divided by possible cases, probability is defined 
in a range and way very similar to those of SR; 
but, even apart from the change from subjective 
to objective interpretation, it is not so close as it 
may seem, because the role of information is now 
lost. We can no longer content ourselves with 
asserting something about symmetry of the real 
world as it is known to us but are compelled to 
entangle ourselves in asserting perfect symmetry 
of what is unknown if not indeed unknowable. Or 
we may switch from this supernatural attitude to the 
harmless one of regarding such assertions as merely 
“hypothetical,” but then obtain only hypothetical 
knowledge of the objective probabilities, since the 
perfect symmetry is only hypothetical. For in- 
stance, what about differences arising from mag- 
netism before magnetism was known? Strictly, the 
perfect symmetry is contradictory, unless unavoid- 
able differences in time, place, past and current 
circumstances, etc. are bypassed as irrelevant. 

Objective probability, frequency (OF). Another 
answer is: “Objective probability is revealed by fre- 
quency.” It is a property that somehow drives 
frequency (with respect to a sufficiently large num- 
ber of “identical” events) toward a fixed number, p, 
that is the value of the probability of such events. 
Statistical data are, then, a clue to the ever un- 
knowable p if the events concerned are considered 
identical or to an average probability if their prob- 
abilities differ. 

OF actually presupposes OS whenever we intend 
to justify the necessity of arranging events in 
groups to get frequencies and to use these fre- 
quencies for other, as yet unknown, events of the 
group. Nonetheless, there may be a conflict if, for 
instance, the frequencies that occur with a die, 
accepted as perfect, are not almost equal, as can 
happen. 

Objective probability, limit of frequency (OL). 
In order to remove the unavoidable indeterminate- 
ness of OF, the suggestion has been made to in- 
crease the large number to infinity and to define p 
as the limit of frequency. Whether one does or 
does not like this idea as a theoretical expedient, 
clearly no practical observations or practical ques- 
tions do concern eternity; for dealing with real 
problems this theory is at best only an elusive 
analogy. 

Critique of objectivistic theories. Objectivistic 
theories are often preferred (especially by some 
practically oriented people, such as statisticians and 
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physicists), because they seem to join the funda- 
mental notions with practically useful properties 
by a direct short cut. But the short cut leaps un- 
fathomable gulfs. It admits of no bridge between 
us, with our actual knowledge, and the imagined 
objectivistic realm, which can be turned into an 
innocent allegory for describing some models but 
has no proper claim to being complete and self- 
supporting. The needed bridge is supplied by sub- 
jective probability, whose role seems necessary and 
unchanged, whether or not we want also to make 
some use of the unnecessary notion of objective 
probability for our descriptions of the world. If this 
bridge is rejected, only recourse to expedients is 
left. It will shortly be explained why subjective 
probabilities are said to provide the natural bridge, 
while objectivistic criteria (such as the usual meth- 
ods of estimating quantities, testing hypotheses, or 
defining inductive behavior) appear to be artificial 
and inadequate expedients. 


Inductive reasoning and behavior 


A feature that has been postponed, to avoid pre- 
mature distraction, must now be dealt with. A sub- 
jective probability, P(E), is of course conditional 
on the evidence, or state of information, currently 
possessed by the subject concerned; to make that 
explicit, we may write P(E|A), where A is the cur- 
rent state of information, which is usually left 
implicit. Any additional information, real or hypo- 
thetical, consisting in learning that an event H is 
true (and H may be the joint assertion, or logical 
product, of any number and kind of “simpler” 
events) leads from the probabilities P(E) to P(E|H), 
conditional on H (or on AH, if A need be made 
explicit). The coherence condition of SC suffices 
for all rules in the whole field of conditional prob- 
abilities and hence specifies by implication what 
it means to reason and behave coherently, not only 
in a static sense, that is, in a given state of infor- 
mation, but also in a dynamic one, in which new 
information arises freely or may be had at some 
cost by experiment or request. 

The passage from P(E) to P(E|H) is prescribed 
simply by the theorem of compound probabilities, 
or equivalently—in a slightly more elaborate and 
specific form—by Bayes’ theorem. As for the deci- 
sion after knowledge of H, it must obviously obey 
the same rules as before, except with P(E|H) in 
the place of P(E); the sole essentially new question 
is how best to spend time, effort, and money for 
more information and when to stop for final deci- 
cision, but that too is settled by the same rules. 

Coherence thus gives a complete answer to deci- 
sion problems, including even induction, that is, 


the use of new information; no room is left for 
arbitrariness or additional conventions. Of course, 
the freedom in choosing P(E), the initial distribu- 
tion, is still allowed (unless we accept uniqueness 
from SR), and similarly for utility. The unifying 
conclusion is this: Coherence obliges one to behave 
as if he accepted some initial probabilities and 
utilities, acting then so as to maximize expected 
utility. 

The particular case of statistical induction is 
simply that in which H expresses the outcome of 
several “similar” events or trials. The simplest con- 
dition, called exchangeability, obtains when only 
information about the number of successes and 
failures is relevant for the person, irrespective of 
just which events, or trials, are successes or fail- 
ures; and the most important subcase is exchange- 
ability of a (potentially) infinite class of events— 
like coin or die tossing, or drawings with replace- 
ment from an urn, or repetitions of an experiment 
under sufficiently similar conditions. The model of 
an infinite class of exchangeable events can be 
proved equivalent to the model that presents the 
events as independent conditional on the value of 
an unknown probability. whether the “unknown 
probability” is interpreted objectivistically or other- 
wise. In fact, to deal with the latter model con- 
sistently, one must—if not explicitly, at least in 
effect—start with an initial subjective distribution 
of the unknown probability; but exactly the same 
results are obtainable directly from the definition 
of exchangeability itself, without recourse to any 
probabilities other than those to which we have 
direct subjective access (Feller 1950-1966, vol. 2, 
p. 225). Whatever the approach, Bayes’ rule acts 
here in a simple way (we cannot go into the details 
here) that explains how we all come to evaluate 
probabilities in statistical-induction situations ac- 
cording to the observed frequencies, insofar as our 
common sense induces approximate coherence. 

The inconsistency just noted of using any but 
the Bayesian approach, even under an objectivistic 
formulation, should be discussed further. The ini- 
tial subjective probability distribution of a person 
must be the same for all decision problems that 
depend on a given set of events. It cannot be chosen 
by criteria that, like the minimax rule, depend on 
the specific problem or on the instrument of ob- 
servation, because such criteria, although co- 
herent within each problem, are not coherent 
over-all. Also, there is no justification for calling 
the Bayesian method unreasonable because the 
needed initial distribution is “unknown.” More ac- 
curately, it is of dubious choice (see, for example, 
Lindley 1965, vol. 2, pp. 19-21). Actually, any 


x 
method proposed for avoiding the risk of such a 
choice is demonstrably even worse than a specific 
choice. The situation is as though someone were 
to estimate the center of gravity of some weighted 
points of a plane by a point outside the plane be- 
cause he did not know the weights of the points 
sufficiently well; yet the projection of the estimate 
back onto the plane must improve it. This is not a 
mere analogy but a true picture in a suitable mathe- 
matical representation. This picture should be em- 
phasized because it shows that, far from opposition, 
there is necessarily agreement between inductive 
behavior and inductive reasoning, which is contrary 
to an opinion current in objectivistic statistics. 


A few nuances of the views 


No sketch of some of the representative views 
can cover the opinions of all authors, if only be- 
cause some might take a particular idea in earnest 
and push it to its extreme consequences while 
others might consider it merely as a suggestive 
abstraction to be taken with a grain of salt. Our 
sketches may therefore appear either as insuffi- 
cient or as caricatures. Still, even mentioning a few 
of the nuances may create a more realistic im- 
pression, 

Each notion changes its meaning with the theory; 
let us take, as an example, independence. Two 
drawings, with replacement, from an urn of un- 
known composition are called independent by an 
objectivist (since for him the probability is the 
unknown but constant proportion of white balls) 
but not by a subjectivist (for whom “independent” 
means “devoid of influence on my opinion”). For 
him, such events are only exchangeable; the ob- 
served outcomes, through their frequency, do alter 
the conditional probability of those not yet ob- 
served, whether in terms of an unknown probability 
—here, the urn composition—or directly. 

In OF, on the contrary, independence and the 
law of large numbers are almost prerequisites for 
the definition of probability. To escape confusing 
circularity, duplicates of notions are invented. Thus, 
the preliminary form of the law of large numbers 
is called the “empirical law of chance,” making a 
distinction based on a similar one between “highly 
probable” and “almost certain”—itself created ex- 
Pressly to be eliminated by the “principle of Cour- 
not.” This ostensible principle is one that seems to 
Suggest that we can practically forget the “almost” 
and consider probability as a method of deriving 
certainties; in a safer version, it might offer a link 
between subjective and objective probabilities. 

But the first prerequisite for OF is the grouping 
of events into classes, sometimes tacitly slipped in 
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by the terminology that uses the word “event” for 
a sequence of events rather than for an individual 
event. What is intended by such a class raises con- 
fusing questions. Does maintaining that probability 
belongs to a class imply that all events, or trials, 
in the class are equally probable? Or is that ques- 
tion to be rejected and the probability of a single 
trial held to be meaningless? What information 
entitles us to assign a given single event to such 
a class? These questions are aspects of a more 
general one that has still other aspects. For any 
objectivist, frequentistic or not, which states of 
information allow us to regard a probability as 
known? Which are insufficient and leave it un- 
known? Is there only one specific state of informa- 
tion concerning a given event to which a probability 
corresponds? Does the whole information on which 
a probability is based consist of all relevant circum- 
stances of the present or of the past? Or can noth- 
ing be excluded as irrelevant so that it consists of 
the whole present and past? 

Actually, some conventional state of informa- 
tion, far from knowledge of the whole past and 
present, is usually considered appropriate. For in- 
stance, for extractions from an urn, when the 
proportion of white balls is known (and perhaps 
when it is also known that the balls are well mixed), 
objectivists say that the probability is known and 
equal to the proportion. But why not require fuller 
information? If, for example, we have noted that 
the child drawing generally chooses a ball near 
the top, then additional information about the posi- 
tion of the white balls in the urn would seem to be 
relevant. 

Something between equal probabilities for events 
in classes and individual probabilities for each 
trial may seem to be offered by the precaution of 
speaking of the probability of an event with respect 
to a given set of trials (see Fréchet 1951, pp. 15- 
16). An example will illustrate the meaning 
and the implications not only of this attitude but 
more generally of all attempts to evade the main 
question above, namely, whether the probabilities 
of events in a class are equal, unequal, or non- 
sense, There is no proper premium for insurance 
on the life of Mr. Smith, unless we specify that he 
is to be insured as, for instance, a lawyer, a wid- 
ower, a man forty years old, a blonde, a diabetic, 
an ex-serviceman, and so on. How then should an 
objectivistic insurer evaluate Mr. Smith’s applica- 
tion for insurance? 


Probability and philosophy 
The view of a world ruled by Chance has also 
been opposed for philosophical reasons—not to 
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mention theological and moral ones. Chance is in- 
compatible with determinism, which, it was once 
said, is a prerequisite to science. By replying, 
“Chance is but the image of a set of many little 
causes producing large effects,” faith in perfect 
determinism in the microcosm was reconciled with 
probability, but perhaps only with SR rather than 
with OS, which was the real point of conflict. At 
any rate, the advent of probabilistic theories in 
physics and elsewhere later showed that determin- 
ism is not the only possible basis for science. 
Subjectivistic views have been charged with 
“idealism” by Soviet writers (for example, Gneden- 
ko 1950), and “priesthood” by objectivistic statis- 
ticians (for example, van Dantzig 1957), inasmuch 
as these views draw their principles only from 
human understanding. For SR that may be partially 
justified; it should, however, be ascribed simply 
to misunderstanding when it is said of SC, which 
seems exposed to the opposite charge, if any. 
Namely, SC allows too absolute a freedom for a 
person's evaluations, abstaining from any prescrip- 
tion beyond coherence. Is that not assent to arbi- 
trariness? The answer is, Yes, in freedom from 
prefabricated schemes; but the definition calls on 
personal responsibility; and mathematical develop- 
ments based on the coherence conditions show how 
and why the usual prescriptions—above all, those 
based on symmetry and frequency—ought to be 
applied, not as rigid artificial rules, but as patterns 
open to intelligence and discernment for proper 
interpretation in each case. 
BRUNO DE FINETTI 


[See also BAYESIAN INFERENCE; CAUSATION; SCIENCE, 
article on THE PHILOSOPHY OF SCIENCE; SCIENTIFIC 
EXPLANATION. } 
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PROBABILITY SAMPLING 
See SAMPLE SURVEYS. 


PROBATION 
See under PENOLOGY. 


PROBLEM SOLVING 


Frequent references to “the problem-solving proc- 
ess,” “the decision-making process,” and “the crea- 
tive process” may suggest that problem solving can 
be clearly distinguished from decision making or 
creative thinking from either, in terms of the proc- 
esses involved. These phrases can also imply that 
each involves a single process. On the basis of pres- 
ent evidence, however, it appears that each in- 
volves a variety of processes. Moreover, it appears 
that processes important in problem solving are also 
often important in decision making or in creative 
thinking (Guilford 1960; Taylor 1960). Each of 
these activities may therefore best be defined not 
in terms of process but in terms of product. For 
example, “Creativity is that thinking which results 
in the production of ideas (or other products) that 
are both novel and worthwhile. Similarly, deci- 
sion making is that thinking which results in the 
choice among alternative courses of action; prob- 
lem solving is that thinking which results in the 
solution of problems” (Taylor 1965, p. 48). Thus 
problem solving, decision making, and creativity 
are all to be regarded as kinds of thinking. The 
question of the degree to which these and other 
kinds of thinking—for example, concept attain- 
ment—involve the same or different processes is 
a question that remains to be answered by empir- 
ical investigation [see CONCEPT FORMATION]. 

The experimental psychological study of prob- 
lem solving may be said to have begun shortly 
before the turn of the century. In 1894 C. Lloyd 
Morgan introduced the term “trial and error” to 
describe the process by which his dog learned such 
tricks as opening a gate by raising the latch with 
his muzzle. Shortly thereafter, in 1898, Thorndike 
reported experimental studies of the behavior of 
cats in various problem boxes—boxes with doors 
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to be opened by turning a door button or pressing 
a lever or pulling a string attached to the outside 
bolt. In his book Animal Intelligence (1890-1901) 
he concluded, on the basis of his experimental re- 
sults, that animals are incapable of processes such 
as reasoning or insight, and he supported Morgan’s 
suggestion that their behavior in solving problems 
is to be accounted for in terms of trial and error. 
[See Morcan, C. Lioyp; and THORNDIKE.] 

Beginning with Hobhouse (1901), however, the 
objection has been made that the trial and error 
observed by Thorndike was the result of the kinds 
of problems that he employed and not a general 
characteristic of animal behavior in problem solv- 
ing. In the problems Thorndike presented, the ele- 
ments are not open to the inspection of the animal 
at the outset; hence trial-and-error behavior is in- 
evitable. Hobhouse devised a number of problems 
in which no essential element is concealed. Em- 
ploying such problems, he obtained results indicat- 
ing that animals are capable of what he called 
“practical ideas”—“a combination of efforts to ef- 
fect a definite change in the perceived object.” 

During the four decades that followed the pub- 
lication of Thorndike’s book, the investigation of 
problem solving was continued by psychologists in 
several countries. But much less attention was de- 
voted to this area or to the more general area of 
thinking than to other areas of psychology, such 
as perception, learning, motivation, and personal- 
ity. However, within the past decade and a half 
an important increase has occurred in the amount 
of time being devoted to the study of not only 
problem solving but also the closely related areas 
of decision making (Edwards 1954; 1961; Taylor 
1965) and creative thinking (Taylor 1960). 

A general review of published research on prob- 
lem solving is beyond the scope of this article. 
Hence, useful supplements to the present dis- 
cussion may be found in the review by Duncan 
(1959) or in the articles by Johnson (1950), 
Taylor and McNemar (1955), and Gagné (1959) 
that have appeared in the Annual Review of Psy- 
chology. Summaries of earlier work on problem 
solving are available in Osgood (1953, pp. 603- 
637), Woodworth and Schlosberg ([1938] 1960, 
pp. 814-848), Van De Geer (1957), and Humphrey 
(1951). 

A comprehensive discussion of theories of prob- 
lem solving is also beyond the scope of the present 
discussion. No description, for example, will be 
given of F. C. Bartlett’s treatment of thinking or 
of the writings of Piaget over a period of more than 
thirty years (see Piaget 1926; 1947; 1957; Inhelder 
& Piaget 1955; for a brief summary, see Taylor 
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1963). Instead, attention will be limited here to 
three theoretical approaches to problem solving. 
The first two approaches derive from more exten- 
sive work in another area of psychology. As a 
result, each is treated in articles dealing with other 
subjects in this encyclopedia. Accordingly, more 
attention will be given to the third approach. [See 
DEVELOPMENTAL PSYCHOLOGY, article on A THEORY 
OF DEVELOPMENT; BARTLETT.] 


Gestalt approach 


Gestalt theory, as embodied in the work of 
Wertheimer, Köhler, and Koffka, is first of all a 
theory of perception, The gestalt approach to prob- 
lem solving gives clear evidence of this fact. In 
1951 G. Humphrey remarked, “At the present time, 
the Gestalt theory of thinking constitutes a pro- 
gramme rather than a fulfilment” (1951, p. 150). 
This remains true, but it is nevertheless a theory 
sufficiently developed to demand consideration 
from one seriously interested in problem solving. 

In its approach to thinking, as well as to percep- 
tion and to other areas, the theory has as its cen- 
tral concept the organized whole, or gestalt. Thus 
Köhler wrote: “This, indeed, is the most general 
concept of gestalttheorie: wherever a process dy- 
namically distributes and regulates itself, deter- 
mined by the actual situation in a whole field, this 
process is said to follow principles of gestalttheorie” 
(1929, p. 193). These self-regulating wholes are 
characteristically extended in time; they are four- 
dimensional. 

The question may be raised as to whether the 
“whole field” involves experience or underlying 
brain processes. The answer is both, because ac- 
cording to gestalt theory, isomorphic relations exist 
between the two such that the same principles 
apply to both. 

When an individual is confronted with a prob- 
lem, processes occur in the brain (and in con- 
sciousness) that by dynamic interaction with each 
other and with memory traces result initially in 
“seeing the problem.” Seeing the problem sets up 
stresses in the psychological field that determine 
the subsequent course of thinking. These stresses 
lead to reorganization or restructuring of the field, 
enabling the individual to perceive the interrela- 
tions that form the solution to the problem. Such 
reorganization tends to occur suddenly, but a prob- 
lem may require for solution not one but a series 
of progressive reorganizations, thus giving the ap- 
pearance of gradual solution. 

The first gestalt studies of problem solving were 
those by Kéhler (1917) and were concerned with 


the behavior of anthropoid apes. Like Hobhouse, 
Köhler contended that the problems which Thorn- 
dike had employed artificially produced trial and 
error. Köhler presented to his chimpanzees prob- 
lems in which the elements necessary for solution 
were available and in which the solution required 
natural responses. Although he reported consider- 
able behavior that might be described as trial and 
error, he emphasized that the critical point in the 
solution of a problem occurred when the animal 
grasped the appropriate relational pattern—when, 
for example, he saw a box upon which he could 
climb as a means of making a workable connection 
between himself and the fruit suspended beyond 
his farthest reach. 

In discussing these early studies, Köhler accord- 
ed a central role to the concept of insight. Em- 
ployed by gestalt writers in subsequent treatments 
of thinking—for example, in the account by Wert- 
heimer (1945) of the exceptional course of thought 
that led Albert Einstein to his theory of relativity 
—this concept has received varying emphasis and 
has been used with more than one meaning. Thus, 
insight has been used to refer to the report by a 
subject of a particular kind of experience accom- 
panying the solution of a problem, It has also been 
employed as a descriptive term for a certain kind 
of behavior—behavior that satisfies a number of 
criteria, such as a sudden improvement in per- 
formance or the ability to repeat a solution imme- 
diately. On the other hand, it has also been em- 
ployed as an explanatory concept, more or less 
equated with restructuring. This last usage must 
be objected to, for neither insight nor restructuring 
is adequate as an explanatory concept, even taken 
together with the concept of the psychological field. 
A more adequate account is needed of the processes 
that result in the occurrence of insight. 

Gestalt psychologists, including Duncker ( 1935), 
have emphasized a distinction, first made by Selz 
(1924), between reproductive and productive 
thinking. Productive thinking is thinking that can- 
not be accounted for simply on the basis of past 
learning because it involves something essentially 
new. The same distinction is emphasized by Maier 
(1930-1931; 1940), who, although he studied in 
Berlin, would not be described as a gestalt psychol- 
ogist. In more recent gestalt writing, Katona 
(1940) and Wertheimer (1945) have suggested 
that the mere newness of the result is not sufficient 
cause for distinguishing reproductive and produc- 
tive thought, Both emphasize that the way in which 
learning originally occurs will be important in de- 
termining whether subsequent thinking is repro- 


ductive or productive. Katona has thus contrasted 
learning by memorizing with learning by organiz- 
ing. In the latter, the subject learns to grasp some 
principle according to which situations are struc- 
tured; as a result, he is able to recognize subsequent 
situations not in terms of their superficial features 
but in terms of their underlying organization, and 
hence he is able to deal with them productively. 


Stimulus—response approach 


Whereas gestalt theory is first of all a theory of 
perception, modern behaviorism, stemming from 
the work of Hull, is first of all a theory of learning. 
The central concern of the theory is the process 
whereby the individual animal or human subject 
acquires the appropriate response to a stimulus. 
The concepts developed to account for learning 
responses to stimuli are employed to explain prob- 
lem-solving behaviors, Among the important con- 
cepts are goal gradient, generalization, and habit- 
family hierarchy, 

The concept of the goal gradient includes, but is 
not fully represented by, the idea that the closer 
in distance or time an organism is to the goal, the 
stronger the organism’s motivation will be. Utiliz- 
ing the goal-gradient hypothesis, Hull (1932) 
made a number of predictions, one of which was: 
Animals will tend to choose the shorter of two 
alternate paths to the same goal. Thus, for the 
stimulus—response theorist as for the gestalt theo- 
rist, the blocking of the most direct path creates the 
situation in which problem-solving behavior may 
be expected to occur, 

Of central importance in accounting for the be- 
havior that occurs in solving a problem is the con- 
cept of the habit-family hierarchy. A convergent 
habit-family hierarchy is one in which a number 
of stimulus situations have been associated with 
the same response. A divergent hierarchy is one in 
which a given stimulus situation is associated with 
a number of different responses (Hull 1934). In 
the latter, responses form a hierarchy in the sense 
that they differ in probability of occurrence. The 
Position of a given response in the hierarchy varies 
with the degree to which it has been previously 
reinforced in similar situations. Given a new prob- 
lem, the hierarchy determines which responses will 
occur and the order of their occurrence. If a re- 
Sponse high in the hierarchy is unsuccessful, it will 
tend to be extinguished; this process will be repeat- 
ed until some response lower in the hierarchy is 
successful. The success of that response will rein- 
force it and increase the probability of its occur- 
rence when that same problem situation recurs. 
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The concept of generalization is employed to 
account for the fact that rewarding an organism 
for making a given response to a particular situa- 
tion will increase the probability of its making the 
same response not only to that situation but also 
to similar situations. This increase in probability 
varies directly with the similarity of the new to 
the original stimulus situation. Thus a man who is 
confronted with the need to tighten a screw but 
has no screwdriver may readily use a fingernail 
file, but he may engage in much trial and error 
before employing a thin coin for the same purpose. 

Perhaps even more important than simple stim- 
ulus generalization in accounting for human be- 
havior in solving problems is the concept of medi- 
ated generalization (Osgood 1953, pp. 359-360, 
392-412). Generalization may occur not because 
of similarity between the two stimulus situations 
but because of the prior establishment of a com- 
mon mediating response to the two situations. 
When the individual acquires a new response to 
the first situation, that response also becomes as- 
sociated with stimuli produced by the mediating 
response. Hence when the second situation occurs, 
the mediating response produces stimuli that result 
in the elicitation by the second situation of the 
response which has previously been given to the 
first situation. 

In order to account for the complex behavior 
involved in human problem solving, Maltzman 
(1955) suggested an extension of Hull's habit- 
family hierarchy, He proposed a combination of 
the concept of a divergent hierarchy with that of 
a convergent hierarchy in order to produce a com- 
pound habit-family hierarchy. In a compound hier- 
archy a given stimulus situation has the disposi- 
tion for arousing not only its own family hierarchy 
but, to a varying degree, the habit-family hier- 
archies of other stimulus situations. There is a 
hierarchy of habit-family hierarchies elicitable by 
that situation, An analogous condition exists for 
the other stimulus situations. Maltzman (1955) 
employed this concept of a hierarchy of hierarchies 
in discussing, for example, the distinction between 
reproductive and productive thinking. He suggested 
that in reproductive thinking a series of problems 
is presented that requires for a solution the elici- 
tation of different responses in the same family 
hierarchy. With the presentation of successive 
problems, this habit family becomes dominant in 
the compound hierarchy. In contrast, in situations 
requiring productive thinking, a habit family 
initially low in the compound hierarchy must be- 
come dominant before the problem can be solved. 
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Once the appropriate family is dominant, a solu- 
tion will occur provided that the correct response 
within that hierarchy in turn becomes dominant. 


Information-processing approach 


A newer approach to problem solving is that sug- 
gested by Newell, Shaw, and Simon (1958a), What 
they propose essentially is that the problem solver 
be regarded as an information-processing system. 
Such a system consists of a number of “memories,” 
which contain symbolized information and are in- 
terconnected by various ordering relations. It also 
includes precisely specified “primitive information 
processes,” which operate on the information in the 
memories, and rules for combining these processes 
in “programs.” 

An investigator employing this approach begins 
by selecting for study some particular kind of prob- 
lem solving. He attempts to identify the processes 
involved. In doing this he may observe and analyze 
the behavior of individuals engaged in solving such 
problems, have them “think aloud” while working, 
examine his own experience, or use any other 
source of information that may be helpful. When 
he has formulated tentative hypotheses to describe 
the processes involved, he undertakes to write the 
program that employs these processes and simu- 
lates the thinking of the human problem solver. 
[See SIMULATION, article on INDIVIDUAL BEHAVIOR. ] 

Viewed as a theory of behavior, a program is 
highly specific in that it represents only the behav- 
ior of one individual in one set of situations. If 
either the individual or the class of situations is 
changed, the program must be changed. There will, 
however, be important similarities among the pro- 
grams that represent the behavior of the same 
individual in different situations or among those 
that represent the behavior of different individuals 
in the same situations. On the basis of these sim- 
ilarities a more general theory of the behavior 
under study may be developed. 

Such programs are written in one of the so-called 
information-processing languages—precise lan- 
guages that have been created for this kind of use. 
Because the language employed is precise, it is 
possible to run the program on any modern high- 
speed computer by supplying the computer with 
the appropriate “interpretive deck.” In this way one 
may determine what behavior any given program 
will produce. It should be emphasized that this ap- 
proach to thinking does not depend upon any crude 

analogy between the structure of the computer and 
the structure of the brain, The function of the com- 
puter is simply to enable the investigator to deter- 
mine quickly whether the consequences of a pro- 


gram are in fact consistent with the observed 
human behavior that it is supposed to explain. If a 
program is to be more than an ex post facto ex- 
planation, it must not only simulate adequately the 
behavior it was written to account for but also pre- 
dict the effect of changing conditions on behavior. 

An important distinction is that between algo- 
rithmic and heuristic processes in problem solving. 
An algorithm is a process for solving a problem that 
guarantees solution in a finite number of steps if 
the problem has at least one solution. An example 
of a very simple algorithm would be that for ob- 
taining temperature on the Fahrenheit scale when 
the value for the centigrade scale is known: Multi- 
ply the known value by 1.8 and add 32. 

A heuristic process may aid in solution but offers 
no guarantee of doing so. In his little book How to 
Solve It, the mathematician Pólya (1945) de- 
scribed a variety of heuristic processes useful at 
the level of high school mathematics. In Mathe- 
matics and Plausible Reasoning (1954) he dealt in 
more detail with the role of heuristic procedures. 
Some heuristics are applicable only to a particular 
class of problems. Thus, the following heuristic 
would be appropriate for use only in discovering 
proofs for theorems in geometry: If the figure has 
one axis of symmetry and it is not drawn, then 
draw it. But other heuristics may aid in solving 
quite varied kinds of problems. One example of a 
more generally useful heuristic is what is called 
“means-end” analysis: Compare what you have 
with what you wish to obtain; identify a differ- 
ence between the two; find and carry out an opera- 
tion that may reduce the difference; repeat this 
procedure until the problem is solved. Another 
generally useful heuristic is the one called “make- 
a-plan”: Find a problem that is similar to, but 
simpler than, the one you are attempting to solve; 
solve the simpler problem; use the procedure suc- 
cessful in solving the simpler problem as a plan for 
solving the more complex problem. 

Within recent years several programs have been 
written employing heuristic procedures and simu- 
lating various kinds of human problem solving. 
One of the first was the Logic Theorist by Newell, 
Shaw, and Simon (1958a). In writing this program 
they undertook to identify the heuristics involved 
in discovering proofs for theorems in symbolic 
logic. When completed, the program produced 
impressive results in a number of experiments 
carried out by running it under various conditions 
on a high-speed digital computer. Principia mathe- 
matica by Whitehead and Russell is a classic of 
modern symbolic logic. In one experiment the 
Logic Theorist, employing the same axioms, defi- 


nitions, and rules used in the Principia, was pre- 
sented with the task of constructing in turn a valid 
proof of each of the first 52 theorems in Chapter 2 
of the Principia in the order in which they appear 
there. Whenever a theorem was proved, it was 
stored in memory and was available, together with 
the original axioms, for use in proving subsequent 
theorems. Under these conditions the Logic Theo- 
rist discovered proofs for 38 of the 52 theorems and 
did so in times ranging from less than a minute 
to more than 15 minutes. 

In a sense, the Logic Theorist has been super- 
seded by the General Problem Solver, another pro- 
gram by Newell, Shaw, and Simon (1960). Con- 
struction of this program began with the study of 
“thinking aloud” protocols produced by individuals 
engaged in solving a type of problem that also 
involved symbolic logic but did not involve dis- 
covering proofs for theorems. Although the original 
objective was to simulate the behavior involved 
in solving the particular type of problem chosen 
for study, the evidence clearly indicates that the 
program constructed can also do trigonometric 
identities and formal integration, solve algebraic 
equations, and may be extended to an even wider 
range of tasks. The program is called general for 
this last reason, not because it can solve problems 
in general. Of marked interest is the fact that the 
General Problem Solver employs only two principal 
heuristics, the means-end heuristic and the make- 
a-plan heuristic, In the past such procedures, if 
considered at all, were often regarded as ambigu- 
ous and trivial. One major result to date of the 
use of simulation has been the rigorous demon- 
Stration of the potential power of heuristic proc- 
esses in solving problems. 

The objection may be made that in view of the 
high speed and accuracy with which a computer 
works, it is hardly surprising that the Logic, Theo- 
rist is successful in discovering proofs for a variety 
of theorems or that the General Problem Solver can 
find solutions for other problems in logic or mathe- 
matics. This objection may arise from the incorrect 
belief that what is occurring is either an exhaustive 
Search or a random search of all possible alterna- 
tives. For simple problems solutions may well be 
attainable by exhaustive or random search. But 
Such searching is simply not feasible in solving 
problems of great complexity. The following anal- 
ysis may help clarify this. 

A useful abstract model of many kinds of 
problem-solving activity is provided by the maze 
(Newell et al. 1962). The task of the thinker in 
discovering the proof of a theorem, for example, 
may be regarded as that of finding one of the paths 
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through a very complex maze that begins with one 
or more of the axioms and ends in the specified 
theorem. The maze consists of all possible paths 
that might have been taken. Using the maze as a 
model, one may make some estimate of the diffi- 
culty of a particular class of problems by asking 
the essential question: How large is the maze that 
must be searched? The task of the thinker is to 
find a way through the maze the first time—not, 
as in studies of animal learning, to learn to run 
the maze without error. 

Estimates have been made of the size of the 
space of possible solutions for problems handled 
by the Logic Theorist. If no limit is imposed upon 
the length or other characteristics of sequences of 
symbolic expressions generated in the attempt to 
discover proofs, then the size of the maze is liter- 
ally infinite. If, however, the space is restricted, 
“for example, to proofs consisting of sequences of 
not more than twenty logic expressions, with each 
expression not more than twenty-three symbols in 
length and involving only the variables p, q, r, s, 
and t and the connectives ‘or’ and ‘implies,’ the 
number of possible proofs meeting these restric- 
tions is about 10*°—one followed by 235 zeros!” 
(Newell et al. 1962, p. 73). Even with the speed 
of a computer, either the random or the exhaustive 
search of such a maze is clearly out of the question. 

For those classes of problems for which efficient 
algorithms have been discovered, such procedures 
are to be preferred. They guarantee solution if the 
problems have solutions. But for many kinds of 
problems algorithms are unknown. Moreover, for 
some types of problems for which algorithms are 
known, their use is prohibited by the enormous 
amount of time that would be required to carry 
them out. 

There is, for example, an algorithm for playing 
chess: Consider all possible continuations of the 
game from the existing position to termination and 
then select one move that will lead to checkmate 
of the opposing king. The mathematician C. E. 
Shannon (1950) has estimated that if this pro- 
cedure were employed, it is unlikely that a single 
game would be completed within a lifetime, even 
if the players worked at the speed of the fastest 
electronic computer. The use of the algorithm is 
simply not feasible. Instead, those who play chess 
employ heuristic procedures which simplify the 
task of searching the maze—heuristics that in- 
clude, for example: “Protect your king,” “Develop 
your pawns,” “Try to control the center of the 
board,” etc. (see Groot 1946). Beginning in about 
1950, a number of attempts have been made to 
program a computer to play chess. A review of 
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these attempts reveals a clear trend toward the 
use in these programs of increasingly selective 
heuristics (Newell et al. 1958b). 

In the relatively short time that has elapsed 
since the initial proposal of the information-proc- 
essing approach, heuristic programs have been 
written for a wide variety of tasks. Although not all 
were written with the primary purpose of simula- 
tion of human thinking, it may be of interest that 
heuristic programs now exist for such varied tasks 
as attaining concepts, discovering proofs for theo- 
rems in plane geometry, composing music, dealing 
with the trust investment problem of portfolio se- 
lection, and balancing an assembly line. A more 
complete introduction to this approach to think- 
ing is provided in Miller, Galanter, and Pribram 
(1960), Hunt (1962), Reitman (1965), and Taylor 
(1965). 

DonaLtp W. TAYLOR 


[Directly related are the entries CREATIVITY, article on 
PSYCHOLOGICAL ASPECTS; DECISION MAKING; REA- 
SONING AND LOGIC; THINKING. Other relevant mate- 
rial may be found in GESTALT THEORY; INFORMA- 
TION THEORY; LEARNING; and in the biographies of 
Hutt; KOFFKA; KÖHLER; WATSON; WERTHEIMER.] 
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PROCESS, POLITICAL 
See POLITICAL PROCESS. 


PROCESS CONTROL 
See under QUALITY CONTROL, STATISTICAL. 


PROCESSES, SOCIAL 

See ACCULTURATION; ASSIMILATION; COHESION, 
SOCIAL; CONFLICT; COOPERATION; INTEGRATION, 
article On SOCIAL INTEGRATION; INTERACTION; 
SOCIALIZATION, 


PRODUCT DIFFERENTIATION 
See MARKETS AND INDUSTRIES; MONOPOLY; OLI- 
GOPOLY. 


PRODUCTION 


The economic theory of production is derived 
from (a) the behavioral premise that the decision- 
making unit (the firm or the production manager ) 
desires to minimize the total cost of producing any 
given output or specified combination of outputs, 
and (b) the postulate that normally there exist 
alternative techniques or processes of production, 
With different patterns of resource (labor, capital, 
materials, energy) consumption and, therefore, dif- 
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ferent associated costs. Cost minimization is nor- 
mally assumed to be a separable but contributing 
part of the primary objective of maximizing profit. 
However, this assumption is not necessary, for the 
decision maker may have other primary objectives, 
such as maximizing sales or market share, but still 
be concerned with producing the scheduled level of 
production at minimum cost. 

Production theory is often called marginal pro- 
ductivity theory, in reference to the decision rules 
that represent necessary conditions for the achieve- 
ment of maximum profit. These classical rules 
state that for maximum profit (including minimum 
cost) the quantity of each resource input used in 
a process must be adjusted until the value of the 
product resulting from the last increment of the re- 
source (the marginal product) is equal to the cost 
of the last increment of the resource. Thus, if 
MP, is the marginal product of labor and P is 
the price of the product of labor, then P : MP, is 
the value of the marginal product of labor. If, fur- 
thermore, the constant price or wage of labor is W, 
the decision rule for the employment of labor is 
to adjust its employment until P - MP, = W. If 
P - MP, > W, the return from an additional unit 
of labor would exceed its cost, and profit could 
be increased by expanding employment. If 
P - MP, < W, the return from an additional unit 
of labor would be less than its cost, and profit could 
be increased by contracting employment. The valid- 
ity of this argument requires MP, to be a decreas- 
ing (downward-sloping) function of employment. 
Mathematically, this condition is sufficient for a 
local maximum of profit. 

The original formulation of production theory 
is associated most closely with the names of John 
B. Clark, Philip H. Wicksteed, Francis Y. Edge- 
worth, and Léon Walras, whose writings on the 
subject appeared between 1874 and 1896. Walras’s 
1874, 1877, and 1889 treatments of production 
were based on the assumption of fixed technical 
coefficients (see below), with only a brief discus- 
sion of the case of variable coefficients. Walras’s 
statement of the marginal productivity theory does 
not appear until the third edition of his Elements, 
in 1896 (see 1874-1877). The first clear state- 
ments of the marginal productivity theory seem to 
have been made independently by Clark (1889), 
Edgeworth (1891-1921), and Wicksteed (1894). 
Wicksteed’s treatment was the most influential, 
and he is probably the one most often given credit 
as the originator of the theory. 

For a fuller discussion of the history of the mar- 
ginal productivity theory, the reader is referred to 
Stigler (1941) and to Walras (1874-1877), espe- 
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cially the extensive notes by the translator, William 
Jaffé. [See also the biography of WALRAS.] 


The production function 


Traditional production theory postulates an 
input-output relationship, or production function, 
showing the quantity of output (or the quantities of 
various outputs ) that can be produced as a function 
of the quantities of the various inputs consumed. 
In the case of one output and two inputs, if we let 
y be the output rate and x, and x, be the input 
rates, the production function can be written 


(1) 4 = f(x, x2): 


The output might be kilowatt hours of electricity 
per year; x, might be tons of coal per year; and x, 
might be maintenance hours per year. Or again, y 
might be a chemical product, with x, a raw mate- 
rial reactant and x, a fuel, a second reactant, or a 
catalytic material. 

Equation (1) represents what is called an effi- 
ciency frontier in the sense that it provides the 
largest y that can be produced for given x, and x, 
(or the least x, required for given y and xz, etc.). 
There are some production decisions that can be 
made on purely technical grounds, without knowl- 
edge of resource prices, These decisions are usually 
called engineering decisions, as opposed to eco- 
nomic decisions. Thus, if a modification of the 
manner in which a process is performed allows the 
same output while permitting the quantity of at 
least one input to be reduced without an increase 
in the quantity of any other input, then a decision 
in favor of the modification can be made on engi- 
neering grounds alone, without any knowledge of 
prices. An action that saves on one input without 
altering any other requirement of a process will 
lower cost, regardless of the price of that input. 

The production function in (1) presupposes that 
all such engineering decisions have been m ade. In 
constructing this function, all methods, techniques, 
or processes that require more of one input and no 
less of any other input are rejected. Once all such 
engineering decisions have been made, we are left 
with the best engineering technology. But this tech- 
nology presents a multiplicity of input possibilities 
which have the characteristic that output cannot 
be maintained at a given level, when one input is 
reduced, unless some other input is increased. The 
choice among these remaining input combinations 
is an economic decision, in that the decision re- 
quires knowledge of input prices. 

Briefly stated, economic decisions require knowl- 
edge of input prices and best engineering technol- 
ogy. Engineering decisions are concerned with best 


engineering technology and require technical 
knowledge of physical processes. 

The characteristics of a three-variable produc- 
tion function are readily summarized graphically 
with an isoproduct map. An isoproduct (constant 
output) contour is the locus of all inputs (x,, x,) 
that are required to produce a specified quantity of 
output. An isoproduct map is a family of such con- 
tours, each corresponding to a different level of 
output. Such a family of contours is shown in 
Figure 1 for outputs of 10, 20, 30, 40, and 50 
units. 

Substitution. Figure 1 also illustrates the phe- 
nomenon of substitution: for any given level of 
output, say 30, if input 2 is decreased, more of 
input 1 is required. In addition to having a negative 
slope, each isoproduct contour is convex to the 
origin (mathematically, ðx,/ðx, < 0, 0° x,/0x? > 0). 
This characteristic is often referred to as the prin- 
ciple of increasing marginal rate of substitution. 
For a given output, larger increments of x, are 
required to substitute for each incremental decrease 
in x+, and vice versa. 

There is, of course, great variation in the extent 
of substitution among different productive proc- 
esses. At one extreme is the case of perfect substi- 
tution, in which the isoproduct contours are 
downward-sloping straight lines. Thus, for some 
purposes it might make no difference whatever 
whether a certain mechanical part is made of 
aluminum or brass, and we would say that the 
two materials are perfect substitutes in that use. 
The production function would be y = a,x; + @:%25 
where a, and a, are the number of parts that can 
be produced from a pound of aluminum and a 
pound of brass, respectively; x, and x, are the 
pounds of input of each metal; and y is the total 
output of parts, that is, a,x, aluminum units plus 
a,x, brass units. 

At the opposite extreme is the case of limita- 
tional inputs or fixed proportions, in which each 
input requirement is rigidly proportional to the out- 
put produced, e.g., x, = b,y, x: = by, where b, and 
b, are the constants of proportionality. In this case 
the production function can be written y= min 
(x,/b,, %./b.), where x,/b, is the largest output 
producible from the amount of input 1 available, 
and x./b, is the largest output producible from the 
amount of input 2 available. The smaller of these 
two outputs determines how much can be produced 
—output is limited by the resource that is in 
shortest supply. 

Some inputs to a process may be perfect or im- 
perfect substitutes, whereas others are limitational. 
Suppose, for example, that x, and x, are perfect 
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Figure 1 — Isoproduct contours 


Substitutes, x, and x, are limitational inputs, and 
x; and x, are continuously substitutable according 
to the relation y = f(x;,%»). Then the production 
function can be written 


Y = min {a X, + aX», Xa/bs , X/D,, (Xs, Xo) }. 


Output is limited by the resource or resource com- 
bination that is in shortest supply. 

Diminishing returns. According to the law of 
diminishing returns, if one of the two inputs is held 
constant and the other input is increased, the mar- 
ginal productivity of that input must eventually 
decline (mathematically, beyond some value of xı 
we must have 4? f/x? < 0, and beyond some value 
of x., 6° f/dx% < 0). In Figure 1 it is seen that when 
X: = 12, 10 units of output can be produced with 
% = 3; 20 units of output can be produced with 
xı = 5; and so on. As output increases in 10-unit 
increments, the quantity of x, required increases, 
first at a decreasing rate and then at an increasing 


rate. Since d, > d, > d, < d, < d;, it is seen that 
the phenomenon of diminishing returns to x, for 
xX, = 12 is experienced when output rises above 
30 units. 

Returns to scale. The study of diminishing re- 
turns is concerned with how output changes with 
changes in an input when other inputs are held 
constant. The study of returns to scale is concerned 
with how output changes when proportional simul- 
taneous changes are made in all inputs and there 
is no generally accepted law or principle requiring 
returns to scale eventually to decline. 

In Figure 1, returns to scale are represented by 
reference to the straight line OR through the origin. 
Observe that if we begin with any input combina- 
tion, such as that represented by the point P(x, 
= 4, x, = 2), then a proportional increase in both 
inputs must be on the line OR. For example, if both 
inputs are doubled, the new input combination is 
represented by the point Q. If, beginning at point 
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P, both inputs are increased % times (to x, = 6 and 
X, = 3), it is seen that output increases by a factor 
of 2 (from 10 to 20). Over this range there are 
increasing returns to scale, If, beginning at Q. both 
inputs are increased % times (to x, = 12 and x, 
= 6), output increases by a factor of 4 (from 30 
to 40), showing decreasing returns to scale. In 
general, if D,>D,> D,>D,, etc., the process 
shows increasing returns to scale at all output 
levels. If D, = D; = D, = D,, etc., the process shows 
constant returns to scale at all outputs. If 

1 < D; < D, < D,, etc., the process shows decreas- 
ing returns to scale for all outputs. In the example 
of Figure 1, the process shows increasing returns 
to scale at first, then constant returns to scale, and 
finally decreasing returns to scale, 

The question of returns to scale can be discussed 
in terms of the mathematical concept of a homo- 
geneous function. The production function y= 
F(x;, x+) is said to be homogeneous of degree 1 if 
an increase in all inputs by the same proportion, 
say a, increases output by the proportion æ. That 
is, the production function is homogeneous of de- 
Sree 1 and therefore exhibits constant returns to 
scale for all outputs if 


(2) ay = f(ax,, ax,), a> 0. 


If the production function is not homogeneous 
of degree 1 and shows either increasing or decreas- 
ing returns to scale, then 


(3) By = f(ax,, ax,), a>0,B>0. 


If B > a we have increasing returns to scale, while 
if B < a we have decreasing returns to scale. 

Individual processes may show increasing, con- 
Stant, or decreasing returns to scale. Still other 
Processes may, over different ranges of output, 
show all three kinds of returns. However, it is 
usually argued that it is inefficient to operate any 
process at a size that is in the range of decreasing 
returns. The output is more efficiently produced 
by building multiple facilities of a size determined 
by the output at which returns just begin to 
decrease. 

Capital in the production function. 
Poses of production theory, the distinguishing fea- 
ture of capital goods is that their presence, in the 
form of physical stocks, is required if production 
is to take place (Smith 1961). Only in special 


current or flow inputs that appear in the produc- 
tion function. For these reasons, it is common to 
measure capital as a stock, as opposed to a flow, 
input resource. The significance of capital stock 
inputs is to be found in the fact that so much pipe, 
cable, concrete, etc., arranged in a certain way, 
must be present if Output is to be produced. Fur- 
thermore, the level of output that can be produced 
varies directly with the physical quantities of the 
capital inputs present when production takes place. 
Such inputs are not consumed in any sense similar 
to the consuming of raw materials, which may be- 
come physically embodied in the object produced, 
or of energy. This concept of capital as a collec- 
tion of assets whose presence is required for pro- 
duction applies also to nonphysical capital, such 
as knowledge, or “human capital,” as it is some- 
times called. Indeed, knowledge, in terms of dura- 
bility and relative nonconsumption in production, 
is the most capital-like of all such assets, 

In introducing physical capital goods into pro- 
duction function analysis, it is important to recog- 
nize another feature of such goods. Capital goods 
are usually freely variable in size only in the proc- 
ess design stage. In this drawing board stage, one 
may consider any size plant or individual items 
of equipment that one pleases, but once the invest- 
ment is made, the “amount of capital” can only 
be varied by replacement with smaller or larger 
units or by installing parallel facilities, 

Empirical examples. The earliest studies of 
empirical production functions were made by agri- 
cultural economists and agronomists concerned 
with experimental studies of soil, fertilizer, mois- 
ture, and seed productivity. Early studies were also 
conducted of the weight-gaining characteristics of 
different feeds and feed supplements for livestock. 
In these cases there is a firm biological basis for 
expecting meat and crop outputs to be dependably 
related to the combinations of feed, fertilizer nu- 
trient, and care expended. One experiment (Heady 
1957) in corn fertilization yielded the production 
function y = —5.68 — 0,32x, — 0.42x, + 6.35 V7; + 
8.52V/x + 0.34 V Xx, where x, is pounds of ni- 
trogen applied per acre per year, x, is pounds of 
Phosphate applied per acre per year, and y is yield 
of corn in bushels per acre per year. Production 
functions for many other agricultural products 
have been determined: the production of alfalfa 
as a function of the application of potash and 
Phosphate fertilizers; of pork as a function of the 
amounts of corn and soybean oilmeal; of milk as 
a function of the forage and grain concentrate 
used. 

In engineering design the earliest known use of 


an explicit cost minimization procedure using es- 
sentially, if not quite literally, the concept of a 
production function, was in the determination of 
economic conductor size by Lord Kelvin. Writing 
in 1881, before Clark, Edgeworth, or Wicksteed, 
Kelvin summarized his derivation in a form later 
known by electrical engineers as Kelvin’s law: “The 
most economical size of the copper conductor for 
the electric transmission of energy, whether for 
the electric light or for the performance of me- 
chanical work, would be found by comparing the 
annual interest of the money value of the copper 
with the money value of the energy lost annually 
in the heat generated in it by the electric current” 
(1882, p. 526). His solution provides the conductor 
size “which makes the two constituents of the loss 
equal.” 

In this case, the principle giving rise to substi- 
tution between energy and conductor cable is the 
energy loss mechanism in all forms of energy 
transmission and transformation. Energy output, 
y, is input, x,, minus loss, L, in the production 
of heat, i.e., y= x, — L. Losses vary directly with 
the square of output but inversely with conductor 
size, and therefore, for a given length transmis- 
sion line, inversely with conductor weight (or 
volume), X.. Hence, L = ky?/X., where k is a con- 
stant depending upon a variety of factors, such as 
the length of the line and the resistivity of the 
cable. The production function can therefore be 
written y =x, — ky?/X., or, in explicit form, y = 
X. (VIF 4k7X; — 1)/2k. 

The production functions for numerous other 
engineering processes have been studied: gas 
transmission (Chenery 1949); heat transmission, 
steam power production, filtration processes, and 
batch reactor chemical processes (Smith 1961); 
and metal cutting (Davidson et al. 1958; Kurz & 
Manne 1963). These studies have been sufficiently 
diverse to show that the principle of input substi- 
tution has a firm base in a wide variety of scien- 
tific and empirical engineering laws. Sometimes the 
laws generate substitution representing a negligible 
Part of the productive activity of some firms; in 
other cases the laws generate substitution among 
the principal inputs of the enterprise. Thus, in 
steam power production, the basic production func- 
tion governing the process arises from the law of 
energy conservation and the empirical laws gov- 
erning energy loss in boilers, turbines, and gen- 
erators. At each stage in the transformation of 
energy—from fuel into steam, steam into mechani- 
cal energy, and mechanical into electrical energy— 
losses occur at a rate which is an increasing func- 
tion of the output of the stage and a decreasing 


PRODUCTION 515 


function of equipment (capital) size in the stage. 
The production function is y = f(x,, X:, Xa, Xs), 
where y is power output rate, x, is fuel consump- 
tion rate, and X., X,, and X, are size measures of 
the boiler, turbine, and generator equipment, re- 
spectively. 

Linear programming. An entirely different mech- 
anism of substitution sometimes occurs where a 
product can be produced by any of several distinct 
processes, each process being characterized by a 
fixed coefficient production function. In that case 
one input substitutes for another when one process 
is chosen instead of another. 

Consider a product that can be produced by 
either of two processes, each of which requires two 
inputs—fuel and labor. The two processes might 
simply be two different kinds of equipment. Proc- 
ess 1 requires one man-hour per unit of output and 
1.5 gallons of fuel per unit of output. If y, is out- 
put, xı labor input, and x+, fuel input, then the 
process is characterized by the equations x,, = yı, 
Xə = 1.5y,. If process 2 requires two man-hours 
of labor and one gallon of fuel for each unit of 
output, then process 2 is characterized by the con- 
ditions x,, = 2y., Xz = Y2. 

Each of these processes taken separately exhibits 
isoproduct contours like those encountered in any 
process whose inputs are used in fixed proportions, 
Figure 2 shows an isoproduct contour (IGI' and 
JHJ’) for an output of 300 for each of the two 
processes. However, if the two processes can be 
operated in parallel combination at any desired 
levels of output, then it will be possible to achieve 
any of the input-output combinations on the line 
segment GH. 

To see this, consider the two rays OB and OP 
(Figure 2), whose slopes represent the ratios at 
which the two inputs must be combined in each 
process. The scale laid off along each ray repre- 
sents the output that would result if the indicated 
input combinations were employed in that process. 
For example, 300 units of output can be produced 
with process 1 when 300 man-hours of labor are 
expended and 450 gallons of fuel consumed. Simi- 
larly, 300 units of output can be produced with 
process 2 when 600 man-hours of labor and 300 
gallons of fuel are used. But 300 units of output 
can also be produced by employing the two proc- 
esses in combination. For example, we could pro- 
duce 100 units of output with process 1 and 200 
units of output with process 2. In Figure 2 the 
length of the “vector” 0A represents 100 units of 
output by process 1, while the projections of this 
vector on the horizontal and vertical axes give the 
amounts of the two inputs required, 100 and 150 
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Figure 2 — Derivation of isoproduct contours for combinations of two processes 


units respectively. Similarly, the vector OD repre- 
sents 200 units of output by process 2, and its 
projection on the axes identifies the input require- 
ments of the process at this output. Now, combin- 
ing or adding the two processes together is geo- 
metrically equivalent to adding the vector 0A to 
OD or, since OD equals AC, to adding 0A to AC, 
giving O0C—a vector representing a total output 
of 300 units and whose projections on the axes 
give the total input requirements of the two proc- 
esses together. Therefore, the point C represents 
a point on the isoproduct contour for an output of 
300 units which is obtained by operating process 1 


at an output level of 100 units and process 2 at 
an output level of 200 units. Thus, all the points 
on the line GH are achievable, and the isoproduct 
contour for 300 units is IGH]’. 

It is now possible to write a mathematical state- 
ment of the efficient production function for this 
two-process system. Since the individual process 
outputs are y, and yz, total output is yı + y:. Proc- 
ess 1 uses xı =y, units of labor, and process 2 
uses X,.=2y. units of labor. Hence, if x, is the 
total labor input, we must have y, + 2y; < x;, since 
not more than the available labor can be used. 
Similarly, if x, is the fuel available for consump- 


tion, we must require 1.5y, + y+ <x,. Mathemati- 
cally, the production function relating y, x; , and x. 
is defined by the condition that y = maxy, y, (Yı + Y2), 
subject to y,+2y.<%, and 1.5y,+y.<%2, or 
yy t+ 2y,+S,=x, and 1.5y,+y.+S.=x,, where 
S, > 0 and S, > 0 are surplus unused quantities of 
the inputs. This maximum problem is a linear pro- 
gramming problem (Dorfman et al. 1958). From 
Figure 2 it can be seen that if x, = 500 and x. = 350, 
we would have y= 300, with y, = 100 and y, = 200. 
It is easily verified that y, = 100, y: = 200, S,= 
S. = 0 is a solution to the above maximum problem. 
Similarly, if x, = 800 and x, = 300, the solution to 
the linear programming problem is y, = 0, y+ = 300, 
S, = 200, S, = 0. 

The above reasoning can be extended to any 
number of processes and any number of inputs. 
For m processes and n inputs, the production func- 
tion y= f(x,,°**,%,) is obtained by solving the 
following linear programming problem: 

y= max, (Cy ty.ts--+y,), 
subject to 
+ 3+ + AinYn SX 


atts + AimYm S Xs 


any, +a 


An Y, + Aye 


Amy + AY: + +++ + AnmYm S Xn, 


where aj; is the quantity of the ith input required 
per unit of production in the jth process. [See Pro- 
GRAMMING.] 


Cost and production theory 


Given the production function for a process, say 
Y= f(x,, x+), and the prices of the inputs, e.g., 
W, W., the problem in the theory of cost is to de- 
termine decision rules for minimizing the cost of 
producing any given output. That is, for any speci- 
fied y, we want C = w,x, + w:x, to be a minimum 
Subject to the requirement that y = f(x, , x+). Nec- 
essary conditions for C to be a minimum are that 
Ww, — A(ðf/ðx,) > 0, where x, = 0 if the inequality 
holds, and that w. — A(df/dx.) > 0, where x, =0 
if the inequality holds. (Mathematically, the 
Problem is to minimize the Lagrangian $ = w,x, + 
WX, — A[f(x,, x.) — y], where A is the Lagrange 
multiplier.) The A in these expressions is inter- 
Preted as marginal total cost (A = dC/dy). Written 
in the form w,/(df/dx,) > A < w./(df/dx:), these 
conditions require that no input be employed in an 
amount such that the marginal cost of an addi- 
tional unit of that input—the price of the input 
divided by its marginal product, w./(df/dx,)—is 
smaller than the marginal total cost for the proc- 
ess. If the marginal cost of any input to the process 
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exceeds marginal total cost, that input is not used 
(we have a “boundary” solution), Thus, for two in- 
puts, if w,/(0f/dx,) =A=w./(Of/dx:), or w,/ w: = 
(df/0x,)/(0f/dx.), then we have an interior tan- 
gency solution, such as point P in Figure 3. 
If, however, w,/(df/0x,) >A =w./(O0f/dx:), or 
w/w. > (Of/dx,)/(0f/dx.), we have x, =0 and a 
boundary solution, as illustrated by point Q in 
Figure 3. 

The condition w,/w. = (f/0x,)/(0f/dx:), in 
combination with the production function y= 
f(x,,2%2), defines the minimum cost input combi- 
nation x?,x?, as a function of the parameters 
Y, w,, w,. Hence, total cost, C° = w, x? + w, x), and 
marginal cost, \°, are determined as functions of 
output and input prices. Over-all profit, that is, rev- 
enue minus minimum cost, R(y) —C%(y), is a 
maximum when dR/dy < 0C/dy, where y = 0 if the 
inequality holds. That is, if marginal total cost, 
öC/ðy, exceeds marginal revenue, dR/dy, at any 
output, that output should be contracted until 
either the two are equal or y = 0, whichever occurs 
first. For example, in the case of two inputs, if it 
pays to use both inputs and to produce some out- 
put, the optimum input—output combination sat- 
isfies the conditions 


OR_0C_,_ w, _ Wi 
5 ay a ooa a 
and 
(b) y = f(X, x2). 


If it does not pay to produce, this fact is expressed 
by the inequality dR/dy < ðC/ðy. 
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The above analysis extends readily to the case 
of n inputs. If all inputs are used and production 
is positive, the necessary conditions are 

wW, Wna 


ORAN ME rat 

y of/ðxz — (OF Xn 
If, say, all but inputs 1 and 2 are used, the condi- 
tions are 


( w, Ws ) SaR ` 
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—_ Ws cy P Wa 
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If one of the inputs, such as 2 in the two-input 
example, is a capital input, the formal aspects of 
the analysis remain unchanged except for two con- 
siderations: (1) the problem of specifying the 
“price,” w,, to be associated with the capital input, 
X;, requires some analysis, and (2) the capital 
input, as noted above, is not freely variable in 
response to changing prices, once it has been 
installed. 

Considering the first problem, the cost function 
is C = w, x, + w,X;. Note that because X; is a stock 
variable, w, must have the dimension “dollars per 
unit per unit of time.” That is, capital cost must 
be expressed as a rental time cost for the use of 
capital, including maintenance, service, “deprecia- 
tion,” and interest on investment. As a simple ex- 
ample, if the upkeep costs are independent of age, 
say, m dollars per unit per year; the life of capital 
fixed, say, L years; the rate of interest i; and the 
unit construction or purchase cost of capital W, ; 
then w: = iW,/[1 — (1 +i)*] + m. The expression 
i/[1 — (1 + i)“] is the annual payment stream over 
L years that is necessary to return a loan of one 
dollar plus interest at 100i per cent per year. 

The second problem can be illustrated as fol- 
lows: Suppose the firm has purchased a capital 
good of size X., representing a minimum cost cap- 
ital investment. Now suppose demand, and there- 
fore the optimal output rate, increases. The new 
production function is y= f(x,, X.) + f(x, X!), 
where Š, is fixed and x,, x|, and X; are variable, 
The variable X, is the size of a second parallel 
facility that might be operated in conjunction with 
the old facility of size X.. The current inputs x, 
and x; are, of course, variable in both the old 
and a potential new facility. Total cost, ie., 
C =w, (x, + x1) + w, (X, + X,), is to be minimized, 
subject to the above production function, with re- 
spect to the choice of x,,x’, and X’,. Necessary 
conditions can be written as before (Smith 1961), 
yielding the solution (1) x, =X; =0, x, > 0, if it 
does not pay to introduce a second facility; 
(2) x, = 0, x; > 0, X; > 0, if it pays to discard the 


original facility when the new one is added (re- 
placement); or (3) x, > 0, x, > 0, X; > 0, if it pays 
to add a new facility and operate it parallel to the 
old. 

Dynamics and uncertainty. The theory of cost 
and production has been extended to treat cases 
in which demand and, therefore, the output re- 
quirements of production are assumed to be (1) 
given functions of time, or (2) random variables 
with known probability density functions (Arrow 
et al. 1958; Modigliani & Hohn 1955; Smith 1961). 
These problems are often discussed under the title 
“inventory theory,” since changes in output require- 
ments over time and demand uncertainty are the 
two conditions giving rise to the necessity of hold- 
ing product inventories. Under dynamic production 
requirements, inventories provide a means of 
smoothing the production plan. If output require- 
ments are known to rise seasonally, the increased 
marginal costs associated with high output can be 
avoided by producing in advance for inventory. 
Optimal smoothing of the production plan requires 
a balancing of direct production costs against in- 
ventory holding costs. 

Under uncertain demand requirements, inven- 
tories are held as buffer stocks to meet temporary 
unpredictable increases in demand. Optimal buffer 
stocks require a balancing of production and sales- 
loss costs (the costs of failing to meet demand) 
with the costs of storage. [See INVENTORIES, article 
On INVENTORY CONTROL THEORY.] 


Aggregate production functions 


The production function for particular processes 
can in principle be, and often is in practice, derived 
directly from engineering laws and data. Logically, 
therefore, if the economist has an interest in labor 
and capital productivity at the industry, regional, 
or national level, the appropriate aggregate produc- 
tion function could be obtained by aggregating 
individual process functions. But such a procedure 
is hopelessly impractical. Instead, economists have 
postulated the form of the production function and 
then used aggregative data to estimate its param- 
eters. The analytical forms used have necessarily 
been restricted by considerations of convenience in 
estimation, [See PRODUCTION AND COST ANALYSIS.] 


VERNON L. SMITH 


[Directly related are the entries CAPITAL; ECONOMIES 
OF SCALE; FIRM, THEORY OF THE; INPUT-OUTPUT 
ANALYSIS.] 
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PRODUCTION AND COST ANALYSIS 


_ Production processes can be studied empirically 
in terms of either production functions or cost 
functions. Estimates of the parameters of these 
functions provide valuable insights into the tech- 
nology of firms and industries. The central ques- 
tions relating to technology are (1) whether pro- 
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duction processes display decreasing, constant, or 
increasing returns to scale; (2) how technological 
progress affects the parameters of production proc- 
esses; and (3) at what rate technological progress 
has occurred. Estimation and interpretation of the 
estimates is complicated by the fact that observa- 
tions on inputs, outputs, and costs reflect not only 
the state of technology but also the economic deci- 
sions made by producers and factor suppliers. 
Assumptions regarding economic behavior and com- 
petition in input and output markets often play a 
crucial role in the statistical analyses, and it is not 
always easy to determine whether the results reveal 
the nature of technology or serve instead to test 
the validity of the economic assumptions. 

Production functions. The fundamental pro- 
ductive organization is the firm, which enters into 
contractual arrangements in buying, transforming, 
and selling goods and services. The production set 
of a firm describes at a given time the possible 
relationships between inputs and outputs. For the 
single-product firm, the production function de- 
scribes the maximum output that can be produced 
from given quantities of inputs. Let X denote out- 
put in physical homogeneous units, and let L and 
K denote two inputs—labor and capital—in homo- 
geneous units; then the production function is 
Xma = F(K,L), or simply X = f(K,L). The numbers 
X, K, and L can take on positive or zero values 
only, and for a given technology the function is 
normally specified as univalued. 

An important practical distinction in statistical 
studies is between the ex ante (or planning) pro- 
duction or cost function and the ex post (or real- 
ized) function. Decisions about the type and scale 
of plant are made years before the plant is com- 
pleted. Expectations, formed in previous years, 
about prices and output levels determine the quan- 
tity and character of new capital employed in the 
current period. The ex post function is the realized 
relationship and is the one that is normally meas- 
ured in practice. If all plans and expectations are 
perfectly realized, the ex ante and ex post functions 
are equivalent. 

Functional forms. The most popular form of 
the production function for statistical testing is the 
Cobb-Douglas function (Cobb & Douglas 1928), 
ke. 


X = AK*L, A, a, B>0, 


where a and £ are elasticities of output with respect 
to capital and labor. Assuming perfect competition 
in the markets for output and for factors of pro- 
duction, with prices P for output, W for labor, and 
R for capital, we obtain the marginal productivity 
conditions 8 = WL/PX and a = RK/PX. In equilib- 
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rium the elasticities are equal to the ratios of factor 
rewards to total revenue. The Cobb-Douglas func- 
tion is a homogeneous function of degree a + £. 
If a + 8 = 1, then increasing both K and L by the 
same proportion will increase output by that pro- 
portion, i.e., constant returns to scale prevails. 
Decreasing and increasing returns to scale corre- 
spond to the cases œ + 8 <1 and a+ 8> 1. The 
Cobb-Douglas function can easily be extended to 
cover the case of many inputs and many outputs. 
Another form of the production function often 
used for statistical testing is the general constant- 
elasticity-of-substitution (CES) form, i.e., 


X = y[8K-e + (1 — 8) L~]-%, 


where y is a positive constant reflecting the scale 
in which X is measured, ô is a “distribution” param- 
eter (0<68<1), and v is a nonnegative scale 
parameter which increases with economies of scale 
and takes a value of unity when there are constant 
returns (Arrow et al. 1961). The parameter p is 
closely related to the elasticity of substitution (o) 
between capital and labor: 


sie dlog (K/L) XP i 
dlog[(@X/dL)/(0X/0K)] ~~ 1 +p 

When the elasticity of substitution approaches 
unity (i.e. when p approaches zero), the CES 
function simplifies to the Cobb-Douglas form. 
When the elasticity approaches zero (i.e., when p 
approaches infinity), the CES function becomes a 
Leontief-type input-output function. The marginal 
productivity conditions with v = 1 give 

WL _1-—8/K\ 

Te- 1(E) 


and 


tog (7) = hos - 8) - 5 4 log (4): 
The CES function can be extended to incorporate 
many inputs, but it is not so simple an extension 
as in the case of the Cobb-Douglas function. 

Specification and estimation. In estimating the 
parameters, one may fit either the production func- 
tion itself or the marginal productivity equations. 
Most Cobb-Douglas function studies have estimated 
parameters directly from the logarithmically linear 
production function, thus leaving open an apparent 
test of the marginal productivity law. Most CES 
studies have avoided fitting the nonlinear produc- 
tion relation and have instead concentrated on fit- 
ting the logarithmically linear marginal productivi ty 
equations. 

Simultaneous estimation of both production and 
marginal productivity relations is beset with dif- 


ficulties of statistical and economic specification, 
identification, and interpretation (Marschak & 
Andrews 1944). Thus, for the Cobb—Douglas form 
(and denoting logarithms by lower-case letters) we 
have for the production function 


x—ak—pl=a+uy 
and for the marginal productivity relations 


x-k =-a+r-p+iun 
x -l=-ß+w-p+u. 


The variables us, u,, and u, may be interpreted as 
random variables, u, affecting “productive effi- 
ciency” and u, and u, affecting “economic efficiency” 
in choosing the correct factor inputs. Different 
interpretations are, however, required for cross- 
section and time series studies. To estimate the 
parameters, restrictions must be imposed on the 
joint distribution function of us, u,, and u.. If 
there is no correlation between u, and u, and be- 
tween u, and u., then a simple extension of the 
usual method of least squares will provide suitable 
estimates, but the empirical evidence suggests that 
there are in fact significant correlations between 
technical and economic efficiency (Walters 1963). 
Cost functions. An alternative way of analyzing 
the production process is to estimate the cost func- 
tion, which describes cost as determined by the 
level of output and the prices of inputs when the 
firm uses the most efficient technique. If the per- 
fectly competitive marginal productivity conditions 
are substituted into the Cobb-Douglas production 
function, we obtain the cost function (C = cost), 
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which is linear in the logarithms of factor prices 
and output. Most empirical studies of cost have 
been concerned with measuring the variation of 
C and X, and not with estimating the effect of 
factor prices on C. It is clear, however, that if factor 
prices do change, there is an opportunity to meas- 
ure the coefficients of the production function by 
regressing cost on factor prices and output (Ner- 
love 1963). 

Cost functions, although apparently more useful 
than production functions because of the avail- 
ability of accounting data, are often more intract- 
able, owing to the difficulties of defining and meas- 
uring cost. The cost function reflects not only the 
technological conditions of production but also 
competitive conditions in factor markets. If, for 
example, the business is faced with a rising supply 
curve of labor, the parameters of that supply curve 
will enter as determinants of the cost function. In 
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addition, one does not escape the simultaneous 
equations difficulty referred to above. 

Time series estimates. The production function 
or cost function of the firm can be measured by 
observing the firm as it reacts to different stimuli— 
such as changes in relative factor and output 
prices. A time series of observations will produce 
variations in output, inputs, and cost from which 
production and cost functions can be traced. These 
observations will usually generate the short-run pro- 
duction and cost relationships—very short-run for 
monthly observations—and what may be called an 
intermediate-run production function for annual 
observations. The main difficulty with this type of 
study is that it samples a dynamic adjustment 
process—a mixture of factor price movements, 
technological change, and exogenous shocks. One 
cannot be sure that one has identified the static 
production function or cost function. 

Most time series studies of firms have been cast 
in the form of cost functions. Accounting data are 
adjusted for changes in factor prices. Usually the 
cost is measured as “direct” or “production” cost, 
but sometimes total cost has been used. The dif- 
ficulties with accounting data are that (a) the 
unit period (the financial year) is longer than 
the short period of economic theory, and (b) the 
valuation of stocks, capital, and depreciation is 
usually conventional and based on the require- 
ments of tax law. Studies vary according to their 
success in dealing with these problems. The gen- 
eral result is, however, reasonably clear; marginal 
cost is constant or even declining over the range 
of outputs recorded in the statistical studies. There 
is no striking evidence of sharply rising marginal 
cost. Close scrutiny of these results reveals that 
some can be explained by the fact that output levels 
were cyclically low relative to the size of the plant. 
There does remain, however, evidence of excess 
capacity in certain industries. 

Cross-section estimates. To avoid the problems 
of technological and other changes over time, re- 
searchers frequently use observations on a number 
of firms for a particular year—a cross-section 
sample. Variations in inputs and outputs from one 
firm to another provide the raw material for cross- 
section estimates. Differences in the sizes of firms 
are so large and have normally persisted for so 
long that the cross-section results are usually in- 
terpreted as long-run relationships. 

The main difficulty with cross-section analysis, 
however, is that in a competitive market there is no 
Separate and quantitatively different stimulus for 
each firm, The success of cross-section studies of 
households in measuring Engel curves is due to the 
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fact that each household has a different income, 
which generates different expenditure patterns. 
But this is not the case with businesses in a com- 
petitive industry. For firms in a competitive mar- 
ket, prices are the same throughout, so that re- 
corded variations in inputs and outputs must be 
due to influences other than price—perhaps due to 
accidental influences or to special nonexchange- 
able inputs, such as “entrepreneurship.” There is 
no variation in factor price ratios to generate 
variations in relative factor inputs. Any observed 
differences in factor inputs are caused by differ- 
ences in production functions (or accidents), and 
the observations do not identify a particular pro- 
duction function. Similarly, the cost function in a 
perfectly competitive cross section shows that aver- 
age costs, as measured by price, are constant over 
the sample. If one measured costs by deducting 
entrepreneurial rewards, one would find only how 
these rewards per unit of output varied over the 
population of firms; nothing can be deduced about 
the variation in cost as a representative firm ex- 
pands its output. Constant cost curves in cross sec- 
tions may be evidence of competition rather than 
of constant returns to scale. . 

This criticism does not apply to studies of firms 
in isolated factor or goods markets or to instances 
where there are imperfections in factor markets 
and output markets or where the level (or price) 
of output is controlled exogenously by a govern- 
ment agency. In these cases, there is some op- 
portunity for measuring the production and cost 
functions (Nerlove 1963). When the production 
relationship is measured directly, i.e., when quanti- 
ties of inputs and outputs are used, the results 
should approximate the underlying production func- 
tion. But in measuring marginal productivity con- 
ditions and cost functions, the parameters of the 
supply curve of factors must be known before one 
can calculate the production function or cost func- 
tion parameters. With international or, in some 
cases, interregional cross sections it is sometimes 
reasonable to specify both separate factor or goods 
markets and competitive conditions in each; in 
these circumstances a cross section may give ob- 
servations which are suitable for estimating pro- 
duction functions. 

The general result from cross-section cost studies 
has been that average cost declines rapidly for 
small outputs and is more or less constant for 
large outputs. There is no evidence that large firms 
incur high costs. For a competitive industry the 
above interpretation would suggest that small firms 
probably produce specialized products which com- 
mand a high price, whereas the large and medium- 
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size firms produce similar commodities which must 
sell at more or less the same price. For public 
utilities, certain monopolistic industries, and rail- 
ways, however, the predominant result—declining 
cost curves—does indicate economies of scale; but 
in view of the regulations imposed by governments 
and (sometimes) the noncompetitive nature of 
factor markets, the result must often be interpreted 
with caution. 

For studies of production functions from cross 
sections of firms (except public utilities), the re- 
sults tend to be that (a) the order of the homo- 
geneous production function is close to unity and 
(b) the share of wages is not very different from 
the value predicted by the production function 
(ie., B in the competitive markets, Cobb-Douglas 
case). There are, however, many exceptions to 
these generalizations. Whether one may interpret 
these findings as evidence of (a) constant returns 
to scale of a “representative” production function 
and (b) the workings of the marginal productivity 
law is open to question (see Walters 1963, p. 37). 
Even if one waives the argument that the observa- 
tions do not trace a “representative” production 
function, there are still two main problems. First, 
least squares estimates may give seriously biased 
results because of the simultaneous equations form 
of the underlying model. Second, to identify the 
production coefficients, one must avoid measuring 
some combination of the marginal productivity 
conditions or the condition that all revenue is ex- 
hausted. In very few studies have simultaneous 
identification methods been used with suitable safe- 
guards. The development of joint cross-section and 
time series data has enabled investigators to dis- 
tinguish “firm effects” from random “time effects,” 
but no general results are yet available. 

Cross-section, interindustry studies. Some in- 
vestigators have measured an “aggregate” produc- 
tion function from industrial aggregates of output, 
labor, and capital. The function so measured de- 
scribes how production relationships vary not with 
the size of firms but with the size of industries. 
Large industries may be composed of small firms, 
and small industries may consist of one or two 
giant firms. The function tells us how the net 
values added of industries vary with their inputs. 
The results might be expected to reveal whether 
there are external economies depending on the size 
of industries. The estimates have not been inter- 
preted in this way; normally they have been thought 
to shed light on whether economies of scale exist 
in the firm. Additional difficulties of interpretation 
arise from the variation in techniques between in- 
dustries, which are reflected in differences in coef- 


ficients for each industry. The heterogeneity of 
output and capital forces investigators to measure 
these in money terms. The supply conditions in 
factor markets and monopolistic power in product 
markets then affect the estimated coefficients, 

The general results of interindustry studies are 
of the familiar pattern, i.e., œ + 8 = 1 and £ is ap- 
proximately labor’s share of total income. But the 
fact that the coefficients sum to unity is not evi- 
dence of constant returns to scale; it tells us merely 
that industries with large values added use pro- 
portionately more inputs than do industries which 
have proportionately smaller values added. Thus 
the productivity of factors is not much affected by 
classification into industries. This is a result which 
one would expect in a more or less free competitive 
market. Similarly, the labor-share result is simply 
a consequence of measuring the equation: Value 
added = V = RK + WL, which gives d(V/RK) = 
Wd(L/RK). From the Cobb-Douglas function 
(a+ B=1) one derives the same slope, W, be- 
tween net value added per dollar of capital and 
labor input per dollar of capital, so that the labor- 
share result proves merely that the value added is 
exhausted by the factors. In sum, the cross-section, 
interindustry studies do not measure the produc- 
tion function and shed no light either on marginal 
productivity theory or on economies of scale. 

Time series studies of technical change. The 
first investigations of production functions were 
made with aggregate time series data, usually for 
the manufacturing sector of the economy, Data 
before the great depression of the early 1930s re- 
vealed the two common results, i.e., œ + 8 = 1 and 
8 = labor's share of total income. For periods which 
include the 1930s, 1940s, and 1950s the results 
deviate considerably from the pre-1930 values. This 
clearly indicates that the early results were largely 
the product of a peculiar historical period. 

One early criticism of aggregate time series 
studies was that the data simply measured trends 
in technological progress. In measuring the effects 
of technical progress with a Cobb-Douglas func- 
tion, the most common specification is a geomet- 
rical or exponential time trend. Progress is then 
“neutral” in both the Harrod and the Hicks senses. 
A surprisingly large fraction of the percentage 
change in output is attributed to neutral technical 
progress (between 60 per cent and 90 per cent is 
the common result), and only a correspondingly 
small fraction is attributed to the increased use of 
capital in production. This outcome is partly due to 
labeling as “technical progress” the increase in out- 
put which one cannot statistically attribute to other 
causes. When the basic model has taken into ac- 


count returns to scale, it seems that a substantial 
fraction of the progress is in fact due to economies 
of scale. 

A further development has been to “embody” 
technical progress in capital equipment at the date 
at which the plant was built. But with time series 
data, embodied and neutral technical change are 
confounded, so the division is not observable. Data 
on education and other indicators of the quality of 
the labor force have also been used to examine 
embodied technical progress. 

Another main development has been to con- 
sider the dynamic adjustment of the economy over 
time; this involves separate specifications for short- 
run and long-run adjustments. The last and most 
extensive development has been to put the produc- 
tion function into the framework of an economic 
system specified in the form of a set of simulta- 
neous equations. The production relationships ap- 
pear as a subset of the equation system. 


In spite of the effort devoted to research in pro- 
duction and technical progress, our ignorance on 
this fundamental problem is the Achilles heel of 
many economic models and policies. The planning 
models of a Leontief or linear programming type 
probably suffer more from incorrect specifications 
of production functions and progress functions 
than from any other error. Painstaking, detailed 
research into processes and innovations is the only 
answer. 

A. A. WALTERS 


[Directly related are the entries AGRICULTURE, article 
On PRODUCTIVITY AND TECHNOLOGY; ECONOMIES OF 
SCALE; FIRM, THEORY OF THE; INPUT-OUTPUT AN- 
ALYSIS; PRODUCTION.] 
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PRODUCTIVITY 


Productivity, as discussed here, refers to a class 
of empirical output—input ratios that is widely used 
in economic history, economic analysis, and eco- 
nomic policy. In one sense, productivity measures 
the fruitfulness of human labor under varying cir- 
cumstances. In another sense, productivity meas- 
ures the efficiency with which resources as a whole, 
including capital as well as manpower, are employed 
in production. In still another sense, productivity 
measures the forces that underlie the trend of real 
wages. And in a fourth sense, productivity meas- 
ures a major factor in the determination of labor 
or capital requirements. Productivity measurements 
are addressed to important questions. 

As we shall see, some of the questions are best 
answered with one kind of productivity measure- 
ment, and other questions with other kinds. For 
this reason, productivity ratios appear in a variety 
of forms. In addition, there are some forms that 
simply reflect the dearth of statistical information. 
These productivity ratios serve as the best available 
approximations to the desired measurements. To 
grasp the subject of productivity, therefore, it is 
necessary to begin by considering both the concept 
of productivity and its measurement. 


Kinds of productivity indexes 


In each of its forms, productivity appears as a 
comparison of an output with one or more inputs; 
that is, as a comparison of an output with the serv- 
ices of one or more of the resources used in pro- 
ducing the output. The comparison is most fre- 
quently put in the form of a ratio of the one to the 
other. Then, because the output-input ratio of 
a particular time and place has limited significance 
standing by itself, it is compared with the corre- 
sponding ratio of another time or place to estimate 
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changes or differences in productivity levels. The 
comparison is usually expressed in relative form; 
that is, as an index of productivity. 

In the output—input ratios with which we are 
concerned, both numerator and denominator are 
measured in physical units or (when heterogeneous 
items are combined, which is most frequently the 
case) in “constant-price” money values, This “phys- 
ical productivity” is to be distinguished from “value 
productivity,” in which output is measured in cur- 
rent money values while input is measured in phys- 
ical units or constant-price money values. It should 
be distinguished also from “cost of production per 
unit,” an input-output ratio in which input is meas- 
ured in current-price money values and output in 
physical units. 

Change in productivity can be measured by 
change in the relation between output and one or 
more inputs, either with other inputs kept constant, 
as in an experiment under controlled conditions, 
or with other inputs free to vary. We shall deal with 
the latter case. This means that a change in the 
relation between output and input, if the input is 
not the total input, may reflect substitution between 
the inputs covered by the productivity ratio and 
those not covered. 

Finally, productivity in a given period can be 
measured by the ratio of the output to the input 
of the period, or it can be measured by the ratio 
of the increment in output to the increment in in- 
put during the period. That is, productivity may be 
“average productivity” or “marginal productivity.” 
We shall deal with average productivity. It should 
be noted, however, that under certain conditions 
(mentioned in the discussion of productivity and 
wages) indexes of average and marginal produc- 
tivity are identical. 

Our discussion is limited, then, to average phys- 
ical productivity; that is, to the ratio of physical 
output to one or more physical inputs, other inputs 
not necessarily being constant. It is this subfamily 
of output—input ratios to which most productivity 
indexes relate. 

Measure of input. While the field has been 
narrowed, it is still rather wide. To illustrate: (1) 
Output can be compared with the sum of all the 
hours of labor—“the manhours”—spent in produc- 
tion (output per manhour). Account is thus taken 
not only of the number of persons engaged but 
also of the long or short hours put in by those per- 
sons who work more or less than the average work 
week or year. When comparisons are made over 
time, allowance is thus made for the declines that 
have occurred in the length of the usual work week 
or year. When the comparisons are between na- 


tions, allowance is made for international differ- 
ences in the length of the work period. (2) Output 
can be compared with the weighted sum of man- 
hours employed (output per unit of labor input). 
That is, an hour of high-quality labor—a highly 
paid manhour of work—is counted as proportion- 
ately more than an hour of low-quality labor—a 
lower paid manhour. In this way, account is taken 
of differences in education, length of experience, 
and other factors determining the quality of labor. 
Thus, also, account is taken of the education and 
other intangible capital invested to improve the 
quality of labor. Or, (3) output can be compared 
with the services of the tangible capital employed 
in production—plant and equipment, tools and roll- 
ing stock, land and mines, and inventories of all 
sorts—as well as the services of the labor resources, 
each appropriately weighted (“output per unit of 
total input,” or “total productivity,” or most specifi- 
cally, “output per unit of labor and tangible capital 
input”). 

Obviously, of the three productivity ratios men- 
tioned, output per manhour requires the least in- 
formation and is easiest to calculate, and output 
per unit of labor and capital input requires the 
most information. It is not surprising, therefore, 
that the more complex measurements are also of 
more recent vintage and are less frequently avail- 
able than the relatively simple output per man- 
hour. 

However, the productivity measurements that 
are more complex in terms of data requirements 
are—from our point of view—less complex in 
terms of content or meaning. Output per unit of 
labor input, for example, describes more than does 
output per unit of labor and capital input. The 
former is equal to the latter multiplied by another 
ratio, the weighted combination of labor and 
capital input per unit of labor input. Output 
per unit of labor input will rise faster than 
output per unit of labor and capital input when 
tangible capital input grows more rapidly than 
labor input. Similarly, output per manhour will 
rise faster than output per unit of labor input when 
education and other investments cause the quality 
of labor to improve. In short, change in output per 
manhour may be viewed—and later we will find it 
useful so to view it—as reflecting the combined 
effect of change in three things: (1) efficiency, as 
measured by output per unit of labor and capital 
input; (2) the relative supply of tangible capital, 
as measured by labor and capital input per unit of 
labor input; and (3) average labor quality, as meas- 
ured by labor input per manhour. 

This chain of productivity ratios can be readily 


extended to include ratios that are more or less 
complex (in either of the senses mentioned) than 
the ratios included in our illustrative series. Thus, 
when the productivity of an industry or other sec- 
tor of an economy is considered, and the informa- 
tion is available (which it seldom is), output can 
be compared not only with the services of the labor 
and tangible capital employed by the sector but 
also with these plus materials, components, fuel, 
supplies, and other commodities and services pur- 
chased from other sectors. This productivity ratio, 
“output per unit of labor, capital, and material in- 
put,” is another, and somewhat different, measure 
of output per unit of total input. Change in output 
per manhour can then be said to combine the ef- 
fects of change in (1) efficiency, now measured by 
output per unit of labor, capital, and material in- 
put, and (2) total resources per manhour, meas- 
ured by labor, capital, and material input per man- 
hour. 

When the data are severely limited (which is 
often the case), output can be compared simply 
with the average number of persons engaged in 
production. The productivity ratio would then be 
simply “output per man.” Change in output per 
man combines the effects of change in (1) effi- 
ciency and (2) total resources per manhour, as 
just defined, and (3) average hours worked per 
man. 

Measures of output. Productivity ratios may 
differ also with regard to the output included in 
the numerator. The output of a country may be 
identified with the goods and services produced 
within its borders (net domestic product) or with 
the goods and services available to its normal resi- 
dents (net national product). National product dif- 
fers from domestic product by the amount of goods 
and services (or the equivalent in claims on for- 
eigners) financed by net incomes received from 
abroad and by net gains from improvements in 
the terms of foreign trade, One must decide 
whether or not to define a country’s productivity in 
such a way as to make it depend in part on changes 
abroad, or at least one must recognize that its pro- 
ductivity has been so defined when it is measured 
by national product per unit of input. 

The use of output concepts like gross national 
product and gross private domestic product reflects 
not so much a preference for these concepts over 
net national product or net domestic product as a 
serious doubt about the accuracy of the measure- 
ment of capital consumption in the former case 
and of the government component of national prod- 
uct in the latter case. This means, of course, that 
ross national product per manhour, for example, 
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is only an approximation to net national product 
per manhour. 

The output of an industry can also be assigned 
different meanings. It can be measured either by 
the “real value added” by the industry or by its 
“real value of product,” where the former excludes 
and the latter includes the value of purchased ma- 
terials, fuel, and energy. To measure an industry's 
“total productivity,” then, one can compare its real 
value added with its labor and tangible capital in- 
put, or its real value of product with its labor, 
tangible capital, and material input. Less consist- 
ent, but usually unavoidable because information 
is lacking, is the comparison of real value of prod- 
ucts with labor and tangible capital input. The 
three productivity indexes will not necessarily par- 
allel one another. Only the first index is directly 
comparable with the national index of output per 
unit of labor and tangible capital. 

In most discussions of productivity, it is cus- 
tomarily assumed that the differences among the 
several output concepts are small and can there- 
fore be safely ignored, largely because scarcity of 
information makes a choice impossible. It is cer- 
tain, however, that the differences are not always 
negligible. But we shall have to follow the custom, 
except for a later comment or two, and concen- 
trate on those differences among productivity in- 
dexes that arise from differences in the inputs. 

Even when there is a choice among alternatives 
and even when the output and the input indexes 
can be made fully consistent, productivity indexes 
are at best crude and somewhat ambiguous meas- 
ures. As with all index numbers, troublesome ques- 
tions arise, and rather arbitrary decisions are made, 
concerning the aggregation of heterogeneous and 
qualitatively changing inputs or outputs. And there 
are special difficulties—for example, in the meas- 
urement of capital input and of the employment 
and hours of family workers. Users of productivity 
indexes must learn not to make fine comparisons. 

Not every important productivity ratio has been 
mentioned. However, for the immediate purpose, 
which is to identify some of the major productivity 
ratios and to make clear why it is necessary to dis- 
tinguish carefully among these and other ratios, 
our list is sufficient. 


Productivity indexes and production functions 


The differences among the several productivity 
indexes can be put more clearly in symbolic terms. 
The symbols are useful also for showing the rela- 
tion between the indexes and statistical production 
functions. 

Following the usual convention, let the following 
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symbols stand for index numbers, on a common 
base, of the items specified: Y = output, N = man- 
hour input (unweighted manhours), L = labor in- 
put (weighted manhours), and K = tangible cap- 
ital input. To combine labor and capital input it 
is necessary to express them in comparable units, 
namely, a dollar's worth of services in the base 
period or place; or, what is the same, to calculate 
the index of labor and capital input by taking a 
weighted arithmetic average of the indexes of labor 
input and of capital input, with the weights equal, 
respectively, to the total value of labor's services 
and of capital’s services in the base period. We 
have, then, aL + bK = labor and tangible capital 
input, where a is the fraction of total input con- 
tributed by labor, b is the fraction of total input 
contributed by capital, and a + b = 1. 

The several productivity indexes are Y/N = out- 
put per manhour, Y/L = output per unit of labor 
input, Y/(aL + bK) = output per unit of labor and 
capital input. In these terms, output per manhour 
is related to output per unit of labor and capital 
input as follows: 


Y/N = [Y/(aL + bK)] [a + bK/L] [L/N]. 


When L and K are combined by means of a weight- 
ed geometric mean, which is quite rare in the cal- 
culation of productivity indexes, this relationship 
becomes 


Y/N = [Y/(L*K?)] [(K/L)?] [L/N]. 


Statistical production functions are empirically 
determined relationships between output (or an 
index of output) taken as the dependent variable 
and the two inputs (or the corresponding indexes ) 
taken as the independent variables, The most 
widely used function assumes a logarithmic-linear 
relationship between output and input, and a uni- 
form rate of shift over time in this relationship as 
technology and other determinants change. The 
function is 


Y=LeK"(1+7)!, 


with a*+b* taken equal to 1. The coefficients 
a’ and b” are, in effect, the weights of L and K, 
respectively, as determined by fitting the func- 
tion to the statistics by one of the usual regression 
methods. Under competition and certain other con- 
ditions, to be discussed later, a* tends to be equal to 
a, and b’ tends to be equal to b. The coefficient r 
measures the average annual rate of shift in the 
relationship between output and input when t is 
measured in years. The variable t is used as a proxy 
for changes in technology, the scale of operations, 


and the other factors that change over time and 
cause this shift, and r is therefore usually (and 
loosely) called the rate of technological change. 
In our terminology, r is the average annual rate of 
change in output per unit of labor and tangible 
capital input. To illustrate: When input is the same 
in the given as in the base period, output will be 
greater in the given period, t years after the base 
period, by the amount (1 + r)‘. When the compar- 
ison is between places rather than periods, t can no 
longer be used as a proxy for technology and the 
other factors that determine shifts in the relation 
between output, Y, and the inputs, L and K. 

In most cases, N rather than L is used to meas- 
ure labor input. The production function is then 
more accurately expressed as 


Y = N@K™(1 +r). 


In this case, (1 +r)! is equal to (Y/L**K’")(L/N)", 
not to (Y/L*K"*) alone. That is, r measures the 
average annual rate of change in output per unit 
of labor and capital input combined with the 
change in the quality of labor. 

An alternative function, which assumes an arith- 
metic-linear relationship, is 


Y=(a@L+b°K)(1+7r)', 


with a'+b'=1. Here (1 +r)'=Y/(a'L + b'K), 
which is identical with the index of output per unit 
of labor and capital input, as usually calculated, 
except for possible differences between a and a’, 
and therefore between b and b*, There may also 
be a difference arising from the fact that r is the 
trend rate of increase over the whole period covered 
by the function, and not just the rate of change 
between the base year and the given year. Ob- 
viously, either type of equation can be modified so 
as to treat separately the input of (unweighted) 
manhours and the input of intangible capital in- 
vested in education, to take account of materials, 
fuel, and other purchased inputs, to include the 
terms of foreign trade as an independent variable, 
and so on. 

Statistical production functions provide an alter- 
native way of calculating productivity indexes, 
particularly indexes of output per unit of labor and 
capital input, and of their rates of change. So 
viewed, the production function approach has the 
advantage of requiring no direct information on 
the weights needed to combine L (or N) and K. 
The approach also has the advantage of requiring 
explicit recognition of the assumptions made about 
the character, or “shape,” of the relationship be- 
tween output and input. At any rate, it provides 


a better way to state these assumptions and a better 
setting in which to study them, as we shall see. 


Variation in labor productivity 

Indexes of output per manhour—commonly 
called “labor productivity” indexes—state the yield 
of commodities and services obtained, under vary- 
ing circumstances, from the expenditure of an hour 
of human labor. In these terms they report on facts 
of high economic importance—the relative power 
of men of different generations or different nations, 
or of men of the same nation and generation under 
different conditions, to produce the things men 
want. 

Secular trends. A glance at secular trends over 
the last hundred years or so, which is about as far 
back as most of the information goes, suggests a 
remarkably general, perhaps world-wide, develop- 
ment. In all countries for which acceptable esti- 
mates are available for at least fifty years—mostly 
today’s developed countries—trends in national 
output per manhour have been upward. We cannot 
be sure about the more remote decades of the past 
century, when the records even of the developed 
countries are poor. And there is doubt about the 
trends even over the more recent decades in the rest 
—the larger part—of the world, which is heavily 
weighted with countries in which present levels of 
labor productivity are very low by Western stand- 
ards, Yet even for the latter, there are grounds— 
Scraps of direct information, and information on 
the causes and the consequences of higher labor 
productivity—for supposing that the present low 
levels are probably no lower, and in most countries 
are in fact higher, than the levels of a hundred 
years ago. The exceptions, if any, are likely to be 
the countries suffering the effects of war or social 
upheaval. Their current levels of productivity are 
below the secular trend lines rather than on them. 

While nearly all countries, if not quite all, have 
Probably raised their labor productivity levels over 
the past century, the rates of advance have been 
highly varied. Most of the countries for which di- 
Tect estimates are available for a sufficiently long 
Period fall in the range between 1.5 and 2.5 per 
cent per annum. There are none higher than about 
2.5, and only a few lower than 1.5, with none 
much below 1.0. This sample is bound to be biased, 
however, because it consists largely of the devel- 
‘oped countries. Including the other countries, which 
are probably mostly in the lower portion of the dis- 
tribution, it is a fair guess that a range of 0.5 to 
2.5—almost surely, of 0.0 to 2.5—would embrace 
virtually all the countries of the world. This range 
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is wide indeed. Over a century it means the differ- 
ence between little or no change and an 11-fold 
increase. It means, also, that countries have dif- 
fered greatly in the degree of change of their tan- 
gible capital per worker, or the quality of their 
labor, or the efficiency with which they use capital 
and labor, or in all three respects. 

Even if there has been no improvement in some 
parts of the world, it is clear that for mankind 
as a whole the average level of labor productivity 
has risen. Over the century taken as a whole— 
though not in recent years—it has been pushed up 
also by a more rapid population growth in coun- 
tries with high levels of output per manhour. It 
is hazardous to express these developments in 
quantitative form, but it seems reasonable to be- 
lieve that the average rate of increase in the 
productivity of all human labor during the past 
century has been something over 1 per cent per 
annum and perhaps closer to 1.5 than to 1. This 
much is certain: the rise in labor productivity over 
the past century has been far greater than the 
average rate of increase experienced during man's 
existence on earth. It is also certain that it has 
been far less than the rate of increase the world 
now assumes in its plans for the future. 

Over the century as a whole, the earth’s popu- 
lation has risen less than 1 per cent per annum, 
and manhours worked per capita have probably 
fallen a bit. It is highly probable, therefore, that 
the average rate of increase in output per manhour 
has been higher than the average rate of increase 
in the aggregate amount of work done. In other 
words, the present generation of men produces far 
more than earlier generations because there are 
more men; but, even more important, also because 
each man today is able on the average to get more 
from an hour of his labor than were his ancestors. 
He has less land to work, it is true, but far more 
capital of other kinds—largely inherited—to work 
it with. And on the whole, the use of his time and 
capital is more efficiently organized. 

International differences. The disparities in out- 
put per capita that appear over the surface of the 
earth are also to be explained very largely, if not 
entirely, by differences in output per manhour. In 
the poorer half of the earth, output per manhour 
in 1960 was probably at a tenth or less of the 
western European level, and the latter was half 
the level of the United States. International differ- 
ences in manhours per capita are very small com- 
pared with those differences in output per man- 
hour and cannot account for much of the variation 
in per capita output. It may be, in fact, that per 
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capita output is greater in the countries working 
fewer manhours per capita. 

Some part of the present differences among na- 
tions in levels of labor productivity is the result 
of differences in the long-term rates of growth of 
labor productivity. But today’s large variation can- 
not be accounted for entirely by differences in 
trends, over a single century, of the order of mag- 
nitude mentioned. A difference even of 2 per cent 
per annum would make the current productivity 
level of western Europe, for example, no more than 
seven times the level of southern Asia, had these 
areas started from equal levels. The current dis- 
parities between the highly productive and the less 
productive countries must therefore bear a resem- 
blance to those of a century ago but be on the 
whole much wider, This is not inconsistent with 
important shifts in the rank of individual nations, 
which have also occurred. A century ago the labor 
productivity level of the United Kingdom, for ex- 
ample, was about the same as that of the United 
States and about double that of Sweden. In 1960, 
the British level was less than half the U.S. level 
and somewhat below that of Sweden. 

Fluctuations. The average rate of growth in 
labor productivity over the century as a whole is 
not the whole story, The upward march of labor 
productivity has often been speeded up and as 
often slowed down, sometimes to the point of de- 
clining absolutely for a time. Among the sources 
of these fluctuations have been the weather, which 
has often put a visible mark on national output 
per manhour in agricultural countries; business 
cycles, of particular importance in industrialized 
countries; and the catastrophes of war and political 
and social upheaval. Not only domestic conditions 
but also conditions abroad, it should be noted, have 
caused fluctuations in the yield from an hour of 
labor as measured by national product. 

The impress of business cycles on labor pro- 
ductivity is pronounced when output is compared 
with the manhours that might be supposed to be 
“available” for work under conditions of full em- 
ployment. But it is visible also in the more usual 
comparison of output with the manhours actually 
worked. Annual figures show a more rapidly ris- 
ing labor productivity during business expansions 
than during business contractions. [See BUSINESS 
CYCLES.] Quarterly figures, confined largely to 
the commodity-producing industries of the United 
States, suggest that the rise is slower during the 
second half than during the first half of expan- 
sions, and especially slow during the first half of 
contractions. This cyclical behavior (which could 
hardly have been foreseen on the basis of a priori 


considerations alone) is evidence of a systematic 
swing in the balance between the factors that make 
for rise and those that make for decline in labor 
productivity—the factors that determine the aver- 
age quality of labor input, the volume of capital 
services available per worker, and the efficiency 
with which labor and capital are utilized. It has 
interesting implications for the explanation of the 
cyclical behavior of labor costs, of profit margins, 
and thus also of decisions to invest. 

The effects on labor productivity of wars, revo- 
lutions, and other serious disturbances have been 
less frequent but usually more violent and pro- 
longed than those of crop cycles or business 
cycles. Disorganization, undermaintenance and 
destruction of plant and equipment, and the loss 
or diversion of trained personnel have sometimes 
caused labor productivity to fall drastically. During 
recovery, labor productivity has often risen for 
some time at an exceptionally rapid pace, but these 
high rates have not always continued long enough 
to erase the effects of the catastrophe from the 
long-term trend. 

Apart from these fluctuations, there have also 
been significant, though less obvious, differences 
among trends measured over a decade or more. In 
most countries, for example, the decade beginning 
about 1952—which is presumably free of most of 
the immediate effects of World War 11—witnessed 
rates of growth in labor productivity that appear 
high in comparison with the long-term averages. 

What is revealed by the fuller record may be 
illustrated by reference to the United States. Some- 
thing like five long waves, of irregular length, are 
discernible in the rate of increase in output per 
manhour in the United States between 1870 and 
1960, excluding the war periods. Rather clearer 
is a change in the long-term trend at about the 
time of World War 1. During the thirty years be- 
fore 1919, output per manhour rose at an average 
annual rate of 2.0 per cent, as measured by real 
gross private domestic output per manhour; during 
the forty years after 1919, the rate was 2.6 per 
cent per annum. These changes, and roughly 
similar changes in some other countries, have fed 
speculations about long cycles and about a tend- 
ency of labor productivity, and of the technological 
change that contributes to it, to accelerate. How- 
ever, the available evidence is too scanty and ir- 
regular to be conclusive. Of course, the net balance 
among the factors that influence trends in labor 
productivity (among which are changes abroad, 
as well as domestic factors) may be larger at one 
time than at another, without necessarily generat- 
ing systematic patterns of change. 


Whether the fluctuations are more or less sys- 
tematic, it is clear that growth of labor produc- 
tivity has been subjected to them. Yet the main 
impression conveyed by the records of labor pro- 
ductivity in various countries is not so much one 
of fluctuation as of a persistent and often powerful 
upward push. This impression is also conveyed 
by the records of individual industries within coun- 
tries. Even industries not obviously adapted to 
mass production methods, such as the service in- 
dustries, seem to have raised output per manhour, 
to judge from studies now under way. And this is 
also true of industries in which natural-resource 
limitations might be expected to play a consider- 
able role, such as mining and agriculture, Tech- 
nological changes, and in some cases also the 
opening of new sources of supply, have prevented 
the appearance in these industries of the “dimin- 
ishing returns” that would otherwise be apparent. 

Sometimes, perhaps because the term “labor 
productivity” is misunderstood, it is believed that 
wage earners (or “labor” as a whole) are wholly 
responsible for these widespread changes in out- 
put per manhour, It is true that among the factors 
that can cause labor productivity to rise (or to 
be higher in one place than in another) is change 
in the quality of labor, for labor productivity is 
measured by output per manhour and not by out- 
put per unit of labor input. But also involved 
are changes in the volume of tangible capital per 
worker and in the whole host of factors—besides 
those embodied in labor—that determine the effi- 
ciency with which manpower and capital resources 
are utilized in production. 


Variation in efficiency 


When efficiency in the use of resources as a 
whole is in question, the appropriate index of na- 
tional productivity is output per unit of labor and 
tangible capital input. It is hardly the perfect 
index, for reasons to be mentioned, but it provides 
the best practical approximation. Yet measurement 
of efficiency even in this way has hardly begun, 
except in the United States. It is therefore to the 
U.S. figures that we turn for a first view of the 
relevant magnitudes. 

It is no surprise to learn that the average quality 
of labor improved—by 0.3 per cent per annum, 
over the period 1889-1960, according to data on 
labor input (weighted manhours) per manhour; 
and that tangible capital per manhour also rose— 
by 1.5 per cent per annum, over the same period. 
Efficiency in the use of resources therefore went 
up less rapidly than labor productivity—by 1.6 per 
cent per annum, as compared with 2.2 per cent 
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per annum. What may be surprising is the differ- 
ence of only 0.6 percentage points between the two 
rates of increase. 

One reason for the small difference is the rela- 
tively heavy weight given labor input when it is 
combined with tangible capital input. The U.S. 
weights, based on national income data, are cur- 
rently 8 to 2. There may be a second reason: un- 
derstatement of the rise in the quality of labor and, 
therefore, overstatement of the rise in efficiency. 
The estimate of 0.3 per cent per annum increase 
in labor quality, based on the use of weighted man- 
hours, assumes all the labor within an industry 
to be homogeneous. Perhaps more important, it 
takes no account of the broad advances in educa- 
tion, health, and the like, which improve the quality 
of labor in industries generally. An estimate based 
mainly on the amount of schooling received by 
workers and therefore free of most of the defi- 
ciencies mentioned (but suffering from others) 
suggests a much higher rate of increase in the 
quality of labor—some 0.8 or even 0.9 per cent 
per annum. 

Also involved, although the direction of its effect 
on the index of efficiency is less certain, is the fact 
that labor and tangible capital input do not cover 
investment in the intangible capital of science, 
technology, and social organization that serves to 
increase production, It is clear that “research and 
development,” for example, is an alternative to 
investment in other types of resources. Whatever 
its origin—whether in activities economically mo- 
tivated or not—scientific, technological, and other 
useful knowledge is in some degree a substitute for 
other resources. But the services of the national 
stock of knowledge cannot be included in input 
because the concept of such a stock, or of changes 
in it, is elusive and not yet within the domain of 
practical measurement. Only in a few studies of 
industries or parts of industries—not entire econ- 
omies—has it been possible to include some in- 
vestments in technology among the inputs. This 
omission means that the available index of total 
input is biased downward if intangible capital in 
the form of technology has risen more rapidly than 
the resources that can be included. But no one 
knows what the difference is between these rates 
of growth, or can even be sure that that of intan- 
gible capital is the higher. This ignorance is also 
one reason why the index of output per unit of 
labor and tangible capital input is not a good 
measure of technological change, as some suppose 
it to be. 

An alternative and more widely used estimate 
of growth in efficiency is available for the United 
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States in the form of an index of gross private 
domestic product per unit of labor and capital in- 
put. This index went up by 1.7 per cent per annum, 
as compared with the 1.6 per cent increase of net 
national product per unit of labor and capital in- 
put. The latter measure is preferable on conceptual 
grounds because it covers the large and growing 
government sector as well as the private sector and 
because of other reasons already given. But the 
former index is more reliable because it avoids the 
conventional, and probably erroneous, assumption 
that labor productivity in government has not 
changed. The difference between the two—and be- 
tween these and other alternative indexes—under- 
scores the lack of precision even in the best of 
available productivity indexes. 

Two conclusions are suggested by the United 
States figures, subject to the reservations required 
by this lack of precision. First, the growth in labor 
productivity reflects upward trends in all three of 
its component factors—labor quality, tangible capi- 
tal per manhour, and efficiency—and not a net 
difference among positive and negative factors. 
Second, of the three factors, it is increase in effi- 
ciency that appears to have made the largest con- 
tribution to the increase in labor productivity. This 
means, also, that it has made the largest contri- 
bution to the growth of output per capita. 

As for countries other than the United States, 
there are grounds for supposing that they too have 
experienced not only the rather general growth 
of labor productivity already mentioned but also— 
at least in some degree—improvements in the 
quality of labor and increases in the volume of 
tangible capital per manhour. If the first conclu- 
sion drawn from U.S. experience holds for them 
also, we may expect that efficiency has generally 
risen, although less rapidly than labor productivity; 
and we may infer that international differences in 
efficiency will be smaller than the corresponding 
differences in labor productivity. 

The measures of efficiency that have been cal- 
culated—and they are far fewer than the indexes 
of labor productivity—seem consistent with these 
expectations. It must be noted, however, that 
for countries other than the United States, effi- 
ciency is measured (when it is measured at all) 
by a comparison of output with a combination 
of unweighted manhours and the services of tan- 
gible capital. The use of unweighted rather than 
weighted manhours means that the measure of 
changes in efficiency also covers the changes in 
the quality of labor. In the case of the United 
States during the period 1889-1960, the index of 


efficiency so calculated went up 0.3 per cent per 
annum more rapidly than output per unit of labor 
and tangible capital input. 

Growth in efficiency does seem to have been a 
rather general phenomenon. Efficiency has risen in 
each of the dozen or so countries for which trends 
over a decade or more have been calculated, al- 
though always less rapidly than labor productivity. 
There is evidence for some of these countries that 
efficiency has also risen in most, if not all, in- 
dustries, and—again—generally less rapidly than 
labor productivity. And there is some evidence, as 
well, that when the average level of efficiency of 
a country is high relative to other countries, effi- 
ciency is generally high also in each of its in- 
dustries relative to comparable industries in other 
countries. Further, as with labor productivity, the 
average rate of increase of efficiency has differed 
widely among countries, although somewhat less 
widely than labor productivity. Efficiency has dif- 
fered also—and to a greater extent—among in- 
dustries within countries, and it has differed 
among periods as well. 

If increase in efficiency has in fact been general, 
but highly varied, something is suggested about 
the factors involved. Advances in technology, for 
example, must consist not only of the major inno- 
vations that revolutionize this or that industry but 
also of a host of smaller innovations that arise 
over the whole range of economic life. Also, knowl- 
edge of innovations, even of those quite specific 
to particular industries or regions, is sooner Or 
later adapted, to some extent, to the peculiar con- 
ditions of other industries and other places. 

The available figures also support the second 
conclusion suggested by the U.S. data. Increase 
in efficiency seems to be important, judged by its 
contribution to the growth of output, output per 
capita, and output per manhour; and international 
differences in the level of efficiency seem to be 
important in explaining international differences 
in these quantities. 

This may be the most striking lesson learned 
from the measurements, to judge by the surprise 
expressed by many economists. Between 1928 
and 1948, under the influence of empirical work 
with the production function, Y= NK (with 
a’+b’=1), it was rather widely held that in- 
creases in output per worker or manhour, and also 
differences among countries in this respect, could 
be accounted for very largely, if not entirely, by 
the volume of tangible capital per worker or per 
manhour. Improvements and extensions of the em- 
pirical data, and calculations of indexes of output 


per unit of labor and capital or of production func- 
tions which allow for changes in efficiency, such 
as Y = N**K**(1+,1)!, have radically revised this 
opinion. It is now generally recognized that neither 
tangible capital alone nor even weighted manhours 
and tangible capital alone are sufficient to explain 
differences in output per manhour. Indeed, only 
a third of the average annual increase in the U.S. 
national output per manhour of 2.2 per cent be- 
tween 1889 and 1960 is accounted for by the in- 
crease in measured input—weighted manhours 
plus tangible capital input—per manhour. The rest 
is accounted for by the increase in measured 
efficiency. Figures available for Norway, Finland, 
Russia, and West Germany, covering periods of 
two to five decades, and for these and a few other 
countries for the postwar decade, tell a similar 
story. 

The issue now takes the form: How much would 
an adequate measure of investment in education 
and other forms of human capital leave to be ex- 
plained by efficiency more narrowly defined and 
measured? It seems questionable, however, to con- 
sider as negligible such factors as the growth and 
diffusion of knowledge (apart from what is cov- 
ered by investment in human beings); widened 
markets, which give more scope to the specializa- 
tion of workers, machines, and business establish- 
ments; reductions in hours of work, which enhance 
labor productivity and may also enhance efficiency; 
improvements in economic organization; and per- 
haps also changes in the character of consumption 
and in the way in which leisure time is used. 
Something is known about each of these factors, 
but to improve this knowledge and to determine 
the relative importance of each factor continues 
to be a major task in the analysis of economic 
growth. 


Productivity and wages 


The major determinants of the average real 
hourly earnings of a nation’s labor force are the 
nation’s efficiency in using labor and tangible capi- 
tal, the scarcity of labor in relation to tangible 
capital, and the quality of the labor force. The 
Productivity index that combines these factors is 
the index of national output per manhour. It is 
this index, therefore, that is used in the analysis 
of trends and of international differences in real 
' Wages, enters wage negotiations, and is looked 
upon as a guidepost in wage policy. 

Theoretical considerations. National output per 
manhour is a particular combination of particular 
Measures of efficiency, relative labor—capital scar- 


PRODUCTIVITY 531 


city, and labor quality. It is the appropriate com- 
bination for the purposes mentioned only under 
certain conditions that need to be made explicit. 
Even with these conditions satisfied, it is the 
appropriate combination only in relation to the 
average hourly earnings of all workers combined. 
These are workers of changing—usually improv- 
ing—quality. Most often, however, the earnings in 
question are those of workers of a substantially 
fixed quality, in a particular occupation or indus- 
try. In this case, the appropriate productivity index 
—if just one index is to be chosen—is national 
output per unit of labor input; that is, output per 
weighted manhour, not output per unweighted 
manhour. 

This will be understood if we first note that 
real labor income includes the earnings from the 
services of the intangible capital invested in edu- 
cation and other improvements in the quality 
of labor, as well as from the services of labor 
of minimum quality—roughly, “unskilled” labor. 
When this total labor income, per manhour, moves 
identically with national output per manhour, the 
fraction of national output that goes to workers 
in the form of real labor income and the fraction 
that goes to owners of tangible capital are con- 
stant. We can then ask under what conditions 
these fractions will in fact be constant. 

Such fractions will be constant when an increase 
in the quantity of tangible capital's services rela- 
tive to the quantity of labor’s services is accom- 
panied by an exactly proportionate decline in the 
price of tangible capital's services relative to the 
price of labor's services. Now, it is clear that a rise 
in the capital—labor quantity ratio will tend to push 
up the usefulness of labor relative to that of capi- 
tal. In more technical language, it will tend to raise 
the ratio of the marginal product of labor to that 
of capital. 

The required conditions are, then: (1) The 
rise in the capital-labor ratio must be accompanied 
by an exactly equal and opposite proportionate 
change in the ratio of the marginal products of 
labor to capital—which means that the elasticity 
of substitution between capital and labor must be 
exactly equal to unity. (2) The ratio of the mar- 
ginal products must be unaffected by changes in 
the composition of output and in the technology 
and other factors that underline efficiency—which 
means that changes in the product-mix and in 
efficiency must be “neutral.” (3) The ratio of the 
marginal products must be equal to the ratio of 
the prices of labor to capital services—which means 
that competition must prevail in the markets for 
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commodities, labor, and capital, so that change in 
the volume of tangible and intangible capital per 
manhour and increase in efficiency are free to 
bring appropriate adjustments in the rates of re- 
turn to labor and to capital. 

This reasoning may be put in terms of the, log- 
arithmic-linear aggregative production function 
described earlier, for this function (but not the 
arithmetic-linear function) assumes unitary elas- 
ticity of substitution and neutrality of technologi- 
cal change, among other things. It is the marginal 
product of labor, not the average product measured 
by output per unit of labor input, which tends to 
parallel the real wage rate under competition. But 
under the conditions described, the index of the 
marginal product of labor is identical with the 
index of the average product of labor, Y/L, as can 
easily be shown, If the relationship between na- 
tional output, labor input, and tangible capital 
input is as specified in the logarithmic-linear pro- 
duction function, and if competition prevails, the 
conditions hold. The index of national output per 
unit of labor input tends to equal the index of real 
wages per weighted manhour—that is, the index 
of real wages per hour of fixed (and average) 
quality. 

The index of productivity usually employed 
in discussions of wages is national output per 
(unweighted) manhour, however. But national 
output per manhour is output per manhour of 
changing quality. It is Y/N, not Y/L: Y/N = 
(Y/L)(L/N ), Under the conditions specified, then, 
this index of productivity tends to equal the index 
of average hourly earnings of labor of changing 
quality, not fixed quality. Given the same condi- 
tions, when the quality of labor improves, the index 
of productivity generally used (Y/N) will tend to 
rise more rapidly than an index of average hourly 
earnings of labor of fixed quality. 

Statistical evidence. Whether the specified con- 
ditions do hold is an empirical question not yet 
satisfactorily answered by direct tests of the elas- 
ticity of substitution, neutrality of change in tech- 
nology and product-mix, or even competition. A 
test of a sort is provided for all the conditions 
combined, however, by historical information’ on 
the fraction of national output received as real 
labor income. The share of national product going 
to hired workers in the form of wages and salaries 
(including “fringe” benefits) has generally risen, 
but the share of national product going to entre- 
preneurial income (which includes property as well 
as labor income) has generally fallen. It is difficult 
to determine the net result, and there is some tend- 
ency to overstate its stability—a change from 70 


per cent to 80 per cent may appear rather modest. 
On the whole, however, the fraction going to labor 
in various countries seems to have fluctuated 
around a substantially horizontal trend prior to 
World War 1 and a somewhat higher but still hori- 
zontal trend afterward. [See INCOME DISTRIBUTION, 
article on FUNCTIONAL SHARE. ] 

In terms of a comparison of long-term changes 
in productivity and real hourly earnings of produc- 
tion workers in manufacturing industries, available 
for the United States for 1889-1960, the conditions 
meet the test moderately well. The real hourly earn- 
ings rose at the annual rate of 2.3 per cent. Real 
national product per unit of labor input rose at the 
rate of 1.9 per cent per annum, and real private 
domestic product per unit of labor input rose at 
the average annual rate of 2.0 per cent. Because 
the manufacturing earnings are in some degree 
affected by improvements in labor quality, it is well 
to compare them also with the corresponding rise 
in output per manhour: 2.2 per cent per annum 
in the national economy, 2.4 per cent in the private 
domestic economy, Larger differences are revealed 
by comparisons over periods that, although shorter, 
are long enough to reveal a trend rather than a 
cyclical or an erratic fluctuation. For example, real 
hourly earnings in manufacturing in the United 
States rose less rapidly than national output per 
manhour in seven such periods between 1889 and 
1960, more rapidly in six, and at the same rate in 
only two. And the absolute difference between the 
annual rates of change in real wages and in na- 
tional output per manhour in all 15 periods aver- 
aged about 40 per cent of the average annual rate 
of change in output per manhour. 

If it is the real hourly income of an occupation 
of relatively fixed labor quality that is of concern, 
the appropriate productivity index is Y/L, not Y/N, 
as we have seen. The difference is not negligible, to 
judge from the figures just cited. If the wage rate 
of every occupation were to parallel national out- 
put per manhour and if workers benefited not only 
from these increases but also from the increases 
in their labor income resulting from training and 
subsequent upgrading, the share of national prod- 
uct going to labor would rise rather than remain 
constant; and it would rise at an average annual 
rate equal to the difference in rates of growth be- 
tween Y/L and Y/N. 

It should also be stressed that Y/L is not the 
appropriate productivity index for every occupa- 
tion of fixed labor quality but only for those close 
to the occupation of average quality. It is plausible 
that the higher the quality of labor in a particular 
occupation, the more rapid has been its growth in 


numbers employed. If this is so and if the condi- 
tions mentioned hold for each class of labor, the 
appropriate productivity index cannot be the same 
for every occupation, This conclusion also follows 
if the human capital resulting from education and 
other investments is thought of as homogeneous. 

On either basis, the appropriate index for un- 
skilled labor would be best approximated by Y/N; 
for occupations of average labor quality, it would 
be Y/L; and it would be something less than Y/L 
for occupations of high labor quality. This amounts, 
of course, to saying that the rate of return to invest- 
ments in education and the like, and therefore the 
average hourly earnings of skilled workers, would 
be expected to decline in relation to the earnings of 
unskilled labor, when the number of skilled workers 
increased more than the number of unskilled. 
Whether or not the rate of return declined abso- 
lutely would depend on the other determinants of 
earnings—the rate of increase in efficiency and the 
relative growth of tangible capital. 

When separate categories of labor are distin- 
guished, however, the assumption of neutrality 
must be questioned. If, for example, technological 
change has not been neutral but rather has tended 
to diminish the demand for unskilled labor signifi- 
cantly, the logarithmic-linear production function 
no longer summarizes fairly the determinants of 
the wages of unskilled labor, and expectations 
based on this function are no longer valid. Other 
qualifications, already mentioned, also need to be 
remembered; no fully adequate measure of L is yet 
available; and the quality of the labor force in most 
occupations in a progressive economy is improved 
over time, if only by the increase in literacy. 

Wage policy. The questions raised indicate 
some of the difficulties in the way of using a rela- 
tively simple productivity index as a summary even 
of the major factors affecting the trend of wages. 
The difficulties are greater when a productivity 
index is used as a general guidepost to short-term 
change in wages in an “incomes” policy—or, as it 
is sometimes put, as a guide to how wages would 
change in a competitive industry in a noninflation- 
ary economy. Some of these difficulties should be 
mentioned. 

First, the trend of national productivity is in- 
tended to serve as a guide to the rate of increase 
in wages from one year to the next, or over the 
brief period (two or three years) of a wage agree- 
ment, and not to the trend rate of increase in 
Wages. There is little ground for supposing that 
these would parallel one another even in a compet- 
itive, noninflationary economy. During the expan- 
sion phase of a business cycle, for example, the 
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average real wage—in competitive and noncompet- 
itive industries alike—usually rises more rapidly 
than its trend rate or than the trend rate of increase 
in productivity, During the contraction phase of a 
business cycle, the average real wage usually rises 
less rapidly than these trend rates. To apply the 
general wage guide only to less than fully competi- 
tive industries—letting wages in competitive indus- 
tries move freely—would mean imposing on the 
former “competitive criteria” that are competitive 
only in a very special, if not peculiar, sense. If the 
application were successful, wages in the industries 
affected would move differently from wages in the 
industries that remained free to respond to cyclical 
as well as other changes in supply and demand. 

Second, the reason for choosing national output 
per manhour (or per unit of labor input) even 
when the wage in a sector of the economy is under 
consideration is not because the efficiency and the 
capital per worker of the sector have no effect, or 
should have no effect, on the sector's wage. They 
do influence the sector's wage, even though they 
cannot be counted as a major factor in the long 
run. They belong among the “qualifications” that 
must always be attached to any short list of factors 
affecting wages in order to explain, or justify, a 
discrepancy between changes in the wages of a 
sector and changes in national output per man- 
hour, 

Third, and related to the second point, changes 
in technology, in tangible capital per worker, in the 
quality of labor, and even in the cost of living take 
time to work out their effects in the economy and 
make their impact on wages. This is true even in 
a competitive economy and even with such allow- 
ance as needs to be made for the influence of ex- 
pectations on the lags, No industry, nor the econ- 
omy as a whole, is ever in full equilibrium either 
in the present or in the base period with which 
the present is compared. 

Fourth, with government—and the taxes govern- 
ment levies, the transfers it makes, and the services 
it renders—economically so much more important 
now than in earlier periods, changes in the struc- 
ture and level of taxes and of government expend- 
itures also significantly affect the relation between 
wages and national output per manhour, at least 
in the long run. In addition, the difficulties of 
measuring wages and productivity are aggravated. 
This explains the custom in the United States of 
using indexes for the “private economy” to repre- 
sent national productivity, but the problem is not 
fully solved by these indexes. 

Finally, changes in the terms of foreign trade 
may properly be treated as one of the “qualifica- 
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tions” in the few countries in which foreign trade 
is of relatively minor importance. As a general 
tule, however, it is better to define productivity to 
take account of these changes, for it is national 
output, not domestic output, that is distributed in 
the form of real wages and other real income. 

To overcome some of the difficulties posed by the 
limitations of a general guidepost, recourse may be 
had to a rule providing for modifications or excep- 
tions. It is seldom clear, however, when an excep- 
tion is justified or—perhaps more to the point— 
when an exception is not justified. And there are 
problems in determining the permissible degree of 
deviation from the general guidepost. 

There are other troublesome points that arise 
when it is asked whether productivity can provide 
an adequate guidepost for wages. It should be evi- 
dent by now, however, that many assumptions are 
involved in applying national output per manhour 
in wage negotiation and determination and that 
they do not hold fully in every period and in every 
situation. 


Productivity and employment 


Productivity indexes are frequently used, along 
with other information, to estimate labor and other 
resource requirements. For example, future levels 
of national employment and unemployment have 
been calculated by (1) estimating the future out- 
put implied by certain assumed levels and condi- 
tions of demand, (2) extrapolating the past rate 
of increase in output per manhour, (3) dividing 
the output by the output per manhour to estimate 
manhour employment, (4) determining the ex- 
pected number of persons employed by assuming 
some level of hours per person, and (5) subtract- 
ing the expected employment from the expected 
labor force to estimate unemployment. If the un- 
employment so estimated seems likely to be 
excessive, a policy may be proposed to avoid it— 
by speeding up the increase in demand, or slowing 
down the increase in productivity, or reducing 
hours below what they would otherwise be. Similar 
calculations have been made of national require- 
ments of land, other tangible capital, water, and 
power, and of these and other requirements of 
particular industries. Of the questions raised by 
these procedures, the two discussed here are put 
in terms of labor requirements only. 

The first question concerns the adequacy of the 
productivity projections. As has been indicated, 
output per manhour may be viewed as determined 
by efficiency in the use of resources as a whole, the 
scarcity of labor relative to tangible capital (or 
total input), and the quality of labor. The direction 


of change of these factors can usually be foreseen 
with some confidence. As a rule, efficiency and 
labor quality may be expected to rise, and man- 
hours relative to tangible capital may be expected 
to decline. However, seldom is enough known about 
their expected rates of change to make possible a 
quantitative projection of labor productivity. What 
is usually done, therefore, is to assume a continua- 
tion of past trends and other patterns of change in 
productivity. The crucial point is whether these are 
sufficiently stable to yield reliable projections. 

To estimate future labor requirements, it is usu- 
ally necessary to take account of cyclical fluctua- 
tions. As we have seen, labor productivity generally 
rises much more rapidly during business expan- 
sions than during contractions. The rate of increase 
in national output per manhour in the United 
States between 1889 and 1960 averaged 3.4 per 
cent per annum during years of expansion in out- 
put and —0.6 per cent per annum during years of 
contraction in output. However, there was a good 
deal of variation around these averages, only part 
of which can be explained by differences in the de- 
gree of rise in output. Use of the average cyclical 
pattern of labor productivity, or of the average rela- 
tion of change in productivity to change in output, 
will help to improve the projection but can hardly 
eliminate all uncertainty. 

Even in making long-term projections of output 
per manhour, cyclical fluctuations must be kept in 
mind. The calculation may be seriously marred if 
the business-cycle phase at the beginning of the 
period used to determine the past trend is different 
from the phase at the end. Also, the business-cycle 
phase of the year from which the projection starts 
must be taken into account. The more important 
question for long-term projections, however, con- 
cerns the past trend. Can it be adequately expressed 
by a constant rate of growth, or must allowance be 
made for acceleration or retardation of the rate of 
growth or for less regular changes in trend rates? 

At one time it was supposed that output per man- 
hour tended to rise at a declining rate as an indus- 
try or economy grew. It does appear to be true of 
individual industries during their early careers. 
(This must be judged from the prices of their prod- 
ucts, for information on the output per manhour 
of young industries is usually lacking; but there is 
good evidence that the two are significantly—and 
negatively—correlated.) However, any systematic 
tendency toward retardation seems to disappear 
once the first period of development has passed. 

As for trends in national output per manhour, 
no evidence has been found of a tendency for rates 
of increase to decline systematically with the pas- 


sage of time. Indeed, speculation has veered in the 
other direction, and it is now more often asserted 
that national output per manhour has been accel- 
erating, at least in recent years. But it is not clear 
whether the current high rates of growth, men- 
tioned earlier, represent acceleration or simply a 
higher plateau. The rates may rise to still higher 
levels; or they may continue at present levels; or 
they may return to earlier levels, as they have often 
done. Even apart from cyclical fluctuations, then, 
and even assuming a continuation of past levels 
of rates of growth, trend extrapolations will be sen- 
sitive to the choice of the past period that provides 
the estimate of the future trend. Allowance must 
be made for rather wide margins of error. 

Our second question concerns the nature of the 
connection between labor productivity and employ- 
ment. During periods of persistent unemployment, 
for example, some of the blame for unemployment 
is often put on mechanization, automation, and 
the other sources of increase of output per man- 
hour, especially in industries in which output per 
manhour is rising rapidly, It is often implied that 
the more rapid the increase in an industry's output 
per manhour, the less rapid is the increase in the 
manhours of employment offered by it. It is true 
that the direct effect of a rise in an industry's out- 
put per manhour is to depress its employment. But 
there are also indirect effects. An exceptionally 
rapid rise in the output per manhour of an indus- 
try has usually caused the price of its product to 
Tise much less, or fall much more, than prices gen- 
erally; as has been mentioned, there is a fair degree 
of negative correlation between trends in output 
per manhour and trends in selling prices. In turn, 
the relative price decline has served to stimulate 
the industry's production and thus, indirectly, to 
maintain or even stimulate a rise in the employ- 
ment it offers. Indeed, long-term rates of growth 
of output per manhour in the industries for which 
British and U.S. data are available—largely the 
commodity-producing industries—are positively, 
not inversely, correlated with employment and with 
output, 

Other factors enter, of course. The elasticity of 
demand in response to changes in price varies con- 
siderably among industries. The elasticity of de- 
mand in response to changes in real national in- 
come—which is largely the result of changes in 
national efficiency—also varies among industries. 
High income elasticities of demand may help to 
explain the growth of employment in the service 
industries despite an apparent lag in output per 
manhour and a rise in selling prices relative to 
the commodity-producing industries. For industries 
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generally, at this stage of knowledge it seems safe 
to say only that especially high rates of growth in 
output per manhour are not necessarily accom- 
panied by absolute or even relative declines in the 
employment offered. More must be learned about 
this, as about other aspects of productivity. 


SOLOMON FABRICANT 


[See also AGRICULTURE, article on PRODUCTIVITY AND 
TECHNOLOGY; PRODUCTION; WAGES, article on THE- 
ORY.] 
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PROFESSIONS 


The development and increasing strategic im- 
portance of the professions probably constitute 
the most important change that has occurred in 
the occupational system of modern societies. The 
growth of the professions has brought to promi- 
nence a set of occupations which never figured 
prominently in the ideological thinking that, after 
having crystallized in the late nineteenth century, 
has tended to dominate public discussion in the 
twentieth. Professional men are neither “capital- 
ists” nor “workers,” nor are they typically govern- 
mental administrators or “bureaucrats.” They cer- 
tainly are not independent peasant proprietors or 
members of the small urban proprietary groups. 

As for so many categories of social status, the 
boundaries of the group system we generally call 
the professions are fluid and indistinct. There are 
many borderline groups whose professional status 
is, for one reason or another, equivocal. However, 
the core criteria within the more general category 
of occupational role seems to be relatively clear. 
First among these criteria is the requirement of 
formal technical training accompanied by some 
institutionalized mode of validating both the ade- 


quacy of the training and the competence of trained 
individuals. Among other things, the training 
must lead to some order of mastery of a gen- 
eralized cultural tradition, and do so in a man- 
ner giving prominence to an intellectual compo- 
nent—that is, it must give primacy to the 
valuation of cognitive rationality as applied to a 
particular field. The second criterion is that not 
only must the cultural tradition be mastered, 
in the sense of being understood, but skills 
in some form of its use must also be developed. 
The third and final core criterion is that a full- 
fledged profession must have some institutional 
means of making sure that such competence will 
be pit to socially responsible uses. The most obvi- 
ous uses are in the sphere of practical affairs, such 
as the application of medical science to the cure of 
disease. However, the skills of teaching and of re- 
search in the “pure” intellectual disciplines are also 
cases of such use. 

Thus the occupational complex we call the pro- 
fessions is organized about that element of the 
modern cultural system ordinarily called the intel- 
lectual disciplines—the humanities, and the sci- 
ences, both natural and social—and about their 
general significance in both modern societies and 
the cultural systems with which they articulate 
(Weber 1919a; Shils 1961; Ben-David 1963-1964; 
Parsons 1965). As this relationship between the 
intellectual disciplines and society has become in- 
stitutionalized, it has come to center in the two 
complexes of universities and research institutions. 
In the English-speaking world, especially the United 
States, the tendency has been for both functions to 
become primarily centered in the universities. In 
continental Europe, especially the communist coun- 
tries, the research function has more frequently 
become institutionalized in separate organizations, 
usually called academies of science. 

As the institutional structure of the professional 
world has crystallized, the university-academy 
complex has come to be its center. From this cen- 
ter its structure ramifies in two directions. The first 
direction concerns the professions’ involvement with 
elements of the cultural system other than the intel- 
lectual disciplines, as well as with these disciplines 
themselves. Historically, the most relevant area of 
the cultural system has been religion, but in the 
modern world the areas of the arts, and of moral- 
ity and ethics in their relation to ideology, are also 
involved. The second field of ramification concerns 
the application of knowledge—that is, of technical 
competence in the mastery and use of one or more 
of the intellectual disciplines or sectors of them— 


to practical affairs in which the interests involved 
are social and psychological rather than cultural 
as such. 


Two sectors of the professional system 


In the course of Western social and cultural de- 
velopment, the two fields of ramification described 
above have gradually become differentiated from 
each other. Accordingly, the core of the profes- 
sional system now lies in two areas: the institu- 
tionalization of the intellectual disciplines in the 
societal structure, and the practical application of 
these disciplines. Hence two primary categories of 
professions are unequivocally central to the modern 
system, 

First is the profession of learning itself. This 
is organized in terms of two primary functions: 
contributing further to learning through research 
and scholarship, and transmitting the learning to 
others, The others involved may be successors of 
the current generation of learned men, or “laymen” 
who value instruction in intellectual matters, or 
both. 

Second is the “applied” branch of the professions. 
Its historic focuses, as represented by the two fields 
of law and medicine, have been the regulation of 
order in society and caring for the health of so- 
ciety’s individual members. However, the modern 
list of applied professions includes a much wider 
range of fields, the boundaries of which are some- 
what uncertain. Engineering—the use of profes- 
sional levels of competence to control physical proc- 
esses—is the most obvious extension. Various 
others are reviewed below. 

Very broadly, we may distinguish these two main 
branches of the modern professional system in 
terms of the cultural primacy of the interests served 
by the academic branch, and of the social primacy 
of the applied branch. Safeguarding the health of 
individuals or controlling physical processes may 
be considered to constitute social interests, since 
they are activities that are generally carried out 
through socially organized groups and processes. 

Each of these spheres of primacy gives rise to a 
main type of problem in defining the limits of the 
Professional pattern. In the applied sphere, the 
Problem concerns the importance, for the practical 
Interests involved, of competence in culturally de- 
fined technical subject matter. However, no social 
System is only, or even primarily, a field for the 
implementation of the kind of technically specific 
Soal-interests that can ignore complex interrela- 
tions with nontechnical concerns. Hence, the ques- 
tion emerges as to whether the nontechnical con- 
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cerns that impinge on the professional function 
can be more or less neutralized so that the profes- 
sional expert need not concern himself too seriously 
with them. Answers to this question vary according 
to the nature of the functions involved and the de- 
grees to which they have been institutionalized in 
a differentiated social system. 


The cultural boundary 


The modern intellectual disciplines developed 
through a process of differentiation from a primar- 
ily religious matrix. If we go back far enough into 
the Middle Ages, all “learned men” were in some 
sense religious specialists. The autonomy of the 
secular intellectual disciplines crystallized in the 
Renaissance, yet anything like the full profession- 
alization of competence in them took some time 
and required the development of a variety of con- 
ditions. For the applied fields, the main problem 
raised by this process of professionalization has 
been a matter of the penetration of professional 
competence into fields of noncultural interest which 
had previously been treated by rule of thumb—that 
is, treated “empirically,” in the older sense of the 
word. For the primarily cultural fields, the problem 
has been the differentiation out of the diffuse cul- 
tural matrix of fields of special competence capa- 
ble of attaining autonomy—in religious matters, 
for example—relative to sources of more general- 
ized prestige and authority. 

In the Judaeo-Christian world the clergy is clearly 
the primary historical matrix from which the mod- 
ern professions have differentiated. The sense in 
which the clergy continues to be a profession, how- 
ever, is at least partly equivocal. Certainly, one of 
the most distinctive features of the Western cleri- 
cal tradition is that from the great writings of the 
rabbis and the fathers of the church down to the 
present it has continued to be a major component 
in the total body of intellectual disciplines. Hence, 
intellectual training has to some degree been a pre- 
requisite of clerical status, however, incompletely 
the ideal has been realized at various points. 

But in the modern world, the element of intellec- 
tual training is central. Only relatively “primitive” 
fundamentalist groups regard direct inspiration as 
adequate qualification for clerical functions. Al- 
though the clergy includes more specifically pro- 
fessional roles, particularly that of theologian, the 
central clerical role must be regarded as marginal 
to the professional system because the “application” 
of technical competence is only one part of the 
complex of its role components (Gustafson 1963). 
In two broad spheres the clergy have not had the 
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insulation from other modes of cultural and social 
involvement or from the responsibility that is requi- 
site to relatively pure types of professional roles. 
One sphere concerns the social aspect of the role, 
which in this case is pastoral in the broadest sense. 
It is widely recognized that the pastoral role in- 
cludes an important therapeutic component which 
in some ways overlaps with that of psychological 
medicine. However, the typical pastor is also the 
leader and administrator of a congregation with 
diffuse social responsibilities within the sphere of 
the parish as a collective system. These responsi- 
bilities are not grounded in any specific technical 
competence. The second sphere is that of authority. 
Clerical leadership and authority are grounded not 
so much in competence in specifically intellectual 
matters as in the diffuse religious and moral au- 
thority of the religious tradition as such. It is the 
charisma of clerical office which is the primary 
focus of the legitimation of the authority and in- 
fluence of the clergy rather than the exercise of 
competence in a sector of the system of intellectual 
disciplines [see CHARISMA; RELIGIOUS SPECIAL- 
IsTs]. 

Nevertheless, it remains of first importance that 
the clergy share with the more specific professions 
a basic anchoring in cultural commitments as dis- 
tinguished from the primary and direct service of 
some category of social interest, notably economic 
or political interests. 

Art and ideology. As the cultural system has be- 
come further differentiated in its articulation in 
society, two further focuses of institutionalization 
of roles have become prominent in addition to the 
more strictly professional and the religious ones. 
These involve (1) primacy of concern for expres- 
sive symbolism (especially for “the arts,” in the 
modern sense); (2) primacy of concern for the 
moral problems of the human condition, and par- 
ticularly of the society. The latter may perhaps be 
called the “ideological” focus. I think that both 
types of concern must be classed with the clerical 
one as marginal to the central professional com- 
plex, sharing certain central features with it but 
having involvements which, in different respects, 
are incompatible with full professionalization as 
defined here. 

Artists. One critical reference point regarding 
the degrees of professionalization concerns the ex. 
tent to which roles can or cannot be fully occupa- 
tionalized in the sense that the incumbent treats 
performance of the function as a full-time job— 
that is, as his primary job or responsibility, on 
which he can safely depend for income to meet not 
only his personal needs but very generally also 


those of a family. In this sense the committed art- 
ist, whether in literature, the visual arts, or music, 
has tended to oscillate between two main non- 
professional focuses of social organization. One 
alternative assimilates him to the economic entre- 
preneur in that he exploits opportunities to live 
from the sale of his product on some kind of mar- 
ket, and so to operate as an “independent” person. 
The other typical alternative has been that of 
patronage, whereby the artist as incumbent is spon- 
sored and supported by some individual or organi- 
zation so that he can do his own work without 
directly meeting the exigencies of independent con- 
tinuance. Of course, individuals have often been 
their own patrons, being artists as a side line while 
deriving their main subsistence from personally 
controlled sources such as independent property 
income or another job. 

The performing arts, particularly music, have 
come to be the most fully occupationalized by and 
large, the symphony orchestra probably being the 
most developed example. Here one may truly speak 
of the “professional musician,” but with the proba- 
bility that the skill component predominates con- 
siderably over that of intellectual mastery, impor- 
tant though the latter may be. When one moves 
from that focus toward the individual performing 
virtuoso or the creative composer, however, it be- 
comes decreasingly feasible to professionalize the 
role in our sense. Yet various forms of economic 
subsidy for such persons, and of validation of their 
political and prestige positions, are common. Such 
forms may, for the most part, be treated as de- 
rivatives of patronage institutions which have be- 
come impersonalized in various ways—for example, 
through the common European institution of pro- 
viding publicly for orchestral music, opera, and 
stage, or through the American use of philanthropic 
mechanisms to support (in both economic and 
other senses) those functions [see CREATIVITY, af- 
ticle on SOCIAL ASPECTS]. 

Intellectuals. In the modern world, concern 
with the expression of moral commitments and 
with their application to practical problems, social 
and otherwise, has to a considerable degree become 
differentiated in the function of ideology and insti- 
tutionalized as a primary concern of the groups 
rather loosely called intellectuals. In part because 
of the emotional fervor and sense of urgency sO 
characteristic of such groups, this concern is per- 
haps even more difficult to professionalize than the 
traditional clerical role. In terms of social organi- 
zation, the intellectuals have hence been in a posi- 
tion similar to that of artists. They have tended 
either to find some type of combined economic and 


influence market for their output, generally as au- 
thors of publications, or to depend on patronage. 
On occasion, this shades over into self-patronage, 
the ideologist himself becoming the major leader 
of a movement attempting to implement a moral 
position. 

Of course, the function of moral evaluation 
spreads through a great many types of collectivities 
and roles. For example, it has always been a major 
function of the clergy. The modern phenomenon 
is the emergence of secular moralists who attempt 
to ground their evaluative judgments in the intel- 
lectual disciplines as such. This particular category 
of ideologists or intellectuals constitutes a major 
boundary group of the modern academic profession, 
if by the latter is meant the groups who specialize 
in different sectors of the intellectual disciplines as 
such and in their more immediate uses in research 
and formally organized teaching. 

This very cursory and schematic review indicates 
very generally that what we are calling the central 
professional type of role and collectivity organiza- 
tion must, with respect to the involvement of its 
cultural component, be limited to the sector of the 
cultural system where the primacy of the values 
of cognitive rationality is presumed. Broadly, this 
is the case for the “profession of learning” in its 
various branches. The groups anchored predomi- 
nantly in religious commitments as such, in expres- 
sive symbolization through the arts, or in moral 
evaluation through ideology, approximate to the 
ideal type of professional organization, but much 
less closely and only in forms subject to important 
restrictions. 


Historic position of the university 


In view of the central—and in recent decades 
rapidly increasing—importance of the profession 
of learning in the total complex of the professions, 
an outline of the status of the Western university 
ìs an appropriate starting point for our discussion 
of the professional application of disciplined knowl- 
edge in the principal socially relevant fields. Both 
in general structure and in relation to the applied 
Professions, there has long been a striking contrast 
between the Continental and English systems of 
higher education, the latter in part determining the 
subsequent developments in the United States. 

The Continental system, which crystallized 
Toughly in the sixteenth century, was organized 
about the four faculties of theology, philosophy, 
law, and medicine. In England, there were for a 
long time only the two universities of Oxford and 
Cambridge, both organized into colleges, which 
were generally nonspecialized. This difference had 
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much to do with the fact that in England the center 
of gravity of professional development, especially in 
law and medicine, but also in science, lay largely 
outside the universities, whereas in Continental 
countries the university has throughout constituted 
the center of that development. However, the paral- 
lelism between the four original faculties of the 
European university clearly illustrates the sense in 
which the era of their predominance was one of 
only partial professional development. 

The universities, along with the systems of sec- 
ondary education which developed in that period, 
had the very important function of building up a 
formally educated class to underpin the hereditary 
aristocracy which played such an important role in 
the transition from medieval to modern society. In 
Roman Catholic Europe, of course, celibacy pre- 
cluded the clergy from becoming a hereditary class 
in any direct sense. At the same time, the character 
of the Roman Catholic church, with its special 
seminaries for training the clergy and its monastic 
orders, tended to minimize the role of university 
faculties of theology. In Protestant countries, how- 
ever, the theological faculties assumed particularly 
important functions both as the main carriers of 
the religious tradition and as the training ground 
for the most important elements of the clergy. Nev- 
ertheless, the limitations, described above, on the 
degree to which the clerical role can be predomi- 
nantly professional apply also to the place of the 
universities in the training and organization of the 
clergy. 

The diffuse cultural character of university tra- 
ditions with a strong emphasis on religion made the 
English universities generally into something rather 
more like theological faculties, and related impor- 
tantly to the failure of training in the applied pro- 
fessions of law and medicine to find a major place 
in the universities. In spite of some highly dis- 
tinguished scientific interludes, such as Newton's 
period of residence at Cambridge, the English uni- 
versities came primarily to combine being the fo- 
cuses of the theological traditions of the Church of 
England with being finishing schools for the hu- 
manistically educated sector of the general social 
elite. This lasted until the latter part of the nine- 
teenth century, when these universities were caught 
up in the more general interrelated developments 
of the intellectual disciplines and of the profes- 
sional occupations. 

In a very broad sense, the Continental law facul- 
ties were to the early modern state what theological 
faculties were to the Protestant church: legal edu- 
cation became the primary prerequisite of qualifi- 
cation for positions in the developing higher civil 
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services of the state. Hence, on the Continent gen- 
erally, private legal practice has always been some- 
thing of a junior branch of the profession relative 
to the civil service. In England, on the other hand, 
law faculties in this sense did not develop, nor did 
law come to have the same relation to public 
administration. 

The faculties of philosophy became the trustees 
of the broad humanistic cultural traditions, which, 
as a result of the Renaissance, had become insti- 
tutionalized on a new level. They also became in- 
volved in various ways—generally through a pat- 
tern that was heavily indebted to the humanistic 
tradition—in the education of the upper social 
groups. For all but the highest aristocracy, they 
gradually took over the functions that had been as- 
cribed to private tutors, some of whom had reached 
a very exalted intellectual level (Thomas Hobbes, 
for instance, served most of his life in such a ca- 
pacity). The line between universities and second- 
ary schools has been hard to trace. It has varied 
considerably in different sectors of Western society. 
But perhaps it is safe to say that for most of the 
period that is important for the early development 
of the professions only a minority of the European 
social and political elites were actually educated in 
the universities, but that by and large the groups 
who taught them were located there. Both human- 
istic scholarship and science remained relatively 
unprofessionalized, as matters of interest only to 
talented “amateurs,” and were until the important 
nineteenth-century expansion of the German uni- 
versities pursued largely outside the university 
framework. This expansion had certain rather spe- 
cial organizational features which precluded full 
professionalization, but it was a crucial precursor 
of the contemporary burgeoning of the university 
as the center of the whole professional complex. 

We thus encounter a historical paradox. Three 
of the four traditional European university faculties 
have been absolutely central to the institutionaliza- 
tion of the cultural traditions which have underlain 
the modern professional complex, But, at the same 
time, these faculties have been involved in a matrix 
of social organization that has prevented them from 
developing very far in the professional direction. 
Only in England did a private legal profession 
gain a firm foothold—mainly because it developed 
outside the university system. 

What, then, of the medical faculty? Its history is 
far harder to interpret than that of any of the other 
three faculties. Moreover, it is particularly impor- 
tant in regard to the place of the professions in the 
social system as such. Perhaps we can approach the 
problem initially by pointing out that both the theo- 


logical and the philosophical faculties are charac- 
terized by a clear cultural primacy, To be sure, they 
are social in that, since their cultural interests and 
activities are to be pursued in a disciplined manner, 
they must be socially organized. Further, I do not 
deny that both religion and secular humanistic 
culture are socially important. But I do suggest that 
both are of relatively diffuse orders of significance 
for the society as a system, especially in compari- 
son with their significance for the cultural system. 

By contrast, law is specifically social, being con- 
cerned with the intellectual grounding and systemat- 
ization of the normative bases of social order. For 
the status of law, including its professional con- 
cerns, a principal problem is its relation to govern- 
ment. Broadly, we may say that the Continental 
tradition has been to assimilate law very closely to 
government, whereas the Anglo-Saxon tradition 
has tended toward differentiating government from 
what I have elsewhere called, because of its differ- 
ent basis of order, the societal community. In terms 
of cultural content, the most important difference 
between the two traditions involves the status of 
private legal rights eithér against the government 
or in independence from it. In terms of social 
organization, the difference consists in the much 
more pronounced development in England of an 
independent legal profession that, in the course of 
the repercussions attending its development, has 
also attained control of access to the judiciary. 

In this perspective, medicine constitutes the prin- 
cipal historic reference point in Western culture 
for understanding and controlling the boundary re- 
lation between the social system and the concerns 
of the individual as both biological organism and 
personality system. The problem of the health of 
the individual is basically salient to the social sys- 
tem because health is a major condition of normal 
social participation, and incapacity for such par- 
ticipation is a central criterion of illness. The cen- 
tral position of the body—mind problem in Western 
philosophical thought is testimony to the critical 
importance of this area. 

In this area, too, the Renaissance and the early 
modern period that followed saw immense cogni- 
tive advances, such as the anatomy of Vesalius, 
Harvey's discovery of the circulation of the blood, 
the work of Linnaeus, and the invention of the mi- 
croscope. Along with these cultural advances, how- 
ever, the institutionalization of medicine as a pro- 
fession, like that of law, was beset with serious 
difficulties. A special one was the tradition that 
“gentlemen” (including practically all learned men, 
obviously) did not work with their hands, and 
hence were precluded not only from surgery, which 


was relegated to the barbers, but even from per- 
forming physical examinations. Only gradually did 
a basis develop upon which scientifically trained 
physicians could actually become the primary 
agents for dealing with the illnesses of individuals. 
On the whole, this basis also developed farther and 
earlier in England than on the Continent. It was 
not until well into the nineteenth century, however, 
that the career of the medical profession could be 
said really to have been launched [see MEDICAL 
PERSONNEL]. 


The discipline-profession complex 


On the cultural side, the triangle formed by the 
interrelations of philosophy, law, and medicine pro- 
vided in an important sense the points of differen- 
tiation for the main system of modern intellectual 
disciplines. In the first place, the differentiation of 
all three from the religious matrix was crucial. This 
was particularly the case with the humanities, since 
they formed the center of the philosophical complex. 
Law and medicine, then, constituted the framework 
of the practically applied complex. However, the 
status of the developing pure disciplines, with their 
practical implications, long remained highly am- 
biguous, After the Renaissance the natural sci- 
ences, especially physics (and astronomy), had 
various forms of anchorage in the academic system 
but were not unequivocally part of it. Thus the 
Royal Society was for a considerable period a more 
important center for the sciences than any univer- 
sity complex in England. Teaching or studying the 
practical applications of physical science knowl- 
edge found considerable difficulty in gaining accept- 
ance in the academic system. For example, the 
German technische Hochschulen (institutes of tech- 
nology ) were long denied the status of universities, 
including the right to grant doctoral degrees. In- 
deed, engineering developed at least as much from 
the “know-how” of the empirical crafts and the con- 
tributions of “inventors” as it did from the academ- 
ically formalized sciences [see ENGINEERING]. 

Even greater difficulty attended the development 
of the social and behavioral disciplines, which in 
one respect are anchored both in medicine and 
a law (with religion always in the background). In 
historical perspective, it is perhaps significant that 
John Locke, a principal father of the British em- 
Piricist tradition in philosophy and, hence, of both 
economics and psychology, was by main occupa- 
tion a physician. By the mid-nineteenth century, 
biological science had begun to reach maturity, and 
around the turn of the last century one could begin 
to say the same not only for economics and psy- 
chology but also for anthropology, political science, 
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and sociology. Of course, the discipline of history 
played a very important role in the development of 
these social and behavioral disciplines, since it 
served, among other things, as a bridge between 
them and the humanities. 

The circumstances just sketched relate in turn to 
the fact that the earlier applied professions, pre- 
dominantly law and medicine, first crystallized in 
the form of private practice. For law, the most im- 
portant field of this development was certainly 
England; the same was at least partly true for med- 
icine. In the United States in the early nineteenth 
century, law and medicine were organized very 
much in the English pattern; thus professional 
training, for instance, was organized almost wholly 
on the basis of apprenticeship. 

Relative dissociation from the university system 
removed the professional practitioner from the type 
of large-scale organization most closely related to 
the professions and placed him in the practical 
position of being a type of independent artisan with 
particularly high social status. These circumstances 
help explain why professional groups have tradi- 
tionally adopted a guildlike status, and why, in par- 
ticular, two major features have characterized their 
orientation. One of these features is the ideology of 
service which long distinguished the professions 
from the market-oriented business groups during 
the period when the latter were gaining increasing 
prominence. The other was the predilection, where 
complex organization did emerge, for an associa- 
tional pattern which differed both from the bu- 
reaucratic models that were becoming increasingly 
prominent in business as well as in government 
and from the types of strict market orientation 
which for a time seemed to dominate the economy. 
The American medical profession has been partic- 
ularly tenacious in its adherence to the pattern of 
individual fee-for-service practice and has tried, in 
ways which include diverse forms of regulation of 
the relations between members of the profession 
and their patients or clients, to discourage the in- 
volvement of physicians in various types of large- 
scale organization. 

The fully modern phase of the development of 
the professional system is peculiar to the present 
century—indeed we cannot foresee how nearly 
mature it has yet become, or what major devel- 
opments it may still hold in store. Its two major 
focuses are the development of the modern univer- 
sity, and the demand for and capacity to use uni- 
versity-level training over a very wide range of 
practical affairs. Since the organization of most 
practical interests increasingly involves complex 
organization, it is becoming less and less practica- 
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ble for a majority of professional personne] in any 
field to practice in the individual fee-for-service 
pattern. 

But if the involvement of professionals in com- 
plex organizations, both governmental and private, 
has forced very substantial modifications in the way 
in which professional services are run, the converse 
is just as true—and probably even more important. 
The involvement of high-level professional person- 
nel in most types of modern organization has been 
the occasion for major changes in the character of 
the organizations themselves. Most conspicuously, 
the predominance of the older type of bureaucratic 
“line authority” is no longer characteristic of any 
but a small set of large-scale organizations. The 
basically associational pattern of structuring rela- 
tions between professional peers—which has had 
to be extended to the relations between professional 
and executive personnel operating at similar levels 
in their respective hierarchies—has come to be of 
paramount importance in the modern type of for- 
mal organization (Parsons 1960a, part 1), 


The new university system 


In a way which few social analysts of even a half 
century ago would have predicted, the United States 
has become the spearhead of several major new 
developments in the field of education. First, it has 
developed, within a century, an essentially new 
type of university system, the keystone of which is 
the graduate school of arts and sciences. In part, 
this system was borrowed from the German philo- 
sophical faculty of the nineteenth century. But it 
was profoundly modified, precisely in the direction 
of professionalizing the faculty role. In short, the 
American system has been devoted to the primacy 
of teaching and research in the secular and pure 
intellectual disciplines, On the whole, this has freed 
the faculty members from their previous assimila- 
tion to the gentleman-scholar tradition of the edu- 
cated upper class—a tradition that has not disap- 
peared but has in general come to focus in the 
surviving undergraduate college. The typical pro- 
fessor now resembles the scientist more than the 
gentleman-scholar of an earlier time; the academic 
emphasis is now much more on achievement cri- 
teria and on reputation in a national and inter- 
national cultural forum than on locally defined 
Status. The graduate faculties, above all, have be- 
come the center of gravity not only of teaching, 
and hence of the mechanisms by which the aca- 
demic profession perpetuates itself, but also of the 
newly professionalized large-scale research func- 

tion. Research, then, has become the primary 
source of a major set of factors operating in the 


dynamics of the society as a whole, most conspic- 
uously in the field of technology, but increasingly 
proliferating far beyond that. 

Second, professional training in the United 
States, in sharp contrast with its English back- 
ground and with most contemporary socialist coun- 
tries, has been brought overwhelmingly within the 
university system. This began with the movement 
in the late nineteenth century to establish univer- 
sity law schools and medical schools on a new level 
of intellectual sophistication. This pattern has now 
extended so far that a major credential of a new 
applied profession is the acceptance of its training 
program within the university framework. 

A particularly important consequence of this 
development has been a complex pattern of inter- 
penetration between the faculties of arts and sci- 
ences, on the one hand, and the so-called “profes- 
sional” faculties, on the other. Thus, a university 
medical school today has a substantial number of 
basic scientists on its faculty who are not very 
different in training and type of competence from 
their peers in the arts and sciences departments. 
Furthermore, research has entered the professional 
school in such an important way that the school 
is no longer only, or even primarily, a teaching 
institution. Finally, it is highly important in this 
broad context that the much fuller acceptance of 
engineering in the academic complex has been ac- 
companied by the incorporation of engineering 
education into the more general pattern of the uni- 
versity system. For example, the Massachusetts In- 
stitute of Technology, from an undergraduate engi- 
neering school, has become a general university 
with an over-all scientific and engineering emphasis. 

Another important feature of the modern uni- 
versity is its insistence on covering, within its arts 
and science faculties, the whole range of the pri- 
mary intellectual disciplines, now generally grouped 
in the three categories of the humanities, the nat- 
ural sciences, and the social sciences. The much- 
discussed trend toward specialization has not re- 
sulted, except for a scattering of cases, in schools 
devoted only to one or another of these groups of 
disciplines, or to specific fields within any one. 
Thus we have not developed graduate schools of 
chemistry that have nothing to do with either phys- 
ics or biology. Moreover, the professionalization of 
the university complex has not led to an abandon- 
ment of the traditional undergraduate college, but 
rather to its strengthening and to its maintenance 
as an integral part of the university structure. To 
an increasing degree, all those destined for the 
higher occupational levels, whether in an academic 
specialty, a specialized applied profession, or 4 


nonprofessional role, are expected to undergo the 
common experience of an undergraduate college 
program. 

Relation to the practicing professions. The mar- 
riage of the university system to the practicing 
professions in the United States was spearheaded 
by medicine and proceeded more slowly and with 
greater difficulty in both engineering and law. In 
the medical case, the crucial stimulus was the 
bringing to bear of European scientific medicine on 
medical education and, with this, the emergence 
of new forms of practice. The primary spearhead 
was the new bacteriology, with its application, 
stemming from Pasteur and Koch, to the control 
of infectious disease, and its further extension, 
stemming from Lister, to asepsis in surgery. From 
this basis, the upgrading process has branched out 
into many fields of organic medicine, very promi- 
nently into pathology and more gradually into the 
psychological and, finally, the social realms. The 
most direct channel of developmental influence op- 
erated through the American medical scientists and 
entrepreneurs who studied in Europe, especially 
Germany, in the last third of the nineteenth cen- 
tury and brought their new knowledge back to this 
country. Particularly central was the Johns Hop- 
kins Medical School in its first generation. 

Engineering, except for the electrical and chem- 
ical fields, remained more or less “empirical”’—that 
is, it operated by rule of thumb—considerably 
longer than medicine, Indeed, the massive break- 
through that founded an entire family of applied 
professions for the physical sciences came with the 
spectacular success of the mobilization of science 
for the war effort of World War 11. To be sure, 
some of these applied professions had predecessors, 
especially in the applications of physics to electrical 
technology and of chemistry to dyes and explosives. 
However, the crucial scientific fields in this case 
seem to have been not only nuclear physics but, 
especially, electronics, with its close relation to the 
development of the information theory—cybernetics 
complex. This “new engineering,” if such a term 
may be used, has clearly dominated the mid-twen- 
tietH-century scene. 

The professions’ opportunities for impressive in- 
stitutionalization at this juncture have been linked 
with their involvement in certain levels of social 
structure. A major factor is clearly the anchoring 
of the university system in the structure of the so- 
ciety. This anchoring includes the university's finan- 
cial and other support, as well as the generalization 
of the belief that it can contribute something 
Special to the public welfare. At the same time, 
Medicine has developed an especially important or- 
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ganizational form, one which is critically different 
from individual private practice, namely, the teach- 
ing hospital. This is rooted in a complex variety of 
modes of social support—religious and philan- 
thropic, private and governmental. But it has pro- 
vided a sufficiently complex and flexible form of 
organization for the testing and working out of the 
highly complicated clinical uses of the new scien- 
tific knowledge. The affiliation of such hospitals 
with university medical schools has ensured that 
primacy is given to scientifically based standards 
of competence, while the other affiliations of these 
hospitals has provided them with a whole series of 
links with the wider community. It is crucial that, 
in the course of its development, the teaching hos- 
pital has moved from purely clinical concerns for 
the immediate welfare of its patients, through con- 
comitant emphasis on use of the patients and fa- 
cilities for the training of physicians, to a steadily 
increasing involvement with the research function. 

For engineering, the comparable institutional 
settings have been industry and certain spheres 
of government, notably the military. The develop- 
ment of industrial laboratories and, increasingly, 
those controlled by governmental agencies, such 
as the three national laboratories for research in 
atomic physics, have played facilitative roles some- 
what similar to the teaching hospitals in medicine. 
These developments clearly rest on the assumption 
that applied science has come to have a truly mas- 
sive practical payoff. It should be realized that this 
situation is altogether new in the present century. 

The major new phase in the development of law 
in relation to society is quite different but exceed- 
ingly important. In the latter part of the first half 
of the nineteenth century, legal thought became 
increasingly important on the American intellectual 
scene, somewhat replacing theology as a focus of 
interest. In its first major phase, it followed the 
English tradition in centering, above all, about the 
courts and so about the bases of judicial decision 
as represented by such figures as Chief Justice 
Marshall, Chancellor Kent, and Justice Story 
(Miller 1966). Toward the end of the century, 
however, the major trends in legal thinking were 
toward a new affiliation with the university law 
schools; the social sciences, too, were beginning to 
exert a more general kind of intellectual influence 
on law. On the judicial side, certainly justices 
Holmes, Brandeis, and Cardozo were especially in- 
fluential; on the academic, perhaps Roscoe Pound 
was the most important figure. 

This development has in turn been related to a 
practical situation in which, since the United States 
is a relatively pluralistic society, the legal aspect 


544 PROFESSIONS 


of the normative elements of societal integration 
has been able to take a certain precedence over 
direct governmental disposal of normative issues. 
Indeed the American constitutional system, com- 
bining federalism with the separation of powers, 
has favored this strong emphasis on law and on 
the role of the courts as the first-order agencies 
of its implementation, Moreover, the services of 
lawyers have been in very high demand in connec- 
tion with the complicated affairs of large business 
organizations and with the complex interrelations 
of governmental agencies involved with one an- 
other and with the private sector. 


Place of the newer applied professions 


As the differentiation of American society has 
proceeded, and problems of inclusion and upgrad- 
ing have become particularly prominent, the legal 
definition of rights and obligations has gained in 
importance. The most conspicuous example in this 
field is the use, by the National Association for the 
Advancement of Colored People (NAACP) and 
other civil rights organizations, of the legal path 
to the definition of the basic rights of its constit- 
uency—a process that, more than any other single 
factor, has precipitated the new revolution in 
American race relations. 

It is perhaps not too much to say that the new 
legal profession, with the strong involvement of its 
academic branch, has become the most important 
professional agency for implementing the moral 
consensus (incomplete as it is) of American so- 
ciety, in the first instance, but more broadly of 
modern society in general. The crucial focus, per- 
haps, is on the concept of right, which controls the 
area in which moral claims, in one of the most 
important spheres of interpenetration in all of 
social structure, fuse with governmental and other 
social obligations. Law is only partly an intellec- 
tual discipline, crucial as its intellectual component 
may be. Thus, in the present context, it is seen as 
the crucial link between the cultural system and 
the practical order of society. 

Law and medicine, then, have constituted the 
frame within which a more elaborate system of 
applied professions has begun to proliferate. On 
the cultural side, allowing for the complex involve- 
ments of the biological and physical sciences with 
medicine, and for the kinship between medicine 
and such fields as agronomy and horticulture, the 
base for this development has been the behavioral 
and social disciplines, which are barely a century 
old, and still far from maturity. Three main sectors 
of concern with the application of these disciplines 
may be very tentatively distinguished. At one pole 
stands the sector which focuses on the problems 


of the individual as a member of society. Here the 
intellectual point of departure is the developing 
discipline of psychology, which interpenetrates in 
a complex manner with the biological sciences. 
However, it necessarily ramifies into the more spe- 
cifically social disciplines, particularly sociology, 
through the involvement of the individual's person- 
ality in various aspects of social structure, espe- 
cially the kinship nexus. Psychology as an intellec- 
tual foundation underlies three main fields of 
partly developed professionalization. First is the 
therapeutic complex, of which the more psycho- 
logical aspects of medicine constitute the core. This 
complex focuses in the first instance on the indi- 
vidual—the processes by which he is socialized, 
and his personal adjustment. Second is the edu- 
cational complex, which is developing as a rami- 
fied profession in the areas where the learning 
process and its organization take precedence over 
the technical features of cultural content. Third 
is the social work complex, which fits into this 
context at a level of application very similar to that 
of education, All of these fields have practical 
antecedents, such as early medical practice, edu- 
cational common sense, and philanthropy, which 
are comparable to the antecedents of engineering 
in the traditional expertise of skilled craftsmen. 

At the other pole, closest to law, stand the at- 
tempts to apply the intellectual disciplines that are 
especially concerned with the processes of large- 
scale social systems. By far the most striking suc- 
cess in this area has been attained in the applica- 
tion of Keynesian and post-Keynesian economics 
to the management of societal-level economies, 
with special reference to the field of problems once 
called “the business cycle.” Very fragmentary pro- 
fessionalization has emerged in regard to some 
aspects of political processes and relationships, 
particularly in the sphere of international relations, 
and some problems relating to the social aspects 
of economic development. Continued concern with 
these areas may eventuate in fields of professional 
application comparable to those in economics. 
However, they are plagued by that functional 
equivalent of common sense that operates in large- 
scale, fluidly changing circumstances—namely, 
ideological concern, which can only gradually be 
brought under the control of a more strictly profes- 
sional type of intellectual discipline. 

In the middle sector of this range lies another 
complex of emerging professions that are broadly 
concerned with the management of complex or- 
ganizations. Particularly in the United States, but 
also in other nations, this complex has crossed the 
line between private and governmental sectors of 
social organization—a development that is ex- 


pressed in the phrase “business and public admin- 
istration.” The cultural base of this complex con- 
sists both in findings of the several social science 
disciplines that can be synthesized to bear upon 
this field of application, and in a rapidly growing 
technology of information gathering and process- 
ing which has become crucial to most responsible 
executives. 


Possible limits of professionalization 


Over the whole range of technical competence, 
including the early reference points of law and 
medicine, there are serious limits to, and possibly 
inherent limitations on, the extension of the process 
of professionalization, a process which in one 
aspect is almost synonymous with that of rational- 
ization, These limits have implicitly entered into 
many of the basic intellectual movements and con- 
troversies of this century. Thus law impinges on 
religious charisma through the problems of the 
ultimate legitimation of authority and legal order, 
not only in general but of specific types of author- 
ity under specific auspices. Similarly, the supposed 
irrationality of man’s so-called instinctual drives 
and interests is often said to be only very partially 
and precariously amenable to organization into any 
kind of orderly system. Other considerations arise 
at all points along the spectrum of areas of pro- 
fessional concern. Perhaps the organizational ex- 
ecutive in particular, because he must deal with 
so many diffuse intangibles (including the basic 
uncertainties of changing situations in complex 
environments), must always “play it by ear” in- 
stead of submitting to codified rules or definitions 
of essential technical knowledge (Barnard 1938). 
In the cultural sphere, it appears to be the special 
mission of the arts in the modern era to emphasize 
what is in the present sense the irrational com- 
ponent of the human condition. At any rate, the 
arts do seem at times to be opposing, in the name 
of freedom of expression, any attempts to reduce 
the emphasis that they place on irrationality. Per- 
haps it is permissible at this point to warn that 
even such basic distinctions as rational—irrational 
must be treated with a certain relativity. In current 
parlance, the relation between them is more likely 
to be one of dialectical opposition than of insoluble 
conflict. Moreover, the main historical trend cer- 
tainly seems to be extending the range of rationally 
ordered organization. 


Leadership status of the professions 


It is my view that the professional complex, 
though obviously still incomplete in its develop- 
ment, has already become the most important 
single component in the structure of modern so- 
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cieties. It has displaced first the “state,” in the 
relatively early modern sense of that term, and, 
more recently, the “capitalistic” organization of the 
economy. The massive emergence of the profes- 
sional complex, not the special status of capital- 
istic or socialistic modes of organization, is the 
crucial structural development in twentieth-century 
society. 

Every major innovation in the organization of 
societies involves a new leadership element—in 
structural rather than personal terms—which 
tends to precede that innovation’s massive institu- 
tionalization. For the modern state, the critical 
leadership element seems to have been institution- 
alized aristocracy. Kings and their lineage were 
the hereditary focuses of what, in terms of our 
sketch, was basically the extension, by legal means, 
of rationalized order through the major politically 
organized units of Western society. Royalty, alone, 
without backing from a broad aristocratic system, 
could not have accomplished this. 

In organizational terms, the capitalistic aspect 
of the industrial revolution was that it “democra- 
tized” the royal—aristocratic system. Careers came 
to be open to talents; the hereditary basis was no 
longer legitimized. But the famous capitalistic 
business firm was a “petty monarchy,” with the 
position of its proprietor being legitimized in terms 
of his services to his lineage. In turn, the lineage 
shared in the benefits of the proprietor’s success 
through the legitimacy that it afforded him as its 
head, and especially through the mechanism of 
inheriting and bequeathing property. 

The differentiation of ownership from control, 
which became evident in the large-scale sector of 
American “capitalism” in the 1920s (Berle & Means 
1932), was comparable to the democratization of 
the national state, an institution that has under- 
gone many vicissitudes since it first became salient 
in the Age of Revolution at the end of the eight- 
eenth century. Since that period, the ultimate sup- 
port of organizational order and authority has had 
to rest on some kind of nonhereditary, if not demo- 
cratic, basis, although legitimation was general- 
ized to more universalistic levels in such concep- 
tions as the “national” or, better, the “public” 
interest. 

The old leadership element was clearly the 
“capitalistic” upper class—or, as I have put it 
elsewhere, the “decentralized” aristocracy—which 
could also legitimate its status in achievement 
terms. The new leadership element is based on 
cultural criteria of legitimacy rather than criteria 
of political power or economic success. Given the 
general secularizing trends of modern society, these 
criteria could not be religious in the older sense. 
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They have had to command both moral and intel- 
lectual authority, while being at the same time 
religiously pluralistic. 

One possible contender for leadership is the 
“artist”—of course, as category not individual. His 
leadership may be characteristic of certain devel- 
oping societies, but it cannot provide an adequate 
basis in the organization of technical functions for 
the modern social world. As a result, whatever may 
happen in the long run, at least a temporary lead- 
ership status seems to have settled on the repre- 
sentatives of the intellectual disciplines. Clearly, 
the primary organizational locus of these disci- 
plines in the modern world is the university. The 
fundamental origin of the modern professional 
system, then, has lain in the marriage between the 
academic professionals and certain categories of 
practical men. These latter have taken responsi- 
bility, on a basis more of specialized competence 
than of a diffuse religious or ideological legitima- 
tion, for a variety of operative functions in the 
society. 

We do not know what lies in store for the next 
phase of professionalization. I suggest, however, 
that the professional complex has already not only 
come into prominence but has even begun to domi- 
nate the contemporary scene in such a way as to 
render obsolescent the primacy of the old issues of 
political authoritarianism and capitalistic exploita- 
tion. Indeed, most of our sensitive contemporary 
dissidents stress quite other themes, such as alien- 
ation. 

TALCOTT Parsons 


[Directly related are the entries CIVIL SERVICE; ENGI- 
NEERING; FINE ARTS, article on THE RECRUITMENT 
AND SOCIALIZATION OF ARTISTS; INTELLECTUALS; 
Journatism; Law, article on THE LEGAL PROFES- 
SION; MEDICAL PERSONNEL; MILITARY; OCCUPA- 
TIONS AND CAREERS; RELIGIOUS SPECIALISTS; SCI- 
ENCE, article on SCIENTISTS; SOCIAL WORK; 
TEACHING. Other relevant material may be found 
im BUREAUCRACY; EDUCATION; KNOWLEDGE, SOCI- 
OLOGY OF; MEDICAL CARE, article on SOCIAL ASPECTS; 
STATUS, SOCIAL; UNIVERSITIES; and in the biogra- 
phies of WEBER, MAX; ZNANIECKI.] 
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PROFIT 


Profit is normally held to be the amount by 
which the total revenues of an enterprise exceed 
its total costs. There has been much confusion in 
the literature over terminology, however, and the 
first task here must be to clear up that confusion. 
It stems, I think, from the changes which have 
taken place with the development of the small, 
owner-managed, competitive firm of the classical 
period into the great corporation of today. In con- 
temporary corporate enterprises the functions of 
the plant manager, who works for wages, of the 
capital supplier, who receives either interest or 
dividends, and of the entrepreneur, who makes the 
strategic decisions, must be distinguished. In this 
article profit will be treated as the return to the 
corporation determined by entrepreneurial success. 


History of the concept 

In the days before the modern corporation, when 
the typical business concern was managed, owned, 
and directed by one man, his reward included the 
wages of management, the return on capital, and 
the gains of entrepreneurship. If wages and raw 
materials were the total costs, all the rest was 
lumped together as profit. Thus, Adam Smith, in 
the Wealth of Nations (1776), speaks of the 
“profits on stock.” In Part 1, Chapter 9, he says 
that “high profits will eat up rents” and indicates 
that the reward to capital is a real cost, which 
must be paid to elicit or maintain the supply. In 
the same passage, Smith distinguishes between 
profits and interest. The former is the reward to 
“stock,” or capital, the latter the price paid for 
borrowed money, i.e., for loanable funds. 

Ricardo, in his Principles (see Works and Cor- 
respondence, vol. 1), normally treats profit as a 
real cost, which must be paid to maintain the sup- 
ply of capital, although sometimes, as Knight 
(1921) says, it is treated as a residual like rent. 
Whatever the confusion in the Principles, in his 
celebrated “Essay on Profits” (“An Essay on the 
Influence of a Low Price of Corn on the Profits of 
Stock”; see Works and Correspondence, vol. 4, pp. 
9-41) Ricardo clearly defines profit as the return 
to capital and its organization in production, i.e., 
its combination with the other factors. 


548 PROFIT 


Alone among the classical writers, Marx (1867— 
1879) has an explicit theory of profit. Marx offers 
a simple, two-sector model with two inputs, labor 
and capital. Labor receives a subsistence wage, 
which according to the Marxist definition equals 
the value of the commodity. The market value, or 
price, exceeds true value, and it is from this “sur- 
plus value” that profit is obtained. (The rate of 
surplus value calculated only on circulating capital 
differs from the rate of profit calculated on all cap- 
ital.) Since the only input other than labor is cap- 
ital, for Marx “profit” is a return on capital. It 
seems, from his discussion, to include returns to 
entrepreneurship, but such returns are purely and 
simply returns to the entrepeneur as the controller 
of the capital input. Marx’s theory of profit has 
been called an exploitation theory. He believed that 
the wage earner will receive only a subsistence 
wage and that all the rest, the residual, is profit 
and is a return to capital which is earned because 
of its bargaining power. Because ownership en- 
ables the capitalist to employ labor, he is able to 
drive a very hard bargain and keep the wage earner 
to a subsistence wage. The ownership of the means 
of production is the clue to Marx’s theory of profit. 
This theory might equally well be called a bargain- 
ing-power theory of profit. However, the theory takes 
no account of the distinct contributions of the cap- 
ital supplier and the entrepeneur, In fact, the two 
functions are considered identical or are comprised 
in the same input. The theory may bear a superfi- 
cial resemblance to that of the English classical 
school, but we should note this difference: for the 
English classical writers, up to and including Mill, 
the reward to capital was a real cost, that is to say, 
a cost which must be paid if the supply of capital 
was to be maintained. It was a reward variously 
described as a reward for waiting, for abstinence, 
or for saving; in any case, it involved a real psychic 
cost. This was not so for Marx. The total reward 
is a residual and is the amount of surplus value 
that could be taken from the worker, by reason of 
the employer's superior bargaining power, over and 
above the true value of the product, which is meas- 
ured by the subsistence of the laboring class, 

In Marshall (1890) the confusion disappears. 
In the stationary state, under conditions of compe- 
tition, the wages of labor, the reward (now called 
interest) to capital, the wages of management, and 
the replacement of capital are all costs. “Normal 
profits” are made up of the payment to the compet- 
itive owner—entrepreneur, i.e., interest on capital 
invested and wages of management. “Abnormal 
profit,” a surplus over total costs, is ephemeral in 
a competitive economy, Profit is therefore a surplus 


attributed to abnormal circumstances, such as lack 
of competition or dynamic and unanticipated 
changes. In the stationary competitive system there 
is no place for the forward-looking, risk-taking 
entrepreneur and, thus, no place, except by acci- 
dent, for profit. 

The Continental school (Pareto, Walras, and 
Wicksell) similarly make no place in their models 
of general equilibrium for any profit residual, Each 
input is paid the value of its marginal product, and 
the sum of the marginal product shares exhausts 
the total product. These general equilibrium models 
are based on severe restrictions: in particular the 
production function is assumed to be linear and 
homogeneous of the first degree [see PRODUCTION]. 
The models are static and competitive. Among 
more contemporary writers the significance of the 
Walrasian model is shown in the simplified equa- 
tion system of Phelps Brown (1936) and in 
Schumpeter’s circular flow equilibrium (1912, 
chapter 1), Profit could not appear in these models 
of general equilibrium unless there was a disturb- 
ance of the static system or a disturbance of per- 
fect competition. This is made clear when Schum- 
peter shows that profit within his system can appear 
only with the emergence of his entrepreneur, that 
aggressive innovator who introduces improved 
techniques of production or in other ways alters 
the production function. 

Some modern authors (such as Joan Robinson 
and Kaldor) use simple two-sector, two-factor 
models and have only labor and capital as their 
inputs. Like the classicists and Marx, they speak of 
only two shares of the national income, namely, 
wages and profit. Profit in these models is, conse- 
quently, the return to capital. This may be a con- 
venient usage, but by not permitting the important 
distinction between the reward to capital and the 
reward to enterprise, it leaves the models without 
the dynamic of the expectational entrepreneur. 


Contemporary views 


Let us turn now to the question of the source of 
profit, its nature, and its function in the private 
enterprise system of contemporary Western society. 
One is considering here the modern corporation, in 
which the functions of the plant manager, the 
owners, and the entrepreneur are distinct. The 
entrepreneur makes the major policy decisions and 
must make these decisions in the present although 
they will bear fruit only in the future. The decisions, 
therefore, are made under conditions of genuine 
uncertainty. 

The writers of the contemporary period were nat- 
urally at first under the strong influence of the 


neoclassical school in England, of the Continental 
writers to whom we have referred, and of John 
Bates Clark in the United States. No profit could 
be earned under the conditions of equilibrium. 
Whether equilibrium was defined as the equilibrium 
of the firm and the industry or whether it was 
defined, in the Continental manner, as the equilib- 
rium of the system as a whole, there was perfect 
foreknowledge. Because there was perfect compe- 
tition, there were no surpluses owing to monopoly, 
apart from Ricardian rents which might be earned 
by those monopolists who controlled a resource so 
scarce that its supply curve was perfectly inelastic. 
Hence, if one was to talk about profit, one had to 
turn one’s attention to conditions within the mar- 
ket which were not those assumed by the neoclas- 
sical writers. Schumpeter had to introduce into the 
circular-flow equilibrium that rude fellow, the in- 
novating entrepreneur, a man who foresaw the 
possibility of profit (even though, under competi- 
tive conditions, it was ephemeral) by altering the 
production function in one way or another, whether 
by a technological improvement or by improved 
techniques in management or improved techniques 
of selling [see ENTREPRENEURSHIP]. 

Joan Robinson (1933) and E. H. Chamberlin 
(1933) showed that profits may be earned and re- 
tained by the entrepreneurs who succeed in achiev- 
ing a privileged monopolistic position in the 
selling market, or, in the case of Robinson, a monop- 
sonistic position in the factor market. Lerner (1944), 
Triffin (1940), and Kalecki (1954) also identi- 
fied the amount of profit with the degree of monop- 
oly. Finally, we may note that Keynes (1930) 
attributed the incidence of profit to a lack of equi- 
librium in the system as a whole, that is to say, 
in his context profit is an accidental windfall oc- 
casioned by temporal changes in the price level 
as a whole. In summary, one may say that all the 
writers of this period were still strongly under the 
influence of the equilibrium models of the nine- 
teenth-century writers. They attributed profit to 
various types of imperfections in the static compet- 
itive models of their immediate predecessors. 

It was Knight (1921) who created an eclectic 
theory of profit under which all the previous theories 
could be subsumed. He treated profit as a residual 
return earned by the entrepreneur as a result of 
correct decisions taken in the present to bear fruit 
sometime in the uncertain future. Distinction must 
be made between risk and uncertainty, Risk exists 
when there is uncertain knowledge of a future re- 
sult of a particular action but when there are ade- 
quate data to make a probability calculation. For 
example, if one is building a new factory, one may 
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provide against loss through fire before it goes 
into operation by taking out insurance. A risk can 
always be insured against because there is a suffi- 
cient sample from previous experiments to permit 
an actuarial calculation. Uncertainty, by contrast, 
exists when the present decision is made under 
conditions where no probability calculus is possible 
—when, therefore, the decision maker has to rely 
upon his own judgment, his intuition, if you will, 
as well as on whatever data he may accumulate in 
order to make his decision, (Some mathematicians 
maintain that the difference between risk and un- 
certainty is one not of kind, but of degree, depend- 
ing only on the amount of data available for a 
sample for experimental purposes. The point can- 
not be debated here, but we may accept a difference 
in the degree of experience as adequate to sustain 
the argument presented in this discussion. ) 

Knight’s entrepreneur will reduce the area of 
uncertainty as much as he can. He will insure 
against risk; he will make forward contracts wher- 
ever they are possible; he will accumulate all the 
data he can. There remains, however, an area in 
which true uncertainty obtains. There may remain 
some elements of cost about which he can make 
no certain calculations. More likely, the strategies 
of his competitors cannot be exactly estimated. 
Decision is peculiarly the responsibility of the entre- 
preneur. If he is, on balance, proved by events to be 
right in his decisions, a profit will result. If, on the 
contrary, he is repeatedly in error, the concern will 
experience negative profits, or losses, and under 
these circumstances the entrepreneur is not likely 
to continue in his decision-making capacity. Briefly, 
we may conclude that the entrepreneur is that 
person—either an individual or a collective—who 
makes decisions under conditions of genuine un- 
certainty, the results of which give rise to profits. 
These profits may come as a result of decisions 
concerning the state of the market or decisions 
which result in increasing the degree of monopoly 
or decisions with respect to the forward holding of 
liquid stocks that give rise to windfall gains or 
decisions taken about the introduction of new tech- 
niques or innovations that, if successful, will give 
rise to innovators’ profits. Thus, all types of profits, 
whether their origin be in the market structure, 
in general movements of the price level, in changes 
in government, commercial, fiscal, or monetary 
policy, or in successful innovation, may be sub- 
sumed in this general concept of profit as the return 
to the entrepreneur for successful decision making 
under conditions of uncertainty. 

If this is agreed, we have an answer to our ques- 
tion as to the origin of profit. We have now to ask 
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how the entrepreneur makes this decision. This 
question is the subject of current debate among 
economists. We may distinguish various answers. 
There is the theory that the entrepreneur does 
make some kind of probability calculation, the best 
possible he can make in the absence of adequate 
data about previous like situations. Hart (1940) 
may be considered representative of this school. 
Others who may be said to belong to the same 
school have added to the probability view the cal- 
culus drawn from the theory of games (von Neu- 
mann & Morgenstern 1944). They have argued 
that it would be possible for the entrepreneur to 
decide upon his own strategy after having made 
a calculus of the possible strategies and responses 
of his competitors. Still others believe that decision 
making will be largely determined by the data 
fed to the entrepreneur by his various chiefs of 
bureau, that is to say, his various technical ad- 
visers, his engineers, his accountants, and those 
who undertake market surveys. Some, also, have 
argued that forward contracts enable the entrepre- 
neur to make his crucial decisions with the min- 
imum degree of uncertainty and that profit, in con- 
sequence, is a kind of accident which, if one may 
be controversial, can be attributed only to the in- 
eptitude of the entrepreneur. To the present writer 
it seems as if all these groups have ignored the true 
meaning of “uncertainty,” which, if we have cor- 
rectly defined it, rules out these various methods 
of giving to the entrepreneurial decision a purely 
rational quality. [See DECISION MAKING, article on 
ECONOMIC ASPECTS,] 

Another school of thought, which is usually con- 
nected with the name of G. L. S. Shackle (1949; 
1958; 1961), believes that the entrepreneurial de- 
cision partakes of an intuitive element. Shackle 
goes so far, in one context, as to compare the entre- 
preneur with the artist rather than the scientist. For 
Shackle, the crucial or strategic decision is neces- 
sarily made under conditions of genuine uncer- 
tainty. No probability calculus is possible, whether 
directly or by use of game theory. Forward con- 
tracts serve only to narrow the critical area of un- 
certainty. Within that area the entrepreneur con- 
fronts a series of hypotheses about future results 
of his decision. Some of these hypotheses will be 
favorable, some unfavorable; some will be highly 
surprising, some not so surprising. As these hypoth- 
eses are ranged in the decision maker’s view, two 
will command his attention. One of these (“focus 
loss”) will appear to him to be the hypothesis with 
the minimum surprise—loss combination, and the 
other (“focus gain”) the one with minimum sur- 
prise-gain combination. Since the hypotheses are 


not additive in either the favorable or unfavorable 
range, choice or decision is made between these two 
focuses of attention (“attraction” is Shackle's 
word). A positive decision results only when the 
focus gain outweighs the focus loss. This view, 
while it has been criticized or amplified by many 
contemporary writers, has acquired a considerable 
following. [See Economic EXPECTATIONS.] It must 
not be supposed, however, that it has come to be 
part of “received theory.” We have shown that many 
contemporary economists, while taking their lead 
from Knight, have preferred a more rationalistic 
theory of entrepreneurial decision making. 

The problem of decision making, whatever may 
be thought in economics, is a wide question and 
has been examined in recent years by psychologists, 
sociologists, military strategists, and others. [See 
DECISION MAKING.] 

It is clear that the above approaches imply that 
the function of profit is to reward the decision 
maker. A subsidiary function is to provide a source 
of new investment funds for the widening or deep- 
ening of capital. Perroux (1926; 1957), among 
others, looks on profit partly as the earning of 
monopolists but principally as a source of savings 
for new economic growth. Perroux’s emphasis is 
on economic development and the sources of cap- 
ital supply. His colleague Jean Marchal (1951) has 
advanced what we might call an institutional 
theory of profit. For Marchal profit is residual, that 
is, its amount or, more properly, its rate depends 
in part on institutional factors, e.g., the social ac- 
ceptance of profit in public utilities as compared 
with that of profit in highly speculative enterprises. 
It is not possible in a brief compass to do justice 
to Marchal, but it seems that his contribution can 
be comprised in an uncertainty theory. 

The position that profit has the function of 
serving as the reserve fund for capital investment 
and, thus, as a principal source of funds for eco- 
nomic growth depends crucially upon certain as- 
sumptions. We have to consider corporation policy 
with respect to retained profit as compared with 
the decisions of shareholders with respect to the 
disposal of their dividends (one may refer to the 
works of W. C. Hood [1959] and R. M. Solow 
[1963]). In this context we have only to note that 
many modern writers attach a great importance to 
Corporations’ retained profits as a source of new 
investment. This proposition depends, however, on 
certain assumptions about the use of retained profit. 
If the corporation uses its surplus funds or its re- 
tained profit for further investment, in the proper 
sense of the word (that is to say, for increasing its 
capital structure for productive purposes), then 


the corporation surplus fund is one source for new 
capital development. If, however, the corporation 
tends to hedge by holding a portfolio of various 
stocks and even government bonds, we cannot re- 
gard retained corporation profits as a source of 
new venture capital. The attitude of corporations 
toward the use of retained profit must be compared 
with that of shareholders. If dividends exhausted 
the total profit of a corporation, then dividends 
would be reinvested through the capital market and 
would presumably be redirected by the price mech- 
anism into the most remunerative of new employ- 
ments of capital. Consequently, there is a case in 
favor of the complete distribution of all profit. It 
is not maintained that profit distributed to share- 
holders will necessarily be invested in a manner 
approximating the optimum allocation of capital 
more closely than would be done by the corpora- 
tion’s allocation of retained profit. It is simply sub- 
mitted that, contrary to the opinion of some authors 
who have written about the capital market, there 
is a case to be made in favor of the full distribution 
of profits to the shareholders of the corporations. 


Measurement of profit 


We must now consider the problem of the deter- 
mination of the amount or rate of profit. From 
what we have already said, it appears that profit 
may be treated as a residual left over after the 
other costs of production have been paid. This 
proposition, however, leaves us with an unsatisfac- 
tory theory of income distribution. As Knight 
(1935) has said, the treatment of profit and rent 
as residuals leaves us with no rigorous determina- 
tion of the difference between one residual and the 
other, It is necessary to distinguish profit from any 
other income share. We do not believe that this can 
be done by marginal productivity theory, which 
depends on the assumption that the input factor 
is completely divisible and is homogeneous in qual- 
ity. The entrepreneur by definition is indivisible, 
and entrepreneurs are not homogeneous as to qual- 
ity. Thus, marginal productivity theory, in the 
Strict sense, does not apply, although it is perhaps 
correct to observe that the entrepreneur who earns 
a profit is apparently more productive in value 
terms than his less gifted competitors. 

It has been argued by some that the amount of 
Profit is a measure of degree of monopoly (Kalecki 
1954; Lerner 1944; and others), The converse 
Proposition would be that the degree of monopoly 
could serve as a measure of profit. This proposition, 
however, is unacceptable. From what has already 
been said, it would appear that monopolistic con- 
ditions reduce the degree of uncertainty, and one 
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would expect that the profit sufficient to attract in- 
vestment under conditions of monopoly would be 
less than that needed to attract investment under 
conditions of dynamic competition. Such indeed 
appears to be the fact. Empirical studies of profit 
rates do not show that profits increase with the 
degree of monopoly. (It is only fair to admit that 
they do not show, on the other hand, that firms in 
monopolistic competition earn markedly lower 
profits than firms under conditions of competition. ) 
It would seem likely that the amount and rate 
of profit are determined by the degree of uncer- 
tainty. If the previous argument is accepted, the 
more remote the time horizon for the planning 
period, the higher the degree of uncertainty which 
must attach to all hypotheses. Thus, we may argue 
that the degree of uncertainty is a function of the 
length of the planning period. It would follow that 
the greater the extension of the time horizon, the 
higher must be the expected profit if a favorable, 
that is to say, a positive, decision is to be made. 
Some economists would like to argue that bargain- 
ing power between the entrepreneur and the cap- 
ital suppliers, on the one hand, and between the 
corporation and the trade union, on the other hand, 
is also relevant to the determination of the rate of 
profit. However, in the eclectic theory put forward 
above, it would appear that such questions of bar- 
gaining power form a part of the entrepreneurial 
expectations or hypotheses with respect to the 
future results of present decisions, We may, there- 
fore, conclude that the proper measure of profit is 
the degree of uncertainty, and if we wish to give 
a quantitative measurement to such degrees of un- 
certainty, we may with some confidence do so by 
the measurement of the extension of the planning 

period to the time horizon, 
B. S. KEIRSTEAD 


[See also the biography of KnicHT.] 
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“Programming”—more properly, “mathematical 
programming,” to be distinguished from “computer 
programming”—is a term applied to certain mathe- 
matical techniques designed to solve the problem 


of finding the maximum or minimum of a function 
subject to several constraints that are usually ex- 
pressed as inequalities. In itself, mathematical pro- 
gramming has no economic content, but mathe- 
matical programming problems arise frequently in 
the context of economics and operations research, 
where inequality restrictions are often encoun- 
tered. A large variety of programming formulations 
exists, and mathematical programming has been 
applied in a considerable number of concrete cases. 
The existence of large electronic computers has 
made the actual computation of solutions to mathe- 
matical programming problems fairly routine. 

Because of theoretical and computational differ- 
ences among various programming formulations, 
it is convenient to classify programming problems 
according to whether they are linear or nonlinear 
and according to whether they are discrete or con- 
tinuous. This article focuses mainly on linear 
continuous, nonlinear continuous, and linear dis- 
crete (integer) programming, Baumol (1958) pro- 
vides an elementary introduction to these topics, 
and Dorfman, Samuelson, and Solow (1958) dis- 
cuss them at an intermediate level. 


Linear programming 

Formulation. As an illustration of a linear pro- 
gramming problem, consider the case of an entre- 
preneur who may produce any or all of n commodi- 
ties, the produced amounts of which are measured 
by xı, Xz, ***, Xn, Assume that the profit per unit 
of the ith commodity is constant and is given by c; . 
Assume, further, that the entrepreneur has m re- 
sources (land, labor of various kinds, raw materials, 
machines) available to him, their amounts being 
measured by b,, bs, +++, Dm. Assume, finally, that 
the activity of producing one unit of the ith com- 
modity requires (or “uses up”) a units of resource 
1, a.; units of resource 2, and so on. The problem 
of choosing production levels for each commodity 
in such a manner that total profit is maximized 
can then be stated as follows: maximize 


(1) Z= CX, + CaXa + +++ + OnXn, 


subject to 


Gy, Xi + Aya Ky + oes FAX, Sb, 


(2) oy X, + sy Xp + +++ + Asn Xn S bs, 


Amr + AmXa + ++ + Andy S bm; 
(3) x,20,x. 20, +++, x, 20. 
Equation (1), referred to as the objective function, 


represents total profit in this example. Inequalities 
(2) express the requirement that for each of the m 
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resources the total amount of a resource used up 
in the production of all commodities must not ex- 
ceed the available amount. Inequalities (3) express 
the requirement that production levels be nonnega- 
tive. Equation (1), together with inequalities (2) 
and (3), is called a linear program, The term 
“linear” refers to the fact that both the objective 
function and the inequalities (2) are linear. 
Linear programs in which the x; measure the levels 
or intensities at which productive activities are 
operated are sometimes called activity analysis 
problems. In compact matrix notation the above 
linear program can be written as follows: maximize 


x, 
subject to 
Ax = b; 
xz. 


(Here the symbol “S” is used to indicate that one 
matrix is less than or equal to the other, element 
by element, without any requirement that strict in- 
equality hold for any component. The definition of 
“=” is analogous.) 

A set of x-values (x9, x2, -+:,x°) is called a 
feasible solution if it satisfies inequalities (2) and 
(3). If the set of all feasible solutions contains an 
element for which (1) attains a maximum, then 
that element is called the optimal solution—or 
simply the solution—of the linear program. The 
solution need not be unique. 

The inequalities (2) may be rewritten as equali- 
ties by introducing m new nonnegative variables 


Vi, ***, Um and rewriting the constraints as 
Oy, Hy + yey Ht + Ain Xn + V =b,, 
(2) pats + Gest eee ee +u = a à 
AmXı + AmXa + t H AmnXy + Um = bm. 


The new variables, called slack variables or dis- 
posal activities, are also required to be nonnegative. 
Their introduction changes neither the set of feas- 
ible solutions for æ nor the optimal solution(s), In 
a linear programming problem with m constraints 
of type (X), a basic feasible solution is one in 
which at most m of the n +m variables appear 
with nonzero values. A fundamental theorem of 
linear programming is that if a linear program- 
ming problem possesses a feasible solution with an 
associated profit level zy, it also possesses a basic 
feasible solution with an associated profit level z, 
such that z» 2 Zy. 

The mathematical meaning of this theorem is 
that if the linear programming problem has any 
solution, then at least one solution will always 
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occur at an extreme point of the convex polyhedron 
defined by (2). The economic meaning, in terms 
of the previous example, is that the number of 
commodities produced will never exceed the num- 
ber of resource limitations. 

Duality. One of the fundamental facts of linear 
programming is that with every linear program one 
can associate another program, known as the dual 
of the first, that stands in a particular relationship 
to it. The relationship between a (primal) linear 
program and its dual program leads to a number 
of important mathematical theorems and meaning- 
ful economic interpretations. 

Given the linear program defined by (1), (2), 
and (3), the corresponding dual program is to 
minimize 

t=b,U, + bow. +--+ + bpum, 
subject to 
Gy Uy + Ay Uz + +++ + Any Um 2 Cy, 


AU, + ArU + +++ + Amm 2 C2, 


Ainus + Arn Us + +++ + AnnUm Z Cn; 
u, 20,u,20,-++,u, 20, 


It may be noted that the relationship between a 
primal problem and its dual is as follows: ( a) If 
the primal problem is a maximization problem, 
then the dual problem is a minimization problem, 
and conversely. (b) The coefficients on the right- 
hand side of the inequalities (2) in the primal 
problem become the coefficients in the objective 
function in the dual, and conversely. (c) The co- 
efficient on the left-hand side of the constraints 
(2) in the primal that is associated with the ith 
constraint and jth variable becomes associated with 
the ith variable in the jth constraint in the dual. 
(d) A new set of variables is introduced in the 
dual, and each dual variable is associated with a 
primal constraint. If a primal (dual) constraint is 
an equality, the corresponding dual (primal) varia- 
ble is unrestricted as to sign; otherwise it is re- 
quired to be nonnegative. (e) The symmetry of 
these relations implies that the dual of a dual is 
the primal. 

The economic interpretation of the dual variables 
and the dual program is as follows: Let u; be the 
imputed price of a unit of the ith resource, The 
dual program then minimizes the total imputed 
value of resources subject to the constraints that 
the imputed cost of producing a unit of any com- 
modity not be less than the profit derived from pro- 
ducing a unit of that commodity. This interpre- 
tation becomes meaningful in the light of the 
following duality theorems: (1) If both the primal 


and the dual problems possess feasible solutions, 
then z<t for any feasible solution; that is, the 
value of the (primal) objective function to be maxi- 
mized does not exceed the value of the ( dual) ob- 
jective function to be minimized. (2) If both the 
primal and the dual problems possess feasible solu- 
tions, they both possess optimal solutions, and the 
maximum value of z is equal to the minimum value 
of t. (3) If in an optimal solution to the primal 
and dual problems a constraint in the primal 
(dual) problem is satisfied as a strict inequality, 
the corresponding dual (primal) variable has zero 
value; if a primal (dual) variable in the solution 
is positive, the corresponding dual (primal) con- 
straint is satisfied as an equality. (4) In an optimal 
solution to the primal and dual problems, the value 
of the ith dual variable is equal to the improvement 
in the primal objective function resulting from an 
increase of one unit in the availability of the ith 
resource. 

In terms of the earlier economic interpretation, 
the meaning of these theorems is as follows: If a 
resource is not fully used up (one of the primal 
constraints is satisfied with strict inequality), the 
corresponding imputed price of that resource is 
zero, from which we can also infer that an addition 
to the entrepreneur's supply of that resource will 
not contribute to profit. If the jth constraint of the 
dual program is satisfied as an inequality, diverting 
resources to production of the jth commodity will 
reduce profits resulting from production of all 
other commodities by an amount greater than the 
profits resulting from production of the jth com- 
modity; hence, that commodity will not be pro- 
duced. These relationships, expressed in theorem 
(4), are known as complementary slackness. 

The discovery and development of the theory of 
linear programming and of the notions of duality 
can be credited to John von Neumann (see Dantzig 
1963, p. 24), D. Gale, H. W. Kuhn, and A. W. 
Tucker (1951), L. V. Kantorovich (1960), G. B. 
Dantzig (1951), and others. A notable application 
of linear programming to the theory of the firm is 
due to R. Dorfman (1951). 

Computational methods, Linear programs are 
normally solved with the aid of the simplex method, 
which consists of two phases: phase 1 provides a 
feasible solution to the linear program, and phase 01, 
taking over at the point at which the task of phase 1 
is completed, leads to the optimal solution (if one 
exists). In many problems a feasible solution can 
be found by inspection, and phase 1 can be by- 
passed. ; 

Phase 11 of the simplex method is an iterative 
procedure by which a basic feasible solution to the 
linear program is carried into another basic feasible 


solution in such a manner that if the objective func- 
tion is to be maximized, the objective function is 
monotone nondecreasing—that is, the value of the 
objective function at the new basic feasible solution 
is greater than or equal to the value of the objec- 
tive function at the preceding basic feasible solu- 
tion, It is a finite method (except in the case of 
degeneracy; see below) in that it terminates in a 
finite number of steps either with the optimal solu- 
tion or with the indication that no maximum exists 
—that is, that the objective function is unbounded. 

In order to solve the problem given by (1), (2), 
and (3), we first rewrite it in the form of the con- 
densed simplex tableau: 


In this tableau the basic variables are v,, -**, Um, 
and the nonbasic variables are x,,-°-*,%,. The 
current solution is thus v, = b,, +++, Um = Dm, X, = 0, 
= 0. It is assumed in the current formula- 
tion that the b; are nonnegative coefficients. An 
iteration consists of a pivot operation by which the 
roles of some v; and x; are exchanged. If the vari- 
able to enter the current solution is x, and the 
variable to leave it is v,, the exchange is effected 
by solving the constraint equation 


Vp = Ap Hy — +++ = Ayn hy — + — ArnXn + Dy 


for x, and substituting the solution into the other 
equations. This substitution yields a new tableau 
that can be interpreted in the same fashion as the 
old one. The substitution also yields the rules by 
which the elements of the new tableau can be cal- 
culated from the old one. Specifically, denoting the 
Semene of the new tableau by aj,, bj, c’,,z,, we 
ave 


a, = 1/ax; 
a, = a,)/Anx, for j # k; 
b; = b,/ax; 
—Aix/Ax, fori#r; 
—Cr/ Ax; 
Qij — Gik Oy; / Ax, fori#r,j7 #k; 
Cj — Cyr) /An, for j #k; 
= b; —b,aix/an, fori#r; 


Za + B,Cy/ Ar. 


The element a is called the pivot of the iteration. 
It is chosen in the following manner: (a) Its 
column index can refer to any column k that has 
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Cx > 0; this ensures that the objective function is 
nondecreasing. If there is no such cx, the maximum 
has been reached. (b) After such a column k is 
chosen, if all a; < O in that column, the problem 
is unbounded. Otherwise we choose the a, > 0 for 
which b;/ai; is smallest. This ensures that the new 
basic solution is feasible. 

It is standard procedure to choose for introduc- 
tion into the solution that x, from among all eligible 
ones for which c, is largest. Extensive experimental 
evidence suggests that the number of iterations re- 
quired to solve linear programs is highly sensitive to 
the particular pivot choice criterion employed and 
that many criteria exist which are more efficient 
than the standard one (Kuhn & Quandt 1963). 

A characteristic of this form of the simplex 
method is that it is a primal algorithm—that is, 
the current solution to the primal problem is feas- 
ible throughout the computation, The dual simplex 
method is completely analogous and differs only 
in that dual feasibility is preserved throughout the 
computations. 

The column vectors in a simplex tableau can be 
expressed in terms of the inverse (if it exists) of 
the matrix of coefficients corresponding to the 
basic variables. This inverse matrix is altered when 
a basic variable is replaced by a nonbasic variable. 
The change in the basic inverse can be expressed 
by multiplying the old inverse by a certain elemen- 
tary matrix. From the computational point of view 
we can simplify the solution of a linear program 
by keeping track of the successive elementary 
matrices necessary to effect the requisite trans- 
formations. Using this procedure is called using 
the product form of the inverse. 

If k denotes the pivot column and if a;, > 0 and 
b; = 0 at any stage in the solution of a linear pro- 
gram, the program is said to be degenerate. Degen- 
erate linear programs can cycle—that is, it is pos- 
sible that a particular sequence of simplex tableaus 
will repeat indefinitely without any further im- 
provement in the objective function, even though 
the maximum has not been reached, Although arti- 
ficial examples exhibiting the property of cycling 
have been constructed (Hoffmann 1953; Beale 
1955), in practice, cycling does not seem to have 
occurred, Since the appearance of degeneracy in a 
problem is preceded by a nonunique choice of a 
pivot in a given column of a tableau, cycling could 
normally be avoided by making a random choice 
among potential pivots if more than one is avail- 
able. Should this procedure fail to avoid degen- 
eracy, there are various methods of perturbing the 
original problem that guarantee that cycling and 
degeneracy will be avoided. The development of 
the simplex method as a whole can be credited 
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largely to G. B. Dantzig (Dantzig, Orden, & Wolfe 
1954; Dantzig 1963). 


Applications and special techniques 


The diet problem. The diet problem is to find 
the cheapest diet that satisfies prescribed nutri- 
tional requirements. One of the first problems to 
be formulated as a linear programming problem 
(Stigler 1945), it is formally analogous to the 
activity analysis problem. Suppose there are n 
foods whose prices are p,, ---, p, and m nutrients 
whose minimum requirements are b,, ---, bm. Let 
%;,°**,%, be the amounts of each food included 
in the diet, and let a;; be the amount of the ith 
nutrient in the jth food. Then the diet problem is 
to minimize 


n 
È px, 
ja 
subject to 
n 
D aux >b, i=1,:,m; 
ja 
x20, j=1, >n 


Matrix games. Two-person zero-sum games can 
easily be formulated and solved as linear programs 
[see GAME THEORY]. Let ai, i=1, +, m, j=1, 
‘++, n, be the payoff from player m to player 1 if 
1 employs his ith pure strategy and 11 his jth. Irre- 
spective of the (pure) strategy employed by player 
u, player 1, utilizing a maximin criterion, would 
wish to use the strategy that will ensure him an 
amount at least equal to some number V, called 
the value of the game. He will play his various pure 
strategies with probabilities x,,--+,2,. The re- 
quirement that his expected gain not be less than 
V (irrespective of the strategy employed by player 
11) can be expressed by 


D ayx >V, a N 
de 
» 
xa le 
Fost 
x 20, i=1,:--,m 


We transform this statement of the problem into a 
linear program by defining z; = X/V, i=1, =, m. 
Then, assuming that V>0, the problem is to 
minimize 


1 3 
ya x Zis 
subject to 
m 
X ajz > 1, sn; 
i 
z, 20, ‚m. 


If V<0, we choose a d large enough so that a 
solution can be found to $% a,x; > V, j=1,-++,n, 
where a;; = a,, + À. This transformation leaves the 
optimal mixed strategy unchanged. For further 
material on the application of linear programming 
to matrix games, see Tucker (1960). 
Transportation and assignment problems. As- 
sume that a commodity is available at m sources 
in amounts ai, i=1,---,m, and is required at 
n destinations in amounts b,, j = 1, +-+, n., Assum- 
ing that the total supply (3{7.,4,) equals the total 
demand ()j.,2,) and that the unit cost of ship- 
ping from source i to destination j is the constant 
Cij, t=1,+++,m, j=1,--:,n, how can demands 
be satisfied so that no source is required to ship 
more than is available at that source and so that 
the total cost of shipping is minimized? The formu- 
lation of this problem, known as the transportation 
problem, is as follows: define x;; as the amount 


shipped from source i to destination j; then 
minimize 
mon 
Z 2 Citir 
ist jet 
subject to 
m 
X xy =b,, JE OE 
in 
" 1 
B xy = a, EE i aG: 
j=1 
xy = 0, for all i, j. 


The equations above express the requirements that 
all demands be satisfied and that no availabilities 
be exceeded. 

One of the m + n equalities above will always be 
linearly dependent on the others; thus, in the ab- 
sence of degeneracy, a solution will consist of 
m +n—1 nonzero shipments. An important prop- 
erty of the transportation problem is that if the 
a; and b; are integers, the solution of the transpor- 
tation problem will also consist of integers. Vari- 
ants of the transportation problem are obtained 
(a) by assigning capacity limits to the various 
routes from source to destination, expressed by 
constraints of the type xi; <ki;, and (b) by per- 
mitting a shipment from i to j to go via some inter- 
mediate destination(s), resulting in what is called 
the transshipment problem (Orden 1956). 

The transportation problem is closely related to 
the assignment problem. Consider the problem of 
assigning n persons to n tasks in such a manner 
that exactly one person is assigned to each task. 
It is assumed that we can measure the effectiveness 
of each person in each task on some cardinal scale. 


If the effectiveness of the ith person in the jth task 
is given by e;;, we wish to maximize 


subject to 


xij 20. 


The solution to the above mathematical problem 
is the solution to the assignment problem, since its 
formal analogy with the transportation problem 
guarantees that the solution values for the x), will 
be integers. Hence, each x;; will be either 0 or 1; 
x;;= 0 means that the ith person is not assigned 
to the jth task, and x;; = 1 means that he is. The 
equality constraints ensure that one and only one 
man will be assigned to each task. 

There are various methods for solving the trans- 
portation and the assignment problems. The most 
common are the simplex method—which is com- 
putationally easier in these cases than in a general 
linear programming problem—and variants of the 
Hungarian method due to H. W. Kuhn (1955). 
Some of the variants of the Hungarian method can 
also be used to solve the so-called network flow 
problem, the objective of which is to maximize the 
flow from a single source to a single destination 
(sink) through a network of intermediate nodes 
(Ford & Fulkerson 1962). The close relation be- 
tween these network-oriented problems and the 
technique of graph theory has resulted in fruitful 
applications of graph theory to certain types of 
linear programming problems. The transportation 
problem was originated by F. L. Hitchcock (1941) 
and T. C. Koopmans (1951). 

Decomposition in linear programming. Certain 
problems are characterized by the fact that they 
can be represented as two (or more) almost inde- 
pendent problems. A case in point is the production- 
scheduling problem of a corporation that has two 
divisions. Assume for argument’s sake that each 
division faces production constraints that are inde- 
pendent of those faced by the other division. Thus, 
the amounts of labor, raw materials, and other 
material inputs used by one division in no way 
depend upon the amounts of these inputs used by 
the other division. There may, however, exist cer- 
tain over-all corporate constraints, perhaps of a 
financial nature. In such circumstances the profit 
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maximization problem of the corporation may be 
expressed as the problem of maximizing 


eX, + eX., 


subject to 


Here X, and X, are vectors of activity levels; b,, 
b, , and b. are vectors of resource availabilities; and 
A,, A., B,, and B, are matrices of input coef- 
ficients specifying the amount of the ith resource 
necessary to sustain one unit of the jth activity. 
The first set of equalities represents the over-all 
corporate constraints; the second and third sets rep- 
resent the two divisions’ respective constraints. 

Very large problems (involving tens of thousands 
of constraints) exhibiting this type of structure can 
be solved efficiently by the decomposition principle. 
The decomposition principle rests upon the follow- 
ing basic observations: (a) The solution of a linear 
program can always be expressed as a convex com- 
bination of the extreme points of the convex feas- 
ible set of solutions. This representation is called 
the executive or master program. (b) If the given 
problem is rewritten in terms of the extreme points, 
we obtain a new problem with substantially fewer 
constraints but with many more variables. (c) The 
solution method is not contingent on our finding 
all extreme points. On the contrary, columns of 
coefficients can be generated when needed (for in- 
troduction into the simplex tableau ) by the solution 
of certain small subprograms, each consisting of 
the independent divisional constraints. (d) The 
solution process is an iterative one that consists 
of the following steps: (i) a feasible solution to 
the executive program is found; (ii) subprograms 
are solved to determine what new column is to be 
introduced into the executive program; (iii) the 
new (restricted) executive program is solved; (iv) 
the corresponding subprograms are solved again; 
and so on, until an optimal solution is reached in a 
finite number of steps. 

The decomposition principle lends itself to ap- 
plications of decentralized decision making. From 
the computational point of view, the decomposition 
principle extends the power of electronic computers 
(Dantzig 1963, chapter 23). 


Nonlinear programming 


Formulation. Cases of mathematical program- 
ming in which either the objective function or the 
constraints, or both, are nonlinear are referred to 
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as instances of nonlinear programming. Examples 
of nonlinear programming arise frequently in eco- 
nomic contexts. The activity analysis example dis- 
cussed in the section “Linear programming” con- 
tains two particularly severe restrictions: (a) the 
profit that can be obtained from producing an extra 
unit of a commodity is constant and is thus inde- 
pendent of the level of production, and (b) the 
amounts of additional inputs needed to produce an 
additional unit of a commodity are constant and do 
not depend on the level of production. Thus, the 
linear programming formulation accounts neither 
for the possibility of (eventually) declining aver- 
age profit owing to negatively sloped demand func- 
tions nor for the possibility of increasing marginal 
amounts of inputs, which are required for the main- 
tenance of constant unit additions to output be- 
cause of the well-known phenomenon of diminish- 
ing returns. Nonlinear programming is well suited 
to dealing with the case of such realistic modifica- 
tions of economic models. The general nonlinear 
programming problem can be formulated as fol- 
lows: maximize 


(4) fa, 


subject to 


Xa), 


PCX, t, n) 20, 
(5) : : 

Gm( Xs, 1 y Xa) > O; 
(6) x> 0, +++, x> 0. 


Duality and the Kuhn-Tucker conditions. The 
necessary and sufficient conditions for a solution 
to the inequalities (5) and (6) to be an optimal 
solution are called the Kuhn-Tucker conditions 
(Kuhn & Tucker 1951, pp. 481-492). These con- 
ditions represent a powerful generalization of the 
duality theorems of linear programming. They are 
based on the assumption that the objective func- 
tion f and the constraints g,, ---, gm are differ- 
entiable, concave functions. (The function k(x) 
is concave if, for any choice of two points x, and 
x, and 0<6@<1, the value of the function at 
any point between x, and x, f[@x, + (1 —@)x.], 
is not smaller than the value of the linear func- 
tion between f(x,) and f(x.) given by OF (x,) + 
(1 — @)f(x:).) The maximization problem can then 
be reformulated in terms of the differential calcu- 
lus using Lagrangian multipliers. The new La- 
grangian objective function is 


plx, u) = F(x, 19) + Engin, AT 


The so-called saddle value problem is to find 
nonnegative vectors x’ = (x?, +++, x?) and w= 
(u, +++, u2) such that 


h(x, w°) < h(x, u°) < h(x, u). 


(Here $(x°, u?) is called the saddle value of $, and 
the point (x°, u°) is called the saddle point of ġ.) 
That is, the saddle value problem is to find x° and w° 
such that the Lagrangian function ¢ has a maximum 
with respect to x at x”, a minimum with respect 
to at w’, and a saddle point at (x°, u°). The fol- 
lowing are the crucial theorems: (1) A necessary 
and sufficient condition for x° to be a solution to 
the (primal) maximum problem is that there exist 
a w such that æ’ and u° are a solution to the sad- 
dle value problem. (2) Necessary and sufficient 
conditions for æ? and u? to be a solution to the 
saddle value problem are 


Ob. ~ Exe E <0, woo 
(7) a ne 0; dx} Bx; 0; x?20, 
E rat for all j, 
and 
Ob » Supt | $ 
(8) TA = 0; Zut T ba 0; u20, 
cus PE for all i. 


If theorem (2) is applied to the linear case, it 
immediately reduces to the familiar duality the- 
orems, with the middle conditions in (7) and (8) 
ensuring complementary slackness. The frequent 
case in which the constraints are linear but the 
objective function is not linear has an immediate 
economic interpretation: the first part of conditions 
(7) states, for example, that the imputed value of 
the resources necessary to produce a unit of the jth 
commodity must not be less than the marginal 
profit contribution of that commodity. 

Recently, the conditions under which the Kuhn- 
Tucker conditions hold have been somewhat re- 
laxed by H. Uzawa (1958). 

Computational methods. A number of methods 
have been developed for solving nonlinear pro- 
grams of various types. The success achieved by 
these methods depends in general on the configura- 
tion of the particular problem. Several methods re- 
quire that the objective function and the constraints 
be concave; strict concavity is necessary for some 
methods. If the objective function is not concave, 
the maximum is generally not unique; thus, solu- 
tion methods that are capable of solving nonlinear 
programs of this type usually obtain local maxima 
but not global maxima. Solution methods fall 
broadly into three classes: (a) gradient methods, 


which, starting with some solution to the problem, 
evaluate the slope (gradient) of the nonlinear ob- 
jective function and alter the current solution by 
moving in the direction of the steepest gradient; 
(b) the simplex method, applicable in the case of a 
quadratic objective function and linear constraints, 
which obtains its usefulness by virtue of the fact 
that in the case mentioned the Kuhn-Tucker con- 
ditions are expressible as linear equations; and 
(c) decomposition approaches, which arise from 
solving certain suitable linear programming prob- 
lems based on evaluating the nonlinear objective 
function over a lattice of points in the space of 
feasible solutions (Wolfe 1963). 


Discrete (integer) programming 

Formulation. In a large variety of linear pro- 
gramming problems, one has to dispense with the 
usual assumption that the space of solutions is 
finely divisible. Cases in which the set of solutions 
consists of discrete points at which the components 
of the solution vector have integer values are called 
discrete or integer programming problems. 

Integer programming problems arise essentially 
for one of two reasons that are perhaps concep- 
tually distinct but are in practice not always dis- 
tinguishable. One reason is that the values of some 
or all variables in a linear programming problem 
may have to be restricted to integers because frac- 
tional solutions may not make sense from the eco- 
nomic point of view. Certain economic activities 
(the building of a bridge, the dispatching of an 
aircraft) are by their nature indivisible, and any 
solution involving such variables must be integer. 
The other reason is that certain otherwise intract- 
able mathematical problems, often of a combina- 
torial nature, have natural formulations in terms 
of integer-valued variables. In such instances frac- 
tional-valued variables in the solution must be ruled 
out for mathematical and logical reasons. 

Applications. The following examples illustrate 
the types of problems that can be formulated as 
integer programs. 

The fixed-charges transportation problem. Con- 
sider a standard transportation problem with m 
sources, n destinations, unit shipping costs Cij, re- 
quirements b,, and availabilities a;. In addition. 
assume that a certain fixed charge e;; , independent 
of the amount shipped, is incurred if xj; is positive 
—that is, if the ith source ships to the jth destina- 
tion—and is not incurred if x;; is zero. An integer 
programming formulation is to minimize 
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subject to 
Ux = by, j=l, =n 
Ery = t, S1, m 
Xi 2 0; 
(9) ty SXi; 
(10) tu Sty <1, 


where t,, is an integer and M is any suitably large 
number, say M = max (a;, b;). Constraint (9) en- 
sures that t,;=0 whenever x,,=0. Inequalities 
(10), together with the requirement that t;; be an 
integer, ensure that t;; = 1 if x; is positive. Hence, 
a cost e; will be incurred if and only if xı; is 
positive. 

The dispatching problem. Let there be m cus- 
tomers whose orders must be delivered and n 
activities, each of which represents the act of de- 
livering one or more customers’ orders by truck. 
These activities A, can be represented by column 
vectors of zeros and ones; the ith element of A, is 
O if the jth activity does not deliver the ith cus- 
tomer’s order and 1 if it does. These various activi- 
ties arise from the fact that several orders can 
occasionally be combined in a single truck. If c; 
represents the cost of the jth activity (per unit), an 
optimal dispatching pattern is found by minimizing 


z CX, 
subject to 
= A;x =d; 
í x; 20, 


where x, is an integer and J is a column vector of 
ones. In the solution, x; = 1 or x; = 0, depending 
on whether the jth method of customer deliveries 
is or is not employed. This problem is closely re- 
lated to the covering problem of graph theory. 

The traveling salesman problem. Given n cities 
and the distance between each two, what is the 
shortest tour that begins and ends in a given city 
and passes through every other city exactly once? 
This problem can be represented as an integer 
program in several ways. 

Computational methods. The computational 
methods employed in solving integer programs are 
due primarily to R. E. Gomory (1958). An outline 
of the working of these methods is as follows: 

(a) The problem is solved by conventional 
methods as a linear program. If the solution con- 
sists of integers, that is the solution to the integer 
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program. This is, in fact, always the case with the 
transportation and assignment problems. 

(b) If the solution to the linear program is not 
integer, a new constraint is generated from the 
parameters of the previous solution with the prop- 
erty that the constraint excludes part of the feasible 
region (hence the name “Gomory-cut”) without, 
however, excluding any integer points in the feas- 
ible region. The introduction of such a constraint 
adds a new, fictitious variable to the problem and 
renders the primal problem infeasible; hence, a 
pivot step by the dual simplex method can be 
undertaken. 

(c) After a finite number of Gomory-cuts and 
dual simplex iterations, the integer solution to the 
problem can be achieved. 

There exists some choice about how to gener- 
ate new constraints. Several methods appear to be 
more successful with some problems than with 
others. The relative success of various computer 
codes in solving integer programs seems to depend 
in some sense on the structure of the problem. The 
relative computational efficiency of the various 
methods is only imperfectly understood. 

It may be noted that the dual values in integer 
programs do not have all the properties and in- 
terpretations that dual values ordinarily have 
(Gomory & Baumol 1960). These problems do not 
possess the property of continuity, and the Kuhn- 
Tucker conditions are therefore not applicable. (For 
an excellent recent treatment of various aspects of 
integer programming, see Balinski 1965.) 


Miscellaneous problems and methods 


This final section is devoted to a brief description 
of some additional problems in programming. 

Parametric programming. Parametric program- 
ming deals with (linear) programming problems 
in which either the objective function or the con- 
straining inequalities are parametrized as follows: 
maximize 


L (ce; + ke; x), 


in 
subject to 


2 ax; S bi + kadi; 
a 
x > 0, 


where k, and k, are parameters. Maximizing with 
respect to the x; as before, one can deduce from 
such formulations the sensitivity of the solution to 
variations in the coefficients of the objective func- 
tion and to variations in the right-hand sides of the 
first set of inequality constraints. By employing 


variants and extensions of the simplex method, 
problems of this type can be solved, and the range 
of k-values for which a solution exists can be 
determined. 

Stochastic programming. If some of the coeffi- _ 
cients in a programming problem are not known 
with certainty but can be regarded as depending 
upon the outcome of some chance event, the prob- 
lem is one of stochastic programming. Such pro- 
gramming problems arise in a variety of situations 
in which it is desired to schedule production to 
meet an uncertain demand. Various formulations 
may differ in the following respects: (a) the coeffi- 
cients in the objective function, in the left-hand 
sides of the inequality constraints, or in the right- 
hand sides of the inequality constraints may be 
random variables; (b) the decision problem may be 
nonsequential or may be sequential in that decision 
variables fall into two or more groups and deci- 
sions must be made with respect to each group at 
a different time; (c) the objective may be to max- 
imize expected profit or to minimize the variance 
of profit, subject to a constraint on expected profit. 
Stochastic programming problems are often non- 
linear, and solution methods may differ from case 
to case, involving ordinary linear programming, 
separable programming, and other techniques 
(Hadley 1964). 

Dynamic programming. Dynamic programming, 
which was developed primarily by R. Bellman 
(1957), employs a’ set of powerful methods that 
can be applied to problems in which decisions can 
be represented as occurring sequentially. Mathe- 
matically it rests upon Bellman’s principle of opti- 
mality, according to which an optimal set of 
decisions is characterized by the fact that the last 
n — 1 decisions in the set are optimal with respect to 
the state resulting from the first decision, whatever 
that first decision may have been. Many problems 
in operations research are capable of being formu- 
lated as dynamic programs. These problems resem- 
ble the types of mathematical programming prob- 
lems discussed above in that they characteristically 
involve maximization (or minimization) of some 
function subject to inequality constraints and to 
nonnegativity requirements. The computational dif- 
ficulties involved in reaching solutions vary, de- 
pending on the problem. 

RICHARD E. QUANDT 


[See also OPERATIONS RESEARCH. ] 
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PROGRAMMING, COMPUTER 
See COMPUTATION. 


PROJECTION, PSYCHOLOGICAL 
See DEFENSE MECHANISMS and PROJECTIVE 
METHODS. 


PROJECTIONS 

See POPULATION, especially the article on POPU- 
LATION GROWTH; PREDICTION AND FORECASTING, 
ECONOMIC; TIME SERIES. 


PROJECTIVE METHODS 


1. Projective TECHNIQUES Gardner Lindzey and 


Joseph S. Thorpe 
um. THE RORSCHACH TEST Samuel J. Beck 
m. THE THEMATIC APPERCEPTION 

Test William E. Henry 


1 
PROJECTIVE TECHNIQUES 


Projective techniques and their application rep- 
resent a point of significant intersection for psycho- 
analytic theory, clinical psychology, social psychol- 
ogy, and cultural anthropology. These instruments 
include a wide array of materials, instructions, and 
interpretative rules, but in spite of this diversity, 
there are certain common qualities that distinguish 
them from other personality measures, such as 
inventories, ratings, or situational tests. 

The most distinctive feature of projective tech- 
niques concerns the presentation to the subject of 
a task that is relatively unstructured. This lack of 
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structure involves ambiguous or vaguely defined 
stimuli and a relatively unrestricted set of response 
alternatives by means of which the subject may 
assign meaning to the stimulus material. It is gen- 
erally assumed that in the process of choosing be- 
tween the many alternatives for structuring or 
interpreting the ambiguous stimulus material, the 
subject reveals significant and fundamental aspects 
of himself, Ordinarily such devices are considered 
to be particularly responsive to latent or uncon- 
scious components of the person, and consistently 
there is a minimum of subject awareness concern- 
ing the purpose of the test. The response data are 
typically profuse and often involve fantasy or 
imaginary creations. The instruments are highly 
multidimensional; that is, they are intended to 
provide information concerning a large number of 
personality dimensions. 

Historical development. The origins and early 
development of projective tests are closely inter- 
twined with developments within psychoanalysis. 
Indeed, Freud’s theory of dreams and technique of 
dream interpretation may be considered the most 
important single source of the dominant ideas in 
this area. At about this time Jung’s Studies in Word- 
association (1904-1909) appeared, which also 
has been heavily influential in the development of 
projective testing. The Rorschach test, best known 
of all projective tests, was developed by a student 
of Jung’s and was undoubtedly influenced by psy- 
choanalytic theory and practice. The term “projec- 
tive technique” came to be associated with these 
instruments primarily as a result of the writings 
of Henry A. Murray and L, K. Frank during the 
1930s; however, the relationship between the con- 
cept of projection and these instruments is any- 
thing but precise (see Lindzey 1961). Indeed, one 
can say that “projection,” as the term is conven- 
tionally used within psychoanalysis today, is sel- 
dom directly involved in projective-technique re- 
sponses. 

In the two decades following World War 1 there 
was a very rapid development and application of 
aptitude and ability tests as exemplified by the 
Stanford-Binet and various group tests. Following 
World War 1 there were comparable developments 
in the area of personality testing, with projective 
techniques receiving the most prominent attention. 
It was during this period that the Society for Pro- 
jective Techniques, made up of persons involved 
in the study and application of these devices, was 
formed. At the same time the Journal of Projective 
Techniques, a journal that contains only papers 
dealing with these instruments, appeared. In addi- 
tion, an extraordinarily large number of articles 

and books concerning these techniques have been 


published. Even for a single instrument, such as 
the Rorschach test, a total bibliography would now 
include more than a thousand items. Moreover, 
within most modern graduate programs in psy- 
chology the inclusion of one or more courses deal- 
ing with projective techniques has become standard, 

The development and application of these in- 
struments have been associated closely with clinical 
activities. For the most part, individuals drawn to 
these devices have been working in clinical set- 
tings, often with disturbed persons and typically 
under circumstances where individual subjects 
could be subjected to a relatively intensive and de- 
tailed study. Moreover, the techniques have been 
considered most attractive and useful by persons 
who are heavily influenced by psychoanalytic the- 
ory or some variety of holistic theory. Seldom has 
a heavy emphasis upon objectivity, precision, and 
psychometric elegance been characteristic of work 
with these devices. 


Major techniques and their classification 


Given these general remarks concerning the 
nature and origins of projective techniques, it may 
be helpful to provide concrete identification of the 
instruments by means of representative illustra- 
tions, There are many different approaches to clas- 
sifying these tests (Lindzey 1959), but the most 
useful approach focuses on the nature of the re- 
sponse evoked from the subject. On this basis pro- 
jective techniques may be grouped broadly into the 
following categories: (a) association, (b) con- 
struction, (c) completion, (d) choice or ordering, 
and (e) expression. y 

Association techniques. Included under associa- 
tion techniques are those instruments that tend to 
minimize ideation and emphasize immediacy of 
response. The subject is instructed not to reflect 
but rather to respond to the stimulus with the first 
word, image, or percept that occurs to him. 

Word association. An early and well-known ex- 
ample is the word association technique, which 
involves the oral presentation of a series of discon- 
nected words, one at a time, to which the subject 
is instructed to respond with the first word that 
occurs to him. Typically, both the subject's associa- 
tion and reaction time to each word are recorded. 
While many lists of words have been used, the 
most frequently encountered are the early list by 
Kent and Rosanoff (1910), which was designed 
as a psychiatric screening device, and the more 
recent list by Rapaport, Schafer, and Gill (1945- 
1946). The latter is similar to Jungs (€-g- 
1904—1909), which contains many words of psy- 
choanalytic significance, particularly with regard 
to psychosexual conflict. Depending upon the par- 
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ticular purpose of the user, scoring and interpre- 
tation may involve an analysis of the content of 
the associations, in terms of their relevance to 
particular motives or conflict areas such as domi- 
nance or aggression, or if a judgment concerning 
the degree of general adjustment is to be obtained, 
the subject's responses may be compared with the 
typical performance of psychiatric and normal 
groups on the same list of words. In addition to 
their content, associations are often analyzed in 
terms of proportion of common responses, tem- 
poral deviations, and associative disturbances, such 
as repeating the stimulus word or giving extremely 
remote associations. 

Rorschach and other inkblot techniques. By far 
the most popular projective technique, in terms of 
frequency of clinical and research application, is 
the Rorschach technique (Rorschach 1921), which 
consists of ten symmetrical inkblots. Typically the 
cards are presented individually and in a set order, 
with instructions to the subject that he report what 
the figures resemble or suggest to him. Responses 
to each card are recorded verbatim and reaction 
times are noted, following which an inquiry is 
conducted wherein the subject identifies the char- 
acteristics of the stimuli which determined or con- 
tributed to his associations. [See RoRSCHACH.] 

There are a variety of scoring systems in use, 
the most popular of which are by Beck (Beck et al. 
1944) and Klopfer (Klopfer et al. 1954-1956). 
Those aspects of responses which are usually 
scored are location, the relative size of the blot area 
incorporated (including the use of white space); 
determinants, the use of color and shading in com- 
parison with the form or shape of the blot; and 
the presence of movement—either human, animal, 
or inanimate—in the response. Also noted are the 
accuracy of form perception, the incidence of pop- 
ular and original responses, and the total number 
of responses. The content of responses is also 
classified, at least in terms of human, animal, 
anatomical, nature, and abstract classes, although 
some scoring conventions are quite elaborate in 
this respect. Various scores, representing the num- 
ber of responses falling into each category, are 
usually further transformed into ratios of scores 
which are considered essential for interpretation. 

Most systems of interpretation of these quan- 
titative scores and ratios prescribe inferences to 
be drawn regarding the more general organization 
of personality, rather than specific motives or con- 
flicts. It is usually emphasized that the interpreter 
must consider the entire profile of scores rather 
than isolated scores. 

Modifications of this technique have involved 
developing new stimuli and more objective scoring 
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procedures, as well as alternate forms and group 
versions of the test. Perhaps the most interesting 
modification is the Holtzman Inkblot Technique 
(Holtzman et al. 1961), which introduced desira- 
ble psychometric properties to the basic Rorschach 
approach. 

Construction techniques. Construction tech- 
niques require the subject to go beyond simple 
association to a stimulus and to create or construct 
a more elaborate product, which is typically a com- 
plete art form, such as a story or picture. 

Thematic Apperception Test, Next to the 
Rorschach, the most widely used of all projective 
techniques is the Thematic Apperception Test 
(Murray 1943). The TAT consists of cards con- 
taining black-and-white pictures of vague scenes, 
and the instructions request the subject to com- 
pose a story to fit each picture, describing what 
the people are thinking and feeling, what led up 
to the scene depicted, and what the outcome will 
be. Analysis of TAT stories, which are usually re- 
corded verbatim, is far from standardized, although 
a number of general scoring systems have been 
proposed (e.g., Aron 1949; Bellak 1947; Stein 
1948; Tomkins 1947). Most of these systems focus 
on the content of behavior and experiences de- 
scribed in the story, although they differ consider- 
ably in the unit of analysis employed and the 
degree of inference required. As used clinically, 
interpretation typically has not rested upon a set 
of objective scores for specific variables. The basic 
assumption of most schemes is one of a psycho- 
logical isomorphy between dispositions attributed 
to the part of the major character in the story and 
those that exist in the storyteller. 

The Blacky Pictures. Closely related to the TAT 
is the Blacky Pictures test (Blum 1949), which is 
intended to investigate 11 specific psychoanalytic 
variables, including oral eroticism, oedipal inten- 
sity, and castration anxiety. The test consists of 
12 cartoons concerned with experiences in the life 
of a dog named Blacky, including his relationships 
with Mama, Papa, and Tippy, a sibling. As its 
central feature the procedure involves story con- 
struction in response to the pictures but adds indi- 
cations of preference and a series of direct ques- 
tions for each picture. While administration is 
highly standardized, scoring and interpretation 
are not. 

Completion techniques. Completion techniques 
present some type of incomplete product, with the 
requirement that the subject complete it in any 
manner he wishes and the restriction that the com- 
pletions meet certain standards of good form or 
rationality. Best known of these devices are the 
sentence-completion tests (e.g., Rotter & Willer- 
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man 1947), which are widely used by both clinicians 
and personality investigators. Typically such a de- 
vice consists of thirty to one hundred brief sen- 
tence stems which the subject is instructed to com- 
plete with the first words that come to mind. In 
some cases the instructions to the subject empha- 
size that the completions should reveal his own 
feelings. The type of scoring employed and the cus- 
tomary inferences derived from this technique 
suggest its close kinship with the TAT. The tech- 
nique is considered most efficient in assessing the 
content of personality (attitudes, motives, and con- 
flicts) at a more conscious or manifest level than 
such instruments as the Rorschach or TAT. Other 
examples of completion tests are story-completion 
and argument-completion tests and the Rosenzweig 
Picture-Frustration Study (Rosenzweig 1949). 

Choice or ordering techniques. Choice or order- 
ing techniques require the respondent to choose 
from a number of alternatives the item or arrange- 
ment that fits some specified criterion, such as 
meaningfulness, relevance, or attractiveness. In 
some cases, such as multiple-choice versions of the 
Rorschach (Harrower & Steiner 1945) and the 
TAT (Goodstein 1954), the subject is required to 
select from a number of hypothetical responses 
the one that seems most appropriate to him. 

The Szondi Test. Perhaps best known of these 
techniques is the Szondi Test. This test was de- 
vised in the late 1930s by Szondi, a Hungarian 
psychiatrist (1947), and made popular in the 
United States through the efforts of Susan Deri 
(1949). The test materials consist of 48 photo- 
graphs of individuals drawn from eight psychiatric 
diagnostic categories. A rather complex and lengthy 
administration procedure elicits from the subject 
expressions of preference for the different photo- 
graphs. The test is based on the assumption that 
a person can be described in terms of eight “need- 
systems” (presumably reflected in the photo- 
graphs), with the subject's selections and rejections 
indicating the degree of strength or tension in 
each. 

Although the test enjoyed considerable popu- 
larity in clinical settings during the 1950s, more 
recently there has been less evidence of applica- 
tion. Despite some positive features, such as the 
recommendation of repeated administrations and 
a highly objective scoring system, the theoretical 
rationale of the test has appeared to be anything 
but compelling to most American psychologists. 
Attempts at empirical verification of the test seem 
to be overwhelmingly negative (Borstelmann & 
Klopfer 1953). A recent volume by Szondi, Moser, 
and Webb (1959) provides a strongly partisan 
description of the technique and its rationale, 
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Picture Arrangement Test. A more recent ex- 
ample of an ordering technique is the Tomkins- 
Horn Picture Arrangement Test. This device con- 
sists of 25 plates, each of which contains three 
line drawings that depict the same figure involved 
in different but related activities. The subject is 
asked to indicate the order in which these activities 
took place and to provide a sentence indicating 
what is going on in the picture. The efforts of the 
authors (Tomkins & Miner 1957) have been de- 
voted primarily to developing objective methods 
for dealing with the arrangements selected by the 
subject, although they provide suggestions for the 
use of the descriptive statements as well. 

This test is unique in its consideration for in- 
terpretation of only those responses that are rela- 
tively rare in occurrence. Rare response patterns 
from a subject provide a basis for estimating cer- 
tain tendencies such as hypochondriasis or avoid- 
ance of people and for distinguishing normal from 
abnormal subjects. The existence of a highly ob- 
jective scoring procedure, coupled with the dem- 
onstration of certain distinct differences in the 
performances of normal and disturbed subjects, 
identifies this instrument as possessing objective 
and normative properties that are not encountered 
anywhere else in the domain of projective tech- 
niques. On the other hand, the data collected are 
sharply delimited, and the process of demonstrat- 
ing that the test is sensitive or useful has only 
begun. 

Expressive techniques. Expressive methods dif- 
fer from construction techniques in that they place 
as much emphasis upon the manner and style in 
which the product is created as upon the product 
itself. They are often considered to be therapeutic 
as well as diagnostic devices, since the subject is 
presumed to relieve his difficulties in the process 
of revealing them. 

Play. Most popular among the expressive in- 
struments are play techniques. Originating in play 
therapy, these methods have been adapted for the 
diagnosis and measurement of personality, pri- 
marily in children. All the approaches present the 
subject with an array of toys which he is encour- 
aged to use in some manner. Among the objects 
frequently selected are dolls representing adults 
and children of both sexes and various age levels. 
Typically the examiner is responsible for recording 
as much of the subject’s behavior as possible, in- 
cluding his choice and arrangement of toys, ac- 
companying comments, and expressive behavior. 
Except for the work carried out by Sears and his 
associates, which has focused on measures of 
aggression and dependency (Sears et al. 1953), 
little has been done with this technique in the way 


PROJECTIVE METHODS: Projective Techniques 


of standardization and specification of just how 
the process of interpretation shall take place. In 
most instances it is assumed that the examiner 
will recognize important motives and conflicts 
when he sees them. 

Drawing and painting. There is also a variety 
of drawing and painting techniques that have been 
used in personality assessment. Attention has 
centered chiefly on procedures using drawings of 
the human figure, the most publicized of which 
have been those of Buck (1948) and Machover 
(1949). When the test is administered individually, 
the examiner usually notes the subject's com- 
ments, the sequence of parts drawn, and other pro- 
cedural details. Scoring of human-figure-drawing 
tests is essentially qualitative, being concerned 
with such stylistic features as the figure’s stance, 
size, and position on the page, disproportions, 
shading, and erasures. Interpretation of both draw- 
ing and painting techniques is far from precise, 
in some cases depending largely upon general clin- 
ical wisdom, coupled with a knowledge of some 
specific rules or generalizations relating certain 
features of the drawing or painting to personality 
characteristics. 


Eyaluation and interpretation 


The preceding survey of the more important ex- 
amples of different types of projective techniques 
illustrates the extreme diversity of these instru- 
ments, Discussions of projective techniques in a 
critical context will be found in volumes by Ana- 
stasi (1961), Cronbach (1949), and Lindzey 
(1961). General sources of more detailed informa- 
tion are books by Abt and Bellak (1950), Ander- 
son and Anderson (1951), and Zubin, Eron, and 
Schumer (1965). 

The interpretation of projective-technique re- 
sponses is carried out under conditions that vary 
from the highly molecular and objective to the 
most generalized and intuitive. Virtually all the in- 
dividual tests have interpretative manuals to assist 
the user in extracting useful psychological in- 
formation from them, and in the case of the 
better-known instruments, such as the TAT and 
Rorschach, there are dozens of books and mono- 
graphs and hundreds of articles dealing with gen- 
eral and specific problems of interpretation. In 
spite of the bulk and diversity of this interpreta- 
tive array, there are certain generalizations that 
can be made concerning customary interpretative 
practice. 

The interpretation of projective techniques in 
individual assessment is more often qualitative and 
subjective than it is quantitative and objective. 
Even in those instances where the responses may 
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be subjected to objective scoring that leads to 
numerical values, the translation of these scores 
into psychological statements usually involves com- 
plex and unspecified processes (Levy 1963), In 
research use, on the other hand, it is much more 
common to find the response data subjected to 
objective scoring, which in turn leads to quantita- 
tive scores on specified personality dimensions. 

If we make the reasonable assumption that the 
chief interest of the person using projective tech- 
niques is in enduring dispositions or in personality 
characteristics, we find that the customary task of 
the interpreter is enormously complicated by the 
sensitivity of these instruments to a wide variety 
of determinants, only a small proportion of which 
are relevant to enduring motivational or personality 
dimensions. Thus, it is well established (Gleser 
1963; Lindzey 1961; Masling 1960) that projective- , 
technique responses are influenced by temporary 
or fleeting psychological states, by the nature of 
the stimuli that are presented, by a number of re- 
sponse sets, by individual differences unrelated to 
personality factors (for example, differences in in- 
telligence), and by a wide variety of situational 
factors (for example, examiner-subject interac- 
tion, past testing experiences, reason for taking the 
test). Given these observations, it is clear that the 
interpreter must make every effort to demonstrate 
that those aspects of the response data being used 
in interpretation are not products of variation in 
nonpersonality factors. In order to be reasonably 
certain of this, it is almost always necessary that 
the examiner know a great deal about the circum- 
stances under which the test was administered, as 
well as a good deal about the subject in addition 
to his test responses. For this reason, most projec- 
tive techniques administered in clinical settings 
form only part of a larger battery of intelligence 
and personality tests, with interpretation leaning 
heavily on the consistency of performance among 
the several tests, Despite this practice, one of the 
major shortcomings in the customary application 
of these techniques is the tendency on the part of 
many users to assume that all test responses are 
determined solely by personality variables; conse- 
quently, they exhibit a readiness to overinterpret. 
This caution is somewhat less important in re- 
search settings, where it is possible to control or 
randomize some of the factors that might other- 
wise distort interpretation. 

As we have indicated, the most extensive use of 
projective techniques has been in clinical situations. 
In the hands of clinical psychologists these instru- 
ments have played a key role in psychodiagnosis 
and personality description in a wide variety of 
psychiatric and psychological settings. These tech- 
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niques have also been used extensively in connec- 
tion with motivation research, or the study of con- 
sumer personality or motivation as a means of 
predicting buying behavior. Not surprisingly, little 
of the work in this area has achieved very high 
standards of rigor and control. 

Of more scientific interest has been the use of 
these instruments in personality research. A sizable 
number of the controlled studies relevant to psy- 
choanalysis that have been conducted have in- 
volved the use of projective techniques (Hilgard 
1952; Rickers-Ovsiankina 1960). The extensive use 
of projective measures of achievement and affilia- 
tion motivation in a wide variety of social psycho- 
logical and personality research represents an- 
other major area of application (Atkinson 1958; 
McClelland et al. 1953). Experimental studies of 
psychopathology have often made use of projective 
measures, The use by psychologists and cultural 
anthropologists of projective techniques in cross- 
cultural studies is another area of extensive appli- 
cation (Kaplan 1961; Lindzey 1961; Zubin et al. 
1965), Indeed, a large proportion of the studies of 
modal personality or national character, psycho- 
logical correlates of acculturation, and personality 
development in nonliterate settings has involved 
more or less extensive dependence upon projective 
techniques. 

In view of this widespread application and the 
frequently partisan enthusiasm that has been as- 
sociated with these techniques, it is not surprising 
that they have attracted a good deal of critical 
attention. Criticisms have generally focused upon 
the psychometric flaws of the instruments, includ- 
ing dubious reliability and validity and the frequent 
lack of objectivity or specificity in the administra- 
tion and interpretation of the tests. There is no 
doubt that much of this criticism is reasonable and 
based upon relatively convincing data. On the 
other hand, certain projective instruments under 
certain conditions have been demonstrated to 
attain reasonable standards of objectivity and 
standardization (e.g., Atkinson 1958; Murstein 
1963). 

The matter of validity is always complicated 
and particularly so when we are dealing with com- 
plex motivational concepts, especially when these 
are presumed to relate to covert or latent aspects 
of personality. While there are numerous Studies 
(Ainsworth 1954; Lindzey 1959; Murstein 1963) 
that demonstrate limited predictive and concurrent 
validity—ability to predict future behavior or per- 
formance and to relate to concurrent criterion 
measures—most findings have not held up well 
upon cross-validation. Further, such studies do not 


ordinarily deal adequately with the covert person- 
ality characteristics for which these tests are sup- 
posed to be singularly appropriate. At risk of over- 
simplification, we may state that when we are 
concerned with the validity of projective techniques, 
their presumed association with covert variables, 
complex motivational processes, and psychoana- 
lytic concepts forces the investigator and theorist 
into the tangled jungle of construct validity, and 
this guarantees the absence of easy answers. The 
relation between projective techniques and psycho- 
logical theory is of key importance in any consider- 
ation of the construct validity of these devices. It 
seems clear on the basis of a number of past ex- 
aminations (e.g., Holt 1954; Conference on Con- 
temporary Issues . . . 1961; Lindzey 1961) that, 
while there are some important links between these 
tests and several varieties of psychological theory, 
particularly psychoanalytic theory, there is little 
in the way of firm and formal association between 
theory and method. Relatively detailed discussion 
of the validity of these instruments may be found 
in Ainsworth (1954), Cronbach (1949), Harris 
(1960), and Lindzey (1961). At the present time, 
no simple statements can be made concerning the 
over-all validity of even a single projective test, let 
alone the entire class of tests. Indeed the question 
of general validity is meaningless; one must always 
ask, Valid for what purpose? and, perhaps, Valid 
when used by whom and under what conditions? 
All things considered, it is fair to say that these 
techniques have not lived up to early expectations 
in regard to their stable sensitivity to covert mo- 
tivational processes. On the other hand there is 
little doubt that in the hands of the sophisticated 
and energetic investigator they have fulfilled satis- 
factorily a number of important functions. As the 
previous discussion has indicated, there are some 
important areas of personality research that have 
been heavily dependent upon these devices, and 
there is agreement that some of these areas have 
involved significant research contributions. It is 
more difficult to assess the use of the instruments 
as they have been employed typically in clinical 
settings, since their function there is highly varia- 
ble, depending upon the particular setting and the 
individual clinician. For example, it is difficult to 
examine Schafer’s discussion (1954) of Rorschach 
interpretation without being convinced that the 
test can serve important assessment and diagnos- 
tic functions when employed in the manner he 
recommends and by a person of his ability. At the 
same time, one must admit that few clinical appli- 
cations are likely to attain these high standards. 
One may conclude that if projective techniques 
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represent the “royal road to the unconscious,” the 

necessary road maps have yet to become common 

knowledge. Nevertheless, the instruments seem to 

have found a useful place within the psychologist’s 
array of assessment tools. 

GARDNER LINDZEY 

AND JosEPH S. THORPE 


[For discussion of other techniques of personality as- 
sessment, see FACTOR ANALYSIS, article on PSYCHO- 
LOGICAL APPLICATIONS; INTERVIEWING, article on 
PERSONALITY APPRAISAL; PERSONALITY MEASURE- 
MENT; SOCIOMETRY; Traits. Additional relevant 
material may be found in ACHIEVEMENT MOTIVA- 
TION; FANTASY; PERSONALITY: CONTEMPORARY 
VIEWPOINTS, article on COMPONENTS OF AN EVOLV- 
ING PERSONOLOGICAL SYSTEM; PSYCHOMETRICS; and 
in the biography of RorscHacuH.] 
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0g 
THE RORSCHACH TEST 


The Rorschach test is one attempt at concur- 
rently examining an individual's functioning intelli- 
gence, his emotions, and his unconscious. Aside 
from theoretical considerations, this has very im- 
portant practical meaning for at least three disci- 
plines—clinical psychology, psychiatry, and social 
work, By the time Hermann Rorschach first pub- 
lished his test, in 1921, all three had been forced 
to grope for multidimensional concepts of person- 
ality. While they were ready for objective tests that 
were sophisticated enough to penetrate to the 
deeper recesses of human behavior, they were not, 
to be sure, prepared to have this accomplished by 
ten inkblots. 

The Rorschach test’s basic data consist of a per- 
son’s free associations to ten inkblot figures. These 


associations are the behaviors from which the ex- 
aminer can conclude that the person is a hysteric, 
an obsessive neurotic, a schizophrenic, and so on, 
or is in sound mental health, These associations 
are behaviors just as are any other visible activities 
of an individual by which we judge his character, 
such as combing his hair or wearing certain kinds 
of clothes. 

The term “projective” was originally attached to 
this kind of test by Frank (1939). It can be justi- 
fied etymologically but not clinically. The person 
does put forth, or project, elements of his own per- 
sonality onto the stimulus. However, the word 
“project” had already acquired its meaning, spe- 
cifically with reference to paranoid behavior, in 
clinical language. So “projection” now has two 
meanings in the languages of two closely related 
disciplines. Frank’s designation has become affixed 
to these tests and appears likely to remain so. In 
order to attain an understanding of the behaviors 
which constitute the Rorschach test data, the in- 
vestigator needs to know the grammar and the 
syntax of the test’s language. The present article 
undertakes to sketch them, 


Psychological processes measured 


Rorschach found the inkblot data informative 
regarding three major psychological processes: in- 
tellectual control, emotional life, and fantasy life, 
and his own thinking pivoted around these three 
areas (1921). Rorschach’s initial interpretation of 
responses to inkblots explicitly follows psychoana- 
lytic theory, and the subsequent evolution of the 
test as a clinical instrument can readily be traced 
within the framework of that theory. We are en- 
abled by means of the Rorschach test to describe 
a person in terms of his ego functioning, his emo- 
tional state, and his unconscious ideation, that is, 
the core areas of Freudian psychology. We are 
also able to identify that other set of dynamic 
traits, the defenses. 

Intellectual controls and good form. Concern- 
ing the intellectual controls, the subtitle of Ror- 
schach’s monograph (1921) indicates that diag- 
nosis is based on perception. The person looks at 
inkblot forms and tells what he “sees.” These per- 
ceptions of forms are the basic behavioral data 
derived from the test. If now we administer the 
test to, say, one hundred presumably normal indi- 
viduals, we infer from the distribution of their re- 
sponses what a normal population “sees” in the 
inkblots, that is, what their perceptual behaviors 
are like in reaction to the test. Administering it to 
a person about whom we know nothing, we com- 
pare his responses with those of the normal popu- 
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lation. To the extent that this person responds as 
they do, one can infer that he sees as they do and 
that he is to that degree normal. 

This is the foundation on which the test rests. 
What I have described in the preceding paragraph 
is Rorschach’s concept of good form (F+). The 
patient's ability to respond with such “good forms” 
within a certain empirically determined range 
is an index of his ability to see accurately; the 
criteria for accurate perceptions are assumed to 
be set by one’s peers. 

What the test does then is to set up ten situa- 
tions which establish their own frames of refer- 
ence for what is accurate and inaccurate, good 
form or bad, reality or deviation from it. The ink- 
blots are a cosmos in themselves. Not having any 
conventional meanings or reality, they can start 
de novo to trace out their own realities. This is the 
great advantage of the test’s using “unstructured” 
stimuli. 

With this advantage, there also exists a corre- 
sponding responsibility for the dependability of 
the frame of reference, that is, the normative 
foundation. 

“Good forms” (or F+) are the nucleus for ego 
evaluation. The logic for this rests on the fact that 
the persons by whom Rorschach established his 
good-form criterion were assumed to be “normals,” 
who take care of their affairs in life with the usual 
respect for law, custom, and morals as well as 
for the social graces. Such living is assumed to 
be motivated by the wish for the respect of one’s 
fellow citizens. The respect that we want and that 
we know we derive from others is also considered 
to be a measure of our self-respect, of ego. Ac- 
cordingly, an F+ score can be considered as a 
measure of ego. 

This deductive reasoning has been borne out 
inductively by clinical experience. Among persons 
who are very disturbed neurotically, many are also 
very erratic in their form perceptions. Neurotics 
are assumed to be persons in conflict. They have 
emotional urges which are inconsistent with their 
standards of good and bad. That is, they are moti- 
vated by urges that are ego-alien. In most cases 
of neurosis, the conflictual stress drains the person 
of his critical control power, and so his perceptual 
accuracy breaks down. The further observation is 
that the milder (clinically) the neurosis, the higher 
the F+ percentage; the reverse is also true. This 
relationship is also obtained in the study of devel- 
opment; the lower the age range, the lower the 
percentage of accurate form perception. Accuracy 
of form perception progresses with maturation, 
which is to say, with the ego’s growth. A plateau 
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is reached at the early adolescent years, and the 
percentage remains stable thereafter, assuming the 
relative stability of the personality. [See NEUROSIS; 
PERCEPTION, article on PERCEPTUAL DEVELOP- 
MENT; SENSORY AND MOTOR DEVELOPMENT. ] 

When an individual’s response deviates con- 
siderably from expectancy, we look for the explana- 
tion in psychological factors, usually emotional, 
that minimize or enhance the accuracy of percep- 
tual responses. Excitability or agitation may break 
down perception. Obsessive anxiety may heighten 
it. Or the person’s maturation may have been ar- 
rested, perhaps as a result of inadequate or dam- 
aged brain tissue, mental deficiency, or a patho- 
logical environmental climate—a family setting 
that has contributed toward halting the person- 
ality’s growth within a childish or even infantile 
phase, which often occurs in schizophrenia. In 
these patients the Rorschach perceptual range is 
usually below the critical minimum both for the 
normal and the neurotic ranges. 

Such are the actual empirical data in clinical 
groups. To restate: the percentage of good-form 
perception varies predictably according to what is 
known clinically of the ego’s growth level and of 
its stamina in the clinical condition. Findings of 
heightened perceptual accuracy agree with this 
logic. The responses of depressives and some very 
severely disturbed neurotics are very close to, and 
sometimes at, one hundred per cent accuracy. 
This is due to their crushing anxiety. It makes 
them incapable of seeing anything except in the 
most rigidly correct fashion. They overreact to their 
tensions and pain. They develop a character armor 
that severely disables them for effective adjust- 
ment to life. These conflicts are just as stressful as 
those of the patients whose conflicts lead to dis- 
torted perception. Such is the clinical logic. The 
empirical fact is that healthy individuals, and this 
includes those of highest intelligence, never rate 
at one hundred per cent F+, or form accuracy. 
They are liberated emotionally, and they do make 
mistakes. 

Perceptual accuracy is not the only index of the 
individual's ego functioning, Ego functioning in- 
cludes other intellectual attributes. Principally 
these are attentiveness, the ability to synthesize 
one’s percepts, the intact or pathological quality 
of the thought processes, and thematic content in 
thinking as a clue to the personal values which the 
patient holds. These variables and their numerous 
nuances are also assessed by the Rorschach test. 

Emotional life and color responses. Emotional 
life is tapped by the Rorschach test principally 
through the person's reactions to the color and to 
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the shading tones of the blot figures. Rorschach 
himself described the variety of the color reactions 
and their psychological significances. Both his tech- 
nical observations and his interpretative principles 
on this point have remained essentially unchanged. 
He had only begun to glimpse the meanings of the 
reactions to the variations in gray, as a posthumous 
paper shows (see Rorschach & Oberholzer 1923). 
Binder (1932) first described these in detail; Ober- 
holzer (1944) has elucidated an especially im- 
portant response to the grays, that is, the vista 
percept; and Klopfer (1942) was the first to de- 
scribe another nuance, the texture response, that 
is, the person responds as though he feels the 
stimulus. What any of these variables tell us psy- 
chologically would take us far beyond the scope of 
the present article. 

Fantasy and the movement response. The most 
original of Rorschach’s discoveries is the “move- 
ment” (Bewegung) response. As he described it, 
both in his monograph (1921) and in a posthu- 
mous paper (see Rorschach & Oberholzer 1923), 
it is a perception of a human form as if in move- 
ment. The patient may see this in the whole blot 
figure or in any portion of it. While the content in 
the perceived form may be that of some animal, the 
movement must nonetheless be one which is ana- 
tomically feasible for a human being (e.g., “two 
bears standing on their hind feet, as if boxing,” or 
even “two birds talking together”). 

Rorschach was both explicit and definitive in ex- 
cluding from his fantasy concept any animal move- 
ment that is a specific characteristic activity of 
that animal (e.g., “a dog snapping at a butterfly”). 
His caution about identifying the movement de- 
terminant accurately was dictated by the critical 
psychological significance of these associations. 
They are the language by which the test reports 
fantasy activity in which the person is engaging, 
fantasy which is a wish or a fear. Fantasy is con- 
sidered to be introverted emotion, that is, a re- 
pressed fear or wish. Nearly all Rorschach fan- 
tasy associations also have a significance for the 
patient's unconscious that is not evident in the 
overt associational content. The association, like 
the dream, expresses an idea which the patient 
cannot entertain in his conscious awareness. It is 
thus the lead to the unconscious, [See FANTAsy.] 

The importance of such an invention in the field 
of personality study can hardly be overestimated. 
The test becomes a depth instrument. Like the 
dream, it communicates in an idiom seemingly in- 
nocent because it is only responses to inkblots. The 
test thus enables the material to bypass censorship 
of the ego, enabling the examiner to discern con- 


flictual needs in the patient and to relate them to 
the entire personality structure—schizophrenic, 
neurotic, or healthy. It can be seen too why Ror- 
schach was so circumspect in defining his fantasy 
response. The fantasy concept has since his time 
been extended, principally under Klopfer’s influ- 
ence (Klopfer 1942; Klopfer et al. 1954-1956) to 
include apparent movement associations that do 
not conform to Rorschach’s definition, They have 
been identified by the symbol FM, that is, “form 
tending to movement.” Examples of FM provided 
by one of Klopfer’s co-workers (Klopfer 1942, chap- 
ter 12) are “spiders crawling up on a leaf”; “a bat 
turned upside down, a bat flying with his wings 
spread out”; “a squirrel on a stub, or a nest; just 
the back, with the tail sticking up.” The present 
writer has not been convinced that these FM re- 
sponses have the psychological significance of the 
Rorschach Bewegung, and he follows Rorschach’s 
original technique. 


Application and evaluation 


The test currently is being used in almost all 
parts of the world. Aside from psychology proper, 
the disciplines that are much interested in it are 
social work, psychiatry, and anthropology. Social 
work agencies having recourse to it as an explora- 
tory instrument find it an aid in assessing the 
severity of a client’s disorder and thus are able to 
arrive more quickly at a treatment plan. In psy- 
chiatry the test provides important leads and thus 
an economical approach to differential diagnosis. 
To the degree of its validity the Rorschach test 
does, in comparatively short time, what might 
otherwise require many exploratory clinical ses- 
sions. 

As a research tool in psychotherapy the test can 
be administered at various stages in the course of 
an illness. The progress of the patient (and so the 
effectiveness of the therapy) can be gauged from 
changes in the test findings. Developmental growth 
can be similarly traced by applying the test to 
children at various ages, something that Ames, 
Métraux, and Walker have been doing (1959); a 
similar study was reported by Thetford, Molish, 
and Beck (1951). 

Anthropology is still another field in which the 
test has been useful in research (see Hallowell 
1956). It yields information about how psycholog- 
ical structures vary with cultural differences. 

Research problems stemming from the Ror- 
schach test are grouped broadly into three areas. 
One is general and experimental psychology, deal- 
ing with questions of the relations of color and 
shading to the affective states of the person, the 
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rationale for employing the human movement re- 
sponse as an index of unconscious ideation, and 
the way in which the intellectual variables of the 
test provide information about the ego’s functioning. 

In the second area are certain theoretical prob- 
lems that are specifically derived from Rorschach 
test findings and principles. First among these is 
Rorschach’s Erlebnistypus, or “experience balance,” 
the ratio between the number of movement re- 
sponses and the number of color responses (1921; 
Rorschach & Oberholzer 1923), and with it its 
derivative, the “experience actual” (Beck 1960). 
These concepts bear on the innermost emotional 
and individualized needs of each person. Yet these 
are made accessible through the objective stimuli 
—the Rorschach inkblot figures. The experience 
balance and the experience actual are known from 
the relative amount of externalizing or internaliz- 
ing of the feelings. This recalls of course Jung’s 
introversion-extraversion hypothesis. But the ex- 
perience balance data have significance apart from 
introversion-extraversion, Related, too, are con- 
cepts of the psychologic structure of personality as 
a balance of forces—affective and intellectual, id 
and ego. 

Third, in the practical clinical field the research 
needs are for more and more controlled observa- 
tions. Here the task is threefold: (1) Statistical 
data are to be gathered for various diagnostic 
groups to determine the means and measures of 
dispersion for each of the Rorschach test variables, 
for each clinical category. (2) To determine what 
the entire structure is like in the typical indi- 
vidual of each group, that is, how much F+ per 
cent, human movement, and color and other test 
responses occur in the typical individual. Then, 
assuming the Rorschach test principles are valid 
psychologically, how does the cofunctioning of the 
data in any one individual make clinical sense? 
(3) What are the finer nuances of responses that 
differentiate individuals in one clinical group from 
others? The test is sensitive to whether thinking 
is integrative or pathologic, to whether fantasy is 
regressive, autistic, or promising of constructive 
imagination, and to whether emotional life is 
uninhibited, a painful withdrawing, or controlled 
release. The goal is to constantly reinforce the test’s 
major foundations and at the same time to discern 
the ever finer and finer differential diagnostic 
features. 

Validity and research value. The big question 
with regard to the test relates naturally to its 
validity. The essence of the problem is: What can 
be the criterion for validity of a psychological tool 
that undertakes to explore the person in four 
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dimensions—the intellectual, the emotional, the 
unconscious, and the unique personal dynamics 
(idea content in the test associations)? A single 
descriptive score that can take account of the 
myriad features in human behavior has not yet 
been devised. 

Statistics are of course essential in this as in 
any psychological test. Rorschach noted his de- 
pendence on them (1921). The present writer and 
his colleagues have established the statistical bases 
in adults (Beck et al. 1950) and in children ( Thet- 
ford et al. 1951). These are means and meas- 
ures of dispersion for each Rorschach variable as 
found in the population samples. These statistics 
enable us to say whether the person under scrutiny 
deviates, and how far, from the norm. What he is 
like in his total adjustment is then inferred from 
the configuration set up by the several variables 
and the comparison of the configuration with 
those established for the several clinical groups. 

The most sophisticated view that can for the 
present be taken is that which has been summed 
up for the clinical field generally by the American 
Psychological Association’s Committee on Clinical 
Tests. According to Cronbach and Meehl, “it is not 
a question of finding an imperfect criterion but 
of finding any criterion at all. The psychologist . . . 
cannot expect to find a clear unitary behavior cri- 
terion” ([1955] 1956, p. 180). A correspond- 
ence between a response to a Rorschach test pic- 
ture and some outside, objective measure is not 
now within this writer’s aspirations, or within 
those of anyone dealing realistically with the test. 
The clinical information about each patient, and 
predictability of test results throughout the course 
of the patient's treatment, must for the present be 
our validating source. 

A psychological tool with the objective and 
promise of the Rorschach test was bound to stimu- 
late new research from a variety of viewpoints, 
Reports of investigations with the test have in- 
creased with the years. The 1965 Mental Measure- 
ments Yearbook contains 733 Rorschach references, 
bringing the total up to 3,030 in the six volumes 
of this series published to date. As was to be 
expected, validating studies have been anything 
but one-sidedly favorable. They range from sharp 
refutations and rejections to unequivocal accept- 
ance and enthusiasm. A serious difficulty in evalu- 
ating the reported publications is that very few 
replicate others in method, whether of administra- 
tion or processing results. Almost none publish 
samples of the raw data, that is, the patients’ as- 
sociations. Variations in basic procedures and in 
interpretive principles are in fact very wide. In 
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part these are consequences of the new perspec- 
tives that have brought about alterations both in 
technique and in interpretation. The most radical 
innovations have been Klopfer’s (1942; Klopfer 
et al. 1954-1956). He follows the phenomenolog- 
ical orientation in evaluating the test's data. 

The objective of the phenomenologist—one may 
say his aspiration—is to know and describe the 
mental process of the other. This assumes the 
ability to enter the mind of the other. While the be- 
haviorist sees this as a consummation devoutly to 
be wished, his orientation is that such entry can 
never be made; the objective is illusory; all that 
we can know of another, including his mental 
processes, we know from his behavior. The be- 
haviorist can respect the phenomenologist. The 
school has its very able exponents. But as Mandler 
and Kessen put it, phenomenology is “unashamedly 
subjective” (1959), The behaviorist’s temperament 
limits him to objective data, 

Other investigators who have made important 
contributions are, in the United States: Piotrowski, 
on brain pathology (1937); Molish (1958), on 
Navy, especially submarine, personnel, and also 
on a South Polar expedition; Schafer, with refer- 
ence to psychoanalytic concepts (1954); Schachtel 
(1941; 1943), on theoretical questions; and Ames, 
Métraux, and Walter (1959), Hertz (1960), and 
Rabin and Haworth (1960), on childhood and 
adolescence. The principal American general texts 
are Klopfer (Klopfer et al. 1954-1956); Piotrowski 
(1957); Anderson and Anderson ( 1951); Rickers- 
Ovsiankina (1960); Beck (1960); and Beck, with 
others (1944), In Europe, the principal names are 
those of Loosli-Usteri, whose work deals with dis- 
turbed children (1938); Oberholzer, on various 
clinical or theoretical topics (1931; 1944); and 
Zulliger, who published (posthumously for Behn- 
Eschenburg) an alternate series of test cards (see 
Zulliger 1941). An important Danish text is that 
of Bohm (1951). 

SAMUEL J. BECK 


[Directly related are the entries Personatity MEAS- 
UREMENT; PSYCHOMETRICS; and the biography of 
RORSCHACH. ] 
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m 
THE THEMATIC APPERCEPTION TEST 


The Thematic Apperception Test (TAT) is a 
projective method for the investigation of person- 
ality that consists of a series of 20 pictures about 
which individuals are instructed to tell stories 
(Murray 1943). While instructions vary somewhat 
among users of the test, they all request that the 
person being tested relate a plot with a beginning, 
a middle, and an end. They ask the individual to 
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use his imagination freely in order to tell what the 
people are thinking and feeling. Individuals tell a 
story for each picture presented to them, with 
minimal intervention of the tester except mild en- 
couragement and occasional instruction to be sure 
to include any crucial part (the beginning, the 
intervening middle events, or the outcome), should 
it be omitted. 

The pictures were chosen by Henry A. Murray, 
Christiana D. Morgan (Morgan & Murray 1935), 
and the staff of the Harvard Psychological Clinic 
involved in the research work resulting in the vol- 
ume Explorations in Personality (1938). These 
pictures were derived from a wide range of sources 
according to the judgment of that staff as to which 
were most consistently revealing of personality 
trends in subjects in clinical study. As originally 
presented the pictures consist of a series of 10 pic- 
tures used for all subjects and an additional group 
of 20 that are coded for age (prepubescent and 
postpubescent) and sex. 

The examiner draws from these 20 an additional 
10 appropriate to the age and sex of the subject 
being tested. Pictures are numbered and are given 
in a standard order reflected by this numbering. 

Underlying assumptions. The underlying logic 
of the full test, and of the particular pictures which 
compose the test, rests upon the assumption that 
when persons are asked to assign meaning to a 
stimulus which is essentially ambiguous, they will 
tend to reveal their own basic assumptions about 
people and human interaction; from these state- 
ments, underlying personality trends may be de- 
duced. This assumption leads to the selection of 
pictures covering a fairly wide range of usual, and 
some highly unusual, kinds of personal or inter- 
personal situations. Some pictures are considered 
more structured—less ambiguous in their apparent 
meaning—than others; that is, the situations as 
described are quite clear-cut, and there is little 
question about the identification of the scenes. The 
specific actions in which the persons are engaged, 
however, what brought them about, how they will 
come out, what the persons are thinking and feel- 
ing, do remain ambiguous. The subject is asked to 
imagine these elements in his story, and his de- 
cisions are seen as reflecting his underlying person- 
ality traits. 

Elsewhere in the series are pictures that are 
more ambiguous in their basic identification. These 
less-structured pictures are normally seen as call- 
ing for greater imagination on the part of the sub- 
ject. While they provide information on the re- 
sponse of the subject to less definite stimuli, they 
are not necessarily more revealing of the subject’s 
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personality. A totally blank white card is also 
included. 

Administration. Variations upon administration 
methods revolve around the questions of the num- 
ber of cards to be used, whether they are given in 
one session or two, and the use of various labor- 
saving procedures. Murray originally proposed that 
the set be divided into two sets of 10, the second 
being administered one day after the first and with 
increased emphasis in the instructions upon full 
use of the imagination. Some investigators prefer 
this method (Stein 1948), while others (Henry 
1956) utilize a single session and reduce to about 
15 the total number of cards administered. Admin- 
istration in groups, with projected slides, is gen- 
erally not preferred but has been shown to be a 
useful method (Murstein 1963, pp. 45-48). Self- 
administered forms have been devised and found 
useful, at least for adult subjects of average to 
above-average education (Henry & Gardner 1949). 
Other labor-saving devices have also been used 
successfully during the preferred individual admin- 
istration, though in each instance effort is made 
to maintain good rapport between subject and 
examiner and to reduce any possible distractive 
effect of such devices as shorthand or recording 
equipment. 


Interpretation 


Any of a variety of psychodynamic personality 
theories are appropriate to the analysis of the TAT 
for its original purpose—the study of unconscious 
and underlying aspects of personality. Murray, in 
his manual for testers (1943), suggested a frame- 
work based upon his conceptual analysis of per- 
sonality in need—press terms. This analysis pro- 
vides a series of categories of motivational states. 
Needs refer to inner states that can possibly be 
expressed, for example, N achievement (need for 
achievement), N affiliation, N sex, N order. Press 
refers to inner states, but to those involving a per- 
ception of some outer event or force acting upon 
the individual. The analysis of TAT stories in this 
system involves the translation, using a set of defi- 
nitions, of story content into the appropriate need 
or press and the interconnecting of these cate- 
gories into a thema descriptive of the entire story. 
The need-press or the combination of them into a 
thema is thought to represent motivational trends 
in the subject. To derive a personality description 
or a clinical diagnosis from such statements of 
need, press, and thema still requires that the ex- 
aminer interpret the material along the lines of 
some more comprehensive personality theory. This 
is commonly done in some form of psychoanalyti- 


cally based theory. Some workers with the test 
(e.g., Bellak 1954) see their analyses as dealing 
with unconscious elements of personality or follow 
a strongly psychoanalytic form of analysis. Some, 
still within this essentially clinical framework of 
individual case analysis, see their analyses as, in 
addition, reflecting upon preconscious, ego-related 
aspects of behavior. This latter focus of analysis 
would include more attention upon the cognitive 
aspects of personality and social role preferences 
and would lead more directly to examining under- 
lying motivations in relation to behavioral events. 

Procedures in analysis and theoretical concepts 
used depend somewhat upon the conception of the 
examiner as to the basic nature of TAT data. Most 
interpreters who have published *statements of 
their methods assume that the TAT data repre- 
sent interactions between deeper personality trends 
and current images of interpersonal situations. 
While the TAT data are clearly prompted in part 
by underlying dynamic forces, it is apparent that 
the stories as told are also influenced by various 
other factors. These include some cognitive abilities 
of the subject, his understanding of the formal 
instructions, his assumptions about the nature of a 
story, the unique properties of the pictures them- 
selves, and the fact that the test-taking act and 
circumstances are themselves social interactions. 
Holt (1961) provides an important discussion of 
this topic. For documentation of specific methods 
and their logic and conceptual background, see 
Bellak (1954), Henry (1956), Tomkins (1947), 
Stein (1948), Holt (1958), Schafer (1958), and 
Wyatt (1947). An interpretive lexicon is provided 
by Lindzey and his associates (1959). This lexicon 
is not a system of analysis but rather a summary 
of findings from a variety of empirical studies or 
of differentiating properties of TAT stories pro- 
posed by various interpreters. A summary and 
analysis of common assumptions regarding inter- 
pretive logics is to be found in Lindzey (1952). 

In general these systems of interpretation rest 
upon the earlier stated assumptions that the story 
as told is, in large part, the invention of the story- 
teller and that the properties characteristic of that 
invention stem from tendencies characteristic of 
the subject. Interpreters vary in the emphasis 
which they give to the determining influence of 
the stimuli pictures themselves, though in prin- 
ciple all acknowledge that influence. The analysis 
of the properties of the stories thought relevant to 
the personality analysis and the particular rele- 
vance proposed may best be seen in the references 
given in the preceding paragraph. For illustrative 
purposes, however, the properties of stories sug- 
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gested by Henry (1956) will suffice. In broad out- 
line, it is suggested that stories may be analyzed 
in two basic categories with various subdivisions. 
These are content and form. Content calls atten- 
tion to the fact that stories differ in topic, in plot, 
in characteristics attributed to persons, in actions 
and emotions reported, in the interrelations of per- 
sons, and in the outcome and fate of all of these. 
Form refers to the manner in which any particular 
story is told. It would call attention to the length 
and linguistic facility, to the sophistication or 
crudeness of language, to the order and organiza- 
tion of plots, and to the utilization of large inclu- 
sive concepts or observations of small details. The 
nature of the sequence of time (references to past, 
present, future events), the clarity of portrayal of 
the actual recognizable elements of the stimulus, 
and the degree of conformity to the instruction as 
given would also be considered. Having observed 
such properties as these for all stories told, the 
interpreter must then attribute meaning to them 
in personality terms and order those terms into the 
conceptual framework he finds appropriate. It is 
at this point that the most skill is called for and 
from this process that the widest variation in re- 
sulting analyses may arise. It is perhaps appro- 
priate here to suggest that the TAT is more a 
method of investigating personality than a test of 
it. In this sense, and in this individual personality 
use, it is a clinical investigatory aid, not an instru- 
ment with known absolute properties reflective of 
definite and circumscribed personality attributes. 
For this clinical use the TAT provides an order of 
data on inner processes and on their relation to 
outer events, which yields its fullest meaning only 
upon study and analysis by the interpreter already 
skilled in personality theory and experienced with 
wide varieties of social and personality groupings. 
In this sense the TAT response constitutes a social 
and a personal document as complex and varie- 
gated as the subject providing the data, and as 
tortuous and subtle as most efforts to condense and 
systematize the life career of any individual. 
Reliability and validity. Complexity by no 
means suggests that issues of reliability and valid- 
ity are to be resolved by estimates of the profes- 
sional competence of the interpreter. The real 
issues of reliability or validity for the TAT are 
complex ones; they involve far more than the usual 
definition of reliability as reproducibility either of 
specific elements of stories on the part of a subject 
or of analytic statements from such data on the 
part of interpreters—or the definition of validity 
as a correlation between analytic statements and 
an external criterion of the same attribute. The 
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problems are methodological ones of a nature as 
yet unresolved. They involve both procedural and 
statistical matters, as well as decisions regarding 
the specificity or abstractness of statements from 
both TAT and criterion data. In general these prob- 
lems must be resolved for projective data as a type 
of data before any specific projective instrument 
can soundly be assayed, To date, the evidence for 
the reliability and validity of the TAT is extremely 
ambiguous. Jensen (1959) and Eron (1959), in 
reviewing the test for the fifth Mental Measure- 
ments Yearbook, report many negative or inconclu- 
sive findings. Studies reporting validity have varied 
greatly in method, in kinds of criteria used, and 
in the kinds of variables from the TAT used. The 
best summary of the state of affairs regarding 
these topics is provided by reviewing the general 
analyses and proposals regarding the problem in 
the works by Cronbach (1948; 1956) and Murstein 
(1963). A recent study by Shipman (1964) is of 
particular interest in that it deals with several 
kinds of units of analysis from the TAT and pro- 
vides evidence of differential degrees of congruence 
with an external criterion. These differences de- 
pend in some instances upon the specific versus 
the global nature of the units dealt with in the 
TAT and in part upon the personality constructs 
available from the criterion instrument. The study 
which most directly parallels the normal clinical 
use of the TAT and which attempts to reflect di- 
rectly upon varying types of validity is that of 
Henry and Farley (1959). This study is based upon 
the design proposed by Cronbach (1948). It relates 
blind analysis of TAT tests of 36 adolescents from 
a homogeneous social environment to a variety of 
criterion instruments—life history from interviews, 
the Rorschach test, objective tests, intelligence 
tests, sociograms—representing data from overt 
levels of behavior, subjective levels, and projective 
levels. The authors believe that a crucial element 
in the substantially significant results obtained is 
the conscious and exhaustive attention given to 
the understanding and definition of the system of 
personality constructs in terms of which all in- 
struments were analyzed and interpreted. [See 
PsYCHOMETRICS.] 


Modifications and applications 


The Murray TAT was developed for use with 
subjects from the mainstream of American social 
life. While the pictures used were intended to 
refer to personality issues that are not culture- 
bound, the fact that these pictures contain specific 
objects, clothes, and physical settings has tended 
to lend them a cultural specificity which restricts 
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their use. In some instances, therefore, the TAT 
principles have been maintained but new pictures 
designed. There have been instances where it was 
felt that there was some inappropriateness in the 
pictures or where a shorter and more focalized set 
was desired. The Children’s Apperception Test 
(Bellak 1954) and the Michigan Picture Test 
(Andrew et al. 1951) are examples designed for 
younger age groups on the grounds that the gen- 
eral tone of the Murray is too adult. In the Michi- 
gan Picture Test, in addition, the focus was upon 
elementary school adjustment, and thus the pic- 
tures were developed to reflect many school set- 
tings. One special set has been developed by 
Thompson (1949) for use with American Negroes. 
It duplicates the regular Murray cards, but all fig- 
ures are Negro. Special sets have also been com- 
posed for the study of elderly persons (Lieberman 
& Larkin 1963) and for the study of personality in 
work adjustment of business executives (Henry & 
Moore 1950). The Blacky pictures (Blum 1949) 
consist of animals in humanlike scenes and are 
designed for the study of children. Lennep (1948; 
1951) has prepared a 4-picture set used in educa- 
tional and industrial settings. A special set by 
Phillipson (1955) consists of particularly vague 
pictures aimed at the study of object relations in 
the psychoanalytic sense. 

Cross-cultural, role, and group studies. The 
principal use of the TAT for cross-cultural pur- 
poses has focused upon the study of nonliterate 
societies. Here it is normally assumed that the 
modern, middle-class tone of most Murray TAT 
pictures is inappropriate. For these reasons special 
sets have been developed for several nonliterate 
societies, These tend to follow the principle of 
design as illustrated by the Murray but are modi- 
fied for special problems of the society and for 
general cultural appropriateness (Henry 1956, pp. 
47-53; Henry 1951). The experience with such 
sets has varied. Generally they have been felt to 
make a useful and positive contribution to cross- 
cultural study, though the formal evidence of 
their validity is mixed. The best presentation and 
analysis of such use may be found in Lindzey’s 
Projective Techniques and Cross-cultural Research 
(1961). 

These special sets are commonly used in con- 
nection with problems having anthropological or 
sociological emphasis, as well as personality con- 
cerns. In some instances, particularly the special 
sets for nonliterate societies, the intention has 
been to examine personality factors as they relate 
to cultural differences or as they form the psychic 
correlates for intracultural variations. Increasingly, 


the TAT is also being used in contexts where social 
roles are at issue, especially where the investigator 
sees the role behavior as having personality com- 
ponents, This was the focus of Henry’s study of 
business executives, where the aim was to clarify 
certain personality components of the behavior of 
executives in large organizations rather than to 
analyze the personality of the individual adult 
working in that setting. In some instances, other 
social role concerns have directed the selection of 
pictures to be used and indicated the mode of 
analysis. In a study of adults in Kansas City (Neu- 
garten et al. 1964), four of the regular Murray 
cards were given, but the fifth was a specially de- 
signed card. It contained four persons in a non- 
descript living-room setting. One was clearly an 
older male, one an older female, one a younger 
male, and one a younger female. They could readily 
be seen as the older parents of a younger couple. 
In designing this picture and in planning its analy- 
sis, Neugarten and Guttman (1958) were inter- 
ested in age-graded perceptions of male and female 
figures. Their interest was in the differential per- 
ception of these age roles and sex roles by subjects 
of differing ages. With such pictures the instruc- 
tions normally remain the same—tell a story— 
though these may be supplemented by specific di- 
rective questions. Similarly modified sets might 
be used to study supervisor—worker situations in 
industry, doctor—nurse interactions, teams inspect- 
ing new automobiles, interactive small groups, 
racial conflict situations—in fact, almost any social 
interaction that lends itself to portrayal in this am- 
biguous pictorial form. 

The use of the TAT in the study of groups has 
had several focuses. One revolves around the study 
of interactive processes in groups themselves, 
usually groups of modest size. Another revolves 
around interest in properties common to various 
natural social groups, sometimes in cross-cultural 
contexts and sometimes in cross-class or cross-age 
contexts within a single society. The Group Pro- 
jection Sketches (Henry & Guetzkow 1951) is a 
special set of five cards designed to reflect various 
concepts for the analyses of small-group interac- 
tion. The analysis of TAT data in various racial or 
social groups has a somewhat different focus. It is 
best seen as overlapping with cross-cultural studies 
in which properties of stories common to particular 
cultures is the issue (see Lindzey 1961). A closely 
related use involves the social-psychological study 
of subgroups of our own society. The monograph 
of Warner and Henry (1948) is one illustration. 
In the effort to provide data on the functional sig- 
nificance of the daytime radio serial for the normal 
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listeners of those programs, the authors attempt 
to identify common psychological concerns and 
modes of adjustment in their principal group of 
subjects, middle-class American housewives. A 
more extensive study utilizing this and other 
methods is that of Rainwater, Coleman, and Handel 
(1959) on lower-class American housewives. A 
special system of analysis has been developed by 
Schaw and Henry (1956); they illustrated its use 
by comparing three groups of business executives 
differing in age. 

Personality variables. A common use of the 
TAT has been the study of specific personality 
variables under a wide range of circumstances. 
Here the interest is on the TAT as a measuring 
device of personality variables thought to be of 
unconscious or “beyond awareness” nature—hos- 
tility, anxiety, achievement drives, and other mo- 
tives such as affiliation and power. The most 
extensive work has been that of McClelland and 
his associates (1953) on need-achievement as de- 
fined in the Murray need—press system of motiva- 
tional categories. The general logic of this approach 
is that if achievement imagery is present in stories 
elicited by selected TAT cards, then the subject 
may be said to be motivated to achieve. Efforts to 
demonstrate that logic have varied considerably: 
some show that persons with high achievement 
imagery do indeed attempt to improve their per- 
formance or that they are in real life “high 
achievers,” while others clearly disprove these same 
points. There has also been substantial work on 
the variable of hostility. Complex relationships tend 
to be found, which suggests that there is a positive 
but by no means clear-cut relation between such 
variables and any particular form of overt behav- 
ior. The most exhaustive treatment of studies in 
this area may be found in Murstein (1963, pp. 84— 
106, 300-322). [See ACHIEVEMENT MOTIVATION; 
FIELD THEORY; ROLE.] 

Special scoring systems which deal with some- 
what more complex units have been devised for 
the study of particular variables. Arnold (1962) 
has been interested in achievement and has devel- 
oped a system of analysis focused upon story themes 
and a scoring method dealing with motivation t 
achieve. This method shows some interesting 
differences between high-achievement and low- 
achievement students in school, among school- 
teachers and business executives, and among well- 
adjusted and poorly adjusted men in the Navy. 
Dana (1959) has devised a system based upon 
degrees of approximation to various expected story 
form and content properties and uses it to dis- 
tinguish normals, neurotics, and psychotics. 
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An additional type of variable has been used by 
Rosen and Neugarten (1960), who have utilized 
the TAT as a source of data on ego energy. In this 
instance they proposed that ego energy would be 
characterized by (1) the ability to integrate wide 
ranges of stimuli, (2) the readiness to perceive or 
deal with complicated, challenging, or conflictful 
situations, (3) the tendency to perceive vigorous 
and assertive activity, and (4) the tendency to 
perceive feelings and affects as related to life situ- 
ations. As measures of these attributes, they chose 
certain elements of TAT stories commonly used 
in clinical analysis but here treated only as scorable 
indicators of these specific ego elements. The cen- 
tral point of relevance about this study, illustrative 
of others of related type, is that the TAT was used 
as a source of data for certain properties of ego 
functioning selected as a test of a theory. Specific 
TAT response categories were selected as scorable 
measures of these properties and the behavior of 
these measures predicted and tested in an appro- 
priate design. In this format, reliability is handled 
by the common interjudge-agreement method. 
Validity is thus subsumed under issues of the 
viability of constructs used in the theoretical for- 
mulation, and the usefulness of the TAT estimates 
of those constructs is determined by discovering 
relationships to other data. 


Evaluation 


The TAT was developed to aid in the “explora- 
tion” of personality (Explorations in Personality 
1938). It has quite justified its existence in the 
sense of the access which it provides to personality 
variables, especially when these are seen as ego- 
related—as opposed to deep unconscious fantasies 
—and as they are seen as reflecting the juncture 
of psychodynamics and social behavioral events. 
The TAT is best viewed as a source of data on this 
order of personality event rather than as a test of 
specific personality attributes of known character. 
It can, nevertheless, assume testlike properties 
under certain conditions. These conditions are, gen- 
erally speaking, outside the instrument itself and 
reside in the manner in which it is used, the data 
to which it is related, and the personality constructs 
in terms of which the basic data are ordered. Its 
successes and failures, in the usual validity sense, 
are explained by analysis of these external events. 
In the usual clinical use, for personality descrip- 
tion, considerable variation in stability appears. 
Such variations may be seen in terms of the speci- 
ficity-generality of personality units used and in 
terms of the order of corresponding external cri- 
teria. The clarification of the use and misuse of the 
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instrument will rest upon continued exploration of 
these factors—the personality constructs utilized, 
the logic of translating data into these constructs, 
the manner of relating these constructs to other 
data about the individual. Among the sensitivities, 
as opposed to the stabilities of the instrument, are 
those reflecting change and ambiguity in the indi- 
vidual, factors which confound the effort to dis- 
tinguish error from genuine difference. The future 
of the instrument will be decided by the study of 
other conditions of use, by increased sophistication 
in the manipulation of the basic data, and by in- 
creases, from any source, in the user’s knowledge 
of psychodynamics and of social systems. A crucial 
element in the future use of the TAT may be found 
in the degree of richness and flexibility of the basic 
concept and in the fact that constructs from several 
fields of psychological and social science investiga- 
tion may be the focus of its use, in the original as 
well in as modified forms, 

WILLIAM E. HENRY 


[See also PERSONALITY MEASUREMENT. Other relevant 
material may be found in ACHIEVEMENT MOTIVA- 
TION; CONTENT ANALYSIS; FANTASY; MOTIVATION, 
article on HUMAN MOTIVATION; PSYCHOANALYSIS; 
PSYCHOMETRICS.] 
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PROPAGANDA 


Propaganda is the relatively deliberate manipu- 
lation, by means of symbols (words, gestures, flags, 
images, monuments, music, etc.), of other people's 
thoughts or actions with respect to beliefs, values, 
and behaviors which these people (“reactors”) re- 
gard as controversial. 

The elements of deliberateness and manipula- 
tiveness distinguish propaganda from merely casual 
communication or the “free” exchange of ideas. 
These elements also distinguish propaganda from 
education: whereas the propagandist presents a 
prefabricated argument or a single set of symbols, 
the educator aims to present “all” sides of an issue 
and leaves mainly to the audience the decision 
concerning the truth (if any) of the claims pre- 
sented and the values (if any) at stake. Inasmuch 
as some communicators and some audiences regard 
as controversial what others regard as self-evident 


PROPAGANDA 579 


truth, it follows that under some conditions one 
man’s “propaganda” may be another man’s “edu- 
cation.” 

The term “propaganda,” in most of its modern 
usages, apparently derives from the shortened 
name, “the Propaganda,” of the Congregatio de 
Propaganda Fide (Congregation for Propagation 
of the Faith), a standing committee of cardinals 
in charge of missionary activities of the Roman 
Catholic church since 1622. Hence, to many Cath- 
olics the word may have, at least when referring 
to ecclesiastical utterances, a responsible and re- 
spectable connotation that it lacks in many other 
contexts. 

Something similar is the case, in communist 
circles, at least, when the term “propaganda” is 
used with the special definition and meanings given 
it by Lenin in a collection of writings published in 
1929 as Agitation und Propaganda. In that book, 
which continues to furnish much of the basis for 
communist reasoning and practice on the subject, 
Lenin distinguished between (1) “propaganda,” 
which he defined as the reasoned use of arguments 
from philosophy, history, and science to influence 
the educated and reasonable few, and (2) “agita- 
tion,” by which he meant the use of emotional 
slogans, “Aesopian” parables, and half-truths to 
influence the uneducated, the semieducated, and 
the unreasonable. Thus, to the disciplined com- 
munist who follows in his Agitprop activities the 
theory and rules laid down by Lenin, the use of 
“propaganda” in Lenin’s sense is highly commend- 
able and unqualifiedly honest. 

A related term is “propaganda of the deed.” This 
means the performance of a nonsymbolic (e.g., 
coercive or economic) act, not primarily for its 
military or economic effects but primarily for the 
symbolic effect it presumably will have on some 
reactor—for instance, staging the public torture of 
a criminal for its presumable deterrent effect on 
others or giving economic “foreign aid” with more 
of an eye to influencing a recipient’s opinions than 
to building his economy. 

Diplomatic negotiation, legal argument, com- 
mercial bargaining, and advertising obviously are 
likely to include considerable elements of both 
“propaganda” and “propaganda of the deed” as here 
defined. 


History 


No doubt propaganda has existed ever since 
primates have been sufficiently articulate to use it. 
Artifacts from prehistory and from early civiliza- 
tions give evidence that dazzling raiment, mystic 
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insignia, and monuments were used to advertise 
the purported majesty and supernatural powers of 
early rulers and priests. 

In Western civilization, the systematic design of 
propaganda and of counterpropaganda appears to 
have begun in the Greek city-states about 500 B.c., 
with the codification of “rhetoric,” a set of tricks 
of argumentation deduced in part from the actual 
practice of successful lawyers, demagogues, and 
politicians, Such teachers as Plato and Aristotle, 
and certain of the Stoics, compiled the rules of 
rhetoric with two aims: not only to make their 
own arguments more persuasive but also to im- 
munize “good” citizens against the use of logical 
fallacies and emotional terms by “bad” lawyers and 
demagogues, and to point out the possible dangers 
of following irrational leaders. So well did the Greek 
rhetoricians do these jobs that they have been 
studied and quoted for over 2,500 years. Aristotle's 
Rhetoric was emphasized in higher education 
throughout the Middle Ages and even after the 
Renaissance, and it is often drawn upon today 
in classes in public speaking and logic and by 
many sophisticated propagandists and counter- 
propagandists. 

In other civilizations, a number of parallel devel- 
opments appeared after 400 B.c. Thus, Kautilya, 
purportedly chief minister to the Indian emperor 
Chandragupta Maurya, counseled in his Arthasdstra 
(“Principles of Politics”) the use by princes of 
prudent and often deceptive wording in their diplo- 
macy and in their public utterances, especially in 
times of war and preparation for war. Like modern 
propagandists, Kautilya was much preoccupied with 
techniques for sowing fear, dissension, and con- 
fusion in the opponent's ranks (psychological war- 
fare) and for showering blandishments on allies 
without becoming excessively dependent upon them. 
[See Kautitya.] 

Parallel advice can be found in The Art of War 
by the early Chinese theorist Sun Tzu. The use of 
“good” and truthful rhetoric and “proper” forms of 
speech and writing was urged by Confucius in his 
Analects as a means of persuading men to live the 
good life —a Platonic admonition deliberately echoed 
as a legitimating device (under the name of “brain- 
washing”) by the present-day rulers of Communist 
China. 

The spread of Christianity, like that of all other 
religions, has of course been due very largely to a 
mixture of earnest conviction and the deliberate use 
of propaganda. Recent scholarship points out the 
striking extent to which the legend of the Jewish 
messiah has been reshaped in the course of the 
centuries, beginning with the earliest writers of 


the Gospels and Epistles, who apparently made 
numerous changes and invented details that seemed 
calculated to engage the attention and influence 
the actions of non-Jews as well as Jews and that 
bore only an allegorical relation to historic fact. 
In other major religions—for instance, in the re- 
casting of the legends of the Hindu epic Mahabha- 
rata, of Gautama Buddha, of the ancestral Japanese 
Sun Goddess, and of the life and relatives of 
Muhammad—a parallel mixture of faith, convic- 
tion, and propaganda can be found. Undoubtedly 
a similar mixture has been involved in the spread- 
ing of every major political doctrine or “ism.” 

Remnants of election propaganda have been 
found on Roman ruins. Such writers as Quintilian 
and Quintus Cicero described campaign tactics. In 
early modern times Machiavelli underscored, like 
Kautilya and Sun Tzu, the effectiveness of calcu- 
lated duplicity in politics and war. In Shakespeare, 
many characters display and discuss the principles 
of propaganda in concepts and language that a 
present-day behavioral scientist could hardly im- 
prove upon. Mark Antony's funeral oration comes 
readily to mind; and such English aristocrats as 
the Duke of Buckingham (see Richard III, Act m1) 
comment knowingly upon such propaganda strata- 
gems as the seizure and monopolization of prop- 
aganda initiatives, the displacement of guilt onto 
others (“scapegoating”), the presentation of one- 
self as morally superior, and the coordination of 
propaganda with violence and bribery. 

After Aristotle, however, only small advances in 
either the highly organized practice or the system- 
atic theory of propaganda took place until the in- 
dustrial revolution made mass production possible 
and thus opened the way for immensely high 
profits through mass marketing. As part of the 
modern trend toward well-calculated high-profit 
distribution, studies began to be made after about 
1900 of the wants and habits of many types of 
consumers and of their susceptibility to alternative 
kinds of salesmanship, advertising, packaging, and 
publicity. f 

In the early 1930s, commercial “sample surveys 
began to develop rapidly. Almost every conceivable 
aspect of opinion, attitude, belief, and behavior 
involved in “consumer motivation” has been inves- 
tigated with respect to ever more refined sub- 
samples of the populations of most major coun 
tries. At present, vast banks of such information 
are stored and processed in computer centers; they 
are used as a basis for increasingly precise “pin- 
pointing” of commercial and other propaganda. 
Nationwide and international advertising campaigns 
cost billions of dollars annually and occupy a very 


large percentage of radio and television time and 
of newspaper, magazine, and billboard space in 
countries where this is permitted. It is generally 
believed that this investment also exerts strong 
influence over some or many of the noncommercial 
contents of these media. One consequence has been 
the evolution of efforts to hold the more Machia- 
vellian advertisers in check through such devices as 
consumers’ unions, “truth-in-advertising” laws, and 
nonprofit publishing, radio, and television along- 
side (or to the exclusion of) the profit-making 
media. [See MARKET RESEARCH.] 

Concurrently with the spread of commercial 
rationalism and the related outburst of commercial 
propaganda, the spread of social rationalism, of 
mass education, and of mass democracy since the 
eighteenth century has deepened awareness among 
the educated and some of the semieducated of the 
roles of fictions and utopian aspirations in social 
and political systems. Long ago, Plato pointed out 
the social functions of “the noble lie.” The exten- 
sion of the suffrage to ever broader and ever more 
ignorant or ill-educated strata of the population in 
the past two centuries has brought enriched oppor- 
tunities to observe the possibilities this offers for 
both the demagogic and the public-spirited propa- 
gandist. 

One early observer was Jeremy Bentham. His 
Theory of Fictions, written near the end of the 
eighteenth century and a forerunner of the modern 
study of general semantics, emphasized the extent 
to which a careful choice of symbols can contribute 
to the respect and awe with which otherwise worth- 
less individuals and institutions can be invested. 
The immense growth of nonrational forms of na- 
tionalism and of plebiscitary despotism throughout 
the nineteenth and twentieth centuries was to re- 
confirm his view. [See BENTHAM.] 

Many subsequent writers, including Georges 
Sorel (1908), Vilfredo Pareto (1916), Lenin (1929; 
and other works), Karl Mannheim (1929-1931), 
and Harold D. Lasswell (1930; 1935; and other 
works) have explicitly or implicitly taken the posi- 
tion that men in the mass, and even men on high 
educational and social levels, often react more 
favorably to “utopian myths,” “Aesopian language,” 
and “nonrational residues” of earlier experiences 
than to sober analytic statements. Pavlov’s experi- 
ments with conditioned reflexes, and the Freudian 
and Neo-Freudian explorations of the unconscious 
mind, have tended to give strong support to this 
view. 

Both democratic and authoritarian regimes of 
recent decades have reacted with varying mixtures 
of warmhearted acceptance and coldhearted cyni- 
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cism to such modern elaborations of ancient in- 
sights. Military inventions and the spread of 
particularist nationalism and of attempts at democ- 
racy have caused recent wars, social revolutions, 
and counterrevolutions to reach proportions that 
called for mobilization of entire national popula- 
tions. Hence, in World War 1, and even more in 
World War u, each of the major contending gov- 
ernments made massive outlays for propaganda, 
both domestic and foreign, and much magic was 
attributed to various “propaganda techniques.” Since 
then, almost every large state has had its ministry 
of propaganda (or ministry of culture, enlighten- 
ment, or international information) or some func- 
tional equivalent—at least in wartime or when 
danger was acutely felt. Likewise it has had its 
official mechanisms for censorship, “information 
control,” or news management. The use of such 
agencies became prominent after World War n 
and during the subsequent cold war. 

Concurrently, nearly every significant political 
party, major pressure group, or mass movement 
has developed its own corps of specialized propa- 
gandists (some of them referred to as “lobbyists” 
or “legislative representatives”). Many are em- 
ployed full time and have relatively high levels of 
skill, training, or both. Many such propaganda 
agencies possess or hire elaborate “research and 
intelligence” facilities to conduct (overtly and 
covertly) observations, opinion polls, and informa- 
tion polls among various strata of the elites, the 
middle classes, and the rank and file. Many kinds 
of data are tabulated concerning those contents of 
the press, films, television, and other media that 
reach the respective strata, “Symbol campaigns” 
and “image building” are conducted in mathemati- 
cally calculated ways. The ancient art of rhetoric, 
practiced by a few skilled leaders, has become the 
modern quasi science of opinion management, em- 
ploying armies of governmental, party, and pres- 
sure-group employees, including a wide range of 
real and purported psychoanalytic, psychological, 
and social scientists. 

The means of controlling these opinion manip- 
ulators and “hidden persuaders” are sought with 
increasing concern by consumers who are aware 
of having been duped, and especially by politically 
conscious persons who fear the spread of regimes 
in whose operations deceptive propaganda may be 
combined with tight censorship to produce a nearly 
unbreakable control. 


Trends in theory 


Trends in the theory of persuasion, including 
the theory of propaganda, have conformed some- 
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what to various functions required by the social 
systems of given times and places, as perceived by 
the intellectuals concerned. 

Thus, the rules of rhetoric were devised when 
Greek urbanism had evolved enough to support a 
considerable number of rival schools of politics, 
logic, and philosophy, which sought to settle con- 
flicts by persuasion as well as by violence and com- 
mercial deals. Greek thinkers explored many facets 
of the relations of rhetoric to various forms and 
functionings of the city-state. In so doing, they 
defined certain rules of far more general applica- 
bility for distinguishing between the discourses— 
including propaganda—of logical, socially integra- 
tive communicators and those of the demagogues 
of a less respectable stripe. 

Yet communication theory among the Greeks 
appears to have remained essentially particularis- 
tic, since it focused on the transactions within the 
social system of the individual city-state. Even after 
Aristotle collected and compared a great many city- 
state constitutions—indeed, even after the Roman 
conquest—the Greek imagination did not reach out 
far enough to evolve a coherent, empirical theory 
of intersystem (i.e., intercity-state, or supracity- 
State, or intercultural, imperial, international, or 
worldwide) social relations. Hence, it did not de- 
velop a corresponding cross-cultural theory of value 
conflicts or of the possible resolution of such con- 
flicts through propaganda or other types of com- 
munication. 

Even the later Greco-Roman Stoics, who did 
envisage a universalistic, polycentric system of 
justice and order under a tolerant system of plural- 
istic law, did not appear to visualize, much less to 
formulate explicitly, the intersystem communication 
processes that would be required to institute such 
a social order and keep it functioning. They dwelt 
on a highly abstract plane, among broad moral con- 
cepts such as justice, harmony, and “the good life.” 
They did not often descend from man in the ab- 
stract to men in specific social systems; hence they 
did not codify the full range of individual, cultural, 
and social differences among men that must be 
considered in order to form any lasting social sys- 
tem above the level of the city-state. 

Imperial Rome, Byzantium, and early Islam, of 
course, were hardly hospitable to flights of natural- 
istic data-collecting or comparative social and psy- 
chological theorizing; nor were the Holy Roman 
emperors, the Christian churches, or the monarchs 
and barons and caliphs and sultans who dominated 
the assorted social systems of western Europe and 
the Middle East between the fall of Rome and the 


industrial revolution. Machiavelli, it will be re- 
called, circulated The Prince in secrecy, like many 
empirical investigators before and since. 

For comparable reasons, and perhaps for other 
reasons as well, the growth of behavioral science 
was inhibited in the other major civilizations— 
China and India—during the same two-thousand- 
year period. It remained for the scientifically ori- 
ented investigators enjoying the degree of freedom 
of inquiry tolerated in the late nineteenth and 
twentieth centuries to discover much more than 
the ancient Greeks had learned about details of 
the psychology and the sociopolitical applications 
of propaganda. 

Present-day theory considers propaganda a spe- 
cial case of the theory of communication in gen- 
eral, which in turn has increasingly been treated 
as a subdivision of the general theory of social 
systems. The latter aims to study the full range of 
possible behaviors of all real and possible social 
actors, ranging from the individual acting exclu- 
sively on his own behalf, through the “dyad” (pair 
of people) and the small group (several people), 
to such large collective actors as interest groups 
(e.g., industrial or farmers’ unions or business as- 
sociations ) and territorial groups (towns, provinces, 
nations, international regions, and the world as a 
whole). 

Contemporary analysts of communication (and 
therefore of propaganda) are aware that the world 
as a whole has become, to a considerable extent, a 
single, relatively manipulable social system of 
which all the surviving previous social systems are 
now subsystems. Most parts of the globe are now 
inhabited, highly interdependent economically and 
militarily, and so richly provided with newsgather- 
ing, telecommunications, and travel facilities that 
symbols of events occurring at any point on earth 
can be transmitted instantly and in detail to any 
other point. 

On the other hand, contemporary social science 
also recognizes that the world system is not yet a 
community, if by “community” we mean a set of 
persons widely accepting a common culture (i.e. 
habitually receiving a common set of “information 
bits” and experiencing a common pattern of cogni- 
tion and values). Hence, modern theory views the 
current state of the world social system as highly 
polycentric: the cultural patterns, and hence the 
economic patterns and political patterns, of its 
component subsystems are at once highly inter- 
dependent and highly diverse and often appear 
more or less incompatible. Yet the set of subsys- 
tems as a whole shows powerful though slow- 


moving tendencies to evolve, convulsively, toward 
global community, mainly through increases in 
population density and through partly planned and 
partly accidental diffusion of common sets of infor- 
mation bits. 


Factors in propaganda 


Confronted by the highly unstable global context 
in which he must operate, the sophisticated propa- 
ganda analyst employing present-day behavioral 
theory tends to formulate his problem as including 
at least 11 sets of factors. He asks: 

(1) To what ends (i.e., to bring about what 
distributions of values), in (2) the present and 
expected states of the world social system and of 
(3) each of its subsystems (nations, lesser terri- 
torial groups, interest groups, etc.) with which the 
propagandist is concerned should (4) the propa- 
gandist or some agent of his distribute (5) what 
symbols through (6) what channels (media, such 
as press, radio, film, face-to-face contact, mass 
demonstrations, religious or cultural organizations, 
etc.) (7) to whom (e.g., elites, opinion leaders, 
middle classes, masses, customers, friends, oppo- 
nents, neutrals), and (8) how can the effects of 
the propaganda be measured (i.e., how can one 
measure the value reallocations attributable to the 
propaganda as distinct from other causes)? In the 
present state of social science, this intricate ques- 
tion can of course be answered with only a moder- 
ate degree of confidence. 

Once the propaganda campaign has begun, the 
propagandist, and also his opponents or counter- 
propagandists, will encounter at least three addi- 
tional sets of factors: With respect to (9) what 
alternative value allocations and (10) by what 
means (e.g., counterpropaganda, censorship, coer- 
cion, or economic pressure) can the propaganda 
be neutralized or controlled, and (11) how can 
effects of such countermeasures be measured? 
These 11 sets of factors will now be discussed. 

Ends (values). When the problem is simply to 
acquire money for oneself or one’s group(s) by 
inducing others to buy a safe and useful commod- 
ity, the stating of ends is easy. When the com- 
modity is of doubtful value or positively injurious 
(e.g., a dangerous drug or a weapon), the problem 
grows complicated. Where the problem is to con- 
vert multitudes to a new religion or a new social 
system, it may be extremely hard to specify just 
what redistributions are desired among large num- 
bers of different sorts of persons, with respect to 
a large cluster of values such as prestige, income, 
“ease of soul,” military security, etc. Yet the prop- 
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agandist can hardly proceed rationally unless he 
can tell at least himself what reallocations of these 
and many other values he is trying to bring about 
and what applecarts he is therefore willing to upset. 

Changes in the world social system. Each act 
of propaganda—whether of commission or omis- 
sion—is very likely to have effects of some sort in 
at least several parts of the global system. Further- 
more, that system itself is inexorably evolving be- 
cause of such factors as population growth, the 
invention and diffusion of new cultural sets and 
technologies, and the consequent emergence of 
new centers of cultural, military, and economic 
power. Social evolution, nowadays often very rapid, 
may decrease the feasibility of many sorts of prop- 
aganda—especially of the more simplistic, paro- 
chial, and particularistic varieties—and increase 
the feasibility of the more sophisticated, scientifi- 
cally formulated, and universalistic. In general, 
the currents of social change, over the past four 
thousand years or so, appear to have been drifting, 
in step with the rising world population and rising 
educational levels, from smaller to larger social 
units. Concomitantly, the currents of cultural 
change apparently have been drifting from less 
rationality and scientism toward more, and from 
primary territorial-group loyalty and interest-group 
loyalty toward primary loyalty to world social 
unity. Is the propagandist, for the sake of his short- 
run or long-run ends, to swim with or against these 
mainstreams of history? If against, at what cost? 
If far ahead of his time, again at what cost? 

Subsystems of the world system. In the past, 
there were many times and places when the prop- 
agandist could effectively ignore world-system re- 
quirements and employ such particularist symbols 
as “My country (or my family, tribe, race, religion, 
or business), right or wrong.” In the present and 
future states of the world system, this self-centered 
type of propaganda may be suicidal. Yet strident 
particularisms persist. The prudent propagandist 
has therefore to decide what mix or reconciliation 
of world-system and subsystem symbolism will best 
serve his purposes in particular places at given 
points in time. With the spread of high-capability 
weapons, the eventual choice, even in the relatively 
near future, may be between universalist coexist- 
ence and particularist nonexistence. 

The choice may be easy to state in theory, but 
it is hard to make in practice, in view of the wide 
variety of particularist subsystems in the world 
and their frequent incompatibility both with one 
another and with the requirements of a world 
system. 
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Present-day social science, still much entangled 
in nationalistic and other small-scale preoccupa- 
tions, is unclear as to details of the value conse- 
quences of promoting, in the present and proximate 
states of the world social system, adherence to 
any given set of positions; yet in every utterance 
the practicing propagandist is explicitly or implic- 
itly making such value choices. 

Use of agents. The use of innocent-looking 
agents or “front” organizations while the propa- 
gandist himself remains behind the scenes can 
maximize his prospects in two principal ways: 
(1) The agent(s) may seem to the audience to be 
much more credible or acceptable than the propa- 
gandist himself or the group(s) for which the 
latter speaks. Especially in areas where the prop- 
agandist is not very familiar with the language and 
customs, or where cultural, racial, religious, or 
nationalist attitudes would deny him a favorable 
hearing, the use of agents is inescapable. Some 
four-fifths of the employees of the United States 
Information Agency abroad, for example, are non- 
Americans; and Soviet propaganda abroad relies 
heavily on local communists as well as on person- 
nel of the Soviet missions. (2) If a given propa- 
ganda stratagem fails in a pretest (a “trial balloon”) 
or in execution, the agent can, if necessary, be 
dismissed or even deliberately “scapegoated” while 
the principal behind the scenes attempts a new 
approach, 

Since modern propaganda in its sophisticated 
forms requires so high a level of rationality and 
of familiarity with public affairs and behavioral 
sciences, the planning of major campaigns prob- 
ably can best be entrusted to qualified intellectuals 
whose backgrounds include both knowledge of 
social science and “hard-nosed” experience with 
public affairs. However, such personalities may be 
viewed askance by many reactors. Hence it is im- 
portant to select “front men” and “contact agents” 
with whom the intended audience is likely to feel 
rapport. 

Choice of symbols. The propagandist aware of 
the findings of the behavioral sciences no longer 
has as much confidence as his counterparts from 
the late eighteenth to the early twentieth century 
had in the ability of rational arguments or even 
of catchy slogans to influence human behavior. 
The evolution of psychoanalysis, clinical psychol- 
ogy, and experimental research on communication 
has made it clear that reactors’ responses are af- 
fected not only by the immediate input of symbols 
but also (and often more powerfully) by three 
other sets of forces: (1) the stored residues of, 
and associations to, previous inputs of related sym- 


bols, which often give the reactor a predisposition 
and capacity to ignore or to rationalize away the 
current inflow of symbols; (2) the economic in- 
ducements (gifts, bribery, commercial deals, etc.) 
or threats (job loss, boycotts, etc.) and the physical 
inducements (caresses, violence, protection from 
violence, or threats of violence) which the propa- 
gandist can apply in conjunction with his symbols; 
and (3) the coercive structures and processes in 
the surrounding social systems, which may either 
facilitate or inhibit the expression of whatever new 
thoughts or other behavioral impulses the current 
input of symbols may activate. 

The sophisticated propagandist, therefore, at- 
tempts to take relevant economic and physical 
action in conjunction with the propagandizing at 
each stage of his campaign. He also tries to select 
his symbols in the light of the findings of all the 
behavioral sciences, from psychoanalytic studies of 
the stored-up and unconscious reactions of particu- 
lar sorts of individuals and groups through the 
psychology, sociology, economics, and politics of 
international relations and of the emerging world 
social system. There is substantial agreement today 
in psychology on what Lasswell has termed the 
“triple-appeal principle.” This principle holds that 
sets of symbols are likely to be most persuasive if 
they appeal simultaneously to three components 
of the individual reactor’s personality. That is, the 
propagandist tries to offset the resistances due to 
previous information inputs by presenting the 
thoughts and acts he desires to induce as if they 
were: (1) rational, advisable, and expedient (psy- 
choanalytically, this is an appeal to the ego); 
(2) pleasurable (an appeal to the id); and 
(3) moral (an appeal to the superego). Within any 
collectivity, the “mix” of these components varies 
from individual to individual; and in large collec- 
tivities it varies from subculture to subculture and 
from stratum to stratum. The propagandist tries 
to adjust his appeals accordingly. 

Research from the clinic also suggests the rel- 
ative effectiveness of choosing vocabularies and 
symbols and of casting the propagandist (or his 
agents) in roles, analogous to those associated 
with parents or parent substitutes (foster parents, 
uncles, aunts, schoolteachers, priests, witch doc- 
tors, political heroes, gods, goddesses, etc.), under 
whose influence the reactors have undergone many 
of their most formative, emotion-laden, and strongly 
sanctioned experiences. It is easy to sense the ap- 
peal of such familistic symbolisms as “the father- 
land,” “the mother country,” “the Mother Church, 
“the Holy Father,” “Mother Russia,” “Uncle Sam,” 
or “Uncle Ho Chi Minh.” The propagandist who can 


seize the emotional initiative and maintain a vir- 
tually parental or divine ascendancy (charisma) 
can arouse both the animosities and the consciences 
of his followers and of neutrals by “satanizing” the 
aims and associates of his opponents while ideal- 
izing or deifying his own objectives and allies. 
Reactors growing up in different social group- 
ings, or in the same groupings at different times, 
are bound to have at least somewhat differently 
structured egos and superegos. Hence, the contribu- 
tion that psychoanalysis and psychology in their 
generalized forms can make to the propagandist is 
not sufficient by itself. Furthermore, even those 
reactors who already have the attitudes the propa- 
gandist desires them to have may be prevented 
from acting upon these by counterpressures from 
the particular social groupings or social systems 
affecting them. It would be difficult, for instance, 
to act openly upon communist leanings in a totali- 
tarian fascist country, or vice versa. Hence, the 
propagandist must adapt his symbolism not only 
to the reactors’ conscious and unconscious impulses 
but also to the lines of action that are open to them. 
Propaganda is likely to be most effective if its 
contents include encouraging references (direct or 
implied) to all those actions that are feasible for 
the reactor and that the propagandist wishes him 
to perform, and if the contents include deterrent 
references to acts the propagandist wishes the 
reactor to inhibit (or, in some cases, no references 
to the latter, lest “ideas be put into the reactor’s 
head”). The structuring of propaganda contents 
around such action concepts increases the probabil- 
ity that the propagandist will be realistic in his 
demands upon the reactor and that the reactor will 
not be left with the feeling, “I agree with this 
message, but just what am I supposed to do about 
it?” Where military or political secrecy or surprise 
is important to the propagandist, he will be inclined 
to state his action demands obliquely or decep- 
tively: in some cases opponents can use systematic 
analysis of the content of propaganda to infer the 
propagandist’s secret or unconscious intentions and 
probable future actions. [See CONTENT ANALYSIS.] 
Much more could be said about the selection of 
symbols. One especially intriguing question for our 
epoch should be raised: Can behavioral research 
discover, and will influential propagandists be will- 
ing to employ, universalistic symbolisms that can 
Teduce interpersonal and intercollectivity destruc- 
tiveness to levels that might make possible a viable 
world social order? 
Channels of propaganda. A comprehensive list 
of media that a propagandist might use would be 
many pages long. It would include newspapers, 
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magazines, radio, television, film, handbills, posters, 
billboards, speechmaking, whispering and rumor- 
mongering campaigns, flags, street names, monu- 
ments, commemorative coins and postage stamps, 
Rhodes, Fulbright, and Soviet Friendship scholar- 
ships, awards and prizes, the composition of novels, 
plays, comic strips, poetry, and music “with a mes- 
sage,” and all human groupings from the dyad and 
the family through advertising and public relations 
firms, churches and temples, pressure groups, 
parties, and “front organizations” to the propaganda 
organizations (overt and covert) of nations, inter- 
national coalitions, and universal international 
organizations. 

Since World War 1 there has been a strong 
drift, in the practice of propaganda, away from 
attempts to “saturate” mass audiences with large 
quantities of simplified slogans. The new trend is 
toward the far more discriminating choice of those 
media to whose messages the intended reactors are 
thought to be especially receptive. This focus upon 
“placing the shots” instead of indiscriminately bom- 
barding the reactors is due in part to findings of 
behavioral research. 

Numerous controlled observations and experi- 
ments on the “media habits” and “source prefer- 
ences” of given reactors have established two views: 
(1) most persons tend to resist messages that 
reach them through media they do not especially 
trust and enjoy; (2) the most effective media, as 
a rule, for messages other than the simplest of 
commercial propaganda are not the impersonal 
mass media but rather those “reference groups” 
with which the individual feels strongly identified 
and in which he feels that he is at home and is 
surrounded with a certain degree of intimate emo- 
tional response and personal protection. First and 
foremost of these is, of course, the family. But 
many other organizations may perform quasifamil- 
ial functions—for instance, the small club of 
cronies, the church, the trade union, the business- 
men’s luncheon club, the clique or gang, the com- 
munist cell. If the propagandist can influence the 
leadership of such a reference group, he may estab- 
lish a “social relay point” that can vastly amplify 
the meaningfulness and acceptability of his mes- 
sage—far more effectively than a huge number of 
broadcasts, leaflets, or billboards, and at much 
lower cost. Hence, a great deal of research has 
been devoted in recent years to the identification 
of such reference groups. One important stratagem 
is the programming of mass media contents (e.g., 
newspapers or broadcasts) in such ways that in- 
stead of using scattergun techniques on undiffer- 
entiated mass audiences they carry material that is 
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considered likely to interest specified reference 
groups (and especially the elites and “opinion 
leaders” among these) and to be relayed by them, 
in their own ways and on their own initiative, to 
other sets of reactors. 

Audiences. Since propaganda deals by defini- 
tion with controversial matters, its ultimate audi- 
ences, whether reached by direct or indirect media, 
can be ranged into three groups: (1) those who 
are initially predisposed to react as the propagan- 
dist wishes, (2) those who are neutral or indiffer- 
ent, and (3) those who are antagonistic. It is 
advisable in many cases to include symbolism and 
to select media appropriate to many positions along 
this continuum. 

Recent research tends to indicate that the most 
dependable result of propaganda is likely to be an 
increase in the resolve or the efforts of those who 
are already the propagandist’s friends. No matter 
how great the volume or symbolic intensity of 
propaganda, neutrals and opponents are likely to 
be little affected unless the propaganda is rein- 
forced by relevant nonpropaganda transactions or 
other events. For example, propaganda that aims to 
induce loyalty to a given regime among a landlord- 
ridden population that is experiencing the modern 
“revolution of expectations” may have to be rein- 
forced by delivering as well as promising land and 
tax reform and bona fide physical protection. 

Measurement of propaganda effects. The prob- 
lem of measurement is almost as broad as the field 
of behavioral research methodology. The modern 
world is seething with rival propaganda campaigns 
and counterpropaganda and with countless other 
symbolic transactions. The problem of disentan- 
gling the effects of one’s own propaganda from the 
effects of the other transactions is often insoluble. 
Yet it is occasionally possible to conduct research 
whose results can be viewed with moderate con- 
fidence. 

Content analysis. Reasonably dependable quan- 
titative evidence as to the contents of propaganda 
can be obtained by the method known as “content 
analysis.” The numbers of column inches of printed 
Space or seconds of radio or television time that 
propaganda occupied can be tabulated, The sym- 
bols and themes it contained can be categorized, 
as already indicated, in terms of expressed or im- 
plied demands for actions of various types and in 
a number of other psychologically or socially sig- 
nificant ways. [See CONTENT ANALYSIS.] 

Intensive interviews. Fairly objective evidence 
as to the intensity and semantic Significance of 
propaganda (i.e., the cognitive and affective asso- 
ciations it evokes in given reactors) can be gath- 


ered by extended intensive interviews (of a psycho- 
analytic or psychiatric type) with small, carefully 
drawn samples of the intended audience. If this 
audience is in a place where freedom of such in- 
quiry is restricted (i.e., most of the world), the 
next best method is to interview any presumably 
informed persons who can be reached—e.g., ref- 
ugees, expellees, or scholars concerned with the 
area. [See INTERVIEWING.] 

Extensive observations. Sometimes participant 
observers can be sent to the relevant places. Voting 
Statistics, press reports, or the speeches and other 
actions of affected leaders can also give clues. Evi- 
dence on the size and composition of the inter- 
mediate (including “relay point”) audiences and 
the ultimate audiences can be obtained from ex- 
tensive sample surveys, press reports, and leaders’ 
reactions. Where printed or telecommunications 
media are used, their readership or listenership 
figures can perhaps be obtained. If public meetings 
or demonstrations are involved, there may be ob- 
servers’ reports. [See OBSERVATION.] 

Experiments and panel interviews. Evidence 
that ensuing behavior of the audience—for in- 
stance, its vote for candidate X or its buying of 
product Y—is due in whole or in part to the prop- 
aganda and not to something else remains far from 
conclusive, however, except in the rare situations 
where something like an experiment is possible. 
In some cases, matched groups can be compared— 
one of them exposed to the propaganda and the 
other not, or one of them exposed to version A 
of the propaganda and another to version B, and 
so on. In some cases, the propaganda reaching one 
group can be abruptly stopped or intensified and 
some of the presumably consequent reactions may 
be observed. However, there is always the possi- 
bility that it was not one’s own propaganda that 
brought about the changes, but someone else’s, or 
that the changes were caused by some unknown 
third factor or set of factors. There is also the 
problem of “sleeper effects” —long-delayed reactions 
that may not become visible until the propaganda 
has worked its way through or around resistances 
that it may encounter deep down in the reactor’s 
unconscious or until obstacles to expression of re- 
actions (e.g., political policemen or suspected in- 
formers) have left the reactor’s environment. And 
there is the possibility that the propaganda may 
have “boomerang effects”—effects the opposite of 
those intended—or combinations of boomerang and 
desired effects. Research design that does not allow 
for all these possibilities is of doubtful evidential 
value. [See EXPERIMENTAL DESIGN, article on QUASI- 
EXPERIMENTAL DESIGN. ] 


In view of the extreme difficulty of tracing effects 
of propaganda upon reactors in their native habi- 
tats, a great deal of effort has been spent in recent 
years on strictly controlled experiments and re- 
peated semi-intensive interviews (“panel inter- 
views”) with small matched groups, with a view 
to establishing general principles of propaganda 
and persuasion, Among the many factors examined 
have been the relative credibility and acceptability, 
to given audiences, of different sources of informa- 
tion, advice, and opinion; the uses of different prop- 
aganda contents aimed at the same results; and 
the effects of different ways of arranging and pre- 
senting the same contents. However, reliance on 
such findings is notably limited by the fact that 
the behavior of reactors available for such testing 
may or may not be representative of the behavior 
of those actual audiences in whom the propagan- 
dist is interested. It seems probable that effects of 
propaganda among actual reactors can in most 
cases only be estimated, not “measured” scientifi- 
cally, and that the most valid estimates are likely 
to be made by persons combining considerable 
training in the methods of social science with con- 
siderable direct experience among the reactors 
under analysis. [See PANEL STUDIES.] 

Opposition and social control. Once propaganda 
produces any effects it tends to evoke opposition. 
Opponents may try to offset it directly or to invoke 
community sanctions to bring it under control. 
Therefore, the propagandist has to estimate his 
Opponents’ values and the steps opponents most 
probably will take. In different sorts of polities 
along the continuum from the democratic to the 
authoritarian, a variety of social controls over 
propaganda may be found. 

By definition, a healthily functioning democracy 
is a polity in which opposition to propaganda is 
habitually expressed primarily through peaceful 
counterpropaganda. It is assumed that a variety of 
propagandists will compete vigorously in “the 
marketplace of ideas,” and it is hoped that the 
ideas best for the society will find the most takers 
in the long run. Prerequisites for such an outcome 
presumably include high levels of education, self- 
control, and civic spirit among the participants, 
and large amounts of freely available information, 
disinterestedly gathered and disseminated by rel- 
atively autonomous, uncensored newsgathering 
agencies. 

In self-protection against secret or “unfair” prop- 
aganda by “hidden persuaders,” modern democ- 
racies sometimes require registration or even licens- 
ing of some sorts of propagandists by public 
authorities, and “plain labeling” of propaganda out- 
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put. In the United States, for instance, periodicals 
using the second-class mails are required to pub- 
lish frequent statements of their ownership, circu- 
lation, and other data. Likewise, all propaganda 
agents of foreign principals must file registration 
forms with the U.S. Department of Justice, where 
the interested public may inspect the data sub- 
mitted about the agents’ and principals’ identities, 
activities, and finances. Such agents are also re- 
quired to place on each piece of printed matter 
they circulate a label identifying the principal. 
This principle of “disclosure,” which appears so 
useful with respect to foreign agents, is not applied, 
however, to all domestic propagandists, although 
similar principles are applied to the registration of 
securities prospectuses and of certain types of polit- 
ical campaign advertisements and contributions. 
Many nations require similar “plain labeling” of 
securities prospectuses and paid political advertis- 
ing, whether foreign or domestic in origin. In many 
countries, claims made in propaganda (including 
advertising) about the contents or characteristics 
of foods and drugs are also subject to registration 
and labeling. 

Other efforts made in democracies to provide 
public control over propaganda include laws con- 
cerning libel and slander; laws giving political can- 
didates and legislators exceptional privileges and 
immunities in the field of free speech; and laws or 
customs requiring equal space or time in public 
media for all major contenders in political cam- 
paigns. In some cases there may be a legally guar- 
anteed “right of reply,” sometimes at the propa- 
gandist’s expense, for any group or individual held 
to be seriously injured or exposed to injury by his 
propaganda. 

Obviously, however, opponents’ reactions to prop- 
aganda need not be limited to disclosure or coun- 
terpropaganda. All manner of economic or physical 
inducements or punishments may be tried, even in 
democracies; and this is much more the case 
in relatively authoritarian polities. In the extreme 
case, the authoritarian regime aims to monopolize 
for itself all opportunities to engage in propaganda 
and will stop at nothing to prevent any kind of 
counterpropaganda. How long and thoroughly such 
a policy can be implemented depends, among other 
things, on the amount of force the regime can 
muster, the thoroughness of its internal intelligence 
and policing activities, and, perhaps most impor- 
tant of all, the level and distribution of secular 
higher education in the social system of which the 
regime is the polity. 

The effects of steps taken to neutralize or sup- 
press propaganda can, of course, be measured by 
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the same methods as the effects of the propaganda, 
and such measurement is subject to the same 
caveats. 

Bruce L. SMITH 


[See also ATTITUDES, article on ATTITUDE CHANGE; 
BRAINWASHING; COMMUNICATION, POLITICAL; Psy- 
CHOLOGICAL WARFARE. Directly related are the 
entries ADVERTISING; COMMUNICATION, MASS; CON- 
TENT ANALYSIS; PERSUASION; PUBLIC OPINION. Other 
relevant material may be found in POLITICAL PAR- 
TICIPATION; SOCIALIZATION; and in the biographies 
of HOVLAND; STOUFFER.] 
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PROPERTY 


Definitions of what constitutes property, as well 
as attitudes toward the ownership of property, vary 
with different cultures and different historical 
epochs. Cross-cultural differences have been studied 
especially by anthropologists; the article that ap- 
pears below describes the institution of property in 
preliterate societies, as do ECONOMIC ANTHRO- 
POLOGY; EXCHANGE AND DISPLAY; TRADE AND 
MARKETS. Historians of economic theory and his- 
torians of social and economic institutions often 
note changes in concepts of property; see, for 
example, ECONOMIC THOUGHT, articles on ANCIENT 
AND MEDIEVAL THOUGHT and SOCIALIST THOUGHT, 
and such historically oriented articles as FEUDAL- 
ISM; MANORIAL ECONOMY; SLAVERY. Property can 
also be considered in its relation to other major 
social institutions. Thus, implicitly or explicitly, a 
wide variety of ideas about property emerges in the 
articles on the major world religions and in Mo- 
NASTICISM. Similarly, in the economic sphere, 
property is directly involved in several of the arti- 
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cles under Taxation, and somewhat less directly 
in CAPITAL; INCOME DISTRIBUTION; RENT. Moral 
and legal problems related to property (including 
intellectual property) are described in PATENTS; 
Privacy. Few thinkers in the social sciences have 
written works that are primarily devoted to the 
elucidation of the institution of property, but the 
problem is discussed in the biographies of AQUINAS; 
HeRsKovirs; HUME; LocKE; Marx; Mauss; QUES- 
NAY; POLANYI; SMITH, ADAM; VEBLEN. 


Property is the name for a concept that refers to 
the rights and obligations and the privileges and 
restrictions that govern the behavior of man in any 
society toward the scarce objects of value in that 
society, People everywhere desire the possession of 
things which are valuable by cultural definition 
and which by this desiring become scarce: sun- 
light, fresh air, land, food, rituals, medicine bun- 
dles, masks, or automobiles. Socially sanctioned 
customs or police-enforced laws, which define rights 
and obligations about ownership, control competi- 
tion for these desired goods. What is owned is 
property. 

The history of the English word property indi- 
cates something of the way Western man has 
thought about scarce objects, Both the Middle Eng- 
lish word propete and the Old French term pro- 
priete are derived from the Latin word proprietas, 
itself the noun form of the Latin proprius, mean- 
ing one’s own, which is akin to the French noun 
propre, meaning what is close or near. Thus, his- 
torically, property carries the implication that one 
has exclusive rights to objects because they are so 
close or near that they have become part of one- 
self through familiarity or common usage. 

Man is not the only animal to show an interest 
in objects close to him, In an unqualified usage of 
the term, property appears to be characteristic of 
many forms of animal life, with the important dis- 
tinction, however, that one should only talk of 
property rights when the control of scarce objects 
can be related to a unique normative system of 
values; such a system is found only in human 
groups. 

Various insects, such as bees, wasps, ants, and 
termites, will guard their prey and defend their 
nests from attack, The material they gather is used 
to feed themselves or their offspring or to build a 
home, In each case the ultimate aim appears to be 
the perpetuation of the species, especially in those 
cases in which the parent never sees the offspring. 
Control of objects is not for its own sake; it is 
merely a link in the life cycle. 

The few birds that accumulate stores or, oc- 


casionally, bright colored objects do so as apparent 
modifications of food-getting or courtship impulses. 
The occupation and defense of territory are part of 
feeding, sexual, or parental behavior rather than 
examples of the control of defined landed property, 

Rodents hoard food and struggle among them- 
selves for the exclusive possession of territorial 
rights. These complex behavioral processes are 
found in rats, squirrels, and beavers. This behavior 
is not entirely innate (it is the wildest speculation 
to talk of an “instinct of property”), but it is a com- 
plex stereotyped action pattern for which there is 
much experimental and other evidence to suggest 
a continuing interaction between nurture and 
nature (Barnett 1963, pp. 41-43, 81-92; Thorpe 
1956, pp. 397-398). 

Among some primate species there appears to be 
accumulation of miscellaneous objects used vari- 
ously for personal adornment, or as playthings, or 
as instruments for acquiring food. It is character- 
istic of apes that objects acquire added value when 
other apes desire them at the same time. Apes, in 
fact, appear to display a “protosense” of private 
ownership, 

Property among animals refers to objects neces- 
sary for species survival. The important and rele- 
vant distinction between man and other animals, 
only faintly foreshadowed among some apes, lies 
in the fact that for man material objects are in 
essence an extension and definition of personality. 
Without being strictly necessary for personal or 
species survival, many material objects are used 
by man to magnify his sense of self and to extend 
his influence by giving him power over other men 
or over the spiritual forces of nature. 

The objects that thus extend personality are gen- 
erally scarce and hard to come by. Their acquisi- 
tion and control result in social conflict. In this 
conflict the social concept of property is conceived 
and developed. The study of property among animals 
shows that exclusive possession of material objects 
subserves certain basic needs that are common to 
all animal species. Similarities are analogous rather 
than identical. Only in man can we postulate the 
emergence of a sense of selfhood that gives psycho- 
logical underpinning for the concepts of rights, 
duties, responsibilities, and obligations. Balancing 
of individual and group conflicts is a delicate task 
that is carried out by tribal customs and conven- 
tions. As these customs differ from one tribe to 
another, so will the content of human property 
rights and duties (Beaglehole 1931; Lowie 1920). 

An understanding of primitive ownership is best 
achieved by concentrating upon the property sys- 
tems of one or two preliterate groups for which 


there is reasonable documentation, always keep- 
ing in mind the fact that the groups chosen repre- 
sent a fragmentary range of customs rather than 
definitive examples of all preliterate societies. 

Cheyenne Plains Indians. The Cheyenne were 
an American Indian people inhabiting the Great 
Plains of what are now the states of Montana, east- 
ern Wyoming, Oklahoma, and Colorado. They lived 
a nomadic, semipastoral life, dependent on buffalo 
hunting. They valued horses and engaged in war- 
like raiding activities. Their property system was 
in part dependent upon nomadism. Cheyenne in- 
heritance customs were of two kinds. When a mar- 
ried man died, he carried with him to his grave one 
set of his best equipment: garments, buffalo robes, 
bows, arrows, gun, war club, knives, pipe, and 
other personal belongings. His favorite horse was 
slaughtered. These things were all his and his 
alone. If the man was a shaman, his medicine 
bundle also went with the body, unless previously 
willed to a friend or a son, The rest of the property 
of a married man who died intestate went to his 
widow, although an old man could make antici- 
patory gifts which were binding on his heirs. The 
parents of an unmarried warrior disposed of his 
goods by gift, Comparable giving away of property 
occurred in the case of married and unmarried 
women, 

Private property rights were well developed and 
well recognized among the Cheyenne. Misappropri- 
ation of property was not uncommon, Trading in 
property or giving and taking chattels often vali- 
dated the exchange of rituals and ceremonials and 
was used as compensation or restitution in legal 
disputes, to pay for services in curing illness, to 
count “coup,” or to mark the transfer of incorporeal 
properties such as supernatural powers. Property 
was also given to pay for the preparation of new 
implements and the teaching of such women’s arts 
as quilling, beadworking, or lodge making. Property 
rights were clearly expressed in regard to killed 
buffalo, war loot, horses, and other articles of orna- 
ment or utility. The owner had full rights of dis- 
posal, At times, legal rights became blurred by 
conflict between custom and personal desires. Ap- 
plication of the principle of “pluralism” suggests 
that among the Cheyenne, as among most other 
preliterate peoples, there were various levels and 
types of prevailing normative systems which at 
different times could be applied to particular cases 
of property disposal and ownership (Llewellyn & 
Hoebel 1941, pp. 20-21, 212-213; Malinowski 
1926). Those preliterates who have not invented 
courts of law to interpret disputed points of owner- 
ship have instead a range of property customs 
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which vary in definition from boldly accepted de- 
lineation to shadowy etchings. 

Hopi Pueblo Indians. Property rights among 
the Hopi Indians, a sedentary, agricultural, pottery- 
and basket-making people living in pueblo villages 
on the tablelands or mesas above the sandy desert 
of northeastern Arizona in the southwestern part 
of the United States, provide a neat contrast to the 
warlike, nomadic Cheyenne pattern (Beaglehole 
1935). 

Among the Hopi, a matrilineal and matrilocal 
people, a man owns his clothes, his ceremonial cos- 
tume (mask, kilt, belt, leg bands, rattles), his 
ornaments of silver, turquoise, and shell, his rid- 
ing equipment, tools, and implements, occasionally 
a horse, and perhaps a few sheep or cattle. Today, 
he keeps the money he earns as his own private 
property, but if he spends this money on goods for 
his wife and children, these goods become the prop- 
erty of these members of the matrilineal group, to 
which the husband does not belong. An unmarried 
woman similarly owns her own clothes and orna- 
ments. A married woman who is the head of her 
own household owns, in addition, her house and its 
contents; her property is customarily inherited by 
her daughters. The property of a man is usually 
inherited by his sisters, brothers, and other ma- 
ternal kinsfolk and not by his wife, sons, or 
daughters. The borrowing of personal goods is fre- 
quent, but prolonged possession or theft is rare. 

Much Hopi property, whether real or personal, is 
owned by members of the matrilineal clan, This 
property includes ceremonial medicine bundles, 
some masks and fetishes, eagles, and the semisub- 
terranean religious chambers called kivas. Nor- 
mally, the senior man of a clan is regarded as a 
trustee for this property, although he may neither 
use it nor dispose of it, except in customarily ap- 
proved ways. 

Land, the basis of Hopi economic life, is se- 
verely limited in extent and fertility. It is owned 
by the village and protected by commonly recog- 
nized boundaries. Village land is divided into clan- 
owned land, marked by stone-slab boundary signs. 
Families within each clan cultivate well-defined 
family areas, None of these ownerships is absolute, 
since land for the Hopi is inalienable. Ownership 
thus means effective trusteeship of the land and 
of the food and wealth that cultivation produces. 
In general all trusteeship rights are inherited within 
the family or lineage connection of the trustee 
clan. The matrilineal clan also owns access to 
fresh-water springs. 

Almost inevitably, a summary description of pre- 
literate property rights gives the misleading im- 
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pression of a fixed or static institution. In the illus- 
trations taken from Hopi and Cheyenne cultures, 
some aspects of ownership are fairly stable, but 
others are subject to change consequent upon in- 
creasing contact with Euro-American culture, The 
institution of property and the accompanying 
philosophical or protojuridical justifications have 
changed and will go on changing to meet the de- 
mands of an ever-changing society and its culture. 
Static models of culture are useful but unrealistic 
analytic tools. Concepts of property are subject to 
the endless erosion of time. 
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PROSTITUTION 


Prostitution is the granting of sexual access on 
a relatively indiscriminate basis for payment either 
in money or in goods, depending on the complexity 
of the local economic system. Payment is acknowl- 
edged to be for a specific sexual performance. Pros- 
titution is a service that may be performed by 
either males or females and for either males or 
females, although in practice in nearly all societies 


acts of prostitution are commonly performed by 
females for males or by males for males. The focus 
of this article is exclusively on female prostitution, 

Money and sexuality in prostitution. While 
prostitutes may be selective about the kinds of 
sexual activities that they will perform for money, 
it is always clear that some kind of sexual act is 
available from them. In like manner, prostitutes 
may be relatively selective about their clientele, 
choosing not to take those who are dirty or de- 
formed, those who request sexual activities that 
they think aberrant, or those who violate certain 
of the norms of conduct between prostitute and 
customer. However, within the limits set by these 
idiosyncrasies, the entree to the prostitute is the 
Capacity to pay for the relationship. Payment for 
the specific act is what distinguishes the prostitute 
from the mistress or from females who accept a 
range of gifts while having sexual contact with a 
male. While these latter cases are marginal, it is 
important to keep central to the discussion the 
immediate nexus between money and sexuality as 
a ctucial part of the prostitute-customer relation- 
ship. 

Money is the most public object in the society. 
In Simmel’s words, “Money, with all its colorless- 
ness and indifference, becomes the common de- 
nominator of all values; irreparably it hollows out 
the core of things, their individuality, their specific 
value, and their incomparability” ({1902-1917] 
1950, p. 414). Money is the universal lubricant and 
the universal metric; it provides the society with a 
fluidity and an ease of exchange that is deeply 
imbedded in the modern consciousness. In an era 
when marriage is no longer the exchange of goods 
for a reproductive object, and when there exists a 
romantic ideology dedicated to the self-selection 
of the mate, the private experience of sexuality is 
the polar opposite of the public quality of money. 
The sexual experience is expected to be individual, 
affective, and incommensurable, and it is in these 
attributes that it carries its special salience for 
society. 

Social functions of prostitution. It is in the act 
of prostitution that the most public and the most 
private values in the society meet without conflict- 
ing. Sexual gratification is legitimately expected 
only within the marital bond, although there are 
quasi-legitimate forms of sexuality that are pro- 
scribed but not punished, either because they are 
tacitly recognized to promote marriage (premarital 
Coitus between those in love) or because they re- 
duce sexual drives during the absence of a marital 
Partner (masturbation either in adolescence or 
during marriage). Prostitution had some of these 


quasi-legitimate functions in American society dur- 
ing the nineteenth century and the first quarter of 
the twentieth. The reason was that, so long as both 
premarital coitus and masturbation were frowned 
upon, the function of the “bad” woman could be 
seen as one of protecting the virtue of those who 
were “good,” and so of maintaining a certain 
property value. Given the large number of migrant 
males, many of whom never married, it was to be 
expected that police officials would conclude that 
the existence of prostitution was a necessary evil, 
or even that it was a positive good in that it re- 
duced the number of rapes in the community. 

From the evidence that is available, it seems 
that the way in which prostitution is organized, 
diffused, and evaluated is a function of the sig- 
nificance of marriage in the society and the ways 
in which society organizes access to legitimate sex- 
ual experience. Where there are great restraints on 
premarital coitus and females are highly valued for 
their reproductive capacities, prostitution among 
females will exist as an alternative form of grati- 
fication for large numbers of males. 


Prostitution in the United States 


A basic shift in the pattern of the use of prosti- 
tutes in the United States correlates with a major 
campaign against prostitution after World War 1, 
but it is more significantly tied to other shifts in 
the structure of premarital coitus. Figures pub- 
lished in 1948 in Kinsey’s Sexual Behavior in the 
Human Male chronicle this shift, for while the pro- 
portion among males born in 1910 and after who 
had ever gone to a prostitute is the same as for 
males born in 1909 and before, the number of times 
that members of later age cohorts had gone to pros- 
titutes was from two-thirds to one-half less (Kinsey 
et al. 1948, pp. 595-609). The most substantial 
changes were among those of lesser education; 
however, this change was true even of the college- 
educated, whose rate of contact with prostitutes 
had always been low and might therefore be ex- 
pected to have remained stable. The differences be- 
tween the college and noncollege rates in all co- 
horts are indicative of the impact of social class 
factors in this area of sexual behavior (ibid., 
pp. 351-355). It is clear that contact with prosti- 
tutes currently accounts for only a small proportion 
of the sexual experiences of American males in 
early adulthood and that most sexual experience 
is centered on peer group activity that results in 
initiation into heterosexual coitus (Kirkendall 
1961, pp. 22-55). It is among males who do not 
marry or who were divorced or separated in their 
thirties that the proportion going to prostitutes in- 
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creases to about 15 to 20 per cent of all sexual 
contacts after age 35 (Kinsey et al. 1948, pp. 
249-259). 

At first glance it would appear that these effects 
might be due to the decreasing visibility of prosti- 
tution, resulting from law enforcement and in- 
creased legal sanctions. But it is likely, given the 
intractability of the control of prostitution, that 
this change was the result of other forces, As a 
result of these forces, the proportion of never- 
married males in American society decreased; 
there was a shift in courtship patterns; and sexual 
contact between unmarried couples increased in 
saliency. During the same period in which the 
number of males going to prostitutes was decreas- 
ing, there was a concomitant rise in the proportion 
of respectable females who were having premarital 
coitus. This was an increase from about one-quarter 
of all females in the cohorts born before 1900 to 
about one-half in the cohorts born between 1900 
and 1929 (Kinsey et al. 1953, pp. 298-302; Ter- 
man 1938; Freedman 1965). This shift occurred 
at all social levels and is as true of females with 
grade-school educations as of those with college 
educations. These changes in rates represent a 
fundamental shift in the role of coitus before mar- 
riage in American society, a shift that indicates a 
major break with nineteenth-century values and a 
change in the status of women, 


Premarital coitus and prostitution 

While the data from Kinsey and others indicate 
that the social class factor has substantial effects 
on the sexual behavior of males, there is no such 
finding in regard to the sexual behavior of females, 
Indeed, the premarital coitus rates of males tend to 
be a great deal higher than those for females, ex- 
cept among the college-educated, where the differ- 
ences are more moderate (Kinsey et al. 1953, 
pp. 78-80). It is especially at the very lowest edu- 
cational level distinguished in the Kinsey reports 
that the rates of premarital coitus for males are 
much greater than for females. Part of this may be 
exaggeration by the males interviewed, given the 
saliency of such coitus to them. Nevertheless, it is 
clear that their rates of premarital coitus are far 
greater than those reported by females in the 
Kinsey sample at the same educational level. 

The explanation of these rate differentials lies 
in the fact that premarital coitus for the majority 
of females at all social levels, but most significantly 
among the better educated, takes place primarily 
with one male—the fiancé. Indeed, nearly half of 
all females who report having experienced pre- 
marital coitus state that it was only with the fiancé 
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(ibid., pp. 291-293), and about 55 per cent of 
them state that it was with only one male. Another 
35 per cent had premarital coitus with two to five 
males. Of women who had premarital coitus but, 
as is characteristic of lower-class girls, married be- 
fore twenty, the proportion reporting that pre- 
marital coitus was with five or fewer males is 90 
per cent (ibid., table 78, p. 336). 

These data indicate that patterns of premarital 
coitus are relatively similar for females across the 
social-class spectrum but that there exists a mi- 
nority of females, primarily of lower-class origins, 
who serve as sexual targets for fairly large numbers 
of males and at fairly high rates of contact. These 
females, in addition, account for most of the in- 
stances of sexual contact between middle-class 
males and lower-class girls. This suggests that a 
relatively small number of females are serving the 
same functions as prostitutes did in the past, but 
without charging for this sexual activity. However, 
it is from this small group of females that the 
largest proportion of women who go into prostitu- 
tion is drawn, 


Entry into prostitution 


The conventional image of the first experience 
of prostitution has commonly been one of inno- 
cence betrayed or, to judge from the biographies 
of many former prostitutes, a severely traumatic 
experience. However, for the bulk of girls who 
enter prostitution from a background of premarital 
promiscuity, the transition is untraumatic, and for 
some may even be an entry into a far more lei- 
surely and less pressured way of life (Young 1964; 
McManus 1960, pp. 81-86). Indeed, even in the 
nineteenth century the working conditions of the 
prostitute in England seemed to some observers to 
be less physically damaging than either work in the 
factories or the exhaustion of repeated childbearing 
(Acton 1857, p. 59), The harmful effects of pros- 
titution are far less obvious: they are attendant 
upon an increasing involvement in the life of pros- 
titution correlated with decreasing social relation- 
ships with others outside that world. To the degree 
that there is specific trauma upon entering prosti- 
tution, it appears more often among females, espe- 
cially middle-class females, who have not had 
earlier conditioning from multiple sexual contacts 
with a variety of males. 

With the decline of the brothel, or house of pros- 
titution, the apprenticeship experience in prostitu- 
tion is now dependent upon dyadic relationships 
between the apprentice and either a more experi- 
enced prostitute or a male who operates as a pimp 
(Bryan 1965). The learning experience involves 


more tasks than simply that of getting over the ex- 
perience of exchanging money for coitus, although 
this is the informing and central dilemma of the 
prostitute. It also involves learning ways of ap- 
proaching males, setting the price, collecting the 
fee, managing the sexual contact, and letting the 
customer go. Each of these tasks involves making 
explicit what has been implicit in all prior sexual 
contacts, since, no matter how many of these there 
might have been, there was always the possibility of 
seeing them as part of a more conventional struc- 
ture of sexual expectations. Once the commitment 
to take money has been made, it means that the 
female has given up even the pretense of an emo- 
tionally valued relationship with the male. The 
situation is no longer one of courting or dating 
but is limited to the specific exchange of sexual 
access for money. This means that even if she has 
refused no one previously, her present lack of dis- 
crimination is now a public event. During this ap- 
prenticeship period she must learn a specialized 
argot not only about sexual behavior but also about 
the naming of others in the environment: cus- 
tomers, steerers, the police, and other prostitutes. 
The argot is highly value-laden and in and of itself 
constrains the neophyte into patterns of action and 
belief (Maurer 1939). 

The most complex of these tasks, however, in- 
volves learning to speak about sexual acts and 
sexual preferences that in the past have arisen and 
been performed in gestural and nonverbal con- 
texts, and then learning to tie the new talk to the 
pricing of the specific activity that is requested. 
The problem is that while the relationship between 
money and sexuality is what makes the act pos- 
sible, the economic portion of the act must not be 
allowed to intervene in the nature of the sexual 
performance. The structure of talk, once learned, 
becomes highly ritualized and predictable, although 
it varies from one social level of customer to an- 
other and from one situation of prostitution to 
another. Thus the centrality of the cash exchange 
is high for the lower-class customer, the sexual 
activities preferred are limited, and the content of 
the sexual talk is small. On the other hand, in con- 
tacts with middle-class males the price is set and 
not referred to again (although there may be 
psychic gain for the maie as a result of payment), 
the sexual interest may be wide, and there is a cer- 
tain expectation of talk that transcends the im- 
mediate sexual character of the relationship. The 
capacity to meet all of these expectations is a rela- 
tively uncommon skill, and this fact may account 
for the mobility problems of girls who enter the 
profession at various levels. 


Entering “the life” thus requires learning a new 
conception of the self, a new relationship to males, 
and a new way of talking about the self, as well as 
learning to meet a new world populated by special 
kinds of others. At the same time there is a de- 
crease in the frequency of interaction with con- 
ventional others (except with men in their new 
role as clients) and consequently a decreasing ca- 
pacity to return to the conventional world. The life 
of prostitution, like other forms of deviance, com- 
mits a person at the most deeply experienced levels 
of the self and in the process produces far greater 
similarities between prostitutes than would be ex- 
pected on the basis of any set of etiological char- 
acteristics, 


World of the prostitute 


The culture of prostitution is, like all cultures, 
composed of significant others who lay claim to the 
prostitute’s time, energy, and affection. Entry into 
this culture requires the prostitute to make a great 
many changes in the ways in which she defines 
other people. These new definitions inescapably cut 
off many of her older, more conventional defini- 
tions. But the older definitions may still have latent 
claims on her, and in some ways entangle her even 
when she is most enmeshed in her new experiences. 

The world of the prostitute is composed of other 
prostitutes; clients (Johns); steerers and procurers; 
in some cases, pimps; in others, Lesbian lovers; 
and, finally, the police and other agents of law en- 
forcement (Bryan 1965; 1966; Jackman et al. 
1963). The relationships with other prostitutes are 
enormously complex, but they seem in all circum- 
stances to carry with them a fair amount of mutual 
dislike and mutual exploitation. The content of 
conversations is often limited to the occupational 
life, for exposure of the self on other levels invites 
exploitation due to the prostitute’s increasing social 
vulnerability. However, the sharing of a special 
kind of alienation and social distance from con- 
ventional society forces the prostitute back on other 
prostitutes, for there is no one else with whom she 
can share the largest part of her daily experience. 

Relationships with clients are equally difficult. 
They are most easily handled when they do not 
impinge on, or have properties of, relationships that 
might be sought in nondeviant ways. Thus the 
house girl, who lives in a world of other prostitutes 
and who services a lower-class clientele, is less 
likely to develop attachments to clients and more 
easily sees them as a series of replaceable objects. 
The call girl, on the other hand, insofar as she 
must appear in public with her clients in roles 
that are defined as nondeviant, runs the risk of 
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becoming emotionally involved with them. This 
emotional involvement is expressed in demands 
that clients pay lawyers’ fees, pay for bail bonds, 
or otherwise help out when she is in trouble, and 
it typically results in intense anger when they do 
not. In such cases, because a residue of old rela- 
tionships is still involved, it is clear that the girl 
has not fully made a separation between her work 
life and her personal life. Failure to achieve such 
a separation may also explain why a call girl often 
reports more intense feelings of antipathy for the 
customer than does the street or bar prostitute. The 
difficulty of the call girl's role here is complicated 
by the fact that she may well have had middle- 
class origins and thus may be more committed to 
the love ethic and more vulnerable to disappoint- 
ment. 

Clients and their needs. The clients of prosti- 
tutes use them for a variety of reasons (Winick 
1962; Gibbens & Silberman 1960; Kinsey et al. 
1948, pp. 605-609). For many men, especially 
those in the lower class, the motive is often simply 
sexual relief or the opportunity to experience a novel 
sexual contact, in terms of either of a new female 
or of a tabooed technique (usually mouth-genital 
contact). However, for many other males, more 
commonly middle class, the resort to the prostitute 
is often more complex and is invested with greater 
ambivalence. Novelty in partner and technique are 
certainly involved, but an equally potent factor, it 
seems, is the lack of future responsibility for the 
consequences of sexual contact, Since many of 
the organizing constraints on sexual activity are 
related to maintaining the family and providing 
for its future, contact with a prostitute is significant 
to the client because it allows him sexual expres- 
sion without such controls on behavior. The guilt 
consequent on the violation of the norms commonly 
deepens and intensifies the erotic character of the 
relationship, as does the degraded status of the 
prostitute, who provides sexual access separated 
from the entanglements of nurturance and affec- 
tion. In addition, the prostitute provides a sexual 
contact that does not involve the male in the time- 
consuming conventional buildup to coitus and frees 
him for other pursuits. The frequency of contacts 
with prostitutes by males at conventions and in 
other situations that are separated from the home 
suggests the loosening of social controls that are 
necessary for such contacts to take place. 

There are also those males whose sexual interests 
are so bizarre that there is no population of con- 
ventional women who could understand these de- 
sires or provide for them. Indeed, the expression 
of such desires as sadism or masochism would 
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undermine the rest of a nondeviant relationship. 
The proportion of such men is small, yet there 
exists a small number of prostitutes who will pro- 
vide such services. The prostitute has to learn to 
cope with the various demands of these clients 
and to sort out those who request things that she 
will not do even for a price. Prostitutes tell each 
other about clients, and it is in this interaction that 
patterns of client management are learned. 

Supporting relationships. The relationship of 
the prostitute to the pimp is poorly understood, and 
the anxiety that the thought of this relationship 
raises in other men is considerable. This reaction 
against the pimp has something to do with the 
asymmetry of a man living off the earnings of 
women in a society where men must provide for 
women. In addition, there is envy of the man who 
gets free that for which all others pay. The strength 
of the tie of the prostitute to the pimp is best 
understood in terms of her deprived emotional re- 
lationships with other males. In a world of chang- 
ing faces, in terms of both prostitutes and clients, 
the pimp remains a stable figure who will speak 
in the terms of love and affection. Given that this is 
often a minimal relationship, it still provides a 
stable anchor for the self in a world where the 
private sexual experience has been depersonalized 
and the replaceability of not only the client, but 
also the self, has been made most apparent. In- 
adequate as it might seem to others, the tie to the 
pimp remains for many a prostitute the single 
human relationship where she may provide both 
sexuality and love, even if it is contaminated by 
money. Indeed, the very act of giving money away 
to the pimp is like divesting the self of filthy lucre. 

Lesbian relationships have much the same char- 
acter as the relationship with the pimp, but are 
probably less demoralizing. They arise from the need 
for affection and the alienation from men that is 
implied in the nexus between sex and money. It 
is not that Lesbians become Prostitutes, but that 
prostitutes, out of their reaction to the emotional 
poverty of their world, seek loving human rela- 
tionships and that often the only such relationships 
available are with women. 

The world of secondary support for the prostitute 
is a shadowy one. The bell boy, bartender, or cab 
driver who directs the customer and the prostitute 
to each other is an untrustworthy ally, and the 
prostitute soon learns the cost of accepting help 
from these individuals, their relative unreliability, 
and the exploitative nature of their relationship to 
her. This system of Secondary support is a quasi- 
delinquent underworld that verges on the straight- 
forwardly criminal. 


Prostitution and law enforcement. Where 
prostitution is illegal, the prostitute is thrust into 
a world of police, courts, and correctional institu- 
tions. This act of exclusion from the larger society 
automatically includes her in a set of relationships 
with the criminal world. Thus her life, already 
difficult, is further complicated by contacts with 
narcotics, petty and major crime, and the corrup- 
tion of the agencies of criminal justice. Since the 
Jaw is enforced only against the paid participants 
in the act of prostitution (the customer is rarely 
arrested), the prostitute is forever confronted with 
the facts of unequal treatment. Since, in any case, 
enforcement of prostitution laws is uneven and 
often tied to changes in the political climate, she 
is vulnerable to blackmail for money from members 
of the police force who are free to arrest her or 
not, as they please. Since her position at the margin 
of the underworld makes her a listening post, she is 
pressured to give information to the police. In all 
of this she is further alienated from the society and 
forced into a criminal conception of the larger 
society and of herself. All of the enforcement pro- 
cedures fail in their intent to reduce prostitution, 
since they more often confirm persons in the ac- 
tivity than they remove them from it. 

Psychopathology among prostitutes. All forms 
of deviance may be seen as at least potentially 
conducive to the development of major psycho- 
pathology. One of the major functions of the cul- 
ture of the prostitute is to reduce this potential 
by providing a system of significant others who 
operate as community and culture. While there is 
a good deal of evidence for pathology among prosti- 
tutes, it is probably reduced by the existence of the 
culture of prostitution. At the same time, the ex- 
istence of this culture means that the prostitute’s 
capacity to return to conventional society is thereby 
reduced. The main potential for pathology is located 
in the amalgam of sexuality and money at its most 
obvious level and is complicated by the nature of 
the control procedures that society invokes. It 
should be clear that such pathology as exists is 
differentially distributed and probably occurs more 
frequently among middle-class girls who enter the 
life than among those who come from other kinds 
of backgrounds. 

Departure from prostitution. Most prostitutes 
do not spend their entire lives in the career of 
prostitution. Looks and physical health are eroded, 
if not by the life itself, then by the very process of 
aging. Some prostitutes marry out of the life into 
relatively stable family systems, while others drop 
out into service and related occupations, some of 
which are at the margins of delinquent communi- 


ties. There is probably some degree of upward 
mobility for women who enter the life at the call- 
girl level from backgrounds at a lower social level. 
Others remain in the system because they have 
police and prison records, still others because they 
are drug addicts. The problem of how the ex- 
prostitute copes with her past and learns to manage 
the renewed disjuncture between money and sex- 
uality has not been studied, If she does learn to 
manage it, part of her capacity to do so may be 
due to the very alienated relationships that she has 
had with clients, so that they really do not count 
as part of her sexual past. It is certain that con- 
ditions in the United States make exiting from the 
life of prostitution more difficult than, say, in 
Denmark, where there is no statute against the 
act of taking pay for a sexual contact as long as 
the female has another occupation. Denmark's 
policy is based on the hope that through such a 
mechanism the prostitute will maintain sufficient 
ties with the conventional community to give her a 
past other than one of prostitution when she 
chooses to leave this career (Kemp 1936; Jersild 
1958; 1959). 


Patterns of prostitution 


The structure and extent of prostitution in any 
society depend strongly upon the character of pre- 
marital sexual practices, It also depends on the 
numbers of unmarried males in the society at any 
one time. To the degree that premarital coitus is 
permissible and becomes part of the courting ritual, 
there will be p concomitant decrease in the use of 
prostitutes by young men. Further, the smaller the 
proportion of unmarried males of any age and the 
larger the proportion of married females, the less 
use of prostitutes there will be altogether. 

In the United States, the structural conditions 
of premarital coitus have remained relatively stable 
—that is, a small proportion of the females provide 
sexual services for a large number of males. How- 
ever, they do so through promiscuity rather than 
through engaging in the same behavior for pay. 
In other countries, to the degree that a woman's 
virginity is identified with her marriageability, and 
to the degree that there are homeless or womanless 
men, the practice of prostitution is likely to remain 
much the same as it was in the United States at 
the beginning of this century. 

The social organization and the culture of prosti- 
tution do vary considerably by nationality. How- 
ever, since prostitution in nearly all countries is 
illegal, socially stigmatized, or both, it is usually 
characterized by secrecy, intense occupational in- 
volvement, and difficulties in leaving the profession. 
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Its ties with the criminal underworld leave the 
prostitute open to harassment not only from the 
police but also from other prostitutes and other 
members of criminal subgroups. Even without the 
pressures created by illegality or stigma, the dilem- 
mas of managing an identification of money with 
sexuality and of establishing an economic yard- 
stick for sexual access would produce strains 
toward the creation of a prostitute subculture. Thus 
the prostitute needs to be provided with an oc- 
cupational milieu that will help her to learn a new 
concept of self, to manage the complex tasks in- 
volved in her role, and to satisfy her normal needs 
for social relationships. 

While it has been pointed out that the marriage 
tie also involves the exchange of sexual access for 
economic security (Davis 1961), the married wom- 
an does not resemble the prostitute in this respect, 
because the critical component in prostitution is 
making explicit the nature of this exchange and 
determining its value in the public medium of 
money. 

Joun H, GAGNON 


[See also POLICE; SEXUAL BEHAVIOR; SOCIOLOGY, article 
On THE EARLY HISTORY OF SOCIAL RESEARCH; and 
the biography of KinsEy.] 
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PROTESTANT POLITICAL THOUGHT 


Protestantism emerged in sixteenth-century Eu- 
rope as a movement within the Christian church, 
seeking chiefly to depoliticize religious thought 
rather than to establish a new religious influence in 
the public order. The aim of Martin Luther's at- 
tempt to reform Catholicism was not to deny the 
responsibilities of Christians in politics but to free 
the gospel, the “good news” of the Scriptures about 
Jesus Christ, from the proprietorship of either the 
church or the state. The Lutheran reformation 
brought into existence a great number of sects— 
Baptist, Congregational, etc.—whose concern was 
chiefly to develop a voluntary community of be- 


lievers whose passionate witness to the purity of the 
gospel’s commandments would change the hopes 
of men in the political order. The Calvinist move- 
ment, seeking to overcome the excessive preoccu- 
pation of Lutheranism with the gospel and of sec- 
tarianism with the development of a free Christian 
community, directed its attention to the problem of 
reconstructing the political order. 

The historical development of the three major 
types of Protestant political thought—Lutheranism, 
sectarianism, and Calvinism—was shaped by the 
fact that they emerged out of the criticism of a 
system of medieval Catholic thought which af- 
firmed that religious and political life are closely 
interdependent, that regnum and sacerdotium form 
complementary jurisdictions within the res publica 
christiana. From the Reformation to the twentieth 
century, Protestant political thought has focused 
on the question of the differentiation of gospel, 
church, and public order and on their free and 
responsible interaction. 


Lutheran political thought 


The nature of the first great Protestant re- 
former's political thought can be sensed at once 
from Luther’s statement addressed to Emperor 
Charles v in 1521 in response to an attempt to 
force in public court Luther's recognition of the 
supremacy of the pope on religious matters. Luther 
said, “Unless I am convinced by the testimony of 
the Scriptures or by clear reason . . . I am bound by 
the Scriptures I have quoted” ([1521] 1958, p. 112). 
The ground of Christian faith was the possession 
of neither church nor state. “The word of God,” 
Luther noted, “which teaches full freedom, should 
not and must not be fettered” ({1520a] 1952, p. 
345). Luther concluded that the righteousness 
talked about by St. Paul as the mark of the Chris- 
tian is a relationship of God and man which is 
freely given to all men entirely at the initiative of 
God. The Christian responds to it with personal 
trust and gratitude. A man’s sense of righteousness 
cannot be mediated or assumed as a responsibility 
by any priest or politician. Hence the Reformation 
slogan Solo fide. 

Luther's search for the “real” in religious experi- 
ence drove him to oppose, at first, not the political 
order but the highly organized power structure of 
the Catholic church and the over-elaborated medi- 
eval theology and morality. In his attack on ecclesi- 
asticism and scholasticism Luther hoped to recover 
what were for him simple and precious realities: 
the love of God for man and the world, revealed 
in Christ, and the freedom of the Christian man in 
obedience to the will of God. In Luther’s words: 
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“A Christian man is the most free lord of all, and 
subject to none; a Christian man is the most duti- 
ful servant of all, and subject to everyone” ([1520b] 
1928, p. 6). Venturesome acts that substituted 
justice and mercy for force and threat in hu- 
man relations were to be the grateful response of 
Christians to what God had done in Christ and 
continued to do through the Holy Spirit. [See the 
biography of LUTHER.] 

From these religious convictions, Luther elabo- 
rated ideas of considerable political significance. 
These ideas had to do with the limits of personal 
responsibility and freedom in public actions and 
with the inadmissibility of attempts by ecclesias- 
tical or political authorities to use coercion to 
compel assent. For Luther, agreement so obtained 
is stripped of all spiritual value; only free adher- 
ence qualifies as faith and morality. Of even broader 
cultural significance has been the Lutheran con- 
viction that the personal dimensions of conscience 
and commitment, of guilt and forgiveness, of 
family and kin affections, are a more basic or 
primary reality than political structures of office, 
status, and power. 

These convictions deeply influenced Luther's 
view of the church. When the church relies too 
heavily upon an elaborate system of laws, good 
works, and indulgences, it becomes like the polit- 
ical realm, relying upon controls and restraints 
rather than upon the persuasive power of the Word 
and the loving, spontaneous ministering to the 
needs of others. For Luther, the unity of the church 
was made possible by a religious reality—the spirit 
of Christ and the faith of believers; the unity of 
the political order rested on a natural or secular 
concept—the uses of reason for the achievement 
of order and peace. The Lutheran church was to 
be then, at least in its founder’s hopes, a unique 
and simple fellowship of those people who believed 
they were called to do the will of God; it was not 
a rationally constituted society such as the town 
or nation. In such a fellowship all believers were 
priests in service to one another. The offices of the 
church were formed to meet specific human needs 
and, hence, were subject to periodic revision. The 
holder of such office had no authority but to 
preach the word and administer the sacraments 
in ways which make plain how God intervenes in 
the lives of men, transforming them by his creative 
power. 

This concept of the church as a historical com- 
munity of persons known as Christians directed 
thought about human institutions toward the feel- 
ings, hopes, and commitments that people shared 
in them. But the elements of coercion, discipline, 


and rationality, which Luther eliminated from the 
church, reappeared in his conception of political 
order. The general conclusion of scholars viewing 
the historical development of Lutheran thought 
and organization (Wolin 1960, p. 148) is that 
Luther’s tendencies to minimize the spiritual sig- 
nificance of political power and institutional policy 
encouraged in the Lutheran church support for a 
monopoly of all kinds of temporal power by the 
emerging national states and an uncritical accept- 
ance of political authority in its more negative and 
repressive forms. Luther was a revolutionary spirit; 
his thought reflects the paradoxes and discontin- 
uities of his own age. Because he saw the chief 
source of spiritual oppression in Roman ecclesi- 
asticism, he sought to build up the powers of the 
princes against the emperor and the papacy. In this 
battle he began to evolve a theory of the two realms 
—political and religious, the former to be char- 
acterized by an official, secular, rational ethic of 
the state and the latter to be guided by the authen- 
tic Christian ethic of the Sermon on the Mount. 
Luther’s more pietist and conservative followers 
found that in practice the gulf between duty and 
faith was difficult to bridge; opposition to injustice 
and tyranny was, therefore, postponed until such 
time as the state interfered with private and 
churchly matters of the spirit. 

Lutheran thought has generally been an inte- 
grating and conserving force for the established 
institutions of family, commerce, religion, and 
government. These are, in the Lutheran heritage, 
divinely ordained; they war constantly against the 
demonic powers in the world, Because they are 
divinely ordained, the Christian is to be obedient 
to them. Also, obedience to political rulers, in 
matters over which they properly have jurisdiction, 
is within the divine ordering of human affairs. 
Yet Luther did not assume that the established in- 
stitutions should be the exclusive instruments for 
the accomplishment of God’s will—what he was 
trying to guard against effectively was any effort 
to bring the natural institutions of life entirely 
under the judgment and control of the spiritual 
enterprise. In this respect he is to be distinguished 
from both ecclesiastical Christendom of medieval 
Catholic theory and the experiments in the holy 
commonwealth that were to spring up in the 
“free church” and Calvinist heritages. Although 
scholars have been able to find a place for a posi- 
tive view of the state’s welfare function within 
Luther's thought, it is clear he held no hope for 
the eventual triumph of Christian forces to the 
point where the government represented a pure 
Christendom. Neither the institutions of the church 
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nor politics can, in the Lutheran tradition, become 
substitutes for the gospel, nor can they be entirely 
changed to fit the requirements of the gospel. This 
is the Protestant principle which, perhaps more 
than any other, has characterized the movement's 
political thought. 


Sectarian political thought 


The reformation which Luther began stimulated 
the formation of an enormous number of small 
groups of earnest Christians who developed what 
Ernst Troeltsch ([1912] 1931, p. 691) termed the 
sectarian ideas, implicit in the thought of the great 
reformers, To this heritage belong all the Baptist 
and Anabaptist, Congregational, and “free church” 
revolts against Protestant orthodoxy. This second 
type of Protestant political thought was termed 
“sectarian” because it was representative of groups 
that cut themselves off from the established re- 
ligious institutions to give witness to doctrines 
interpreted in extreme terms, Some of the earliest 
sects (Baptist, Anabaptist, millenarian) retreated 
from attempts to reform or redeem the world and 
the churches, viewing them as incorrigibly corrupt 
and sinful. They therefore formed small voluntary 
communities committed to fulfilling the Sermon on 
the Mount and other scriptural imperatives. The 
extreme sects of the sixteenth century did not hope 
for the reconstruction of the political world but 
for its dissolution. Their members refused to use 
the courts in civil disputes, to serve in the army, 
or to associate in any way with the political order. 
In a few instances the sects turned to violence— 
for example, the followers of Thomas Miinzer 
joined briefly with the German peasants in violent 
revolt against the princes, arguing that the liberty 
of the Christian faith bore with it truly equal rights 
for all in the political order. 

Most of the sects which flowered in the seven- 
teenth and eighteenth centuries (Congregational- 
ists, Levellers, Methodists) were formed in the hope 
that voluntary communities of believers might in- 
fluence the political order by an appeal to all men 
to discipline their self-interests, to be reasonable 
in public debate, and to cherish freedom once they 
experienced it in religion. For sectarian liberalism 
the possibility of regeneration and renewal of in- 
dividual men through the power of the gospel and 
the fellowship of believers became the focus of 
concern, rather than the scope of self-interest 
and the persistence of national and institutional 
loyalties, which preoccupied the Lutherans. None- 
theless, the sects considered themselves as part of 

the Reformation and appealed to its principles of 
the priesthood of all believers, the freedom of Chris- 


tian conscience, openness to the fresh blessings of 
the Holy Spirit, and the possibilities of brotherhood 
in the political order. 

Many scholars (Nichols 1951, p. 3; Bennett 
1958, p. 147) have viewed this sectarian heritage 
as the seedbed of liberal, democratic politics, while 
noting that sectarian ideas of the seventeenth and 
eighteenth centuries are not to be equated with 
Lockean politics or the individualism of nineteenth- 
century American Protestantism. The principle of 
human equality was implied in the sectarian em- 
phasis that all believers are priests capable of 
contributing to the interpretation of God’s will and 
response to it. But this did not mean that all men 
are alike in standards and values. It meant that 
all men are alike in being children of God, that 
each has something incomparable and unique 
about him, a particular purpose to his work and 
life which God will make known. 

Democratic principles of government by consent 
and of constitutional safeguard for freedom of ex- 
pression were formulated by Protestants out of 
their experiences in self-governing, autonomous 
congregations of Independents, Levellers, Quakers, 
etc., in the seventeenth century, These congrega- 
tions developed and put into practice concepts 
which had been implicit in the Church’s teachings 
from the beginning, such as the duty to follow 
one’s conscience or inner light and to have one’s 
acts open to critical discussion by the community. 
In these congregations the Scriptures were not left 
to private interpretation of each member or to the 
authority of the clergy. The will of God for society 
had to commend itself to the whole fellowship of 
believers, To discern God’s will demanded a diver- 
sity of gifts and the liberty of men to agree and 
disagree without resort to violence. Thus, the freely 
negotiated contract was, in economic and political 
life, the characteristic institution of the liberal 
society. 

The liberal sectarian faith reached its greatest 
influence in American Protestantism of the nine- 
teenth century. The optimistic belief of the move- 
ment that the rule of God could be established 
in society by voluntary action became pronounced 
in a nation which had a sense of new beginnings 
in a frontier land. The Methodist church, founded 
by John Wesley, with its stress upon change of 
heart and conversion of the individual as the key 
to social salvation, was the fastest growing denom- 
ination in nineteenth-century United States. In 
American Protestantism the underlying assumption 
of both religion and culture is that the religious 
convictions of individuals freely gathered in 
churches and acting in voluntary associations will 
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permeate the cultural—political order by persuasion 
and example. 


Calvinist political thought 

The political problems which Luther and the 
sects of the Reformation bequeathed to John Calvin 
centered on a developing crisis in the concept of 
order and in the Western traditions of civility. 
Lutheran and sectarian criticism of Roman Catho- 
lic Christianity had focused upon a demand to free 
the individual believer from a mass of institutional 
controls and traditional restraints. This liberation 
had encouraged the development of a conception 
of the religious community as a fellowship bound 
together by ties of faith, love, and the worshiped 
presence of Christ, but it had invited avoidance, 
indifference, even antagonism toward the harsher 
political realities. In this crisis of order, Calvin 
put forward a system of ideas which stemmed the 
flight from civility. Calvin provided Protestantism 
with a fully developed doctrinal and political sys- 
tem as an alternative to Roman Catholic thought 
and organization. While Luther spoke in anguish 
of the great dilemmas of faith—freedom and au- 
thority, anxiety and justification—Calvin belonged 
to the second generation of Protestants, concerned 
with systematic innovation in moral conduct and 
social organization. 

In his “Dedication” to the Institutes, Calvin made 
clear that he belonged to the Reformation, reflect- 
ing Luther's statement of the primary principle of 
Protestantism: “While we make use of their [the 
Church Fathers’) writings, we always remember that 
‘all things are ours’; to serve us, not to have domin- 
ion over us, and that ‘we are Christ's alone,’ and 
owe him universal obedience. He who neglects this 
distinction [from the papal tradition] will have 
nothing decided in religion.” But the whole thrust 
of Calvin's thought is that Christianity constitutes 
far more a faith of discipline and obedience than 
of justification. “And doubtless this is the priest- 
hood of the Christian pastor, that is, to sacrifice 
men, as it were, to God, by bringing them to obey 
the gospel, and not, as the Papists have hitherto 
haughtily vaunted, by offering up Christ to recon- 
cile men to God” ([1540] 1948, p. 527). 

) In Calvinism and the reformed-church tradition 
in Protestantism, Christianity was neither a priestly 
communion with God nor a voluntary fellowship 
cn a dan but rather a social religion. 
PRA E en was an elaboration 
E ple tha e Christian faith would be 

plete without an institutional structure to 
express religious convictions effectively in the soci- 
ety, possessing the power and control needed to 


insure the solidarity of the group. The Calvinists 
viewed the Lutheran church as too vulnerable to 
political interference and the sects too split by the 
internal disorders and confusions of egalitarian 
democracy. The Calvinist church polity aimed at 
self-sufficiency without divorce from political life, 
at active membership and strong leadership without 
papal and clerical domination. Ecclesiastical and 
civil government were to be more analogous than 
antithetical, different in objectives and spirit but 
not in the necessities of law, authority, and power. 
Calvin often spoke of the church as a common- 
wealth, a counterpolity to the state, which cohered 
fundamentally by virtue of a transcendent spirit 
working through the “saints,” who had joined Christ 
to form a corpus mysticum, activated and disci- 
plined enough to “take the earthly kingdom and 
transform it.” [See the biography of CALVIN.) 

The greatest innovations in sixteenth-century 
politics (e.g., the English revolution led by Oliver 
Cromwell) stemmed from this theory of the church. 
The principal political idea of the revolution of the 
saints, as one historian of Puritanism has noted 
(Walzer 1965, p. 1), was that “specially desig- 
nated and organized bands of men might play a 
creative part in the political world, destroying the 
established order and reconstructing society ac- 
cording to the Word of God or the plans of their 
fellows—this idea did not enter at all into the 
thought of Machiavelli, Luther or Bodin.” The 
activism of the Calvinist saints—Genevan, Hugue- 
not, Dutch, Scottish, Puritan—marked the trans- 
formation of politics into a calling or work that 
could over a lifetime express one’s deepest religious 
and moral convictions. 

The Calvinists changed the emphasis of political 
thought from the king and prince, governing a 
traditional society of nonparticipating, inactive 
men, to a band of saints, seeking through inde- 
pendent political action to transform and construct 
a new, better world. The Calvinist theory provided 
the emerging bourgeois, public officials, and intel- 
lectuals with a view of the world in which their 
concern, efficiency, and ideas would be respectable 
and responsible. The medieval-feudal society, 
against which the saints revolted, could be re- 
formed only from above, by changes in the hier- 
oe Sa and monarchs, or from outside, by 
eae Spe Lesser men—magistrates, 

. priests, merchants, etc.—were 
ai = members of an organic, natural 
stablished patterns of hierarchy and 

deference. Not until the Calvinists did a political 
and theological system of thought develop which 
suggested that laity should express their deepest 
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personal, moral, and religious convictions in free, 
methodical, rational association and work in the 
political order. What Calvinists said of the saint, 
other men would in time say of the citizen and his 
civic virtue. 

This kind of political organization and person- 
ality would have been impossible if Calvinism had 
not broken from both the Lutheran teaching that 
government was chiefly a mighty instrument of 
repression and from sectarian insistence that the 
political order must either conform to the law of 
love in Biblical Christianity or be forsaken. Calvin’s 
thought was based on a realistic conception of the 
wickedness of men and their eternal need for re- 
straint and expressed hopefulness in the possibili- 
ties of dedicated men bringing something of the 
divine command for a holy commonwealth into 
the political order. Calvinism had a conception of 
“the common grace” that enabled men to live to- 
gether in civil society and that made it cautious of 
judging non-Christian secular governments as un- 
worthy of the saints’ loyalty and concern. Calvinist 
speculations were rooted in acknowledgment of 
the independent value of the political world; be- 
liever and nonbeliever alike needed civil order 
encouraging peace as well as piety, moderation as 
well as passionate commitment. But the civil order 
also needed, in the eyes of the saints, men who 
took as their standard the compassion and justice 
of the holy commonwealth, who sought to fashion 
a corporate community neither religious nor secu- 
lar but a complex compound of both. 


Contemporary Protestant political thought 


Each type of Protestant political thought—Lu- 
theran, sectarian, and Calvinist—has maintained 
a dynamic and vitality of its own to the present 
time, although some scholars dispute the persist- 
ence and strength of these types in the midst of 
pervasive processes of secularization and urbani- 
zation in the larger society and of movements toward 
ecumenical theology and professionalization of the 
ministry, Some (e.g., Herberg 1955) have seen 
strong tendencies in the American scene toward 
the integration of Protestantism into a cultural— 
national religion in the liberal sectarian tradition. 
Other, empirical studies (Glock & Stark 1965, 
p. 86) reveal significant differences between de- 
nominations originating in the three great tradi- 
tions, Historians and theologians engaged in re- 
study of these traditions have not only evidenced 
a concern with understanding the distinctive in- 
sights of each phase of the Protestant movement 
but have also been engaged in a process of reinter- 
pretation which reflects a heightened consciousness 


of the useful insights in all of these traditions for 
clarifying the Protestant identity and action in the 
contemporary scene and for seeing aspects of a 
given religious tradition previously ignored (H. R. 
Niebuhr 1951), Intensive studies of social and re- 
ligious interaction in political crises involving all 
the churches indicate that the varied traditions are 
not without influence in contemporary communi- 
ties and are increasingly viewed by influential 
clergy and laity as possessing complementary or 
mutually significant values in the work of the whole 
church and in the formation of public policies 
(Underwood 1957). 

The emphasis of the Lutheran Reformation upon 
extricating the church from the world is seen now 
as a continuing need to free the gospel from the 
proprietorship of either church or state and to probe 
the heights and depths of personal faith and con- 
science in culture. The chief impact of neo- 
Reformation theologians such as Karl Barth has 
been to help Protestants view Christianity not sim- 
ply as a human cultural phenomenon called re- 
ligion but as the faithful response of men to Christ, 
revealed in the Scriptures as the living God, a being 
who acts in the totality of history and in a fashion 
transcending the wisdom of all his interpreters and 
the social forms of all his churches. 

In the fragmentary writings of Dietrich Bon- 
hoeffer, the pastor executed in Germany for his 
participation in a plot to assassinate Hitler, are to 
be found perhaps the most radical formulations 
of the implications for Christians of Barth’s fidelity 
to Christ as the final revelation of the word of God 
and as the paradigm by which men may try to 
shape their lives. Bonhoeffer saw himself as re- 
flecting a historical movement, beginning about the 
thirteenth century, “towards the autonomy of man 
[which] has in our time reached a certain comple- 
tion. Man has learned to cope with all questions of 
importance without recourse to God as a working 
hypothesis” ([1951] 1953, p. 145). Bonhoeffer was 
unhappy with all current “metaphysical and in- 
ward” formulations which theologians use as an 
explanation of what is happening in science, 
the arts, politics, and religion. Bonhoeffer particu- 
larly rejected those concepts which focused Chris- 
tian hope upon salvation from “sin and death into 
a better world beyond the grave” (ibid., p. 154) 
and upon Christian acts as being “religious in a 
particular way, to cultivate some particular form 
of asceticism (as a sinner, a penitent or a saint).” 
To be Christian, for Bonhoeffer, was “to be a man. 
It is not some religious act which makes a Chris- 
tian what he is, but participation in the suffering 
of God in the life of the world” (ibid., p. 166). 
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What such participation meant for political thought 
and action is not very clearly developed. Tradi- 
tional Lutheran terms such as “orders” of religion 
and politics were rejected as implying too rigid and 
abstract a view of where the commandments of 
God were present. Bonhoeffer preferred the term 
“mandate” as a “concrete divine commission.” The 
mandate could come to a man in an institution or 
office if these concepts were purged of their con- 
servative overtones of rational—legal authority and 
if they made it possible for committed men to 
act as each new situation required. 

The contemporary restatements of the liberal 
sectarian tradition have noted the way in which 
voluntaristic, individualistic ideas of nineteenth- 
century Protestantism were exaggerated and for- 
malized in ideological rejection of the use of the 
state to balance economic interest for the sake of 
justice, to meet new welfare needs of people in 
urban industrial societies, and to provide basic 
liberties to neglected racial and ethnic minorities. 
The studies and assemblies of the World Council 
of Churches and other ecumenical bodies and in- 
terpretation of these studies by ethicists and theo- 
logians (e.g. Bennett 1958) have related the univer- 
sal concerns of the historical liberal tradition to 
other traditions of a Protestantism no longer char- 
acterized by sects but by denominations represent- 
ing members of a changing spectrum of social and 
religious backgrounds. The basic concern retained 
from tradition has been the acute awareness of 
the insufficiency of outward forms of order and 
justice as political goals. This tradition has kept 
alive the intuition that people may experience 
estrangement and personal disorganization in the 
midst of economic abundance and civil liberty; 
that the public resources are ultimately for the 
nourishment of communities of faith, vision, and 
love; and that law serves not as retribution for its 
own sake, but as a prologue to those redeeming 
acts of vicarious responsibility for public acts ante- 
cedent to the violation of law (Harding 1956, p. 67). 

Since the 1930s the major figure in the contem- 
porary reformulation of Protestant political thought 
has been Reinhold Niebuhr. He criticizes conserva- 
tive, pietistic theology as too pessimistic about the 
human situation to see the possibilities of justice 
and compassion in civil society. Those who express 
such pessimism are so afraid of disorder that they 
will permit public authorities to stifle all minority 
dissent for fear of anarchy. On the other hand, 
Niebuhr criticizes utopian, liberal-democratic the- 
ory for its incapacity to see that the profound 
collective self-regard persists, even in the moral 
ideals advocated by economic, national, or religious 


groups; American illusions of omnipotence and 
moral concern for other nations have as much 
possibility, in his judgment, of shattering nuclear 
peace as the illusions of closed, dogmatic societies. 
Niebuhr’s own Calvinistic realism is epitomized in 
his observation that “man’s capacity for justice 
makes democracy possible; but man’s inclination 
to injustice makes democracy necessary” (1944, 
p. xi). 

Scholars who have attempted to foresee future 
developments in Protestant thought (Meyer 1960; 
West 1958; Williamson 1964) have sensed the 
limits of the ability of political realism to affect the 
wellsprings of personal and cultural renewal or to 
touch the sources of faith and conversion, The 
search is for images and facts which reveal alter- 
natives of concrete political action rather than for 
abstract, idealistic theological formulations. What 
do the historical principles of the faith mean to 
men who have the power and organization to 
create economic growth in new nations, to discover 
areas for mutual cooperation between nations with 
strong ideological differences, to use space and 
technology in cities, with aesthetic and moral 
purpose? 

Protestantism is not likely to be able to view 
itself in any nation in the future as the religious 
expression of the common aspirations of a culture. 
Efforts of Protestants to envision and realize new 
political possibilities and to encourage careers of 
public service may well reflect something of older 
Calvinist hopes, but these efforts must now com- 
mend themselves to non-Protestants as well as to 
heirs of a particular history. In this situation, Prot- 
estant thought is likely to pay greater attention to 
conditions of religious liberty and social creativity 
not dependent upon a people’s being Christian or 
religious but rather upon their being human and 
politically responsible. Protestants can be expected 
to extend their study of the relations of non-Western 
religions to public life and to the ecumenical move- 
ments within the Christian church. 


KENNETH W. UNDERWOOD 


[See also CHRISTIANITY and the biographies of BODIN; 
Cavin; and LUTHER. A guide to other relevant 
material may be found under RELIGION.] 
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PROUDHON, PIERRE JOSEPH 


Pierre Joseph Proudhon (1809-1865), French 
socialist, was born in the city of Besancon. He be- 
longs, with Fourier, Considérant, and Cabet, to that 
group of social reformers of the first half of the 
nineteenth century who came from Burgundy and 
Franche-Comté. 

Proudhon’s father was a local brewer, and his 
family fits into a social category that was then 
widespread—artisans who owned a few patches of 
land and rose to the level of shopkeeping when 
their fortunes improved. But Proudhon's father was 
too honest to succeed; he sold his beer for no more 
than the malt, the hops, and his subsistence cost 
him, and after a very little time he went bankrupt. 
Crushed by burdensome mortgages, the family was 
destitute, Proudhon, a brilliant student, would come 
home from the lycée to find his parents having 
bread and water for supper. He was able to con- 
tinue his studies only by becoming a printer's fore- 
man and proofreader, and he absorbed all the mate- 
rial he handled. The knowledge he acquired in this 
fashion was extremely broad, but ill-organized. He 
was 29 before he passed the baccalaureate exam- 
ination and received a scholarship from the Suard 
fund, awarded by the Académie de Besangon to 
poor, hard-working young men. Thanks to this 
scholarship, Proudhon was able to keep on with 
his studies and to begin to publish. 

This was a difficult period for him, After an 
unsuccessful attempt at being an independent 
printer, he worked for a small shipping firm in 
Lyons. The company sent him to Paris, where he 
met liberals, communists, and socialists, as well as 
German exiles who introduced him to the philos- 
ophy of Kant and Hegel. 

He played only a peripheral role in the revolu- 
tion of 1848. After being defeated in the elections 
to the National Assembly, he was elected in the 
June supplementary elections, when he ran with 
Thiers, Victor Hugo, and Louis-Napoleon Bona- 
parte. He protested in the Assembly when the 
“national workshops” and “right to work” programs 
were discontinued. Primarily, he expounded his 
ideas in the press and published Le représentant du 
peuple, a newspaper whose name was changed 
first to Le peuple and later to La voix du peuple. 


In March 1849 he was sentenced to three years 
in prison for having published articles slandering 
the chief of state. While in prison, he married a 
Parisian working girl. His imprisonment was more 
like an enforced residence, for it did not prevent 
him from publishing. After further trials and con- 
victions, he went into exile in Belgium. He became 
ill in 1856 and endured several years of poverty. 
When in 1862 his hostility to nationalist separa- 
tism provoked sharp criticism in Belgium, he was 
pardoned by the emperor and returned to France. 
He died in 1865, killed by the mental strain and 
hardship of his restless life, 

Differences from other socialists. Some of Proud- 
hon’s ideas departed sharply from those common 
among the socialists of his time. First of all, he 
did not accept the general socialist advocacy of 
free love and common-law marriage. Instead, 
Proudhon had an extremely conventional concep- 
tion of the family; he extolled family happiness 
and advocated the strictest possible discipline 
within the family. He entirely rejected feminism 
and believed that women should confine them- 
selves to performing household duties and raising 
children. 

Second, Proudhon differed from other socialists 
in that his thought had a classical cast. He was 
influenced by the Bible, the ancient classics, and 
the great writings of the seventeenth and eight- 
eenth centuries. He was a typical French craftsman 
rather than a workingman, belonging to the petty 
bourgeoisie. Marx was being nasty when he called 
Proudhon petit bourgeois, but he was near the 
mark: this epithet indeed characterizes both 
Proudhon’s position in society and his moral 
temperament, 

Like other socialists, Proudhon was an atheist, 
but he was a Christian atheist. He was a member 
of the Sincérité, Parfaite Union et Constante Amitié 
lodge of the Masonic order in Besangon; his open 
profession of atheism concluded with this famous 
denunciation: “God is folly and cowardice; God is 
hypocrisy and falsehood; God is tyranny and 
misery; God is evil” (Oeuvres complètes, vol. 2, 
p. 384 in the Riviére edition), However, what 
seemed to Proudhon’s contemporaries to be a total 
and permanent breach with Christianity now ap- 
pears to have been an attempt at dialogue. As 
Lubac put it, Proudhon did not permit the Christian 
to turn away from his worldly tasks but rather 
compelled him to meditate constantly, as did 
Proudhon himself, on eternal problems (1945). 

As Cassou put it (1939), Proudhon embodied all 
the restlessness and strength of the French artisan, 
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with his impassioned and captious thirst for jus- 
tice. Proudhon’s writings are well described by the 
epigraph he himself gave them: Destruam et aedi- 
ficabo. The destruction is anarchism; the new 
edifice, mutualism and federalism. 

Finally, Proudhon was the best writer among 
the socialists. He had a natural eloquence, power- 
ful and caustic, Saint-Beuve was not wrong in re- 
garding Proudhon as one of the great figures of 
nineteenth-century literature. 

Proudhon’s anarchism. Proudhon condemned 
all governments and denounced the very existence 
of the state. Insofar as it is a system of order 
through power, the state derives its authority from 
the family. Appropriate as such authority is to the 
family, it becomes in the state an “artificial system, 
varying with different centuries and climates, yet 
alleged to be natural and necessary to humanity” 
(Oeuvres completes, vol. 1, p. 297 in the Riviére 
edition). This system is in fact intrinsically evil 
and incorrigible. It is vain to struggle against its 
abuses; such a struggle merely aggravates them. 
Proudhon had an especially lively antipathy toward 
democratic reformers because he felt that their 
faith in suffrage only serves to camouflage reality, 
to conceal the impossibility of making the state 
consonant with the people’s wishes. 

Proudhon clearly perceived the void created by 
his condemnations. As reported in La voix du 
peuple on December 29, 1849, when he was asked, 
“What will you set up in place of the State?” he 
answered, “Nothing. Society is perpetual motion. . . . 
It contains its own spring, always wound, as well as 
its own balance wheel” (ibid., vol. 1, pp. 416-417). 
This “motion” derives from the economy, which 
takes precedence over politics. As Proudhon de- 
scribed it in De la capacité politique des classes 
ouvrières, first published in 1865, a few months 
after his death, the relationship between the econ- 
omy and politics is analogous to the physiological 
relationship between the functions of organic life 
and those of spiritual life. It is by virtue of the first, 
of the organic functions, that is, of economic life, 
that a society exists. 

However, Proudhon, unlike the liberals, did not 
expect that once men were free of the state, they 
would spontaneously create a new economic order 
by entering into mutual agreements. Mutualism 
had to be organized. 

Mutualism and federalism. Proudhon proposed 
a mutualist organization of the economy which 
would substitute an egalitarian liberalism, founded 
on free credit (granted by the “people's bank”) 
and natural exchange of services, for the economic 
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irresponsibility of unorganized individualism, Again 
in De la capacité politique, he asserted that society 
must be considered not as a hierarchy of functions 
but as a system in which free forces are in an equi- 
librium based on the enjoyment of equal rights in 
exchange for the discharge of equal obligations 
and the enjoyment of equal advantages in ex- 
change for the performance of equal services. Only 
mutualist institutions freely created in accordance 
with reason and experience will create order in the 
chaotic tangle of existing economic relationships. 

Fundamental economic rights, public rights, im- 
mediately follow from this conception, according 
to Proudhon. Since he rejected government as a 
source of guarantees, the problem is how to substi- 
tute mutual guarantees for those based on au- 
thority. Federalism, that is, mutualism transferred 
into the realm of politics, is the solution. 

Far from living in isolation, individuals will 
form middle-sized and sovereign groups based on 
territorial considerations, as well as larger eco- 
nomic units called agricultural—industrial com- 
munities based on the free exchange of services. 

Proudhon defined the federal contract with par- 
ticular care in his Du principe fédératif of 1863. 
Federation (from the Latin word foedus, or pact, 
contract, treaty, convention, alliance, ete.) is a 
covenant by which several groups of communities 
or states bind themselves reciprocally and equally 
for one or several purposes. 

Unlike the social contract, the federal contract 
involves only limited renunciation of individual 
liberty. In forming the pact, the contracting parties 
retain more personal and real rights than they sur- 
render. Proudhon’s political regime rules out any 
surrender or transfer of rights, which in his opinion 
is the consequence of the classical form of repre- 
sentative government. He asserted that it is im- 
possible to delegate that which is most essential to 
human beings: the right to have one’s Say, to 
negotiate directly, individually, for oneself. 

To the extent that the Proudhonian concept 
tolerates any deputies, they would be like the 
proxies of private law. Proudhon condemned the 
notion in revolutionary public law that a deputy 
represents the abstract “nation” and nothing else. 
Each deputy is, first of all, the man of the district 
or profession that chose him as its representative— 
one of its citizens and its special trustee author- 
ized to defend its particular interest. In order for 
suffrage to be direct, it is not enough that the voter 
directly give his vote to the man elected; it is also 
necessary that the deputy represent, no less 
directly, opinions, rights, interests, and business. 


For a state, a society, is not solely a matter of will; 
it also consists of more material concerns, 

The final result of anarchist federalism is the 
dissolution of the state into multiple autonomous 
groups having their own administration. Under 
these circumstances, the principle of authority 
tends to disappear. The state (res publica) is placed 
on the immutable foundation of law, of corporate 
and individual local liberties. The government 
actually no longer exists. The community will be 
set up as an autonomous administration, for it is 
“in its essence, like man, like the family, like each 
individual and each intelligent collectivity, moral 
and free, a sovereign being” (Oeuvres complètes, 
vol. 4, p. 265 in Riviére edition). It will issue de- 
crees, pronounce decisions, and will even make its 
own laws. There is no middle course. The com- 
munity must be sovereign or subordinate, all or 
nothing. 

Proudhon’s influence. The evolution of states 
in the nineteenth century, moving toward central- 
ization domestically and toward nationalist sov- 
ereignty in foreign affairs, hardly corresponded to 
Proudhon’s expectations. Nor did political social- 
ism develop along Proudhonian lines; not only 
were his long-term views on federal and anarchic 
organization ignored, but so were his short-term 
instructions for boycotting elections. 

On the other hand, over the long run, Proudhon 
has proved to be a prophet. He foresaw “a thousand- 
year purgatory” for mankind if federalism were 
rejected. He also foresaw the importance of labor 
unions, proclaiming that “workshops will replace 
the government.” The substitution has not been 
complete, but the twentieth century has seen the 
development of great collective forces that have 
influenced every aspect of public life. 


The continuing interest in Proudhon’s work vin- 
dicates his view, expressed in De la capacité poli- 
tique, that ideas succeed only if people cling to 
them and turn them into institutions and customs 
which legislators can then turn into laws. 


MARCEL PRELOT 


[For the “historical context of Proudhon’s work, see 
ANARCHISM; SocIALISM; and the biographies of 
Marx and SorEL.] 
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1. THE FIELD 
u. CHILD PSYCHIATRY 
mm. SOCIAL PSYCHIATRY 
Iv. FORENSIC PSYCHIATRY 
v. THE RELIGIO-PSYCHIATRIC 


MOVEMENT Samuel Z. Klausner 


I 
THE FIELD 


For the last hundred years psychiatry has been 
considered a specialty of medicine usually defined 
as the medical practice or applied science of treat- 
ing and preventing mental diseases, or disorders of 
the mind. Today, however, Masserman’s definition 
(1946) is more appropriate; it considers psychi- 
atry broadly as a science that deals with the vari- 
eties of human behavior, their determinants, 
methods of analyzing them, and the techniques for 
coordinating behavior with optimal personal and 
social goals. The disparity between these two defi- 
nitions indicates that profound changes have taken 
place in psychiatry. It is presently a peculiar ad- 
mixture of biology, psychology, and sociology. 


History 

In the late eighteenth century psychiatric pa- 
tients were released from chains and removed from 
dungeons, where they had been treated like ani- 
mals and had died prematurely. Not until about 
1820 (Kraepelin 1917), however, were some 
patients considered curable by medical means and 
no longer considered to be animals with innate 
weakness and baseness, or voluntary pursuers of 
evil whose insanity was contagious to their keepers, 
About the same time, psychiatric textbooks ap- 
peared and medical students were given instruc- 
tion in a wide variety of physical therapies. The 
subject of psychiatry became less a concern of phi- 
losophers and more a medical discipline ( Schneck 
1960). 

Although the strictly biological and medical ap- 
proaches were pursued early, it was late in the 
nineteenth century before hypnosis, introspection, 
psychotherapy, and psychoanalysis placed the em- 
phasis on deeper levels of mentation rather than 
on behavior, and the focus centered on the psy- 
chology of the individual. It is worthy of comment 
that the psychologies and sociologies that took sig- 
nificant hold in psychiatry were those developed 
by psychiatrists themselves. Academic psychologi- 
cal and social sciences had little influence on the 


608 PSYCHIATRY: The Field 


field and, in fact, were more stimulated by, and 
receptive to, psychiatric thinking than they were 
contributory. 

These sciences were also still in the early stages 
of development and concentrated on matters of 
little concern to the psychiatrists, Psychology was 
in the almost exclusive phase of “brass-instrument” 
studies of the mechanisms of behavior and animal 
experimentation. Anthropologists were describing 
customs of primitive populations, and cultural- 
personality conjunctions were still vague. Neither 
psychology nor sociology could assist in the study 
of the mentally healthy or ill. Medical tradition, 
however, directed psychiatrists to reason from the 
abnormal to the normal. Psychopathology furnished 
insight into psychodynamics and into healthy struc- 
ture and function, 

Recent scope. In the last several decades the 
direction of influence has shifted, at least enough 
to indicate that the commerce between psychiatry 
and the social sciences traverses in two directions. 
As dynamic psychiatry has moved from concentra- 
tion on unconscious instinctual representations, 
drives, and id forces to concern with stress re- 
Sponses, coping devices and defenses, signal anxi- 
ety, and reality testing of modern ego psychology, 
it has attempted to incorporate the vast amount of 
information acquired by psychologists about ego 
functions. As psychiatrists have learned that the 
functions and structures of the individual in the 
process of development are derived from experi- 
ences with people, that the major stresses of life 
arise from disturbed or broken relations with peo- 
ple, and that therapeutic processes occur best in a 
social setting, social sciences have become impor- 
tant for the understanding of man and his prob- 
lems. Therefore, these fields have recently fused 
in some areas into new hybrids, social psychology 
and social psychiatry (Rennie & Woodward 1948). 
In fact, the definition of Psychology emanating 
from Harvard University (1947) differs little from 
the ideal definition of psychiatry—a biopsycho- 
social discipline (Murphy 1947; Cameron 1963). 

What is currently included within the field of 
psychiatry actually ranges from biodynamics to 
existentialism, In fact, psychiatry seems to involve 
a study of all human behavior and is as broad as 
life and its social and cultural derivatives, Behavior 
as the central focus of psychiatric research is not 
elucidated by a fractionation of man as a totality 
isolated from his environment of things, people, 
and symbols ( Goldstein 1934). Furthermore, it 
does not ignore the nervous, circulatory, and endo- 
crine systems, the visceral and somatic organs, 
and their constituent cells and fluids. Unfortunately 


there is no unified theory, so that controversy still 
exists between “reductionists,” who hope to explain 
man’s behavior exclusively on a physicochemical or 
cellular basis, and “extensionists” or “humanists,” 
who focus entirely on society and culture. These 
are polarities leading to futile arguments, The bio- 
logical or the social sciences alone cannot explain 
behavior, but psychiatry cannot progress without 
them (Grinker 1964), 


A point of view 


We should approach living human beings as if 
they existed in a total field of multiple transactions, 
thereby avoiding the dichotomies of nature—nurture, 
organic—functional, lower-higher, or reduction— 
extension (Dewey & Bentley 1949), Furthermore, 
in dealing with multivariable problems, we can 
operationally behave as if we really believed in 
multicausality of both healthy and disordered func- 
tion, within a total field whose constituents range 
from physicochemical to symbolic foci. We are then 
able to study each focus or system (for example, 
nervous, endocrine, cognitive, behavioral, and emo- 
tional) in terms of structure, function, integrative 
capacity, and transactions with one or more other 
systems. The specific or exclusive importance of 
one view or system of empirical events becomes 
minimized when we recognize that change in one 
system affects all others. However, one view or one 
technique has only limited usefulness, for although 
each system has considerable commonality in es- 
sence with all others, each also has its specific 
regulatory devices approachable only by special 
techniques (Grinker 1961). Multidisciplinary teams 
are essential for multisystems investigation at the 
present level of sophistication with respect to the 
large number of variables involved in human re- 
search (New York Academy of Medicine . . , 1957). 

If psychiatry is a part of human behavioral sci- 
ences, it is differentiated operationally by its focus 
on the individual in relation to some other person 
or persons, things or situations, even though we 
know he can be isolated only temporarily for in- 
vestigation from other foci in his life, just as the 
physiologist isolates a single organ or organic sys- 
tem for investigation. 

In this temporary isolation the individual may be 
studied through the methods of observation, de- 
scription, experimentation, and quantification in 
transaction with another focus by observers using 
the necessary techniques from a suitable frame of 
reference. Thus some parts—for example, endo- 
crine processes, autonomic neurological activity, 
and muscle tension—may be studied at the same 
time as observable or reported behavior. In another 
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example the behavior of the individual may be 
viewed in relation to the behavior of others. One 
focus or another may be utilized in the transac- 
tional analysis, and step by step the functional 
parts of man can be studied in relation to various 
behaviors until eventually the entire field may be 
covered, When physiology is considered in relation 
to emotion, the social group is absent; when the 
individual is viewed in his group, his component 
parts fade. On the other hand, the social scientist 
loses sight of the individual. To envisage the entire 
field requires a rapid shift of accommodation in 
mental imagery, in which it is necessary (but is 
rarely done) to state explicitly what is of concern. 


Theory in psychiatry 

Although it is well known that theory and inves- 
tigatory methods interact by continuous feedback 
processes, in the field of psychiatry theoretical ab- 
stractions and hypotheses are rarely explicit but 
usually must be extracted from the operational 
procedures in use, Furthermore, because of the 
large number of foci or facets in the total field to 
which various disciplines hold proprietary rights, 
theory is not unified but is partial or fragmented. 
Psychoanalytic theory, because of its enormous in- 
fluence on modern psychiatry and its vast scope, 
has been given high priority. 

Psychoanalysis and its derivations. Horace Kal- 
len (1934), in the Encyclopaedia of the Social Sci- 
ences, stated that psychoanalysis had no quanti- 
ties, meaning systems of measurement, and could 
never be a science, He stated that inferences were 
employed as agents in their own verification. Psy- 
choanalysis was considered a faith and a dogma; 
psychoanalysts were judges of their own truth and 
of the criteria establishing its validity. 

Although Freud’s psychoanalytic science and 
metapsychology were speculative and the methods 
of psychoanalysis may be unreliable, Freud estab- 
lished the principle that psychological processes 
should be studied by psychological methods. Even 
though the data may be difficult to verify, and his- 
torical anecdotes may lead to invalid inferences, 
Freud did develop a new paradigm suitable to man 
as an animal capable of symbolic transformation, 
which in essence was a scientific revolution and a 
generative idea. Unfortunately, Freud was incapa- 
ble of systematic presentation, and psychoanalytic 
theory and its modifications are criss-crossed by 
shifts to the new and reaffirmations of the old, 
written in poetic style interspersed with metaphors 
and confused by circular reasoning, 

There is no use in attempting to formulate the 
totality of psychoanalytic theory. It is even difficult 
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to extract precise declarative sentences suitable for 
the development of testable hypotheses, although 
the attempt has recently been made by Rapaport 
(1959). In general, it can be stated that psycho- 
analysis is not a philosophy and is not yet a science. 

Despite justifiable criticism, psychoanalysis has 
had a profound effect on all of the psychological 
and social sciences (Hendrick 1934). It avoids re- 
ductionism, yet it is based on a theory of instincts 
(life and death), is motivational, and focuses on 
the present personality structure in the light of the 
individual's past (structural), Psychoanalysis has 
a system of allocated mental functions: id, ego, 
and superego (topological), It deals with forces 
and conflicts (dynamic). Finally, it views the re- 
sultant personality structure in adaptational terms 
(economic). 

The several early deviants from Freud's classical 
psychoanalytic theories (Alfred Adler, Carl Gustav 
Jung, and Otto Rank) have had varying effects on 
psychiatry, psychology, and the social sciences 
(Munroe 1955). Adler’s concept of “individual psy- 
chology” formed the basis for a special therapeutic 
approach, which has been so diluted that it is now 
hardly recognizable as a separate school, Never- 
theless, the concept of the “will to power” was the 
precursor of Freud’s later theory of the death in- 
stincts and their mastery through outwardly ex- 
pressed aggression, Jung’s notion of the “collective 
unconscious” became more of a mystique and less 
of a theoretical or operational approach in psychi- 
atry. It has become aligned with existential psy- 
chiatry, Rank’s concept of “birth trauma” led to a 
short-lived therapeutic procedure of establishing a 
patient—therapist relationship and setting a termi- 
nation date well in adva..ce, so that impending 
separation from the therapist might enable the 
patient to work through a new “birth trauma” under 
better conditions. [See PsycuoLocy, article on 
EXISTENTIAL PSYCHOLOGY; and the biographies of 
ADLER; JUNG; RANK.] 

The so-called Neo-Freudians (Harry Stack Sulli- 
van, Frieda Fromm-Reichmann, Erich Fromm, 
Karen Horney, etc.) form a heterogeneous group 
with many divergencies from classical psychoanaly- 
sis, The interpersonal or interactional theory of 
Sullivan gave a great impetus, including the estab- 
lishment of a “school,” to the treatment of schizo- 
phrenic psychoses, The Neo-Freudians attend more 
to the difficulties of present interpersonal processes 
(social) than to lengthy reconstructions of the 
past. [See the biographies of HORNEY; SULLIVAN.] 

Ascendancy of psychoanalysis—a critique. It is 
unfortunate that contemporary psychiatry has 
almost completely adopted the psychoanalytic or 
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psychodynamic model. Psychodynamics is pur- 
ported to be the basic science of psychiatry, It is 
frequently stated that the core process in psychiatry 
is a dyadic “depth” relationship in which diagnosis, 
prognosis, therapy, and research are processed, a 
technique that utilizes a participant—observer of 
uncertain reliability. 

This “dynamic” approach within a dyad has 
superseded and even resulted in a depreciation of 
the basic psychiatric techniques through which so 
much progress was made in the nineteenth cen- 
tury. These are the methods of careful and con- 
trolled observations over time and accurate descrip- 
tions of behavior, including verbalizations but not 
limited to them. 

According to these basic techniques, observation 
of behavior rather than inferences about feelings 
is the keystone of psychiatric research. Behavior, 
in actuality, represents functions allocated to a 
hypothetical ego that filters perceptions on the one 
hand and actions on the other, expresses reportable 
motivations, affects, defenses, and compromises, 
employs symptoms and sublimations, and demon- 
strates integrative capacities and disintegrative 
trends. About any of these effects there need be 
few inferences or interpretations. As observational 
material, they can be coded and rated, repeated 
and replicated, and tracked through time. These 
basic living data can then be interpreted according 
to fruitful theory and placed in juxtaposition to 
any hypothesis, Information gained from “depth” 
interviews may, in addition, account more ade- 
quately for deviances and interindividual varia- 
bility. The behaviors of the participant-observer 
and of the subject whose “internal behaviors” are 
being observed, as in psychoanalytic sessions, re- 
quire better systems of recording and more ade- 
quate subsequent analysis. The acquisition and 
validation of behavioral data are the core opera- 
tions of psychiatry as a science, but this does not 
deny the existence of inner variables expressed by 
verbal behavior. 

Animal study and quantification. As scientists, 
psychiatrists are interested mainly in the develop- 
mental, adaptive, and maladaptive processes that 
human beings experience, no matter how “pure” 
or “basic” their image of science may be. But the 
large number of variables and parameters involved 
in natural existence has inordinately complicated 
human research in increasing degrees as sophisti- 
cation about the timing of responses and feedback 
processes among variables has developed. As a re- 
sult, we are often compelled to approach human 
problems indirectly through animal experimen- 
tation. 

Mathematical and statistical overemphasis has 


thereby become a danger modern psychiatry should 
avoid (Langer 1942). It did, however, succumb to 
a greater danger when the majority of psychiatrists 
became entrapped in the fascination of inferences 
elicited from dyadic observer—participant studies 
and became involved in psychodynamic stereo- 
types. There is still an area of scientific behavior- 
ism that is properly the focus of psychiatry, 
requiring observations or “witness” and subsequent 
quantitative analysis, Advancement of psychologi- 
cal and psychiatric knowledge requires a combina- 
tion of observation, description, depth interviews, 
and experimentation. There is no need for antag- 
onism between the proponents of these complemen- 
tary methods or between behaviorists, psychoana- 
lysts, and phenomenologists (Wann 1964). 


Psychiatric classification 

One of the first great advances in the field of 
psychiatry, as in all sciences, was the classification 
of a chaotic descriptive group into nosological en- 
tities. These carried with them the implication of 
specific causes, a course, and final result of so-called 
diseases of the mind. Necessary though this classi- 
fication was at the beginning of modern psychiatry, 
it has created many difficulties. Although the etio- 
logical factors in most of the classified entities are 
far from known, classifications led to stereotypes, 
especially in terms of malignancy and susceptibility 
to therapy. 

“Process” classification. Gradually over the 
years, psychiatrists tried to make a classification 
that was less rigid and medically oriented and, 
instead, more dynamic and concerned with proc- 
esses. The current official classification is much 
better than the previously fashionable ones but still 
represents disorders of mental functioning in terms 
of disease stereotypes. The neophyte in the field 
considers each entity as a disease and develops a 
mental image of its course and prognosis that is 
far from valid. 

The most ambitious attempt to alter this classi- 
fication has been made by Karl Menninger (Men- 
ninger et al. 1963), who has discussed mental 
disturbances in terms of coping with stress. Coping 
devices are sometimes successful but at other times 
are ineffective. One can conceive of the results as 
a series of dysfunctions in increasing order of dis- 
turbance, It is such a “process” type of classification 
that enables the psychologist and social scientist 
to make some kind of reasonable correlations with 
related variables within their fields. It is also this 
kind of “process” classification that enables us to 
speak of types of psychological treatment, no mat- 
ter from which school they may stem, in terms of 
phases of dyscontrol and attempts at reinstitution 


of coping devices. For some time psychiatrists have 
been less interested in diagnostic labels and more 
concerned with the dynamic processes involving 
the individual which lead to his failure in adapta- 
tion; they have focused on the coping devices and 
the reasons for their lack of success, and in therapy 
they have sought to enable the individual to regain 
a more comfortable equilibrium, 


Etiology 

Dynamic approaches to the etiology of many 
degenerate diseases not caused by trauma or bac- 
terial invasion deal with the unconscious conflicts 
that are not expressed directly through the somatic 
nervous system but that result in physiological dis- 
turbances in organs innervated by the autonomic 
nervous system. Among these are hypertension, 
peptic ulcer, rheumatoid arthritis, migraine, and 
colitis, A promising breakthrough occurred in the 
1920s and 1930s, when a specific repressed emo- 
tional constellation was postulated for each of the 
so-called psychosomatic diseases: peptic ulcer was 
associated with passive dependency, hypertension 
with chronic repressed rage, etc. However, this 
practical application broke down because emotional 
specificity could not be confirmed, Actually, speci- 
ficity is a complex and holistic problem because of 
a large number of variables, including genic, early 
experiential, social, and other factors, In fact, the 
mind-body problem is theoretically at the same 
stage of nonresolution as it was two thousand years 
ago. It may be that a revolutionary concept is re- 
quired regarding processes that link or identify the 
mental with the physical. For lack of better con- 
cepts, we are thrown back to operational research, 
in which more knowledge regarding psychophysio- 
logical processes can be acquired, The result may 
not be an elucidation of disease processes but a 
better understanding of the kinds of information 
communicated between the mental and physical 
process; a higher level of abstraction may be nec- 
essary before these basic systems of communication 
can be understood. 


Fields of psychiatry 


Developmental study. While psychoanalytic psy- 
chiatry, which focuses on internal dynamics, has 
attempted to reconstruct the libidinal experiences 
and the vicissitudes of the instincts in reconstruc- 
tions of childhood experiences, Piaget (1937) has 
attempted to observe, describe, and classify actual 
childhood behavior in the process of development. 
Psychoanalysis has not developed a learning theory 
(Rapaport 1959) and, therefore, leans heavily on 
vaguely expressed concepts of conditioning (Pavlov 
1873-1935). Piaget has stimulated research on 
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child development to the point that it is becoming 
an independent discipline. Its application to psy- 
chiatry is still nebulous. There have been studies of 
transactions between mothers and infants in nat- 
ural homes, and adult-child transactions in found- 
ling homes and hospitals have been studied. Engel 
and Reichsman (1956) have been able to observe 
in a child with a gastrostomy changes in behavior 
and gastric acidity on the approach and departure 
of friendly or strange adults. The field of child 
development will become basic to our understand- 
ing of normal and pathological personality devel- 
opment. [See DEVELOPMENTAL PSYCHOLOGY; PER- 
SONALITY, article on PERSONALITY DEVELOPMENT. ] 

Concepts of normality. In the area of develop- 
ment, the absence of a substantive concept for defi- 
nitions of normality presents considerable difficulty. 
The various concepts utilized in sociology, psychol- 
ogy, and psychiatry to define normality have been 
classified into four groups: normality as an aver- 
age, normality as an ideal, normality as health, 
and normality as a process through stages of a 
lifetime. 

Biological psychiatry. The field of biological 
psychiatry has recently been revived many years 
after it was buried by the failure of neuropathology 
to link psychiatric syndromes with structural 
changes in the nerve cells of the brain. Constitu- 
tional differences have always been given lip serv- 
ice—for example, neurotic constitution, neuras- 
thenia, Freud’s varying strength of the instincts, 
body types, and body inferiority. But these were 
static concepts lacking explanatory vitality and evi- 
dences of correlations. More recently, correlations 
between increased gastric secretion of pepsinogen 
and later development of peptic ulcer opened up 
a vast area of functional rather than structural 
constitutional differences, 

Genetics. The development of scientific ge- 
netics with the breakthrough in the understanding 
of the genic codes transmitted by RNA revealed a 
wide range of genic differences. The suspicion has 
increased that schizophrenia is basically a genic 
enzymatic disorder, secondarily producing charac- 
teristic mental symptoms. The mental symptoms 
secondary to phenylketonuria have become the 
prototype of a constitutional psychiatry, just as 
syphilis of the brain was a prototype of a neuro- 
pathological psychiatry. [See MENTAL DISORDERS, 
articles on GENETIC ASPECTS and BIOLOGICAL 
ASPECTS.] 

Neural mechanisms. Discovery of a reticular 
activating system and the connections of the limbic 
or visceral brain focused interest on the higher 
levels of emotional control previously considered 
to reside in the hypothalamus, A succession of in- 
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vestigations revealed the role of the constituent 
parts of the visceral brain in emotional control and 
release. The new neuroanatomy and neurophysi- 
ology changed our concepts of brain function, ex- 
tending them even to discoveries of sleep mech- 
anisms and the functions of dreams. At the same 
time, advances in steroid chemistry have enabled 
us to observe the relations between brain physiology 
and the endocrine systems that respond to or 
are antecedent to anxiety, panic, anger, depres- 
sion, and emotional confusion. [See NERVOUS sys- 
TEM; see also Interdisciplinary Research . . . 1962.] 

Psychopharmacology. The field of psychophar- 
macology constitutes a bridge between the biologi- 
cal and psychological approaches to healthy and 
disturbed behavior. Since the discovery of the tran- 
quilizing and later the energizing drugs utilizable 
for the treatment of psychoses, the custodial state 
hospitals have been revitalized and chronic “un- 
treatable” patients have been given appropriate 
attention, resulting in an increased rate of dis- 
charge and prolonged maintenance in the com- 
munity of previously institutionalized psychotics 
(Uhr & Miller 1960), 

The primary gain, however, has been an added 
stimulus to the study of brain chemistry and endo- 
crinological and metabolic factors involved in be- 
havior of individuals in group living, The bridging 
function of psychopharmacology is in the process 
of intensive investigation. [See Drucs, article on 
PSYCHOPHARMACOLOGY. | 

Model psychoses, Attempts have been made to 
utilize the effects of psychedelic drugs, such as 
LSD-25, as model psychoses. Similarly, the behav- 
ioral changes temporarily achieved by isolation and 
sensory deprivation resemble natural appearing 
disturbances. By far the most informative models 
appear in animal research dealing with prevention 
or early interruption of the symbiotic relationship 
of infant and mother, Prevention induces behavior 
similar to human schizophrenic “symbiotic” psy- 
choses; interruption induces behavior resembling 
that of “borderline” patients, who, although not 
psychotic, have extremely attenuated affective re- 
lationships with human objects (Harlow & Zimmer- 
mann 1959), 

The Pavlovian theory of conditioned reflexes and 
their influence on behavior, emotions, and thinking 
has extended beyond the Soviet Union and provides 
another such model (Pavlov 1873-1935). Facilita- 
tion, extinction, reinforcement and conflict are 
terms that can be translated into the language of 
psychoanalysis. More recently, psychiatrists have 
been utilizing reinforcement techniques in an at- 
tempt to help human patients overcome their inhi- 
bitions and anxieties (Gantt 1958), 


Social psychiatry. Social psychiatry has recently 
blossomed into prominence through the interest of 
sociologists (Parsons & Bales 1955) and anthro- 
pologists in health and mental illness. Since psy- 
chiatry, in general, is concerned with persons and 
their interactions and considers that a person is 
the resultant of physiological, psychological, and 
situational factors, social psychiatry is derived 
from fractionating the total field. 

Social psychiatry includes studies of the family, 
group, community, society, and culture, In general 
the field attempts to explore social systems and 
their effects on psychiatric processes—how the so- 
cial matrix contributes to the cause, course, and 
results of mental illness. It is also the study of the 
effects of mental illness on social systems of vari- 
ous sizes. 

Impressive diagnostic and therapeutic research 
on the family is being conducted. Emphasis is shift- 
ing from intrapsychic processes to a group-oriented 
approach. Both normal and disturbed families. are 
being studied in the interpersonal rather than per- 
sonal context, and families are being studied as 
systems rather than as conglomerations of indi- 
viduals. Such research, however, is still utilizing a 
patchwork of systems theory, cybernetics, and in- 
formation theory, 

Group studies have been conducted in therapeu- 
tic settings, but group therapy is a vague and poorly 
defined ubiquitous term applied to any process ob- 
servable in groups. Only lately have the principles 
of small-group dynamics been applied. 

Community psychiatry. Community psychiatry 
is a relatively new venture that crystallized rather 
suddenly with both the increased cooperation be- 
tween hospitals, clinics, and halfway houses and 
the merging of community aspects of prevention 
and rehabilitation (Bellak 1964), Communities are 
poorly defined as “clusters of people having a com- 
mon destiny.” Community psychiatry deals with a 
wide variety of relationships among community 
social agencies, resources, clinics, and hospitals, 
and studies a wide variety of administrative, thera- 
peutic, and research personnel, However, it lacks 
both theory and operational methods, There is no 
specific discipline and certainly no community 
psychiatrist knowledgeable enough to encompass 
the whole sector, including a broad network of 
agencies. At present, community psychiatry is an 
organizational concept. 


Since psychiatric phenomena are social in that 
they arise from a social matrix, are precipitated, 
and flourish or decline under various social condi- 
tions (that is, have an epidemiology), and since 
therapy depends on social and cultural factors, 
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interest in contributions to etiology from the social 
sciences, although late and naive, is now flourish- 
ing. If we ever have anything to contribute to 
prevention, it will be through widespread social 
education at all levels. If therapeutic efforts are not 
to be limited to a few, there will have to be an 
increased interest in methods of understanding and 
eventually in treatment of family, group, and soci- 
eties, based on our understanding of the individual 
but not a direct extension of it, For these reasons, 
the education of psychiatrists will require extensive 
broadening, if not to widen the extent of their 
functions, at least to facilitate the cooperation of 
psychiatry with psychology, sociology, and anthro- 
pology. 

Roy R. GRINKER, SR. 
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LLI 
CHILD PSYCHIATRY 


Child psychiatry is a subspecialty of medicine, 
and, as in other subspecialties of medicine, its 
practitioners are trained in the “parent” specialty 
to which it belongs. To be officially certified in 
child psychiatry (in accordance with the policy of 
the American Medical Association, the American 
Psychiatric Association, and the American Acad- 
emy of Child Psychiatry), the candidate, after re- 
ceiving his M.D. and completing his general hos- 
pital internship, must have two years of training 
in adult psychiatry—all preliminary to training in 
child psychiatry. Furthermore, the teaching staff 
of the institution at which the candidate is trained 
in child psychiatry must include psychologists and 
psychiatric social workers who work in collabora- 
tion with the child psychiatrists on the staff. Special 
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qualifying examinations—a recent requirement— 
are held yearly by the American Board of Psychi- 
atry and Neurology; the first examinations were 
given in 1959. 

The history of this combined staff of psychi- 
atrists, psychologists, and social workers, a dis- 
tinctly American contribution to the field of child 
psychiatry, will form the major portion of this 
discussion. Its development in the United States 
is intimately related to the mental-hygiene and 
the child guidance movements, both American in- 
novations, and to the juvenile court. Although child 
psychiatry was bound to develop into a special field 
of its own, both in the United States and abroad, 
it would probably have done so at a much slower 
pace without the impetus of events in the United 
States, beginning in 1909 with the study and treat- 
ment of delinquents in Chicago by William Healy 
and with the emergence of the mental-hygiene 
movement. 


Early history 


Until the nineteenth century the presence of 
children, however few, in mental hospitals seemed 
to excite little interest on the part of physicians, 
to judge by the small number of articles on this 
subject. Benjamin Rush—probably the first Amer- 
ican psychiatrist and certainly the first one to write 
a standard reference book on the subject, Medical 
Inquiries and Observations Upon the Diseases of 
the Mind (1812)—made no mention of children 
(see also Whitehorn 1944). Rush’s work was based 
largely on observations made over the years on 
psychotic patients who were confined to a separate 
section (opened in 1752) in the Pennsylvania Hos- 
pital. He described a few cases of psychoses in 
children that he attributed to disease of the brain. 

Hospitalization of the insane was started in the 
United States in 1751, and that is regarded as the 
beginning of psychiatry as a medical specialty in 
the United States. This was three to four decades 
before Rush or Philippe Pinel (of France) or Peter 
Frank (of Germany) lectured or wrote on “diseases 
of the mind.” It is surprising that so many years 
were to elapse before child psychiatry evolved. 

In the first 45 years of publication of the Amer- 
ican Journal of Insanity, 1844-1889, not one ar- 
ticle that appeared had any reference to children. 
A review in 1883 of all the previous literature on 
mental illness in children contained 55 references 
(see Kanner 1961), 

The paucity of studies by psychiatrists of mental 
aberrations in children, a lack which lasted for so 
long a time, requires some explanation. Although 
lectures on mental disorders were first given to 
medical students in this country in 1791 by Rush 


at the University of Pennsylvania, a regular course 
of study in psychiatry as part of the medical cur- 
riculum was not given in the United States until 
1863, when one was offered by the Harvard Medi- 
cal School. 

In general, mental hospitals were very large, 
and, according to Adolf Meyer, the psychiatrist’s 
job “was then as it still is in many places an insti- 
tutional and legal task” (1928). There was little 
opportunity for intensive studies. Children were 
not accepted in mental hospitals unless they were 
diagnosed as psychotic. There were few such cases 
in mental hospitals then, and there are still but 
few today. According to Malzberg (1959), in 1950 
first admissions of children under 15 years of age 
to all mental hospitals in the United States ac- 
counted for only about one per cent of all such 
admissions. (Most of the children in special wards 
today are not psychotic. ) Gillespie (1939) also 
found that out of 21,000 first admissions to a num- 
ber of mental hospitals in different parts of the 
world, about one per cent were children diagnosed 
as psychotic. Nevertheless, the finding of such 
cases (largely schizophrenic and manic-—depres- 
sive) led to a growing interest in the psychopathol- 
ogy of childhood, and although this interest was 
strongly influenced by an organic bias, at that time 
so firmly held by hospital psychiatrists, it advanced 
neurological research in child psychiatry in a valu- 
able way. 

As Gillespie noted, the incidence of childhood 
psychoses is very likely larger than statistics show, 
since many possibly psychotic alterations of mood 
and behavior in children are regarded as normal by 
their parents, Besides, a child is less likely to cause 
as much trouble through psychotic behavior as an 
adult is, and hence the child is less likely to be 
referred to a hospital. He is also more likely, in the 
case of a dementia, to be regarded as retarded 
rather than psychotic. It should be added that the 
dominance of a purely organic, largely a neurologi- 
cal, point of view was aided by the severity of the 
pathology seen, whether in mental deficiency or 
in psychosis. [See MENTAL DISORDERS, article on 
EPIDEMIOLOGY.] 

In the literature of medicine before the nine- 
teenth century there are a number of case records 
and essays that present several ideas thought to be 
of much more recent origin. No doubt historical 
research will discover many more. A number of 
these are discussed by Alexander Walk (1964), 
among them a study of fear in dreams, a study of 
sleep disturbances (one by C. Rollins in the fifteenth 
century, the other by Thomas Phaer in the six- 
teenth century), a study of the psychotherapy of 
stuttering by “driving out” the emotion of fear 


(written by Hieronymus Mercurialis in the six- 
teenth century ), an essay on “the medical education 
of children” in which the problem of the jealousy 
of infants is discussed (written by Brouzet in 
1754), and a day-to-day account of a childhood 
psychosis and an etiological classification of men- 
tal aberrations that includes, besides organic causes, 
psychological ones like “reproof in school” and any 
incident causing loss of temper (written by William 
Perfect in 1803). 

Starting in the mid-nineteenth century there was 
a spurt in publication (see Kanner 1948). Books 
were written on the psychiatry of childhood, and 
in the latter half of the nineteenth century numer- 
ous studies were published about child neurology, 
mental deficiency, child development, mental meas- 
urement, and education—all subjects relevant to 
child psychiatry. 


Remedial education 


Edouard Seguin, a French psychiatrist dealing 
primarily with problems of mental deficiency and 
education, had a significant influence upon child 
psychiatry. He studied with J. M. G. Itard, who 
originated the method of “sensory training” still 
utilized in the Montessori schools. Itard is best 
known for his book Wild Boy of Aveyron (1801), 
in which his method of “sensory training” was 
described fully in the discussion of his attempt to 
educate a boy, mute and apparently “wild,” who 
was captured in the woods by a party of hunters. 
After escaping from a home, the young savage was 
placed in a hospital and in 1800 was transferred 
for scientific study to Itard, then a physician at the 
institution for deaf mutes in Paris, Seguin further 
developed the method of sensory training. 

Seguin also studied under the psychiatrist Félix 
Voisin at the Bicétre in Paris. Voisin was far ad- 
vanced for his day, He worked out a method of 
“medical education” that he called orthophrenic 
treatment, which was to be adapted to four cate- 
gories of children: feebleminded children, children 
with difficulties attributable to faulty education, 
children with character difficulties since early 
childhood, and children born of insane parents and 
hence considered to be predisposed to nervous mal- 
adies. He founded a private institution that closed 
after a decade, and he later become physician to 
the annex of the Bicétre in Paris dealing with 
feebleminded children. Seguin was on his staff 
there. Through the influence of Voisin and Seguin, 
doctors and teachers worked together in institu- 
tions for the feebleminded. 

In Austria, J. D. Georgen, physician and teacher, 
applied Voisin’s method, under the name Heil- 
pädagogik (see Walk 1964), to psychotic children 
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and to a large variety of disturbed children, besides 
defectives. The development of “remedial training” 
remains the contribution of the field of education 
to child psychiatry in Austria and Germany. 

Through the invitation of Samuel G. Howe, who 
founded in Massachusetts the first experimental 
school for the feebleminded in the United States, 
Seguin came to America in 1848. There he intro- 
duced the method derived from Itard and Voisin 
(see Lowrey 1944). By 1880 there were 15 such 
institutions in the United States. Howe was also 
familiar with the methods used by the Swiss phy- 
sician Jean-Louis Guggenbiihl in the training of 
cretin idiots. 


Pathological conceptions 

In 1867 Henry Maudsley published The Physi- 
ology and Pathology of the Mind, In a later edition 
he was the first, according to Alexander Walk, to 
include in a book on mental disease a chapter on 
the insanity that may occur in early life. In this 
chapter he noted the importance of the instincts 
and of childhood aggression and sexuality. 

Emminghaus’ Die psychischen Stérungen des 
Kindesalters (1887; “Psychic Disturbances of Child- 
hood”) includes psychological causes besides the 
usual pathological ones. It includes also methods 
of psychotherapy, Emminghaus urged a collabora- 
tion of psychiatry and pediatrics. 


Outpatient clinics 

The development of outpatient clinics was an 
important factor in the early development of child 
psychiatry (and also of neurology and general psy- 
chiatry). It meant expanding psychiatric service 
and including cases that were not extreme. The 
first outpatient clinic was established at the Phila- 
delphia Hospital for Orthopedics in 1867 and in- 
cluded defective children among its patients. The 
first outpatient clinic at a mental hospital was 
opened in 1891, under the direction of Howe. The 
patients were mostly children. Howe's successor 
was Walter Fernald, a leader in the study of men- 
tal retardation. Fernald’s outpatient clinic accepted 
many children whose problems were not limited to 
mental retardation; indeed, many whose problems 
are typical of referrals to child guidance clinics 
today (Fernald 1920; Stevenson 1944; Abbot 
1920). 


G. Stanley Hall 

G. Stanley Hall, an American psychologist, 
started the first journal of child psychology in 1891 
(Pedagogical Seminary) and also the first society 
of child study, in 1894. Some studies made at this 
time, especially those of European origin (for ex- 
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ample, those by Sully of England and Perez of 
France), including some biographical accounts of 
infants, remain of value today [see the biography 
of HALL; see also Hall 1907; Gesell 1930]. 

Thus, it is clear that child psychiatry began to 
develop long before child guidance came into being 
in the United States or elsewhere. However, the 
child guidance movement gave a great impetus to 
the development of child psychiatry, especially in 
the United States (Glueck 1930; Crutcher 1943: 
Healy 1915; Bunker 1944; Levy 1947; Lewis 1959; 
Stevenson & Smith 1934; Lowrey 1944; Kanner 
1948). 


Healy and the child guidance movement 


The child guidance movement began with Wil- 
liam Healys Juvenile Psychopathic Institute 
(Healy & Dummer 1909; Healy & Bronner 1948). 
In his account of the early years of the movement, 
Healy states: 


- .. there was one central and often disastrous weak- 
ness in the proceedings [of the juvenile and municipal 
courts]: The individual before the court, given at most 
only a physical examination, was not in the least 
known with regard to his essential nature, Nor was 
anything known as to why he was a delinquent. Never- 
theless, judgment was passed and the type of penalty 
or supposed treatment that a court can order was pre- 
scribed. How, these observers argued, can a judge or 
anyone else, laboring under such a handicap, possibly 
prescribe wisely for the individual who, if placed on 
probation, needs some special form of treatment or 
incentive to help him mend his ways; or should have 
some particular type of institutional training or segre- 
gation? 

In a neurological clinic I had long been finding 
that the symptoms of numerous young patients as 
given by relatives included complaints of conduct diffi- 
culties. Many of these—head injury cases, choreics, 
epileptics, or hysteria—were reported over-restless, ag- 
gressive disturbers of the peace at home or at school, 
truants, runaways, or involved in stealing or sexual 
misbehavior. Some were already on probation from 
the juvenile court and others were soon taken there 
charged with delinquencies that led them to be sent 
to the Parental School or to what was , . . [euphemis- 
tically] termed a training school. To me in that clinic 
it seemed that the handling of these young people 
largely by punitive measures, whether in the home, 
school, or court, was so far removed from common 
sense application of scientific knowledge to the treat- 
ment of human beings that it savored of the dark ages 
of man’s dealing with his fellowmen. And on occasion 
I publicly said as much. 

- relatives would come to the detention home 
and from interviews with them we could ascertain 
many things that we needed to know—heredity, de- 
velopmental histories, personality peculiarities, family 
interrelationships, and much else. Then, besides the 
possibilities of studying the youngsters themselves dur- 


ing their period of detention, we could obtain facts 
from the school about their progress and conduct in 
that setting. . . . If resources for treatment were avail- 
able, with better knowledge of them something better 
might be done to check delinquent behavior tendencies 
which, as everyone already knew, all too frequently 
led to criminal careers. . . . 

No such clinic existed. The only places I found 
where even psychological testing was added to a physi- 
cal examination were Witmer’s clinic for retarded 
children at the University of Pennsylvania, and God- 
dard’s laboratory for mental measurement of the feeble- 
minded at Vineland. Interestingly enough, the various 
theories offered in possible explanation of delinquency 
and crime were none of them based on actual study 
of cases. Some original methodological suggestions for 
us were made, particularly by Thorndike. As for any 
scientific studies of the bases of children’s behavior 
tendencies, it was said that we in Chicago would have 
to blaze a new trail... . 

The first psychologist with me was Dr. Grace M 
Fernald, We two and our secretary comprised the paid 
staff. At that time and until years later the psychiatric 
social worker did not exist, but in all fairness it must 
be stated that some of the social workers who helped 
us to study cases and even do some treatment with the 
families had developed appreciation of psychiatric 
implications. . . . Our social work, whatever it amounted 
to, was done then by certain workers from other agen- 
cies and by well-trained probation officers. Relatives 
came with surprising willingness for interviews with 
me; indeed some of the earliest requests for our serv- 
ices came from parents with a child in court, and even 
lawyers asked the judge to continue a case in order 
that the new clinic might find out what was the matter 
with their client’s child that he should be behaving in 
a delinquent fashion. 

It was our early decision to confine our research to 
delinquents who had committed offenses repeatedly. 
Rightly or wrongly these seemed to offer the greatest 
challenge for treatment by the court or in any other 
way. First offenders might have committed casual 
offenses that had little relationship to their nature or 
needs, and little significance for their future conduct. 
We took such cases as we could within our working 
limits, cases referred by the judge or probation officers, 
by families, and occasionally a policeman would bring 
a boy tous. (Healy & Bronner 1948, pp. 4-6) 


In the rest of his account Healy showed how he 
disproved the theories of Lombroso and others that 
ascribed delinquency and crime to heredity (based 
on anatomical “stigmata of degeneracy”) and to 
causes such as enlarged tonsils and adenoids; focal 
infections; uncorrected refractive errors; periph- 
eral irritations—for example, tight foreskin, im- 
pacted teeth. Healy began to look down on all 
theories of organic “causes” of delinquencies 
(1915). In time he admitted overlooking some 
cases in which organic factors were paramount, 
as in epidemic encephalitis, in conditions causing 
partial asphyxiation, and (in one case) the path- 


ae a 


ological effects on the personality of severe burns 
in early childhood. [See the biography of Lom- 
BROSO.] 

Healy had been well trained in neurology; he 
was an associate professor of nervous and mental 
diseases at the Chicago Polyclinic for 13 years. The 
physical examination was a routine procedure in 
his work with children. As time went on he paid 
less attention to neurological studies, being more 
absorbed with the social environment, the patient’s 
“own story,” psychological testing—to which he 
made important contributions (especially Healy's 
Picture Completion 11)—and psychoanalysis. In 
time he recognized anew the importance of the 
neurological examination as neurology and neuro- 
physiology became more and more relevant to child 
psychiatry. 

In Healy's early years, the important names in 
psychiatry were Adolf Meyer, Alfred Binet, and 
Sigmund Freud (Levy 1947; Whitehorn 1957; 
Kanner 1961). That is the order, I think, of their 
impact chronologically in the field later to be known 
as “child guidance.” Through Meyer, psychiatry had 
become liberated from the purely symptomatic ap- 
proach to the study of man. The patient was no 
longer a collection of symptoms. Meyer presented 
to psychiatry a conception of man as an integrated, 
purposeful human being of whom the psychiatrist 
had previously lost sight. Healy benefited from 
Meyer’s restoration of man’s place in psychiatry. 
[See the biography of MEYER.] 

When Healy started his work in Chicago, Alfred 
Binet had already opened up the new field of mental 
measurement through his studies of school children 
in the city of Paris (Pressey & Pressey 1922; Ter- 
man 1919). Methods of quantitatively estimating 
intellectual capacities were to develop into a new 
science. Of importance in relation to Meyer's con- 
cept was a further accentuation of the patient as 
the primary point of orientation. The determina- 
tion of a child’s intelligence, which came to mean 
his capacity for schooling, was based on the actual 
study of the child himself by means of a standard- 
ized procedure, rather than on pronouncements 
by authorities derived from clinical impressions. 
Healy started at a period when testing devices, to 
which he contributed greatly, were already ap- 
propriate tools of psychiatric study. The enormous 
influence of the tests of Binet and Simon and also 
Seguin’s pedagogical ideas are clearly delineated 
in Benjamin Baker's bulletin “History of the Care 
of the Feeble-minded.” [See INTELLIGENCE AND IN- 
TELLIGENCE TESTING; and the biography of BINET.] 

Freud’s contribution was the energizing force in 
psychiatry. It marked the beginning of significant 
motivational studies. In Healy's early work their 
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impact was revealed in his book Mental Conflicts 
and Misconduct (1917). In The Individual Delin- 
quent (1915), the classic book in the field, Healy 
wrote that he found the following psychoanalytic 
tenets of value: (1) that conduct is unconsciously 
motivated; (2) that “much that is formative of 
character does not appear above the surface”; 
(3) that “experiences which come to the individual 
with a great deal of emotional context are likely to 
cause the greatest amount of reaction”; (4) that 
“experiences . . - related to sex life [caused the] 
‘strongest and subtlest reactions’ ”; and (5) that 
psychic traumata experienced most frequently in 
young children may be unknown to the parents 
(pp. 119-120). 

The particular traits of Healy’s personality that 
gave shape to the new field were his receptivity and 
democratic nature (Levy 1947). He was receptive 
to knowledge from any source. He was free from 
the typical prejudices (so common in the psychi- 
atrists of those days) against psychological tests 
and against anything that could be labeled Freud- 
ian, He was free also of the authoritarian attitude 
of the doctor that makes teamwork with co-workers 
in allied fields impossible. Actually, out of his alli- 
ance with the clinical psychologists and the social 
workers, a new group emerged. For the first time 
the psychiatrist, the psychologist, and the social 
worker worked together as an authentic unit, and 
problems of personal prestige were forgotten. 

Starting with Healy, psychiatrists who worked 
in the field of delinquency and crime became in- 
creasingly aware that they were in a new field, one 
in which social problems and social agencies were 
of primary concern. The Institute for Juvenile Re- 
search in Chicago, the heir of Healy’s Juvenile 
Psychopathic Institute, became part of a large state 
organization involving penal, correctional, and 
other institutions, all under Herman Adler, the 
state criminologist (Adler 1926). Karl Menninger 
was the first to interest Herman Adler and Fran- 
wood Williams, then director of the National Com- 
mittee for Mental Hygiene, in forming an associa- 
tion of the representatives of the neuropsychiatric 
or medical view of crime that would include the 
neuropsychiatrists primarily interested in what 
might be called medical criminology or disciplinary 
psychiatry or orthopsychics. 


Orthopsychiatry 

In January 1924 a committee of psychiatrists 
that was to become the American Orthopsychiatric 
Association was formed at the Institute for Juvenile 
Research in Chicago (Lowrey 1948). Of the names 
suggested for this committee “Social Psychiatry” 
was favored and then dropped. This was due to the 
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belief that sociologists and social workers would 
regard such a field as belonging to them, The 
word “orthopsychiatry” was also chosen in prefer- 
ence to “social psychiatry” since the latter term, 
it was thought, stresses the idea of disease which 
was regarded as incidental to the general problem. 

The other fields to which orthopsychiatry was 
closely related—clinical psychology, sociology, and 
social work—were thought of as sources of assist- 
ance. However, there were many clinics in which 
psychiatrists, social workers, and psychologists 
were then operating as equals. Furthermore, the 
original stress on delinquency and criminology gave 
way over a period of time to an increased emphasis 
upon the relationship between abnormal behavior 
and broader patterns of social behavior. The first 
annual convention of the American Orthopsychi- 
atric Association was held at the Institute for 
Juvenile Research on June 10, 1924. William Healy 
was elected president. It is interesting that of the 
nine papers presented at that meeting only two 
dealt specifically with crime and delinquency. 

The problem of the status of nonmedical people 
was an early issue, Those who held out for control 
by psychiatrists finally gave way, Amendments to 
the constitution of the society made provisions for 
other classes of members with voting privileges. 

The fifth annual meeting of the association was 
held in New York City. There were 540 members 
registered. They represented 112 organizations 
from outside the city, There were registrants from 
20 states and 5 foreign countries. The journal of 
the society was first issued in 1930 under the 
editorship of Lawson G. Lowrey, The 1965 meeting 
had eight thousand registrants. Of these, twelve 
hundred were members. 


The mental-hygiene movement 


In 1909, the year in which Healy began his work 
in Chicago, the mental-hygiene movement began 
(Deutsch 1937; 1944), It followed the publication 
(1908) of Clifford Beers's book A Mind That Found 
Itself, an exposé of the brutal experiences suffered 
by the author when he was a patient in three dif- 
ferent mental hospitals. Beers was an unusual com- 
bination of writer, organizer, and crusader, Through 
his initiative, the National Committee for Mental 
Hygiene came into being. Its goal, improving the 
care of the insane, soon expanded into the general 
field of preventive psychiatry. Later, through the 
aid of the Commonwealth Fund, it furthered the 
development of child guidance clinics. These were 
modeled after Healy's Chicago Juvenile Psycho- 
pathic Institute and the Institute for Juvenile Re- 
search. A five-year demonstration of child guid- 
ance clinics started in 1922 in St. Louis. After each 


year’s demonstration it was hoped that the com- 
munity would take over. In this sense no demon- 
stration was ever more successful. By 1934 hun- 
dreds of child guidance clinics were established in 
the United States. 


Training 

After the first five years of the demonstration 
clinics it had become all too clear that a special 
national teaching institute was necessary to train 
personnel—psychiatrists, psychologists, and psy- 
chiatric social workers—whose services could not 
be supplied in sufficient numbers by the clinics 
previously established. The Institute for Child 
Guidance was established in New York City in 
1927 for the purpose of training and research 
(Lowrey & Smith 1933). In its six years of exist- 
ence 50 psychiatrists and 289 psychiatric social 
workers went through a program of training there. 
The psychiatric social work students came chiefly 
from the Smith College School for Psychiatric Social 
Workers, started in 1918 as the first school of its 
kind, and the New York (now the Columbia) 
School of Social Work, Training of social workers 
in psychiatry began as an innovation of Adolf 
Meyer's, at the turn of the century, It was brought 
into prominence by Elmer Southard of the Boston 
Psychopathic Hospital (see Southard & Jarrett 
1922). The initial idea for such a school came 
from Southard, who at first was concerned to find 
social workers to serve as aides to psychiatrists 
in war service. The aides were able to schedule 
appointments and take histories. They could visit 
homes and prepare reports on the families of pa- 
tients and, in general, about the child’s environ- 
ment. While the psychiatrist worked directly with 
the patient—that is, the child—the social worker 
dealt with the mother and with other members of 
the child’s family. In this role he acted presumably 
under supervision of the psychiatrists. He gave 
counsel, information, and insight. When such meth- 
ods were ineffective, he utilized various forms of 
psychotherapy. The role of social worker as ther- 
apist, especially as therapist in private practice, 
remains today a controversial issue (Lowrey & 
Smith 1933; Levy 1937; Witmer 1946; Group for 
the Advancement of Psychiatry 1948). 


Child guidance and child psychiatry 

The differentiation of child guidance and child 
psychiatry was easily made at first by the fact that 
a guidance staff always included a psychiatrist, 
a psychologist, and a social worker. Furthermore, 
child guidance clinics received their patients from 
juvenile courts and were never connected with hos- 
Pitals. In contrast, child psychiatry was easily 
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demarcated as a medical specialty, in the same 
manner as adult psychiatry. Healy's work began 
as a search for psychopathology. Many of his case 
studies, however, were not in that field; they were 
examples not of psychosis or neurosis but of “mal- 
adjustment” as a result of the influence of the 
social environment, especially the child's family and 
his companions. This environment produced cer- 
tain antisocial attitudes. As Kanner has rightly em- 
phasized, the special American contribution to child 
psychiatry is the recognition and study of attitudes 
(Kanner 1932; Glueck 1930; MacCalman 1939). 
This was probably facilitated by the fact that pa- 
tients who showed evidence of mental deficiency 
or any form of severe psychopathology were not 
accepted in many child guidance clinics. Even 
when the field expanded beyond delinquency to all 
forms of “behavior problems” it was thought wise 
to limit patients to the milder cases. 


The role of social agencies 


Child guidance clinics like Healy’s Juvenile Psy- 
chopathic Institute had close relations with social 
agencies. According to Stevenson and Smith (1934), 
who were both closely involved with the develop- 
ment of the clinics, their functions were threefold: 
(1) to study and treat patients; (2) to stimulate 
the interest of other community agencies in pre- 
vention and treatment of behavior and personality 
disorders in children; and (3) to bring to the at- 
tention of the community the unmet needs of 
children. 

Put in this way, the psychiatrist who, according 
to the original plan, was always the director of the 
clinic was not only a practitioner but was also cast 
in the role of community planner and leader. Steven- 
son and Smith stated also that the clinics should 
attempt child guidance only at those points where 
other agencies were unable to meet the child’s emo- 
tional needs. Presumably this involved a close rela- 
tionship with other agencies and, as it turned out, 
the agency's being able to select for guidance those 
children regarded as beyond its competence. This 
at least sharpened the problem for the agency in 
regard to selection of cases most suitable for the 
Kind of service appropriate to its special skills. 
There was always some kind of adaptation to local 
problems, to differences in the personnel of social 
agencies, to their standards, and to their willing- 
ness to utilize and cooperate with the available 
child guidance clinics; there was also a testing 
period in which the usefulness of the new relation- 
ship was tried out. 

Meanwhile, for various reasons, including eco- 
nomic ones, a number of social agencies took on 
the function of a psychiatric clinic without having 
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a single psychiatrist as a permanent member of the 
staff. They utilized psychiatrists as lecturers or 
consultants who were useful in fulfilling their roles 
especially when a medical diagnosis was legally 
necessary for the admission of a child to a mental 
hospital. In some agencies the rule that the psychi- 
atric social worker always be supervised in her 
case work by a psychiatrist was either withdrawn 
or complied with at the discretion of the worker. 
At times, psychiatrists also violated their function 
by making diagnoses without examining or even 
seeing the child, relying entirely on the psychiatric 
social worker's case record. 


Evaluation of child guidance clinics 


Hardcastle, a British psychiatrist who studied 
the child guidance clinics in the United States from 
1930 to 1933 and had been a student at the Insti- 
tute for Child Guidance, found the widest varia- 
tions in the original model and in methods of 
therapy (Hardcastle 1933). Besides visiting a num- 
ber of clinics he received replies to a questionnaire 
from 12 clinic directors. He found that the “off- 
spring” of psychiatry had taken many new direc- 
tions and that “each year psychologists and other 
non-medical workers are gradually wresting it from 
the field of psychiatry.” The therapy was of all 
sorts, at one extreme psychoanalytic treatment of 
the child and each parent, at the other, purely 
didactic instruction. 

Of the criticisms of child guidance clinics the 
most frequent are (1) its growing distance from 
the hospital and medical clinic, and (2) its rigidity 
(Bowman 1944; Levy 1952; Markey 1963; Murphy 
1941). 

As child guidance expanded it drew farther away 
from the hospital and into closer collaboration with 
many social agencies. Case referrals, as traced by 
Hardcastle for a representative clinic, were largely 
from the courts before 1920 (538 out of its first 
600 cases); several years later the majority were 
from 31 other agencies (308 of its third list of 
600 cases). 


Hospital settings 


Child psychiatry as developed in hospitals started, 
according to Karl M. Bowman, for many years 
director of psychiatry at Bellevue Hospital in New 
York City, with the “dictum that a child is not to 
appear before a conference.” Bowman, after de- 
scribing both the advantages of hospital settings 
for training in child psychiatry (1944), particu- 
larly at Bellevue Hospital, and the bed wards avail- 
able for children and adolescents, was especially 
critical of the “stereotyped” requirements of child 
guidance clinics. Psychiatrists at Bellevue, however 


620 PSYCHIATRY: Child Psychiatry 


competent in child psychiatry, were not permitted 
to serve in child guidance clinics because a fully 
trained psychiatric social worker was not a mem- 
ber of the Bellevue teaching staff. 

A number of referrals by psychiatrists in child 
guidance clinics had been made to psychiatrists in 
Bellevue in order to obtain a diagnosis, This is 
simply an illustration of the fact that the child 
guidance clinic dealt with a different variety of 
cases than those found in hospital wards. In the 
past, training for child guidance took place within 
the child guidance clinics. The more specifically 
Psychiatric difficulties, for example, childhood 
schizophrenia, autism, dementia, severe emotional 
difficulties, severe retardation, anorexia nervosa, 
convulsive states, tics, and phobias, are relatively 
less commonly seen in a child guidance clinic. In 
the latter the more frequently seen cases are delin- 
quency and other varieties of “behavior problems,” 
family relationship problems (for example, mater- 
nal overprotection, maternal rejection, sibling ri- 
valry), so-called rearing problems, including “the 
everyday problems of the everyday child,” a special 
concern of Douglas Thom’s Habit Clinic (Thom 
1927), 

Leo Kanner, the first person to develop a full- 
time psychiatric service in a department of pedi- 
atrics (at Johns Hopkins Hospital in 1930), re- 
garded “the cultivated estrangement from medicine” 
on the part of child guidance clinics and their 
“self-imposed limitation” in the selection of cases 
with favor because of “the great benefit derived 
from making certain types of children’s behavior 
problems a matter of community concern” (Kan- 
ner 1932), He regarded as the contribution of child 
guidance to psychiatry the broadening of the range 
of inquiry to include, in addition to the study of 
the home and the school, all influences outside the 
child, including the significant people in his en- 
vironment—generally “interpersonal relationships.” 

A second basic contribution is “multidisciplinary” 
collaboration. This influence was Strongly felt by 
Kanner in his own clinic, in which, however, the 
selection of cases was not restricted to any particu- 
lar type of difficulty. 


Research 


Research in American child psychiatry may be 
traced in articles in the American Journal of Or- 
thopsychiatry, which began in 1930, The contribu- 
tors are mainly psychiatrists, psychologists, and 
Psychiatric social workers, and the contributions 
have become more allied to psychiatry than to child 
guidance and tend to resemble such European jour- 
nals as the French Revue de neuropsychiatrie in- 
fantile; the Swiss Zeitschrift für Kinderpsychi- 


atrie (founded by M. Tramer in 1934 and now 
called Acta paedopsychiatrica); and the Ameri- 
can Journal of Child Psychiatry, which was 
founded in 1962. 

The trend of research is seen generally in a re- 
finement and extension of the earlier Studies on 
the influence of infantile experience on later 
growth, and in the study of every possible influ- 
ence on the patient of all his human contacts, in 
single and composite form, as in family and cul- 
tural constellations. Clinical studies of syndromes 
and disease entities still continue in the historical 
tradition of medicine, from severe pathology to the 
mild “everyday problem.” In recent years there has 
been special interest in the biochemistry of behav- 
jor, brain physiology, neurology, drug therapy, fam- 
ily and group therapy, developmental psychology, 
projective tests, and the field of communication. 
There has also been an increase in the study of 
mental deficiency, childhood schizophrenia, resi- 
dential treatment, and education therapy. [See 
Drues, article on PSYCHOPHARMACOLOGY; MENTAL 
DISORDERS, article on BIOLOGICAL ASPECTS, MEN- 
TAL DISORDERS, TREATMENT OF, article on SOMATIC 
TREATMENT; MENTAL RETARDATION; NERVOUS SYS- 
TEM; SCHIZOPHRENIA. | 


Since about 1950 there has been an increasing 
number of child psychiatry clinics in hospitals and 
medical schools and an increasing collaboration 
with pediatrics and neurology. Child guidance clin- 
ics are becoming better grounded in psychiatry and 
psychodynamics, They remain, in keeping with 
their history and tradition, community-oriented, 
maintaining a special interest in problems of delin- 
quency and school retardation, and, in general, in 
the psychiatric, psychological, and social aspects 
of community problems. The distribution of thera- 
peutic functions among the members of the team 
remains a problem. 

Davin M. Levy 


[Directly related are the entries CLINICAL PSYCHOLOGY; 
MENTAL DISORDERS, article on CHILDHOOD MENTAL 
DISORDERS; MENTAL DISORDERS, TREATMENT OF; 
MENTAL RETARDATION, SOCIAL WORK, Other relevant 
material may be found in DeLinqueNcy, article on 
PSYCHOLOGICAL ASPECTS; DEVELOPMENTAL PSY- 
CHOLOGY; INFANCY; INTELLECTUAL DEVELOPMENT; 
INTELLIGENCE AND INTELLIGENCE TESTING; LAN- 
GUAGE, article on SPEECH PATHOLOGY, PERSONALITY. 
article on PERSONALITY DEVELOPMENT; PHOBIAS; 
READING Disanitities; and in the biographies of 
Binet; MEYER; PINEL; Rusa.) 
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SOCIAL PSYCHIATRY 


Although the phrase “social psychiatry” is re- 
cent, the topic is not. Man has suspected for a long 
time that he could derange himself by his style of 
living and that the emotions engendered by inter- 
personal relations could lead to illness, including 
organic illness, The expressions “You will drive 


chiatric knowledge to the social sciences (A. H 
Leighton 1960), 

Concern with people in numbers, Social psy- 
chiatry focuses on problems of psychiatric dis- 
order in populations. Questions are asked about 
frequency and distribution of people with dis- 
orders, about the causes of variation in these dis- 
tributions, and about methods of control. 

This perspective contrasts somewhat with that 
of clinical psychiatry, which places much greater 
emphasis on the individual patient and the prob- 
lems of his particular diagnosis and treatment, But 
like clinical psychiatry, and like the physiological 
and chemical approaches to psychiatric disorder, 
there is in social psychiatry an ultimate concern 
for understanding the cause and prevention of 
disorder in persons, The individual is the basic 
unit, and social psychiatry as a field does not imply 
values that underestimate the importance of the 
individual. Rather, the importance of the indi- 
vidual is magnified by the number who exist in a 
state of mental and emotional difficulty. 

By and large, those working in social psychiatry 
conceive of their primary responsibility as some- 
what different from that of a practicing psychia- 
trist, There is less emphasis on a contractual type 
of understanding with particular individuals and 
more emphasis on an obligation to serve the wel- 
fare of a population or a subgroup within a popu- 
lation, such as a local community, an industry, a 
military unit, a school, or a family. The focus, 
however, remains on real people in real situations 
and not on an abstract entity such as the state. 

The importance attached to people in numbers 
does not mean individual psychotherapy is played 
down, but it adds a corollary set of considerations: 
the possibility and desirability of altering those 
circumstances which can be shown to be a re- 
current source of psychological disturbance. 

Concern with sociocultural systems. Social psy- 
chiatry gives attention to understanding the pat- 
terns of values and sentiments, symbols of com- 
munication, interaction, work division, and social 


and how its molding, directing, containing, and 
changing influences operate, Equally a 
various situations of stress, such as those occur- 
ring regularly and continuously in the lower socio- 
economic levels of a society or those occurring 
more briefly but with greater intensity in disasters. 
Adaptation of social science knowledge. As a 
consequence of its concern with sociocultural sys- 
tems social psychiatry has an interest in the rele- 
vant discoveries and theories of the social or be- 
havioral sciences for the purpose of applying them 
to the understanding and treatment of psychiatric 
disorders, to their prevention, and to the improve 
ment of mental health generally, Finding himself 

confronted with social phenomena 
whether his approach is primarily action or pri- 
the 


necessary, 

At thie point. the reader may be inlined t 
who the people are who are engaged in 
chiatry, Are they the usual clinical 
chiatrist, psychologist, and social 
added training in the social sciences? 
social scientists working in the field 
health? The answer is, of. course, that they are 
both types. There are some interesting trends in 
the interrelationships of the disciplines, which will 
be discussed later, 
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edge that go beyond the limits implied by its focus 
on individual health, Although this point of view 
has at times had wide acceptance, strong doubts 
about it have also been expressed. It seems worth- 
while, therefore, to attempt to clarify the matter 
by stating a few core points, 

The first point is that by definition the psychia- 
ama. is an authority on psychiatric disorders and 

and training has concepts and 
— for distinguishing such disorders from 
generally disagreeable behaviors and feelings. 
Consequently, he should have something to offer 
wherever psychiatric phenomena are relevant to 
theory building and research in the social sciences. 

The second core point is that the psychiatrist 
has by virtue of his contact with patients a par- 
ticular kind of access to the understanding of hu- 
man motivation, both conscious and unconscious, 
While the claim can be overemphasized, and while 
there are certainly dangers in going too far with 
“psychiatrizing” normal behavior, it is also im- 
portant not to minimize this potential, Major con- 
tributions to behavioral theory and to the framing 
of experimental tests have emerged from this 
source in the past and can be expected to continue 
in the future. 

Third, the psychiatrist is oriented toward and 
experienced in dealing with human beings as 
whole entities, The demands of his work put be- 
fore him both the opportunity and the obligation 
aa hyd individual as an ongoing integration of 


(1955) says that the field is concerned not only 
with the mentally il] but also with the problem of 
adjustment of all persons in society and with the 
factors that tend to damage or enhance their adap» 
tive capacitios. It assumes, he says, that most per- 
sons possess potentialities that have never been 
realized because of personal, emotional, or social 
interferences. Thus, social psychiatry attacks the 
whole social framework of contemporary living; it 
seeks to understand the dynamics of individuals 
seen in their total setting. 


Having presented, 
area covered by “social psychiatry,” we may com- 
sider next some of the more limited ways in which 


i 
if 
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“social psychiatry” in the second decade of the 
present century (Southard 1919). It seems likely 
that he was the first to use the words in English, 
and one must add that some of his remarks make 
it clear that he was implying sociology as well as 
social work. 

Most psychiatrists are apt to assume that social 
psychiatry is, despite its breadth and depth in sub- 
ject matter, nothing other than a branch of psy- 
chiatry. It is often considered to be parallel to 
clinical psychiatry and what may be called “physio- 
logical psychiatry’—that is, the study of the or- 
ganic processes underlying psychiatric disorders. 

As might be expected, this view is not shared 
by many social scientists. They prefer to regard 
the field as a branch of a discipline, such as soci- 
ology, that is concerned with deviance and the 
effects of deviance on social process, In this orien- 
tation, social psychiatry is seen as a component 
in or a parallel to studies of delinquency, the use 
of narcotics, prostitution, suicide, and so on. In 
anthropology it may be looked upon as a way of 
Studying cultural influences with particular refer- 
ence to the interplay of personality and culture and 
to crosscultural comparisons on certain topics of 
interest, such as curing roles and rituals or cul- 
tural integration. 

According to these views, then, “social psychia- 
try” is a name for a field that has autonomous 
Status among the social sciences, Psychiatry is 
visualized as contributing some but by no means 
all of what is needed in terms of theory, method, 
and criteria for the study of the phenomena in 
question. In both research and action programs, 
the place of the psychiatrist is, in this framework, 
that of a specialist, one among several with tech- 
nical skills to contribute. 

This brings us plainly to issues of interdisci- 
plinary rivalry as well as to matters of concept, 
theory, and method. Since such questions as “Who 
is entitled to head programs?” and “Should there 
be chairs of social psychiatry outside medical 
schools?” are apt to be of real concern, it is likely 
that divergent orientations of this kind will be 
with us for some time, 

Community psychiatry. It has been advocated 
that “social psychiatry” be limited to describe re- 
search endeavors and that “community psychiatry” 
be used for enterprises concerned with action and 
the training of professionals for action (Bellak 
1964; Goldston 1965). This division has been ad- 
vocated in the hope of resolving some conflicts by 
using persons with clinical training to head mental 
health services and other activities aimed at pre- 
vention, while—depending on the nature of the 


problem, the institutional setting, and the particu- 
lars of individual training and ability—social sci- 
entists, psychiatrists, or clinical psychologists are 
used to direct research. 

The usage outlined aboye has been vigorously 
attacked for drawing a line between action and re- 
search that is stultifying and unworkable, It is said 
that “community psychiatry” is merely a name for 
one of many areas of interest for the social psy- 
chiatrist. As such, however, the area embraces not 
only action and service but also teaching and re- 
search, in the manner customary for any subdi- 
vision of disciplinary subject matter. It is further 
pointed out that no reason exists for emphasizing 
the clinician as the director in community psy- 
chiatry action programs; on the contrary, social 
work, political science, or public administration 
may offer far better preparation and training for 
imaginative and effective development of the field. 

Whether it lasts or not, the use of “community 
psychiatry” for action programs and “social psy- 
chiatry” for epidemiology, cultural comparisons, 
and other types of basic research has at present 
some currency. [See MENTAL HEALTH.] 

British and French definitions. In Great Britain 
there is also a lack of uniformity regarding the 
meaning of “social psychiatry,” but generally the 
term refers to social amelioration in the same sense 
as “social medicine” and is thus part of a broad 
philosophy regarding public responsibility for the 
handling of welfare problems. The emphasis is 
pragmatic, and action arises mainly from within 
the discipline of psychiatry, Research is, of course, 
performed, and social scientists have been drawn 
in, but the orientation is dominantly one of pro- 
viding information that will help guide those pro- 
grams concerned with such matters as rehabilita- 
tion, the employment of schizophrenics, and the 
analysis of “therapeutic communities.” It may be 
concluded, therefore, that what is called “social 
psychiatry” in Britain has more in common with 
what is called “community psychiatry” in the 
United States than it has with the definitions of 
social psychiatry discussed above (Jones 1952; 
Rapoport et al. 1961). 

In France the term “socio-psychiatrie” appears 
to be used in a manner that approximates the 
more restricted definition of social psychiatry. The 
center of interest is research into the possibility 
that social factors play a major etiological role in 
the appearance of psychiatric disorder. Most of the 
French researchers are psychiatrists, but a great 
deal of work is nonetheless being conducted by 
social scientists in closely related areas (Bastide 
1965; Encyclopédie Médico-Chirurgicale 1955). 


An intervention framework. What has been 
said thus far is an attempt to summarize the 
present state of a complex emerging field, which 
has many crosscurrents and countercurrents. It 
depicts how things are rather than how a logical 
mind might arrange them. In the course of dis- 
cussing these matters, R. N., Rapoport, an anthro- 
pologist with long experience in social psychiatry, 
has suggested (in an unpublished personal com- 
munication) the following: 


The most fruitful way to look at social psychiatry is 
as an intervention framework. The “iatry” suggests 
intervention, and social psychiatry is the professional 
practitioners’ field that aims its intervention efforts at 
the level of social structure. Like individual clinical 
psychiatry and organic psychiatry, its aim is to heal 
ill individuals, to prevent others from becoming ill, 
and to assist all toward the realization of positive 
mental health. Clinical psychiatry seeks to do this [in] 
the direct one-to-one relationship [by] persuasion, 
transference interpretations, [and] interpersonal influ- 
ence in the two-person situation. Organic psychiatry 
seeks to do this by altering biochemical substructures, 
etc. 

Social psychiatry is supported by the basic sciences 
of the social science spectrum (sociology, anthropol- 
ogy, social psychology, economics, history, political 
science). Clinical psychiatry is supported by the psy- 
chological sciences (the various approaches to psychol- 
ogy, from behaviorism through symbolic interaction- 
ism). Organic psychiatry is underpinned by the basic 
sciences of biology and the physical sciences directly 
geared to biology. 

When individual psychiatrists, of whatever persua- 
sion, do research, they are participating in the basic 
science field that underpins their intervention specialty. 
One would call such individuals research social psy- 
chiatrists, or research clinical psychiatrists, or research 
organic psychiatrists to distinguish them from their 
colleagues who are mainly concerned with intervention 
as a focal professional activity. To the extent that the 
individual does not interest himself in interventions 
aimed at treating individuals, either as a professional 
practitioner or in terms of the goals in view for his 
research, . . . I would not think of him as a psychiatrist 
at all, but as a person who had psychiatric training 
but who was , . . something else—a biochemist, a 50- 
ciologist, or a psychologist. The problem of commit- 
ment to a type of goal is crucial in defining a person's 
professional identity. 


Research 


As in medical research, the central targets in 
social psychiatry are treatment, prevention, and 
the discovery of causes. The topics are placed in 
that order because, contrary to what is sometimes 
supposed, successful methods for treatment and 
prevention have frequently been known long be- 
fore the relevant causes were discovered—vaccina- 
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tion for smallpox being a notable example. In fact, 
much research is not strictly causal in its orienta- 
tion but rather is concerned with probing the 
efficacy of steps intended as corrective. 

Nevertheless, cause remains a central, although 
difficult, focus. This difficulty is manifest in the 
frequency with which research questions are posed 
that have answers already determined through 
built-in assumptions. It would seem that in psy- 
chodynamic issues, one man’s theory is another 
man’s fact. At the present stage of knowledge, 
therefore, much research effort must still be di- 
rected toward the primary task of observing, study- 
ing, and sorting phenomena in order to produce 
questions that can be answered with demonstrable 
public evidence and replicable procedures. As part 
of this process it is important for research workers 
to visualize the causes of psychiatric disorder not 
only in terms of multiple origins that range from 
the psychological to the organic but also in terms 
of factors that precipitate overt impairment and 
favor its perpetuation. From some points of view 
the critical question may be less “What are the 
psychodynamic roots of a neurosis?” and more 
“What are the factors which evoke certain patterns 
of feeling and behavior to such an extent that these 
are seriously disturbing to the person, to others, 
or to both?” 

Thinking along this line has led in recent years 
to the formulation of such concepts as the “social 
breakdown syndrome” (Gruenberg 1966), the “cul- 
ture of poverty” (Lewis 1961), and “sentiment pat- 
terns characteristic of sociocultural disintegration” 
(Hughes et al. 1960; D. Leighton et al. 1963). 
The social breakdown syndrome, in which with- 
drawal and hostility in patients are prominent 
features, is considered the result of social processes 
in large hospitals for chronic patients and to be 
superimposed on the schizophrenic, depressed, or 
senile condition or whatever it is that has led to 
hospitalization. The syndrome is regarded as seri- 
ous because it is believed to compound disorder, 
prevent recovery, and cause severe deterioration. 
The culture of poverty and sentiments character- 
istic of sociocultural disintegration refer to very 
similar states of feeling and behavior in nonpa- 
tients living in deprived social systems of natural 
communities rather than in the back wards of 
hospitals. 

The congruence of these independently devel- 
oped concepts is very striking, and it raises a num- 
ber of important questions. For example, to what 
extent is the epidemiological finding of high preva- 
lence of psychiatric disorder in low socioeconomic 
groups a manifestation of the social breakdown 
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syndrome? And further, since large chronic mental 
hospitals draw disproportionately from low socio- 
economic groups, to what extent is the presence 
of the social breakdown syndrome in these institu- 
tions due to many patients already having the 
syndrome at the time they enter? 

Looking at the matter from a different point of 
view, it may be asked whether one can transfer 
to communities what is learned experimentally in 
hospitals about the kinds of social processes that 
produce and prevent the social breakdown syn- 
drome. This concern obviously marks a confluence 
point of the processes at work in both individual 
and societal functions. 

Confluence points of this kind are characteristic 
of research in social psychiatry. The range of the 
whole field has already been outlined, but it may 
be further illustrated by listing more specific topics, 
such as epidemiology; transcultural and crosscul- 
tural psychiatry; culture and personality; social 
processes in psychiatric hospitals and other re- 
lated therapeutic institutions; mental health and 
morale in the armed services, industry, and other 
nontherapeutic organizations; small-group studies, 
particularly those concerned with group psycho- 
therapy; the relationships of various social roles 
to mental health; family patterning and therapy 
of families as groups; experimental studies of 
mothers and young among animals and of animal 
populations considered as social systems. It is of 
course not possible to deal with all of these in the 
space of this article, but four may be selected as 
examples for discussion: epidemiology, transcul- 
tural and crosscultural psychiatry, social process in 
hospitals, and small-group studies. 

Epidemiology. Epidemiology is one of the main 
avenues by which social psychiatry has sought 
clarification of its questions (Lin & Standley 1962; 
Hoch & Zubin 1953; Hollingshead & Redlich 1958; 
Hughes et al. 1960; Jaco 1960; The Midtown Man- 
hattan Study 1962-1963; D. Leighton et al. 1963; 
Rennie 1955). Mapping the distribution, frequency, 
and types of disorders is basic to the understand- 
ing of environmental influences. It is of interest, 
therefore, to find that virtually all studies show 
marked variation in disorder prevalence in relation 
to demographic and social characteristics such as 
age, sex, marital status, socioeconomic level, and 
degree of sociocultural disintegration. In other 
words, numbers of relationships have emerged 
that suggest targets for more crucial investiga- 
tion aimed at distinguishing antecedents from con- 
sequents. 

To do this one must define and measure varia- 
tion in the sociocultural systems in which these 
clusterings of psychiatric disorder occur. While 


some workers have been concerned with socioeco- 
nomic class and others with states of “integration” 
or the functional cohesiveness of social systems, 
such efforts are just a beginning. What must come 
now is more emphasis on the adaptation and de- 
velopment of theories from social psychology, so- 
ciology, and anthropology bearing on small-group 
and large-group processes as these are relevant 
to mental health research in general and psychi- 
atric epidemiology in particular. The necessity for 
method development, is of course, concomitant. 

By such steps it should be possible to narrow a 
gap that still exists in psychiatric epidemiology, 
despite much interdisciplinary cooperation, be- 
tween those who think in terms of individuals and 
statistics about individuals and those who think in 
terms of social structure and function. I believe 
that without this linking emphasis, psychiatric epi- 
demiology will bypass major opportunities for the 
advance of etiological understanding. 

The case-counting aspect of psychiatric epidemi- 
ology has a number of desiderata for development 
that may be summarized as: a more precise and 
standardized linking of terms with phenomena 
(Glueck et al. 1964; Hoch & Zubin 1953; D. Leigh- 
ton et al, 1963; Ward et al. 1962); the establish- 
ment of comparable methods of keeping hospital 
and other relevant cumulative records; the expan- 
sion of case registries (Bahn 1962; 1965); the 
achievement of greater reliability and validity in 
the survey methods of making estimates of preva- 
lence and incidence; and the speeding up of the 
steps involved in ratings and statistical analyses 
through the use of computers. [See MENTAL DIS- 
ORDERS, article on EPIDEMIOLOGY; see also Dale 
1964; Overall et al. 1964; Pearson et al. 1964; 
Swenson et al. 1965; Smith 1967.] 

Surveys. Because the survey technique is rela- 
tively new and has promise of being one of the 
major advances in the field, it deserves an addi- 
tional word of comment. The method essentially 
consists of drawing a representative sample from 
a general population and conducting a systematic 
interview with each individual in the sample by 
means of a prepared schedule of questions. The 
results of the interviews are evaluated, usually by 
psychiatrists, and a score is assigned to every re- 
spondent. The aim is to estimate the true preva- 
lence of psychiatric disorder in a chosen population 
independently of the amount of treatment provided 
and accepted. 

The survey procedure suffers at present from 
being both overvalued and undervalued. On the 
one hand, results are naively accepted without ade- 
quate understanding of the way in which they were 
obtained. On the other, they are naively rejected 
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as unpsychiatric, if not unscientific, and it is some- 
times asserted that they have no known relation- 
ship to what a clinician means when he makes a 
diagnosis. 

In point of fact, considerable work has been 
done on reliability (in the sense of agreement be- 
tween psychiatrists who conduct their ratings in- 
dependently) and on validity (in the sense of 
agreement between survey and independent clini- 
cal judgment based on direct examination of indi- 
viduals), While serving to highlight certain weak- 
nesses and to point up priorities for advancement, 
such investigations support, on the whole, both the 
survey as an approach and most of the main find- 
ings obtained so far. [See INTERVIEWING; PSYCHO- 
METRICS; SURVEY ANALYSIS.] 

Transcultural and crosscultural psychiatry. An 
overlap of transcultural and crosscultural psychiatry 
with epidemiology exists but each subsumes inde- 
pendent spheres as well (Murphy & Leighton 
1965). Examples of such overlap are the compara- 
tive analysis of the incidence of a clinical entity 
(e.g., schizophrenia) in two or more cultures and 
the attempt to probe deeply into cultural varia- 
tion in underlying psychological processes through 
dream analysis, projective tests, and the produc- 
tions of psychiatrically disordered persons. In the 
past there has been concern with culturally de- 
termined basic personality and the effect this 
might have on the generation of disorder (see, e.g., 
Holmberg 1950). 

There is a fairly large literature on syndromes 
such as piblokto, latah, koro, and susto, which have 
been thought to be peculiar to one or another cul- 
tural group. In most instances these disorders 
appear to be local variations of conditions such as 
hysteria or anxiety rather than fundamentally new 
types of illness that are otherwise unknown. 

An important area that is progressively receiving 
more and more attention is the comparative ex- 
amination and analysis of treatment methods in 
various cultures. There are apparently no cultures 
that fail to recognize psychiatric disorders and 
none that do not provide some sort of treatment. 
[See CULTURE AND PERSONALITY.] 

Social process in hospitals. A third example of 
a research area views the psychiatric hospital as 
a societal system and is concerned with the rela- 
tions between the way the system functions and 
the consequences for the patients’ mental health 
(Caudill 1958; Stanton & Schwartz 1954; Levinson 
& Gallagher 1964). The behavioral science tech- 
niques utilized include ethnographic description, 
participant observation, the recording of interac- 
tions, systematic interviews, sociometric mappings, 
and psychological tests. Although some of the most 
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important investigations have been observational 
and comparative, experimentation has also been 
done through such methods as introducing an in- 
novation in one part of a hospital while using other 
parts as controls. There have also been what might 
be called “trial studies” in which a new way of 
structuring an entire hospital is put into effect and 
the results studied by independent observers. Max- 
well Jones's “therapeutic community” (1952) and 
Rapoport’s analysis of it is a notable example [Rapo- 
port et al. 1961; see also MENTAL DISORDERS, 
TREATMENT OF, article on THE THERAPEUTIC 
COMMUNITY]. 

The most difficult technical problem in much 
of this research is that of measuring change in 
the dependent variable, namely the patient's men- 
tal health, Although the task of the hospital is the 
improvement of the patients’ health, and practical 
decisions have to be made every day as to whether 
improvement has or has not occurred in specific 
patients, objective standards remain elusive and a 
handicap to rigorous hypothesis testing. This is 
of course a difficulty common to studies of all types 
of psychiatric therapy, including the evaluation of 
drug effects. 

Small-group research. Small-group studies con- 
stitute our final research illustration. As the term 
is commonly employed, “small group” refers to a re- 
current gathering of individuals who are collec- 
tively trying to accomplish something. The number 
involved is generally between 5 and 15, such as 
might assemble around a table. Characteristically, 
the group has a longer life than the presence of 
any of its individual members. Groups of this kind 
are a world-wide phenomenon and of major im- 
portance in societal processes. 

The observational and theoretical reasons for 
maintaining that the small group can have an in- 
fluence on psychiatric patients are numerous and 
complex, but a few may be selected for illustra- 
tion. One is the fact that strong emotions are apt 
to be engendered among the members, emotions 
having to do with acceptance and admiration or 
rejection and scorn. It is much more difficult to be 
an isolated, indifferent, or preoccupied bystander 
in a small group than in a large one. Fear, anger, 
anxiety, depression, and euphoria commonly suc- 
ceed each other during the meetings at various 
levels of intensity and are transacted through a 
variety of interpersonal dramas. There is thus po- 
tential for arousing strong motivation of some kind 
in the individual members. 

Another characteristic of small groups is that 
each is made up of a number of roles which 
provide opportunities and enjoin limitations and 
which are taught to every new member. When the 


628 PSYCHIATRY: Social Psychiatry 


new member is a patient and the group has ob- 
jectives relevant to therapy, this process can pos- 
sibly reduce the patient’s symptoms and increase 
his social capacities. The small group has there- 
fore molding as well as motivating potential. 

Numbers of industrial and other studies have 
shown that the morale and productive ability of 
small groups is increased when the group as a 
whole has a measure of self-determination. Each 
member then has an opportunity of exerting some 
influence and experiencing a sense of worth and 
belonging—of being somebody who matters. The 
greater the cohesiveness of the group, the greater 
this effect is apt to be. When these forces are act- 
ing on a patient in a group of his peers, the effect 
can be one of mitigating depressive and inferiority 
feelings or of shearing away the idiosyncratic and 
asocial aspects of schizophrenic behavior. 

Other therapeutic properties of the small group 
include opportunity for catharsis, self-expression 
through dramatization, and the relinquishing of 
old defense patterns and the trying out of new 
ways of dealing with one’s self and others in a rela- 
tively safe context. The sense of safety is perhaps 
more a feature of therapeutic groups than of other 
kinds because it is deliberately fostered, but it is 
also part of the sense of belonging and so is a 
component in the emotional climate of any co- 
hesive group. [See COHESION, SOCIAL; GROUPS; 
MENTAL DISORDERS, TREATMENT OF, article on 
GROUP PSYCHOTHERAPY.] 

The critical question about all these various 
possibilities is, do they work? Can group forces 
be harnessed in the service of treatment? The 
answers to these questions have not yet been 
found, but one can safely predict that when they 
are they will not be in any simple form. It is prob- 
able that some kinds of group processes will be 
helpful with some kinds of disorders, while other 
combinations will turn out to make no difference 
or will be actively harmful. In the meantime we 
can say that there is an enormous amount of evi- 
dence pointing to the effectiveness of group proc- 
esses and that the outlook is promising; however, 
no studies known to this author are as yet de- 
finitive. 

One study of exceptional interest has addressed 
itself to the possibility of forming autonomous 
problem-solving groups among hospital patients in 
order to see whether such groups have any positive 
effect on the patient’s mental health (Fairweather 
1964). The design for the experiment was a two- 
by-four analysis of variance, factorial type, with 
two treatment and four diagnostic categories. The 
two treatment groups were made up of two wards 
of male patients matched to an extraordinary de- 


gree; their treatments differed only in that one ran 
according to traditional lines while the other had 
in addition the requirement that every patient join 
for two hours a day, five days a week, in a task- 
oriented, decision-making small group. The experi- 
ment, involving a total of 195 patients, lasted 27 
weeks with a six-month follow-up and included an 
unusually large number and variety of observa- 
tions designed to detect and measure results. 
Among the findings were the following: the pa- 
tients who spent some of their time in small groups 
showed in their general behavior more physical 
activity than did those who experienced only the 
traditional ward program. More important, they 
showed greater social participation, more affability, 
and more eagerness to communicate with their 
peers in complex social relationships. As a whole, 
the ward containing the small groups displayed 
greater feeling of unity and social attraction among 
its members. Although the patients found the 
small-group ward taxing and many wished they 
were on a different one, many of them felt to a 
greater extent than did patients on the other ward 
that their treatment program had helped them. On 
the average, the small-group patients had 40 fewer 
days of hospitalization. In contrast to this, the six- 
month follow-up showed no statistically significant 
difference between the two wards in the percentage 
of patients returning again to the hospital. This 
suggests that while 40 fewer days of hospitaliza- 
tion is a gain in that it is not neutralized by early 
return, the problems of posthospital adjustment 
are not helped by the small-group treatment in its 
present form. This indicates the need for experi- 
mentation regarding the possibility of continuing 
this kind of small-group participation among pa- 
tients after they leave the hospital. 


ALEXANDER H. LEIGHTON 


[Directly related are the entries MENTAL DISORDERS; 
MENTAL DISORDERS, TREATMENT OF. Other relevant 
material may be found in CLINICAL PSYCHOLOGY; 
SCHIZOPHRENIA; SOCIAL work; and in the biography 
of SULLIVAN.] 


BIBLIOGRAPHY 

AMERICAN PUBLIC HEALTH ASSOCIATION, TECHNICAL DE- 
VELOPMENT BOARD, PROGRAM AREA COMMITTEE ON 
MeNTAL HEALTH 1962 Mental Disorders: A Guide 
to Control Methods. New York: The Association. 

Baun, Anita K. 1962 Psychiatric Case Register Confer- 
ence: 1962. Public Health Reports 77:1071-1076. 

Bann, Anita K. 1965 Experience and Philosophy With 
Regard to Case Registers in Health and Welfare. Com- 
munity Mental Health Journal 1:245-250. 

BAsTIDE, Rocer 1965 Sociologie des maladies mentales. 
Paris: Flammarion. 

BELLAK, LeEoPoLD (editor) 1964 Handbook of Commu- 
nity Psychiatry and Community Mental Health. New 
York: Grune & Stratton. 


CALHOUN, Jonn B. 1962 Population Density and Social 
Pathology. Scientific American 206, no, 32:139-148. 

CaupiLL, WILLIAM 1958 The Psychiatric Hospital as a 
Small Society, Cambridge, Mass.: Harvard Univ, Press. 

Date, P, W. 1964 Preliminaries to Programming a Com- 
puter for Psychiatric Diagnosis. Methods of Informa- 
tion in Medicine 3, no. 1:33-34. 

ENCYCLOPÉDIE Mépico-CumurcicatE 1955 Psychiatrie. 
Edited by Henry Ey. 3 vols. Paris: The Encyclopédie. 
—> See especially the subsection “Socio-psychiatrie.” 

FAIRWEATHER, GEORGE 1964 Social Psychology in Treat- 
ing Mental Illness: An Experimental Approach. New 
York; Wiley. 

GARDNER, ELMER A. et al. 1963 All Psychiatric Experi- 
ence in a Community. Archives of General Psychiatry 
9:369-378. 

Guueck, B. C. et al. 1964 The Quantitative Assessment 
of Personality, Comprehensive Psychiatry 5: 15-23, 

GOLDSTON, STEPHEN E. (editor) 1965 Concepts of Com- 
munity Psychiatry: A Framework for Training. US. 
Department of Health, Education and Welfare, Public 
Health Service, Publication No. 1319, Washington: 
Government Printing Office. 

GREENBLATT, MILTON et al. (editors) 1957 The Patient 
and the Mental Hospital: Contributions of Research 
in the Science of Social Behavior. Glencoe, Ill.: Free 
Press. 

GRUENBERG, ERNEST (editor) 1966 Evaluating the Ef- 
fectiveness of Mental Health Services. Milbank Memo- 
rial Fund Quarterly 44, no. 1, part 2. 

Hocu, PauL H.; and ZUBIN, JOSEPH 1953 Current Prob- 
lems in Psychiatric Diagnoses. New York: Grune. 
HorLNcsHeEaD, Aucust B.; and REDLICH, FREDRICK c. 
1958 Social Class and Mental Illness: A Community 

Study. New York: Wiley. 

Houmperc, ALLAN R. 1950 Nomads of the Long Bow: 
The Siriono of Eastern Bolivia. Smithsonian Institu- 
tion, Institute of Social Anthropology, Publications, 
No. 10, Washington: Government Printing Office. 

Hucues, Cnarzes et al. 1960 People of Cove and Wood- 
lot: Communities From the Viewpoint of Social Psy- 
chiatry. The Stirling County Study of Psychiatric Dis- 
order and Sociocultural Environment, Vol. 2. New 
York: Basic Books. 

Jaco, E. Garry 1960 The Social Epidemiology of Men- 
tal Disorders: A Psychiatric Survey of Texas. New 
York: Russell Sage Foundation. 

Jones, MaxwELL et al. (1952) 1953 The Therapeutic 
Community: A New Treatment Method in Psychiatry. 
New York: Basic Books. > First published as Social 
Psychiatry: A Study of Therapeutic Com munities. 

LEIGHTON, ALEXANDER H, 1959 My Name Is Legion: 
Foundations for a Theory of Man in Relation to Cul- 
ture. The Stirling County Study of Psychiatric Dis- 
order and Sociocultural Environment, Vol. 1. New 
York: Basic Books. 

LEIGHTON, ALEXANDER H. 1960 An Introduction to So- 
cial Psychiatry. Springfield, Ill.: Thomas. 

LrIGHTON, DororHea et al. 1963 The Character of 
Danger. The Stirling County Study of Psychiatric Dis- 
order and Sociocultural Environment, Vol. 3. New 
York: Basic Books. 

Levinson, DANIEL; and GALLAGHER, EUGENE 1964 Pa- 
tienthood in a Psychiatric Hospital: An Analysis of 
Role, Personality, and Social Structure. Boston: 
Houghton Mifflin. 

Lewis, Oscar 1961 The Children of Sánchez: Autobiog- 
raphy of a Mexican Family. New York: Random 
House. 


PSYCHIATRY: Forensic Psychiatry 629 


Lin, Tsune-v1; and STANDLEY, C. C. 1962 The Scope 
of Epidemiology in Psychiatry. Public Health Papers, 
No. 16. Geneva: World Health Organization. 

The Midtown Manhattan Study. 2 vols, 1962-1963 New 
York: McGraw-Hill. > Volume 1: Mental Health in 
the Metropolis, by L. Srole et al. Volume 2: Life Stress 
and Mental Health, by S. T. Langner and S. T. 
Michael. Volume 2 was published by the Free Press. 

Murpuy, JANE M.; and LEIGHTON, ALEXANDER H. (editors) 
1965 Approaches to Cross-cultural Psychiatry. Ithaca, 
N.Y.: Cornell Univ. Press. 

Overatt, J. E; Hovtister, K.; and HoLLISTER, L. E. 
1964 Computer Procedures for Psychiatric Classifi- 
cation, Journal of the American Medical Association 
187:583-588. 

Pearson, J. S. et al. 1964 Further Experience With the 
Automated Minnesota Multiphasic Personality Inven- 
tory. Mayo Clinic, Rochester, Minn., Proceedings of 
the Staff Meetings 39:823-829. 

Rapoport, Rozert N, et al. 1961 Community as Doctor: 
New Perspectives on a Therapeutic Community. 
Springfield, Ill.: Thomas. 

Rennie, THomas A. C. 1955 Social Psychiatry: A Defi- 
nition. International Journal of Social Psychiatry 1, 
no. 1:5-13. 

SourHarp, E. E. 1919 Cross-sections of Mental Hygiene, 
1844, 1869, 1894. American Journal of Insanity 76: 
91-111. 

STANTON, ALFRED H.; and SCHWARTZ, M. S. 1954 The 
Mental Hospital: A Study of Institutional Participation 
in Psychiatric Illness and Treatment. New York: Basic 
Books. 

Swenson, WENDELL M. et al. 1965 A Totally Auto- 
mated Psychological Test: Experience in a Medical 
Center. Journal of the American Medical Association 
191:925-927. 

Warp, C. H. et al, 1962 The Psychiatric Nomenclature. 
Archives of General Psychiatry 7, no. 9: 198-205. 


Iv 
FORENSIC PSYCHIATRY 


In Western civilization there has long been an 
interest in the mental state of persons coming 
before the courts. It is to this branch of psychiatry 
that we apply the adjective “forensic.” The criteria 
for establishing mental states vary, depending on 
whether the immediate concern be testamentary 
capacity, compulsory hospitalization (commitment), 
the appointment of guardians, the validity of deeds 
or contracts, the relation of emotional states to va- 
rious symptoms (tort law), annulment or divorce, 
credibility of witnesses, fitness for trial, or criminal 
responsibility. Furthermore, the criteria vary from 
jurisdiction to jurisdiction; in France, for instance, 
an act is simply considered not to be a crime where 
there is a condition of mental disorder (démence), 
whereas the Anglo-Saxon courts have presumed 
to set up definitions, of greater or lesser clarity, of 
the symptoms and types of mental disorder which 
will entitle the accused to a verdict of “not guilty 
by reason of insanity.” 

It may be assumed that these “tests,” set up 
at varying times, are far from being in line with 
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modern psychiatric standards. After all, the law 
cannot be expected to lead; it must, rather, follow 
accepted standards. The chief difficulty is that the 
law changes in a very leisurely fashion. Both law 
and psychiatry are concerned with human be- 
havior, but the viewpoints are in some respects far 
apart. To the psychiatrist, the patient is an indi- 
vidual, with problems largely of emotional origin, 
whereas the law deals with him as a willful of- 
fender against a code of generalized rules which 
apply to the average man. That is, the law assumes 
that the person possesses free will, whereas psy- 
chiatry recognizes the potency of the unconscious 
in the motivation of the individual. At the same 
time, the law does recognize some mitigating or 
nullifying factors, such as “heat of blood.” 

The usual layman's concept of forensic psy- 
chiatry is that it deals with criminal cases—the 
defense of insanity, mental fitness for trial, and 
so forth. As a matter of fact, however, criminal 
actions occupy but a small proportion of the fo- 
rensic psychiatrist's attention. For general pur- 
poses, we will be concerned primarily with civil 
rather than criminal law. 

Tort law. One type of civil action has to do with 
what one person has inflicted on another, either 
by negligence or by intent. For example, suits based 
on injury attributed to automobile accidents come 
in this category. 

Emotional shock is often alleged as the cause 
of various symptoms, and sometimes unduly gen- 
erous verdicts are allowed by the courts, apparently 
on the erroneous theory that traumatic neurosis is 
incurable. There are many types of action that fall 
under this heading, such as malpractice suits 
against physicians, but many of these have not yet 
attracted much psychiatric interest. 

This is an area in which the law has been some- 
what laggard in the extent to which it is guided by 
psychiatric concepts. It must be added, however, 
that many law schools are providing instruction in 
psychiatric concepts. The American Bar Associa- 
tion has worked in conjunction with the American 
Psychiatric Association; the American Law In- 
stitute gave much attention to this area in the 
recent formulation of its Model Penal Code (U.S. 
v. Currens, 290 F. 2nd 751, 1961). The Ameri- 
can Bar Foundation has not only published a 
compendium of the laws relating to mental dis- 
order and the law (American Bar Foundation 
1961), but also is making extensive field studies 
of the actual operation of those laws. Further still, 
the law, by statute and decision, has materially 
extended the scope of the definition of mental dis- 
order to include not only “insanity” in its old sense 
but psychoneurosis, sexual deviation, drug addic- 


tion, alcoholism, and (on occasion) juvenile delin- 
quency, psychopathic or sociopathic personality, 
mental defect, and mental disorder of degree, to 
different extents in the various jurisdictions. 

Wills. In most cases, the person who is the 
subject of litigation in the matter of a will is not 
available for examination. In the event that the 
validity of a will is disputed on psychiatric grounds, 
if the psychiatrist has known the testator and is 
not inhibited by a statute of privilege, he may test- 
ify on the basis of his own knowledge. Otherwise, 
he must answer a “hypothetical question” about the 
testator’s mental state, on the assumption that 
some (or all) of the facts pertaining to the mental 
state of the testator testified to are true. 

A will may be disallowed if the testator is found 
generally incompetent mentally or the victim of 
“undue influence.” If a delusion had an obvious 
influence on a provision of the will, it may be dis- 
allowed on that ground. The legal definition of a 
delusion contains elements, however, with which 
the psychiatrist can hardly agree; briefly, if any 
factual basis existed, then there is considered to be 
no delusion. Even the most florid delusion may 
have a kernel of factual basis, 

Contracts and deeds. The validity of a contract 
or deed may occasionally be questioned on the 
grounds that the contractor was mentally incompe- 
tent. In general, it may be said that the law requires 
somewhat more mental acuity for a valid contract 
or deed than is required in the making of a will 
(Overholser 1959). Very few cases of this sort 
arise. 

Annulment and divorce. There has long been 
provision for a judicial finding that a marriage is 
invalid if there existed in one of the partners an 
extreme mental disorder or defect, or a state of 
drunkenness, at the time the ceremony was per- 
formed. A number of states have provided for 
divorce on the ground of “incurable insanity” 
where one of the partners has spent a certain num- 
ber of years of continuous confinement in a mental 
hospital. 

Guardianship. There are various reasons why a 
person is unable to act in his own behalf. In such a 
case the court may appoint a guardian or commit- 
tee to act for him. This process has no necessary 
connection with compulsory institutionalization, 
although in some jurisdictions the two proceedings 
are as one, that is, the order of commitment is an 
automatic adjudication of incompetency. 

Hospitalization. Until very recently, nearly all 
admissions to psychiatric hospitals resulted from 
court order, after a finding (usually) that the pa- 
tient is, by reason of mental disorder, “dangerous 
to himself or others.” The order authorized his de- 
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tention until, in the opinion of the staff, he could 
safely be released (subject, of course, to court 
order). In general, release was a medical matter. 
More recently, with the advent of a therapeutic, 
rather than custodial, atmosphere, voluntary ad- 
missions are encouraged, and efforts are being 
made to make the process of admission as informal 
as it is in general hospitals. In fact, most general 
hospitals now provide psychiatric facilities. The 
advantages of this procedure are obvious as steps 
in removing the stigma of mental disorder which 
still prevails, although to a diminishing extent. The 
educational campaigns carried out by the National 
Association for Mental Health and by local mental 
health associations are bearing fruit. Much more 
might be said on the subject of mental hospitals, 
such as their use as receptacles for the so-called 
sexual psychopath or those offenders acquitted by 
reason of insanity, but space does not permit. 

Criminal law and procedure. So far we have 
considered actions between persons. In criminal 
actions, law deals with actions against society. The 
accused is prosecuted in the name of the state or 
county or country, as the case may be. Let us reit- 
erate that although criminal cases for various rea- 
sons attract wide attention, they are far fewer in 
number than other types of legal action. The ac- 
cused, too, is presumed to be innocent until proved 
guilty and has certain rights which the prosecution 
must respect. 

One of the elements of most crimes is what is 
known as criminal intent, or mens rea. Until 1843 
only the most “furiously mad” were thought to be 
sufficiently deranged to meet the test of lunacy, or 
whatever it might be termed then. In that year, in 
the case of Daniel M’Naghten (M’Naghten’s Case, 
10 Cl. & Fin. 200, 1843), the judges of England 
attempted to lay down the criteria of insanity— 
basically the “right and wrong” test. The MNaghten 
test is strictly a cognitive one, yet it is in force in 
most of the English and American courts today. 
It maintains that a person can be excused from 
liability in criminal proceedings only if he did not 
understand the nature of his act or that it was 
wrong. Shortly before (State v. Matthew Thomp- 
son, Wright’s Ohio Rep. 617, 1834), an American 
court had recognized the importance of emotional 
factors and had enunciated the so-called irresistible 
impulse test, now recognized in about twenty 
states. In 1871 the New Hampshire Supreme Court 
threw out all tests, holding that “insanity” is a fact 
to be determined by the jury (State v. Jones, 50 
N.H. 369, 1871). 

In 1954 (Durham v. United States, 214 F. 2nd 
862, 1954) the District of Columbia followed suit 
in the famous Durham decision, the opinion stating 
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that the question to be decided by the jury is 
whether the act is the product of mental disease or 
defect. Several variants have been proposed 
(McDonald v. United States, 284 F. 2nd 232, 1960; 
Lynch v. Overholser, 369 U.S. 705, 1962), but the 
Durham decision still remains a pioneer. 

The evidence on mental status is given in an 
atmosphere where adversaries oppose each other— 
hardly a situation that appeals to a physician. 

Among other functions of the psychiatric expert 
is the determination whether witnesses are fit to 
testify. Courts, however, do not avail themselves 
of this opportunity very often and depend, rather, 
on the judge’s psychiatric insight. 

Remedies—attempted and possible. It should 
not be thought that psychiatrists and lawyers are 
content with the status quo of definitions and pro- 
cedures. Various groups have been at work on the 
problems, including the American Law Institute, 
the American Bar Foundation, and the American 
Psychiatric Association. Space, unfortunately, does 
not permit a discussion of the problem, For the 
long pull, however, there is some reason, despite 
the well-known hesitancy of the law to change, to 
be guardedly optimistic. 

WINFRED OVERHOLSER 


[See also Law. Other relevant material may be found 
under CRIME; CRIMINAL LAW; CRIMINOLOGY; DE- 
LINQUENCY; INTERNMENT AND CUSTODY; MENTAL 
DISORDERS; PENOLOGY; PSYCHOPATHIC PERSONAL- 


iry.] 
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v 
THE RELIGIO-PSYCHIATRIC MOVEMENT 


Religio-psychiatry is a twentieth-century move- 
ment whose participants are concerned with the 
relation between religious and scientific approaches 
to mental, emotional, or spiritual healing. The par- 
ticipants are, by and large, clergymen and psychia- 
trists who, disenchanted with their respective in- 
stitutional traditions of healing, introduce a form 
of healing which combines religious and psychia- 
tric means to relieve emotional suffering, and the 
clients who seek help from these innovators. Some 
traditionalists in the ministry and in psychiatry, 
who attack the innovators and engage them in 
ideological debate, enter the action as antagonists. 
The movement is manifest in a literature which 
debates and publicizes the ideology; in professional 
associations through which its protagonists share 
ideas; in clinics for mental, emotional, and spir- 
itual healing; and in training facilities for recruit- 
ing and socializing new participants. 

Historical background. Since classical times, 
science and religion have become institutionally 
differentiated. Science has come increasingly to 
shoulder primary responsibility for investigating 
and explaining natural events. Religion has re- 
tained a responsibility for guiding human attitudes 
toward those events. Mental and emotional heal- 
ing, once the concern of what may be termed, 
anachronistically, religio-science, continues to con- 
cern religious pastors as well as scientific psychia- 
trists. 

Chemists, physicists, and biologists who special- 
ized in applying the knowledge of their respective 
disciplines to the problem of healing prepared the 
ground for a medical specialty dealing with mental 
and emotional disorders. Psychiatric specialists 
appeared toward the close of the eighteenth cen- 
tury. Their scientifically influenced approach is 
characterized by differential diagnosis of the dis- 
ease in terms of the malfunctioning of the organ- 
ism, by the empirical discovery of propositions 
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relating therapeutic techniques to those malfunc- 
tions, and by the application of the indicated tech- 
niques in an effort to restore normal functioning 
(Zilboorg 1941). 

Pastoral responsibility for emotional healing has 
traditionally fallen to the same clergyman who has 
acted as priest, teacher, and prophet. Western re- 
ligion has termed this the “cure of souls” because 
the emotional manifestations have been considered 
symptomatic of a fundamental derangement of life 
values and of a distortion in the relation between 
man and the transcendental. A few medieval and 
early modern Christian clergymen wrote peniten- 
tials as a way of enacting their roles as spiritual 
directors. A specialized ministry to those suffering 
mental and emotional distress awaited the twen- 
tieth century. Religious healing is concerned with 
rectifying the sufferer’s distorted relation to nature, 
man, the cosmos, or the “ground of meaning,” 
through the mediation of a personal relation with 
the pastor (McNeill 1951). 

Thus, pastoral and psychiatric roles have 
emerged side by side in Western society. They 
represent the applied specialization within the re- 
ligious and scientific institutions, respectively, to 
meet the human exigency manifested by emotional 
distress. The religio-psychiatric movement is the 
product of an attempt by some pastors and psychia- 
trists to integrate their means and their goals. 

Emotional and spiritual problems. There is 
little agreement among participants about criteria 
for discriminating between emotional and spiritual 
problems and, consequently, about criteria for as- 
signing a problem to a medical or a religious 
specialist. The same symptomatology supports 
either diagnosis. Some differentiate between illu- 
sory problems (a concern of psychiatry) and real 
problems (a concern of the ministry). The psychia- 
trist would thus be concerned with guilt feelings 
when there is no objective basis for the feeling, 
while real guilt—for example, following a trans- 
gression against a fellow man—would be a relig- 
ious problem. Roman Catholic priests are cau- 
tioned to discriminate between “scrupulosity,” a 
compulsive neurosis leading to detailed confessions 
of partly imagined misdeeds, and the appropriate 
confession of sin, a religious concern (Corcoran 
1957). 

Despite the paucity of criteria for distinguishing 
emotional from spiritual problems, there is some 
anxiety among ministers that psychiatrists may 
reduce the religious concern of a patient to its 
psychological component and so render disservice 
to the patient and to the church. Conversely, psy- 
chiatrists, both in and out of the movement, fear 


633 


that ministers may not recognize such phenomena 
as the hallucinatory elements in a religious ex- 
perience. 

Research evidence for the effectiveness of psy- 
chiatric treatment is far from conclusive (Berelson 
& Steiner 1964). Systematic empirical evidence for 
the effectiveness of religious healing or of religio— 
psychiatric healing is almost totally lacking. Pro- 
tagonists of the movement cite medical studies 
such as those of healing at Lourdes (West 1957); 
anthropological reports of healing in primitive so- 
cieties through the intervention of charismatic 
“Jeeches” or healers (Rivers 1924); the power of 
religio-hypnotic phenomena such as voodoo heal- 
ing and death (Métraux 1958); and an oral and 
written tradition of testimonials, 

Given both the difficulty in distinguishing emo- 
tional from spiritual problems and the limited evi- 
dence for the effectiveness of religio—psychiatric 
therapy, the growth and persistence of the religio— 
psychiatric movement must be understood in terms 
of other social, psychological, and cultural func- 
tions it may subserve. 

Conditions promoting the movement. Protago- 
nists of the movement argue that religio—psychiatry 
responds to a demand for pastoral services where 
psychiatric personnel and facilities are limited. The 
geographical location of the movement, however, 
shows a simple ecological correlation. Religio- 
psychiatry arises in precisely those environments 
having the most psychiatric facilities. The religio— 
psychiatric movement is found in the urban centers 
of Western countries. Over two-thirds of its publi- 
cations are in English, and most of these are pub- 
lished in the United States. France and Germany 
are secondary publication centers. The movement 
is centered in industrialized and scientifically ori- 
ented milieus (Klausner 1964a). 

Religio-psychiatry is the product of the same 
cultural conditions that produced the mental health 
movement. This wing of the mental health move- 
ment has, however, its own particular characteris- 
tics. It seems to draw ministers and psychiatrists 
who are serious about both their religious and sci- 
entific commitments and who feel that these insti- 
tutions impose conflicting behavicral demands. 
Ministers with advanced secular education (77 per 
cent of the ministerial authors hold doctorates) 
who serve denominations of higher socioeconomic 
status tend to have a greater affinity for scien- 
tific culture, which values empirically assessable 
achievements. These ministers are also committed 
to religion’s stress upon purpose, meaning, and the 
attainment—through personal relationship—of 
nonempirical ends. They accept both of these cul- 
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tural systems as relevant to the same social role, 
that of counselor to the emotionally distressed. 

Certain psychiatrists are exposed to this dual 
cultural demand. Some have been personally reli- 
gious since their youth; others have come to reli- 
gion through wartime experiences; and others— 
because they are members of, and receive referrals 
from, hierarchically structured churches, such as 
the Roman Catholic and Episcopal—are responsive 
to a demand for a more religiously oriented 
therapy. 

Growth of the movement. Klausner (1964a; 
1964b) examined 1,853 books and articles pub- 
lished between the turn of the century and the end 
of 1962, in the field of religion and psychiatry. The 
publication rate may reflect developmental patterns 
of the religio-psychiatric movement. Growth, as 
measured by number of publications per year, was 
quite slow between the turn of the century and the 
beginning of World War 11; the rate of growth 
thereafter, however, increased rapidly. During the 
1950s, though each year witnessed a greater num- 
ber of publications than the preceding one, the 
rate of increase began to decline. It is conceivable 
that the movement, as reflected in its literature, 
will attain its maximum size by the late 1960s or 
early 1970s, Thereafter, barring new factors and 
generalizing from the growth pattern of other so- 
cial movements and assuming a correlation be- 
tween literary and organizational aspects, religio— 
psychiatry may be expected to begin its decline, 

Stages in the growth of the religio—psychiatric 
movement are reflected in shifting thematic em- 
phases in its literature. Comparison of religious 
and scientific conceptions of man, writ large, was 
the overriding theme prior to World War 1. This 
theme reflects an early interest in exploring the 
possibility of relating the two institutions. After 
the war, interest shifted to comparing conceptions 
of emotional and mental deviance. The issues, 
while still theoretical, were more narrowly defined. 
Practical questions of counseling the emotionally 
disturbed and questions arising around relations 
between ministers and psychiatrists drew relatively 
more interest by the time of World War 11. This 
may reflect the crystallization of ideology in organ- 
izational form in the establishment of clinics and 
training centers. Recently, the training of ministers 
for a psychological-counseling role has drawn rela- 
tively greater attention in the literature. Concern 
with training may reflect consolidation of a new 
religio—psychiatric role. 

Characteristics of participants. What are the 
characteristics of the ministers and psychiatrists 
who constitute the movement? This question may 


be answered in part from information available 
about writers in the field. 

Ministers and psychiatrists are not the sole con- 
tributors to the literature of the movement, as 
shown in Table 1, but they are the major contrib- 
utors, Psychiatrists, however, have had more influ- 
ence on the ideology, especially in the movement's 
early stages. This may be because the movement 
occurs within scientifically oriented societies, be- 
cause minister-participants actively seek scientific 
knowledge, and because the ministers in the move- 
ment have tended to be younger than the psychia- 
trists. Minister-authors have been, modally, in their 
thirties and forties at the time of writing, while the 
psychiatrists have been, modally, ten to twenty 
years older. The age factor and the scientific orien- 
tation of the societies in which the movement 
emerges conspire to place leadership in psychiatric 
hands. 

When the numbers of ministers and psychiatrists 
in the movement are compared with their respec- 
tive numbers in the society as a whole, it appears 


Table 1 — Characteristics of the authors and country of 
publication of books and articles in the field 
of religio—psychiatry® 


Author's occupation: 


Minister 55% 
Psychiatrist 19 
Psychologist 8 
Other 4 
Unknown 14 


Author's religion® 


Protestant 51% 
Episcopalian 8 
Presbyterian 5 
Methodist 5 
Congregational 4 
Lutheran 4 
Baptist 4 
Other denominations and denomination unknown 21 

Roman Catholic 28 

Jewish 8 

Religion unknown 13 

100% 

Country of publication: 

United States and Canada 72 

Germany (and German Switzerland) u 

England and other Commonwealth 7 

France (and French Switzerland) 6 

Southern Europe (Spain, Greece, Italy) 3 

Other 1 

100% 


a. Total number of publications = 1,853. 

b. Since the table is based on a total of publications rather than a total of 
individvals, the percentages accompanying each characteristic refer to the 
Proportion of items written by individuals with that characteristic rather 
than to the proportion of all the authors who possess that characteristic. 


Source: Adopted from Klausner 1964a, p. 65. 
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that a higher proportion of all psychiatrists than 
of ministers enters this movement. The counseling 
role is, of course, central to the psychiatric profes- 
sion, while the pastoral is but one role—and usu- 
ally not the central role—of the ministry. 

Table 1 also shows this to be predominantly a 
Protestant movement, Among Protestant members 
are found many Episcopalians, Presbyterians, 
Methodists, Congregationalists, and Baptists—but 
very few fundamentalists. Few ministers serving 
poorer parishes participate in the movement. Since 
the religio—psychiatric movement is centered in the 
United States, where Methodists and Baptists far 
outnumber the wealthier Episcopalians, Presbyte- 
rians, and Congregationalists, one may infer that 
ministers and psychiatrists belonging to these latter 
denominations are more likely to be drawn to the 
movement. 

In the United States, the proportion of Roman 
Catholics in the movement has reflected roughly 
the proportion in the population. Catholic interest 
in the movement increased during the 1950s fol- 
lowing the declaration of Pope Pius xm (1953) that 
the techniques, but not the philosophy, of psycho- 
analysis are acceptable to the church. In the 1960s 
the proportion of Catholic priests, but not of Cath- 
olic psychiatrists, participating in the movement 
began to decline. 

Jewish clerical interest has been meager. Histor- 
ically, the rabbinate has more often played an 
educational role than a pastoral one. Rabbis of 
Reformed Judaism, responsive to new role defini- 
tions among the Christian clergy, have accounted 
for most of the Jewish clerical interest. Jewish psy- 
chiatrists, particularly psychoanalysts, tend to be 
prominent contributors to general psychotherapeu- 
tic literature, and their writings are used in train- 
ing clergy of all faiths. 

Protestant and Roman Catholic concern tends, 
at times, to be with a search for a Christian psycho- 
therapy as an alternative to that so strongly 
influenced by Jewish psychiatrists or by nonreli- 
gious psychoanalysis. Between the 1930s and 1950s 
the movement became popular in Germany, Eng- 
land, and the United States, successively. This 
popularity paralleled that of the psychoanalytic 
movement in each of these countries. Religio- 
psychiatry seems to have emerged, in part, to 
countervail psychoanalysis. Nevertheless, Freudian 
influence on religio—psychiatric writings has in- 
creased steadily. Prior to World War 1, about one- 
third of all writings could be identified as primarily 
Freudian in orientation. By the 1960s about seven 
out of eight publications had a Freudian orienta- 
tion (Klausner 1964a). 
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New counseling norms. Ministers and psychia- 
trists, by participating in the movement, attempt 
to resolve the conflicting demands which they feel 
religious and scientific culture makes upon them. 
Their conflicts are revealed through complaints 
they voice against their respective institutions. 
These ministers and psychiatrists are critical of 
colleagues who argue that religious and scientific 
values are inconsistent. Ministers in this movement 
complain that contemporary religion fails to realize 
its ideal. Psychiatrists complain that lack of value 
commitment constitutes a gap in psychiatric prac- 
tice. Ministers observe that religion is ineffective 
in helping parishioners with their emotional prob- 
lems. Psychiatrists observe that traditional therapy 
is insufficiently effective. Some ministers feel that 
their lack of psychological training hampers their 
effectiveness in helping parishioners. This self- 
doubt finds little parallel among the psychiatrists, 
who feel little need for additional schooling in reli- 
gious concepts. Both the ministers and the psychia- 
trists assert that a combination of religious and 
psychiatric elements generates a superior, more 
effective psychotherapy. 

These ministers introduce psychiatric methods 
into their pastorate, while these psychiatrists intro- 
duce religious orientations into their counseling. 
Ministers advocate greater emphasis on techniques 
of counseling, on improving the psychological 
functioning of the counselee, and on maintaining 
an objective, detached stance. Psychiatrists advo- 
cate greater emphasis on personal relationships, 
on helping the patient to evolve a meaningful atti- 
tude to life, and on permitting their feeling about 
the patient to impinge upon the therapeutic rela- 
tionship. Some ministers and psychiatrists evolve 
a form of counseling which combines psychiatric 
techniques with goals conceived in religious value 
terms. At the extreme, some psychiatrists provide 
a religious type of counseling, and some ministers 
are hardly distinguishable from psychologists in 
either the means or goals of counseling. ‘ 

Both ministerial and psychiatric protagonists 
suffer attacks for advocating change in the coun- 
seling norms institutionalized in their respective 
groups. Ministers are criticized for distorting reli- 
gion by church leaders who assume the role of 
the movement's antagonists. The principal accusa- 
tions revolve about the introduction of what the 
traditionalists argue are hedonistic, deterministic, 
materialistic, and pansexualistic elements. Psychia- 
trists are criticized by members of the medical pro- 
fession for consorting with lay therapists and, by 
a few colleagues, for failing to deal with religion 
as symptomatic of an obsessional neurosis. 
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Legitimation. Ministers and psychiatrists in 
the religio—-psychiatric movement appeal to the in- 
stitutions to which they belong and to their clients 
to accept and approve their new activity. They 
offer four principal types of rationales or bases of 
legitimation. These are consolidating rationales, 
which argue that religion and science are really 
identical, that apparent differences are merely lin- 
guistic—for instance, that “revelation” in religion 
is the same as “insight” in psychiatry or that the 
psychiatric notion of “emotional security” has the 
same referent as the religious notion of “faith”; 
complementing rationales, which argue that reli- 
gion and science are separate but complementary 
—that, for instance, psychiatry deals with func- 
tioning and the stresses of life, while religion deals 
with purpose, and that a total therapy attends to 
both; harbinger rationales, which argue that the 
contributions of one profession may help toward 
the goals of the other—that, for instance, the solu- 
tion of emotional problems prepares a person to 
accept religious doctrine; and social rationales, 
which point to leading members of the group who 
introduced similar new norms—for example, the 
contention that Jesus was a practicing psychologist. 

Clinical services, New relations of ministers 
and psychiatrists to their institutions and to their 
counselees take concrete form in religio—psychiatric 
or pastoral counseling clinics. These clinics have 
been established mostly in the United States, but 
there are some in England, France, Germany, 
Switzerland, and Belgium. An early organizational 
attempt to coordinate ministerial and psychiatric 
work was that of the Emmanuel Church in Boston, 
in the first decade of the twentieth century. Those 
active around this clinic—especially Ellwood 
Worcester, a priest, and Isador Coriat, a psychia- 
trist—became known as members of the Emmanuel 
movement. 

During World War 1 a significant number of 
clergymen and psychiatrists were brought together 
by the military. The American military chaplaincy 
introduced parish preachers and teachers to a situa- 
tion built around the pastoral ministry. Following 
the war, hospitals provided another institutional 
setting for ministerial and psychiatric cooperation. 
Traditionally, parish ministers visited parishioners 
in hospitals. Returning military chaplains estab- 
lished a model for a specialized hospital chaplaincy, 
drawing upon a tradition developed in the religious 
orders and applying it in a medical context. The 
mental-hospital chaplaincy followed. Its early de- 
velopment in the United States is associated with 
the name of Anton Boisen, who interpreted his own 
psychotic episode as a religious struggle and com- 


mitted himself to the service of the mentally ill 
(1936). 

Probably most pastoral counseling takes place 
in the study of the parish clergyman, just as its 
counterpart, religiously oriented psychiatry, takes 
place in a private office. Where individual service 
gives way to a clinical staff, questions of sponsor- 
ship and allocation of tasks appear. Some pastoral 
clinics are sponsored by a church and have clerical 
staffs. These staffs consult with and refer to psy- 
chiatrists as the need arises. Others are sponsored 
by a denomination or group of denominations or 
by educational institutions. Religious institutions 
may sponsor traditional psychiatric clinics with no 
clergymen on the staff. Independent clinics may be 
established with clergymen, social workers, psy- 
chologists, and psychiatrists cooperating as part of 
the regular staff (McCann 1962), 

Clinical training. In the past, most chaplains 
entered upon their duties with no specialized 
clinical training. Some of them also had limited 
confidence in the worth of traditional religious ap- 
proaches to the cure of souls. Bexley Hall Episco- 
pal Seminary was probably the first seminary to 
provide formal clinical pastoral education by found- 
ing, in 1923, the Cincinnati Summer School in 
Social Work for Theological Students and Junior 
Clergy. Other Protestant, Roman Catholic, and 
Jewish seminaries followed with programs which 
generally combined classroom instruction in clin- 
ical psychology with internships in hospitals or 
church-affiliated clinics. 

Clinical training programs have involved psy- 
chiatrists in the training of ministers. These pro- 
grams have not been limited solely to chaplaincy 
training: they have also prepared parish clergymen 
to recognize psychological disturbances and to 
make appropriate referrals. Counseling training 
has been provided in a few instances. The relation- 
ship of continuing advice and referral has done 
much to sustain contact between ministers and 
psychiatrists in local communities. 

Literature. The religio—psychiatric movement 
has produced its own characteristic literature. It 
consists of books and articles published in special- 
ized, as well as general, theological and scientific 
journals. Perhaps 2,500 such items by psychiatrists 
about the role of religion in psychotherapy and by 
ministers about the place of psychiatric and psy- 
chological concepts in pastoral work were pub- 
lished between the turn of the century and 1962 
(Klausner 1964b). 

During the first decade of the twentieth century, 
a journal entitled Psychotherapy provided a com- 
mon platform for ministers and psychiatrists in the 
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United States. This was followed by Religion und 
Seelenleiden in Germany and, later, by the Journal 
of Psychotherapy as a Religious Process and the 
Journal of Religion and Health in the United States. 
This last is the organ of the Academy of Religion 
and Mental Health, the principal professional or- 
ganization of religiously oriented psychiatrists and 
psychiatrically oriented ministers. In the United 
States Pastoral Psychology and in Belgium Lumen 
vitae, which also publishes in the general area of 
the psychology of religion, are more specifically 
directed to clergymen, 

Titles of a few of the movement's more signifi- 
cant books may illustrate some types of concern. 
Among the protagonists’ texts is Carl Gustav Jung’s 
Modern Man in Search of a Soul (1922-1931), 
which deals with religious symbols in mental proc- 
esses, in general, and in psychotherapy, in partic- 
ular, Less widely known are the writings of Oskar 
Pfister, such as his Religionswissenschaft und 
Psychoanalyse (1927). Pfister, a Swiss Lutheran 
pastor, corresponded with Freud about psycho- 
analysis and Christianity. While psychoanalysis 
was under attack by some authorities of the Roman 
Catholic church, Roland Dalbiez in France wrote 
Psychoanalytical Method and the Doctrine of Freud 
(1936), discussing the possibility of a reconcilia- 
tion with psychoanalysis and Freudian thought. 
This reconciliation was actively pursued by Agostino 
Gemelli in Italy and is reflected in his Psychoanal- 
ysis Today (1953). Religio—psychiatric conferences 
under the leadership of Wilhelm Bitter, which be- 
gan in Germany during the 1920s, are reported in 
Psychotherapie und Seelsorge (1952). The Angli- 
can prelate Leslie D, Weatherhead, in Psychology 
in Service of the Soul (1930), sees counseling con- 
tributing to a fuller religious life. Boisen’s The Ex- 
ploration of the Inner World: A Study of Mental 
Disorder and Religious Experience (1936) provided 
an early impetus to the clinical pastoral movement 
in the United States. Religious existentialism has 
been a road into the movement for some psychia- 
trists. Viktor E. Frankl, in The Doctor and the Soul 
(1946), describes some limits to a psychotherapy 
which, in not aiming at a reconstruction of values, 
does not deal with a holistic conception of man. 

Among the antagonists’ writings is Thomas H. 
Hughes's “Freudianism and Religion” (1934), which 
describes Freudianism as a menace to Christianity. 
Rudolph Allers, an American Roman Catholic pro- 
fessor of philosophy, in The Successful Error 
(1940), examines psychoanalytic thought from a 
Thomistic point of view. Ernest Jones, the English 
psychoanalyst, in his Essays in Applied Psycho- 
analysis (1923), implicitly warns psychiatrists 
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away from the religio—psychiatric movement by 
arguing that religion represents a crude solution of 
the Oedipus complex. David Forsyth, in his Psy- 
chology and Religion (1935), calls upon psycho- 
therapists, among others, to cease dissipating their 
energies in imaginary pursuits and accept the dis- 
cipline of science. 

Ramifications. Religio-psychiatry has implica- 
tions for counseling, in that it coordinates two 
counseling traditions. The movement also has im- 
plications beyond the counseling situation. It serves 
the religious and psychiatric institutions involved, 
as a way of controlling the extent of normative 
deviance on the part of their leaderships. Ministers 
and psychiatrists who might break more completely 
with their institutidns may find a middle ground. 
While the movement is simultaneously a variant 
of both traditional religion and traditional psychia- 


-try, it still remains within both institutions. 


For the broader society, the movement serves as 
a bridge between religion and science. The counsel- 
ing situation provides a meeting place, an institu- 
tional boundary, across which religious and scien- 
tific conceptions having wider ramifications may 
be exchanged. In the past, the movement has also 
served to integrate the religious groups of a plural- 
istic society. Protestants, Roman Catholics, and 
Jews have been able to meet and share thoughts 
and problems on grounds which are doctrinally 
relatively neutral. 
SAMUEL Z. KLAUSNER 


[See COUNSELING PSYCHOLOGY; MENTAL DISORDERS, 
TREATMENT OF, especially the article on PSYCHO- 
LOGICAL TREATMENT; PSYCHOLOGY, article on EXIS- 
TENTIAL PSYCHOLOGY; RELIGION, especially the 
article on PSYCHOLOGICAL STUDY; and the biog- 
raphy of JunG.} 
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